You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Younes K.YK

Younes K.

Senior AI System Architect

€750/day
Paris, FR
8-15 years

Average response time: 1 hour

About Younes

PhD in Computer Science / Telco (IMT Atlantique). Technical co-founder with a decade of experience designing and shipping production AI systems: from ML pipelines at Sanofi to co-founding Namla.cloud, a Kubernetes-native, multi-tenant orchestration platform for cloud-and-edge workloads, with NVIDIA partnerships and SD-WAN integration. Prior to that, I worked several years as a low level Telco Research Engineer developing 4G/5G technical systems.

Core areas of expertise:

LLM systems & agent architecture: agent orchestration, multi-layer memory design, RAG and GraphRAG (cuGraph, NeMo Retriever, Nemotron Embed/Rerank)
Inference & performance engineering: Paged Attention, Flash Attention, KV cache optimization, speculative decoding, context-window strategy, cost-tiered model routing
Low-level systems: transformer internals (attention mechanics, KV cache layout), Linux systems and networking, performance profiling, kernel-level reasoning about throughput and latency
AI infrastructure & edge: Kubernetes, multi-tenant SaaS, GPU orchestration, edge fleets on Jetson, NVIDIA stack (NIMs, NemoClaw, Jetson, Brev)
Production stack: Python 3.12 / FastAPI, Postgres + pgvector, Next.js 15, SSE, APScheduler + Redis, Cloudflare Tunnel, VPS-to-scale architectures

How I work:

Comfortable as a Forward Deployed Engineer, Solutions Architect, or technical lead — equally at home architecting AI systems end-to-end, shipping production code, and translating between business stakeholders and deep technical teams. French/English bilingual, based in Paris, Open to remote engagements.
  • French

    Native or bilingual

  • English

    Native or bilingual

  • Arabic

    Native or bilingual

Remote only
Primarily works remotely

Experience

  • Namla
    Technical Co-founder
    SOFTWARE PUBLISHING
    January 2022 - Today (4 years and 5 months)
    Paris, France
    Co-founded Namla.cloud and led the technical build-out of a Kubernetes-native, multi-tenant orchestration platform for cloud and edge workloads, with integrated SD-WAN networking and an NVIDIA partnership.

    Defined the platform architecture end-to-end: multi-tenant control plane, distributed agent runtime, networking stack, and edge-to-cloud orchestration model.
    Led a team of 4 engineers building the Namla Orchestrator backend (microservices architecture, Kubernetes operators, gRPC/REST APIs); owned engineering roadmap, technical hiring, and code review culture.
    Owned the NVIDIA Jetson support layer — adapting the networking stack and agent runtime across multiple vendor hardware form factors and Jetson SKUs; shipped GPU-aware workload scheduling and remote lifecycle management for edge fleets.
    Drove the technical relationship with NVIDIA Jetson and Metropolis teams (platform alignment, joint roadmap, co-marketing); platform architecture mentored by Sébastien Pahl (Docker co-founder), investor and board member.
    Anchored technical credibility on strategic accounts: architecture reviews, deep-dive workshops, and on-site deployment with enterprise customers in telco, industrial, and defense.
    Kubernetes Edge Computing Linux LLM Python
  • Sanofi Pasteur
    Lead ML Engineer
    PHARMACEUTICALS INDUSTRY
    September 2020 - June 2022 (1 year and 9 months)
    Rouen, France
    Embedded ML engineering lead in pharmaceutical vaccine manufacturing — a regulated, high-stakes (GxP) environment requiring rigorous validation and production-grade reliability. End-to-end ownership of ML pipelines from raw industrial data through model training to production serving.

    Designed and shipped production ML pipelines (Python, TensorFlow, OpenCV) for visual quality inspection and process optimization on the vaccine manufacturing line.
    Applied NLP to virus sequence analytics for vaccine manufacturing optimization — processing large biological datasets to surface insights that directly informed production decisions.
    Integrated ML workflows with AWS and enterprise data systems; operated autonomously across manufacturing, quality assurance, and data engineering stakeholders under GxP regulatory constraints.
    Machine learning Python TensorFlow Computer Vision NLP
  • Mantu
    Machine Learning Engineer
    DIGITAL AND IT
    April 2019 - September 2020 (1 year and 5 months)
    Nice, France
    ML engineer in Mantu's innovation lab, owning the full ML lifecycle from training through production API serving for large-scale document intelligence.

    Built an end-to-end NLP pipeline processing 100k+ resumes at scale: document ingestion, embedding generation, semantic matching (vector similarity search), and candidate–job relevancy scoring — deployed to production as RESTful inference services.
    Designed asynchronous data pipelines (RabbitMQ) and a graph-based matching system (Neo4j) to power candidate–job recommendations.
    Iterated rapidly with business stakeholders to align model outputs with real operational requirements.
    Machine learning Python Deep Learning MongoDB Neo4j

Recommendations

Be the first to recommend Younes

Help this freelancer shine by sharing your experience working together.

These freelancer profiles also match your criteria

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Education

  • Doctorat
    IMT Atlantique
    2016

Skill set

Categories