About Julien
- **Backend & Infra**: Python, FastAPI, Pydantic v2, asyncio, Kafka, S3, PostgreSQL, Redis.
- **LLM Engineering**: LangGraph, Function Calling, Schema-first JSON, Prompt Engineering, Multi-model routing (Gemini 2.5, OpenAI).
- **LLMOps & Monitoring**: Token/cost tracking, Rate-limiting, DLQ, Prometheus, Grafana.
- **Deployment**: Docker, Kubernetes (K8s), AWS (EKS, Bedrock), vLLM, RunPod, CI/CD.
- Extraction Pipeline(Scale-up SOLV): Replaced an unstable system with a minimalist Kafka architecture.50k+ docs processed, 99.2% success, API costs ÷3.
- AI Constraint Clustering**: Designed a **scalable hybrid algorithmwhere DBSCAN/K-Means failed semantically.
- Automation (Venio AI)**: **Production-deliveredagent platform via OpenAPI spec.
French
Native or bilingual
English
Fluent
Spanish
Fluent
Experience
- SOLVProduction LLM EngineerOctober 2025 - Today (8 months)Bruxelles, BelgiumBelgian scale-up in stakeholder analytics & risk management for complex infrastructure projects.LLM Document Extraction Pipeline:Complete reconstruction of an unstable extraction pipeline (Redis + embeddings + RAG + premium models, crashing at 10+ docs) using a minimalist asynchronous Kafka system in Python/FastAPI.→ 50,000+ documents processed, 99.2% success, cost reduced by 3xConstrained Clustering Algorithm:Design and implementation of a hybrid algorithm: feature extraction via LLM (orientation, entities, nature) injected as penalties into the distance matrix before hierarchical clustering. Solved the limitations of two previous attempts (DBSCAN, HDBSCAN+K-Means).Multi-model Routing & LLMOps:Intelligent routing Gemini Flash ↔ Gemini 2.5 Pro (OpenAI fallback), selection based on complexity/cost. Production Prometheus/Grafana dashboards (p95 latency, costs, extraction density), rate-limiting, exp-backoff retries, DLQ.
- Venio AIAI EngineerFebruary 2025 - September 2025 (7 months)Reggio d'Émilie, ItalyStartup specializing in AI agent automation for non-tech companies.Conversational Agent Platform:Built a platform for LLM agents in Python/FastAPI: the system understands user needs in natural language, generates a suitable agent, and exposes a ready-to-use API endpoint. Automatic generation of agent tools from OpenAPI specs.Benchmarking & Deployment:Benchmarking suite (accuracy, cost, latency) to compare LLM models and prompts before production deployment. Automated Docker/Kubernetes deployments via GitLab CI/CD.
- ONECLICKHIREDFounderJanuary 2025 - September 2025 (8 months)AI SaaS: CV parsing + automated personalized outreach. Full stack built solo: React/TS, Fastify, PostgreSQL, Redis/BullMQ, Stripe.Multi-provider LLM integration (Gemini + OpenAI), reliable asynchronous jobs. 150 sign-ups.
Recommendations
Be the first to recommend Julien
Help this freelancer shine by sharing your experience working together.
These freelancer profiles also match your criteria
Agatha Frydrych
Backend Java Software Engineer
4.7
(3)
2
Baptiste Duhen
Fullstack developer
4.6
(4)
5
Amed Hamou
Senior Lead Developer
4
(2)
7
Audrey Champion
Web developer
4.3
(3)
4
Education
- Engineer, AIEPITA2025Ingénieur, IA
- MPCPGE N.D. de Sion2022MP