You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Arnault G.AG

Arnault G.

Data Scientist - Machine Learning - NLP - AI - LLM

€900/day
26 projects
Paris, FR
8-15 years

Average response time: 2 hours

Freelancer profile translated to English.
Back to original language

About Arnault

🔍 Data Scientist & AI / Machine Learning Engineer, 10 years of experience

💶 CII Approval: Benefit from a 20% tax credit on my services related to your R&D projects.

🧠 Specialties: NLP, AI, Data Science, Machine Learning, Deep Learning, Large Language Models (LLM / ChatGPT).

🌐 Experience: Edtech, Legaltech, Smart City, Greentech, Fintech, Medtech. Team Lead.


🌟 Focus:
- Social Impact
- Modeling: R&D, PoC, Monitoring
- Management & Strategy
- Data Project Supervision.

💡 Expertise / Services:
- Classification and prediction
- Data Visualization
- OCR / Document Transcription (digital documents)
- Textual Trend Exploration
- Entity or Information Extraction in texts
- Customer Churn Detection
- Customer Segmentation
- Image Analysis
- Anomaly Detection
- Dynamic Pricing
- Time Series Forecasting
- Prompt Engineering
- LLM/ChatGPT Adaptation / Fine-tuning

🎓 Degrees: ENSAE Paris, Paris Saclay.

🌍 Remote work, available for travel. Weekly follow-up.

🌐 Recent clients: WHO, Silvr, World Bank or Kalent AI.
  • English

    Fluent

  • Spanish

    Fluent

  • French

    Native or bilingual

Remote only
Primarily works remotely

Experience

  • beta.gouv.fr
    AI Engineer - Legal RAG
    LEGAL
    September 2025 - Today (9 months)
    Londres, United Kingdom
    Legal RAG, LLM Agents & Production AI Systems

    I participated in the design and deployment of an AI legal assistant platform for the Conseil d'État (Beta.gouv), combining LLM, retrieval augmented, and language model fine-tuning technologies.

    Key achievements of the Jacepair project:
    • Hybrid RAG Architecture — Search system combining BM25 sparse and dense embeddings with Reciprocal Rank Fusion, on a corpus of 1.6M+ legal articles and 110K+ court decisions
    • Multi-source ingestion pipeline — Automated integration of LEGIFRANCE (codes, laws, decrees), ArianeWeb (administrative case law), and ConsiliaWeb (advisory opinions), with versioning and change tracking
    • Intelligent extraction of legal references — LLM + regex system to automatically identify and structure citations of articles, laws, ordinances, and decisions in legal documents
    • Multi-LLM provider abstraction — Flexible architecture supporting OpenAI, Mistral AI, and Albert (French sovereign LLM) via LiteLLM, with Pydantic validation for structured outputs
    • Production-ready infrastructure — Docker Compose deployment with PostgreSQL/pgvector, Qdrant, async FastAPI, Streamlit, and optimized connection management (AsyncPG, pooling up to 50 connections)
    artificial intelligence LLMs AI Python Data Science
  • kwarto
    Malt logoOn Malt
    AI Engineer - Technical Document Extraction
    MECHANICAL ENGINEERING
    September 2025 - September 2025
    Londres, United Kingdom
    I carried out a mission as a Data Scientist / ML / NLP Engineer. The project aimed to implement a solution for extracting technical entities from PDF documents related to telecommunication installations.

    - Extraction of technical information from PDFs
    - Deployment of open-source LLMs on OVH
    - Implementation of an automatic extraction evaluation system (via MLFlow)
    - Development of the fine-tuning system
    - Development of the annotation/correction platform
    NLP LLM Machine Learning artificial intelligence Data Science
  • LaReserve.tech
    AI Engineer (Volunteering)
    PUBLIC SECTOR
    March 2025 - Today (1 year and 3 months)
    Londres, United Kingdom
    RAG Fact-Checking, Multi-Source Search & Adversarial Evaluation

    Description: Development of production-ready RAG fact-checking systems.

    I participated in the design and deployment of a comprehensive automated fact-checking platform combining multi-source search, structured generation, relevance evaluation, and iterative red teaming.

    Key achievements of the Vera project:
    • Multi-Source RAG Architecture — Dual search pipeline (fact-checking + generic sources) with Google CSE, priority system, automatic fallback, and configurable temporal filtering
    • Structured Query Generation via LLM — Tool-calling architecture designed to maximize recall, multilingual support
    • Retriever Metrics System — LLM evaluation of source relevance, baseline comparison, and tracking over time
    • Source Analysis with Embeddings — Detection of unknown sources (hallucination indicator), measurement of recall and precision
    • Iterative Red Teaming — Orchestration of adversarial attacks in 4 phases (generation → execution → analysis → improvement), automatic refinement over N iterations
    • TypeScript/Python Bridge
    LLMs Python artificial intelligence NLP Machine Learning

Reviews

5.0

Out of 7 ratings

F

Fabrice

EzDEV

Reviewed on 3/18/2024

Very professional, competent and organized.
K

Ksenia

Equanimity

Reviewed on 12/4/2023

Although a freelancer, Arnault was a true member of our team. Available and agile, very thorough, always attentive, but above all Arnault quickly understood our needs and gave us excellent advice on the NLP project we were working on. He managed our English-speaking team very well, both on the business aspects and as a manager.

Recommendations

MN
Marie BeigelmanMB
Arnaud RachezAR
+1
Maksym Nikolayev and 3 other people have recommended Arnault

These freelancer profiles also match your criteria

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Education

  • Engineer in Statistics and Economics
    ENSAE ParisTech
    2016
    Cours (Spécialisation) : • Machine Learning • Apprentissage par agrégation • Gestion de données massives • Programmation • Compressed Sensing • Méthodes séquentielles et chaînes de markov cachées • Méthodes statistiques pour l'économétrie • Econométrie Avancée pour les données qualitatives • Soundings • Modèles statistiques dynamiques avec variables cachées • Bootstrap & statistiques asymptotiques • Statistiques Bayesiennes • Machines Learning Avancée • Data Visualization. Cours (Deux premières années) : • Probability Theory • Advanced Statistics • Non-parametric Statistics • Time Series • Game Theory • Econometrics • Data Analysis • Monte-Carlo methods • Dynamic Optimization • Stochastic Calculus • Macroeconomics
  • Master II - Quantitative Economics, specialization in decision theory
    Université Paris-Saclay
    2016
    Cours : • Advances Econometrics for Qualitative Data • Semi Parametric and Non-Parametric Econometrics • Advanced Game Theory • Decision Theory • Monetary Economics • MacroFinance • Information, Transmissions and Communications in Games.

Skill set

Categories