You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Khalil SagoumiKS

Khalil Sagoumi

Data Scientist | ML Engineer | Python | GenAI

€490/day
Paris, FR
3-7 years

Average response time: 1 hour

Freelancer profile translated to English.
Back to original language

About Khalil

Data and Statistics Engineer, graduated from the Léonard de Vinci Engineering School (Data & Artificial Intelligence major). With 4 years of experience as a Data Scientist/ML Engineer, first at Orange and then at SAMU centre 15 des Yvelines (Hôpital André Mignot), I would be happy to assist you in your projects, from simple data exploration to the construction of complex models.


Technologies and frameworks

Programming language: Python, PySpark, C, C++, R, SQL
Data Science / ML: Python, PyTorch, TensorFlow, Spark, Elasticsearch, Lightning, ClearML, OpenCV, Scikit-learn, LlamaIndex, LangChain, NEO4J, Multiprocessing, NumPy, SciPy, Pandas, Matplotlib, Selenium, AutoML, H2O, Flask API
Software / Cloud: Git, Azure, GCP, Docker, Kubernetes, Jupyter, VSCode, Databricks
  • French

    Native or bilingual

  • English

    Fluent

Can work on-site
Paris (up to 30km)

Experience

  • Milhano SAS
    Data scientist ML Engineer
    LUXURY GOODS
    November 2024 - April 2025 (5 months)
    Paris, France
    - Development of an AI chatbot (RAG) enriched with product data and intended for salespeople, which provides
    instantaneous detailed product descriptions via a web API (FastAPI) (Composition, types of leather, characteristics)
    - Development of a data enrichment pipeline by extracting key fields from invoices with Qwen 2.5 vLLM,
    processing with Polars and injection into a DuckDB database
    - Creation of dashboards that analyze sales based on hours, seasons, and purchased products
    (Power BI)
    RAG FastAPI LLM DuckDB CAG
  • SAMU 78
    Data Scientist / ML Engineer
    HEALTH AND WELLNESS
    December 2022 - July 2024 (1 year and 8 months)
    Le Chesnay, France
    Definition of the data valorization strategy and implementation of data
    treatment tools with the SAMU management (Data Governance)
    • Implementation of the new cloud architecture for the data warehouse with GCP and
    data migration (Oracle, BigQuery)
    • Creation of a medical data processing pipeline with anonymization to
    develop an algorithm to predict the seasonality of
    certain pathologies (flu, bronchiolitis, psychiatric pathologies) to adapt medical
    resources (DataFlow, Times Series, LSTM, Gradient Boosting)
    • Extraction, transformation, and loading of data from medical regulation reports to
    clean, validate, and organize this data for the construction of statistical
    studies
    • Construction of algorithms to cluster patient journey phenotypes and
    predict potential re-hospitalizations (Partitioning Around Medoids – PAM,
    Python)
    • Creation and deployment of dashboards that analyze the distribution of calls by
    pathology in the department of Yvelines (Tableau /Power BI)
    Cloud GCP Python Machine learning NLP Git Github Actions CI/CD Docker
  • Orange SA
    Data Scientist
    TELECOMMUNICATIONS
    October 2020 - October 2022 (2 years)
    Paris, France
    • Implementation of HSQL queries to process and analyze data from the Data Hub (Apache Hive, Hadoop)
    • Optimization of data pipelines to reduce costs (Airflow)
    • Development of an RNN (LSTM) to detect incidents and malfunctions in technical domains (IoT, Roaming, and Wholesale
    Offers)
    • Creation of KPIs and operational dashboards (Tableau Software, Grafana)
    • Automation of mobile data dashboards (Qlik Sense)

Recommendations

Be the first to recommend Khalil

Help this freelancer shine by sharing your experience working together.

These freelancer profiles also match your criteria

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Education

  • General Engineer
    ESILV
    2022
    Machine Learning, Deep Learning, NLP, NoSQL, Python for Data, DataViz, Statistique de la donnée, Base de données et interopérabilité, Probabilités numériques, Statistique inférentielle, Optimisation et Recherche Opérationnelle, Cloud and virtualization techniques, Graph and minning Soft Skills : Savoir vendre ses idées, Team Building, méthode Agile et Design Thinking
  • Preparatory class MPSI/MP
    Lycée Jeanne d'Albret
    2019

Skill set (34)

Categories