Freelancer profile translated to English.

Description

Doctorate in Computer Science and certified Databricks Data Engineer Professional, I have over 10 years of experience in data analysis, processing, and valorization, with proven expertise in designing and industrializing big data solutions.

I specialize in the design, modeling, optimization, and orchestration of data pipelines, primarily on Spark, SQL, Python, Scala, Databricks, Azure, AWS, Hadoop, Hive, and Airflow.

Specifically, I work across the entire data chain:

• Medallion Architecture (Bronze / Silver / Gold): design and implementation

• Multi-source ingestion, streaming or batch, to the Bronze layer

• ETL Pipelines: modeling and normalization of Silver / Gold layers, quality controls, automated and optimized flows to feed BI and analysis systems

• Processing orchestration

Industry field of expertise

Languages

French
Native or bilingual
English
Fluent

Workplace preferences

Can work on-site

Paris (up to 50km)

FRAMATOME
Senior Data Engineer
April 2025 - Today (1 year and 4 months)
Courbevoie, France
Senior Data Engineer responsible for the ingestion, processing, and valorization of project planning data from sources such as Primavera P6, Jira, MS Project, etc.:
- Design and implementation of data ingestion pipelines within a Medallion architecture on Databricks
- Data modeling according to star schema or snowflake schema,
- Data normalization respecting normal forms: NF1, NF2, NF3
- Development of KPIs for project performance monitoring (progress, costs, deadlines) and financial indicators
- Implementation of data quality controls
- Optimization of Spark processing performance
- Deployment and maintenance of ETL pipelines in a production environment
- Conversion of data transformations written in M language (Power Query) in Power BI into PySpark scripts for optimized execution in the Databricks environment
- Connection of data from the Gold layer of the Medallion architecture to Power BI, enabling smooth and secure feeding of dashboards for real-time visualization of project KPIs and metrics.
Azure Databricks Azure DevOps Azure Data Factory PySpark
ENGIE SOLUTIONS
Tech Lead Data Engineer
ENERGY AND UTILITIES
January 2023 - December 2024 (2 years)
Bagneux, France
Senior Data Engineer responsible for implementing data ingestion flows for electricity and gas consumption data:
- Design and implementation of data processing pipelines
- Processing and ingestion of various file formats (XML, JSON, CSV, PARQUET, etc.)
- Implementation of streaming processes for real-time ingestion of data flows.
- Implementation of data processing pipeline orchestrators
- Optimization of Spark processing performance: partition management, Spark configuration tuning, parallelization, caching, etc.
- Migration of Oracle data flows to Databricks
- Database management and optimization.
- Production deployment of ETLs
Databricks Airflow Python Spark Scala
Natixis
Senior Data Scientist/Engineer Consultant
BANKING AND INSURANCE
October 2020 - October 2022 (2 years and 1 month)
75013 Paris, France
Data Engineer/Scientist responsible for implementing data solutions for fraud, money laundering, and terrorist financing detection models for compliance:
- Implementation of data pipelines for data extraction, transformation, and loading (ETL)
- Implementation of models for suspicious transaction detection,
- Implementation of matching models between clients and politically exposed persons, and persons on sanction/embargo lists
- Segmentation of countries based on the risk associated with money laundering and terrorist financing
- Implementation of data processing pipeline orchestrators
- Optimization of Spark processing performance: partition management, Spark configuration tuning, parallelization, caching, etc.
Python PySpark Hadoop

Check out Cheick's experience

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

Baptiste Duhen

Fullstack developer

4.6

(4)

Amed Hamou

Senior Lead Developer

(2)

Audrey Champion

Web developer

4.3

(3)

Signup to reveal

Ph.D in Mathematics / Computer Science.
Université Pierre et Marie Curie
2017
Ph.D in Mathematics / Computer Science.
Master's Degree in Probability and Random models.
Université Pierre et Marie Curie
2012
Master's Degree in Probability and Random models.

Check out Cheick's education

Databricks Certified Data Engineer Professional
Databricks
https://credentials.databricks.com/de748cb7-2c65-4cee-865b-9eb1898da47a#acc.I9Ty8ijH
Python Programming PySpark Databricks Spark Streaming Apache Spark SQL

Cheick Sanogo

Senior Data Engineer | Databricks | Spark

About Cheick

Experience

Recommendations

These freelancer profiles also match your criteria

Education

Certifications

Skill set

Categories