Freelancer profile translated to English.

Description

Data Engineer with over 6 years of experience, I have participated in numerous projects that have allowed me to build solid expertise in the Big Data universe.

I will assist you in the design and development of robust data pipelines, covering the entire lifecycle: from ingestion, cleaning, transformation, modeling, to data exposure for analytical or operational uses.

Industry field of expertise

Languages

French
Native or bilingual

Workplace preferences

Can work on-site

Paris (up to 50km)

DOCAPOSTE
Senior Data Engineer
DIGITAL AND IT
February 2025 - June 2025 (4 months)
Neuilly-sur-Seine, France
Benchmarking of Data Platforms (Databricks, Snowflake)
Objective: Conduct an in-depth comparative study of the main market data platforms, based on performance, cost, and feature criteria.

Definition and validation of evaluation criteria for the benchmark
Selection of a large and representative dataset for business use cases (complex joins, filtering, aggregations)
Design and implementation of reproducible test scenarios for each platform:
ingestion, transformation (ELT), aggregation, analytical querying, load testing.
Writing of a detailed comparative report including summary tables and graphs
Environment: Azure Databricks, Snowflake, Azure DevOps (Repos, Pipeline), ADLS, Azure Data Factory, SQL, Spark, Python
Snowflake Databricks Azure DevOps SQL Spark
Intermarché
Senior Data Engineer
AGRICULTURE
March 2024 - January 2025 (10 months)
Châtillon, France
Migration from Teradata to Azure Databricks - SIC Intermarché France
Development of ingestion pipelines triggered by file drops from source applications into the DataHub (Blob Storage), using a feeding framework designed with Databricks and orchestrated via Azure Data Factory
Writing of interface contracts defining technical specifications and mutual commitments for data exchange between sources and the DataHub
Creation of business table DDLs on Databricks based on existing Teradata DDLs
Migration of business tables from Teradata to Databricks for reporting needs (cash register receipts, revenue, customers, cardholders, Intermarché France points of sale)
Development and population of calculated tables on Databricks, based on Teradata feeding scripts
Orchestration of Databricks notebooks via Azure Data Factory
Environment: Azure Databricks, Azure DevOps (Repos, Pipeline), ADLS, Azure Data Factory, SQL, Spark, Python
Databricks Spark SQL Python Microsoft Azure Azure Data Factory
LA POSTE
Data Engineer
BANKING AND INSURANCE
February 2023 - February 2024 (1 year)
Issy-les-Moulineaux, France
Project 1: Migration of Digicompta (Cloudera on-premise to Databricks)
Resource creation: keyvaults, premium Databricks workspace, ADLS gen 2,
Upgrade of Spark 2 code to Spark 3 to ensure compatibility with Databricks Runtime
Migration of Airflow DAGs to Azure Data Factory to orchestrate our job pipelines, thus replacing the use of Airflow
Implementation of a post-migration testing strategy:
Comparison of results between source (Cloudera) and target (Databricks) environments on representative samples.
Validation of volumes, business rules, and aggregates
Creation of non-regression reports and gap analysis.
Project 2: C3S (development of indicators to evaluate the effectiveness of the call system by factors in the delivery of signed or taxed mail)
Development of an ingestion pipeline on Databricks for daily processing of flat files deposited in a storage account
Implementation of a medallion architecture (Bronze / Silver / Gold):
Bronze: For ingesting raw files as-is into Delta Lake for archiving and traceability.
Silver: For processing, cleaning, and normalizing data.
Gold: Calculation of performance indicators (signed delivery rate, failure rate, average response time), aggregation by geographical area and period.
Development of modular PySpark jobs for each layer
Storage optimization (partitioning, compaction, Z-Ordering) to speed up downstream queries.

Environment: Azure Databricks, Azure DevOps, Spark, ADLS Gen2, Azure Data Factory, SQL, Python
Spark Azure Databricks Azure DevOps Azure Data Factory Azure Data Lake Storage

Check out Ibrahima Matar's experience

Be the first to recommend Ibrahima Matar

Help this freelancer shine by sharing your experience working together.

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

Baptiste Duhen

Fullstack developer

4.6

(4)

Amed Hamou

Senior Lead Developer

(2)

Audrey Champion

Web developer

4.3

(3)

Signup to reveal

Master of Statistics for New Data
Université Paris Est Marne la Vallée
2017
Bachelor's Degree in Mathematics and Computer Science
Université Paris Est Marne la Vallée
2015

Databricks Certified Data Engineer Professional
Databricks
https://credentials.databricks.com/35b722df-039d-43d8-9afe-3579020f7830#acc.FtvTOKN6
Data Pipelines Lakehouse production Alerting Delta Lake Deployment Data Modeling Spark ETL Testing
Databricks Certified Data Engineer Associate
Databricks
https://credentials.databricks.com/aef46aac-8ac7-4bd9-a6ec-e20833c1e4af#acc.AVsaGwBg
Python 3 Data Pipelines Lakehouse Delta Lake Databricks ETL Delta Live Tables Apache Spark SQL

Data Engineer

AI engineer

Ibrahima Matar Gueye

Cloud Data Engineer

About Ibrahima Matar

Experience

Benchmarking of Data Platforms (Databricks, Snowflake)

Migration from Teradata to Azure Databricks - SIC Intermarché France

Project 1: Migration of Digicompta (Cloudera on-premise to Databricks)

Project 2: C3S (development of indicators to evaluate the effectiveness of the call system by factors in the delivery of signed or taxed mail)

Recommendations

These freelancer profiles also match your criteria

Education

Certifications

Skill set

Categories