You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Ibrahima Matar GueyeIM

Ibrahima Matar Gueye

Cloud Data Engineer

€600/day
Paris, FR
3-7 years

Average response time: 1 hour

Freelancer profile translated to English.
Back to original language

About Ibrahima Matar

Data Engineer with over 6 years of experience, I have participated in numerous projects that have allowed me to build solid expertise in the Big Data universe.

I will assist you in the design and development of robust data pipelines, covering the entire lifecycle: from ingestion, cleaning, transformation, modeling, to data exposure for analytical or operational uses.

  • French

    Native or bilingual

Can work on-site
Paris (up to 50km)

Experience

  • DOCAPOSTE
    Senior Data Engineer
    DIGITAL AND IT
    February 2025 - June 2025 (4 months)
    Neuilly-sur-Seine, France

    Benchmarking of Data Platforms (Databricks, Snowflake)

    Objective: Conduct an in-depth comparative study of the main market data platforms, based on performance, cost, and feature criteria.

    • Definition and validation of evaluation criteria for the benchmark
    • Selection of a large and representative dataset for business use cases (complex joins, filtering, aggregations)
    • Design and implementation of reproducible test scenarios for each platform:
    • ingestion, transformation (ELT), aggregation, analytical querying, load testing.
    • Writing of a detailed comparative report including summary tables and graphs
    Environment: Azure Databricks, Snowflake, Azure DevOps (Repos, Pipeline), ADLS, Azure Data Factory, SQL, Spark, Python
    Snowflake Databricks Azure DevOps SQL Spark
  • Intermarché
    Senior Data Engineer
    AGRICULTURE
    March 2024 - January 2025 (10 months)
    Châtillon, France

    Migration from Teradata to Azure Databricks - SIC Intermarché France

    • Development of ingestion pipelines triggered by file drops from source applications into the DataHub (Blob Storage), using a feeding framework designed with Databricks and orchestrated via Azure Data Factory
    • Writing of interface contracts defining technical specifications and mutual commitments for data exchange between sources and the DataHub
    • Creation of business table DDLs on Databricks based on existing Teradata DDLs
    • Migration of business tables from Teradata to Databricks for reporting needs (cash register receipts, revenue, customers, cardholders, Intermarché France points of sale)
    • Development and population of calculated tables on Databricks, based on Teradata feeding scripts
    • Orchestration of Databricks notebooks via Azure Data Factory
    Environment: Azure Databricks, Azure DevOps (Repos, Pipeline), ADLS, Azure Data Factory, SQL, Spark, Python
    Databricks Spark SQL Python Microsoft Azure Azure Data Factory
  • LA POSTE
    Data Engineer
    BANKING AND INSURANCE
    February 2023 - February 2024 (1 year)
    Issy-les-Moulineaux, France

    Project 1: Migration of Digicompta (Cloudera on-premise to Databricks)

    • Resource creation: keyvaults, premium Databricks workspace, ADLS gen 2,
    • Upgrade of Spark 2 code to Spark 3 to ensure compatibility with Databricks Runtime
    • Migration of Airflow DAGs to Azure Data Factory to orchestrate our job pipelines, thus replacing the use of Airflow
    • Implementation of a post-migration testing strategy:
    • Comparison of results between source (Cloudera) and target (Databricks) environments on representative samples.
    • Validation of volumes, business rules, and aggregates
    • Creation of non-regression reports and gap analysis.

    Project 2: C3S (development of indicators to evaluate the effectiveness of the call system by factors in the delivery of signed or taxed mail)

    • Development of an ingestion pipeline on Databricks for daily processing of flat files deposited in a storage account
    • Implementation of a medallion architecture (Bronze / Silver / Gold):
    • Bronze: For ingesting raw files as-is into Delta Lake for archiving and traceability.
    • Silver: For processing, cleaning, and normalizing data.
    • Gold: Calculation of performance indicators (signed delivery rate, failure rate, average response time), aggregation by geographical area and period.
    • Development of modular PySpark jobs for each layer
    • Storage optimization (partitioning, compaction, Z-Ordering) to speed up downstream queries.

    Environment: Azure Databricks, Azure DevOps, Spark, ADLS Gen2, Azure Data Factory, SQL, Python
    Spark Azure Databricks Azure DevOps Azure Data Factory Azure Data Lake Storage

Recommendations

Be the first to recommend Ibrahima Matar

Help this freelancer shine by sharing your experience working together.

These freelancer profiles also match your criteria

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Education

  • Master of Statistics for New Data
    Université Paris Est Marne la Vallée
    2017
  • Bachelor's Degree in Mathematics and Computer Science
    Université Paris Est Marne la Vallée
    2015

Certifications

Skill set

Categories