You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Chafiq M.CM

Chafiq M.

đŸ„‡ Data engineer: Spark, Databricks, Lakehouse, AI

€670/day
Paris, FR
8-15 years

Average response time: 1 hour

Freelancer profile translated to English.
Back to original language

About Chafiq

👋 Welcome to my Malt profile! 🚀

I am a Senior Data Engineer / Tech Lead (8+ years), expert in Databricks, Spark, Kafka, Airflow, and CI/CD (GitLab/Jenkins). I help teams deliver reliable, observable, and cost-effective pipelines, from batch to streaming, on-prem or cloud (Azure/AWS). Areas: banking/insurance, risk, fraud, regulatory, KPIs.

đŸ§‘â€đŸ’» Data Engineering & DevOps
‱ Design & build of end-to-end ETL/ELT pipelines (Delta/SQL/APIs)
‱ Streaming & event-driven with Kafka (Connect/Streams/Schema Registry)
‱ Performance & cost optimization (partitions, cache, AQE, Z-Order)
‱ Data quality, TDD tests, observability (metrics, SLA/SLO)
‱ Airflow orchestration, CI/CD, Docker packaging
‱ Kubernetes deployments, secrets management
‱ Evolutionary/corrective maintenance & debugging of critical jobs

đŸ’» Software Engineering
‱ Python/Scala/Java backends (FastAPI/REST/Spring Boot)
‱ Database & model integration (Delta Lake, Hive, Elasticsearch, Postgres)
‱ Data exposure: APIs, SQL, dashboards & KPIs

🎯 Key Achievements
‱ 🏩 CrĂ©dit Agricole Assurance: x30 performance increase on Spark (hours → minutes) + Spark 3.5 & Airflow migration, automations via GUI.
‱ 🏩 SociĂ©tĂ© GĂ©nĂ©rale: on-prem market fraud migration → Azure/Databricks, execution time Ă·30.
‱ 🏩 Natixis: optimized risk engine (VaR/CVaR), 95% TDD, jobs 8 min → 1 min.

🧰 Stack
Apache Spark 2/3, Databricks (Delta/Unity Catalog), Kafka, Airflow, Hadoop/Hive/HBase, Snowflake, DBT, Python, Scala/Java, SQL, GitLab/Jenkins/Sonar, Docker/Kubernetes, Azure & AWS, Cloudera/MapR.

📈 My requirement: robust, scalable, tested, and observable solutions, with quick wins from the 1st week.

Need a Data Engineering/DevOps/CI expert?
Write to me with use case, volume, stack: I'll come back with a concrete action plan.
I respond quickly (often < 1 hour).

Agencies/Consulting firms:specific conditions, please contact me.
  • English

    Fluent

  • German

    Basic

  • Spanish

    Basic

  • Arabic

    Native or bilingual

  • French

    Native or bilingual

Can work on-site
Paris (up to 50km), Paris (up to 100km)

Experience

  • CrĂ©dit Agricole Assurances
    Tech Lead - Senior Data Engineer
    BANKING AND INSURANCE
    January 2023 - Today (3 years and 5 months)
    Paris, France
    Context:Refactoring, migration, and evolution of several monthly and annual regulatory declarations (FICOVIE, IER, EAI CDC, EAI, FATCA) in a critical and regulated Big Data environment.

    Actions:
    • Coaching and mentoring of a Data Engineers team on Spark/Hadoop.
    • Implementation of development standards (Git, TDD, CI/CD, SonarQube).
    • Management of the Data platform migration: Oozie → Airflow orchestration, Spark 2.4 → Spark 3.5 migration, Java 8 → Java 17, management of compatibility and performance constraints.
    • Active participation in technical and functional specification workshops, as well as migration monitoring committees.
    • Development of a business GUI to automate end-to-end declarative processes.
    • Release management.

    Results:
    • Reduction of Spark processing time from several hours to a few minutes (x30 performance).
    • Successful migration to Spark 3.5 and Airflow, with securing of critical jobs and minimization of regression risks.
    • Acceleration of team skill development (Spark 3.x, Airflow, CI/CD training).
    • Improvement of software quality and pipeline reliability thanks to the implementation of CI/CD and TDD best practices.
    • Automation of regulatory declarations via the GUI → reduction of manual tasks and increased reliability of business processes.

    Technologies:Spark 2.4 & 3.5, Hadoop, Hive, HBase, Kafka, Python, Airflow, Java 8, Scala 2.12, MAPR, Spring Boot, Angular, PostgreSQL, SQL, Kubernetes, JFrog, Jenkins, SonarQube, GitLab, Github Copilot (GPT, Claude, Gemini), IntelliJ, Windows.
    Apache Spark Java Scala Airflow Kubernetes
  • SociĂ©tĂ© GĂ©nĂ©rale
    Senior Data Engineer
    BANKING AND INSURANCE
    August 2021 - December 2022 (1 year and 4 months)
    Fontenay-sous-Bois, France
    Context:Development of a market control and fraud detection platform on massive volumes, in a cloud migration context.

    Actions:
    • Management of the On-Premise → Azure Cloud ecosystem migration (HDInsight, Azure Storage).
    • Spark 2.x → Spark 3.x migration, with performance optimization (AQE, DPP, adaptive joins).
    • Coaching and mentoring of the Data Engineering team (Paris, London, Bangalore).
    • Implementation of development best practices (Git, TDD, CI/CD, SonarQube)
    • Close collaboration with business and support teams to secure production deployments.
    • Release management.

    Results:
    • Reduction of Spark processing time from several hours to a few minutes (x30 performance).
    • Acceleration of fraud analysis thanks to optimized and scalable pipelines in Azure.
    • Successful adoption of Spark 3.x and Azure Cloud → improved platform robustness and flexibility.
    • Significant improvement in the stability and operational costs of risk calculations.
    • Improvement of software quality and pipeline reliability thanks to the implementation of CI/CD and TDD best practices.

    Technologies:Microsoft Azure (HDInsight, Azure Storage Explorer, Databricks), Java 8, Scala 2.11, Apache Spark 2 and 3, Kerberos, Spring Boot, REST API, HDP 2.6, Hive, Hadoop, Windows, IntelliJ.
    Data Engineer Spark Scala Java Cloudera Data Platform
  • BPCE SA
    Data Engineer
    BANKING AND INSURANCE
    February 2019 - June 2021 (2 years and 4 months)
    Paris, France
    Subject: Software for calculating risk indicators and P&L (Profit & Loss).

    • Ensure maintenance and evolution of existing components.
    • Implement calculation processes for HVAR, Sensi x Shocks, Stress Tests.
    • Implementation of the TDD methodology in the team (from 0% to 95% code coverage by UT).
    • Optimization of algorithms and Spark code for greater efficiency in time and memory usage (from 8 mins to 1 min in calculation processes).
    • Analysis, design, and development of solutions and new features.
    • Participation in the development of technical and functional specifications.
    • Participation in deliverable costing meetings.
    • Ensure production deployments.
    • Coordination with support and infrastructure teams.

    Technologies: Windows, IntelliJ, Scala 2.11, Apache Spark 2, Apache Kafka, Kerberos, Spring Boot, Java 8, REST API, HDP 2.6, HBase, Hive, Hadoop.
    Data Engineer Spark Scala Event-driven architecture Cloudera Data Platform (CDP)

Recommendations

FU
FU
FU
+3
Former user and 5 other people have recommended Chafiq

These freelancer profiles also match your criteria

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Education

  • Master MIAGE - Big Data for Finance
    Université Paris-Dauphine
    2017
    - Big Data & Data Engineering - Machine Learning & Data Analytics
  • Engineering Degree - Software Engineering (Big Data Option)
    ENSEIRB-MATMECA
    2017
    - Développement Logiciel - Architecture des SystÚme d'information - Développement Web - Bonnes pratiques et méthode Agile - Big Data

Certifications

Skill set

Categories