Freelancer profile translated to English.

Description

👋 Welcome to my Malt profile! 🚀

I am a Senior Data Engineer / Tech Lead (8+ years), expert in Databricks, Spark, Kafka, Airflow, and CI/CD (GitLab/Jenkins). I help teams deliver reliable, observable, and cost-effective pipelines, from batch to streaming, on-prem or cloud (Azure/AWS). Areas: banking/insurance, risk, fraud, regulatory, KPIs.

🧑‍💻 Data Engineering & DevOps

• Design & build of end-to-end ETL/ELT pipelines (Delta/SQL/APIs)

• Streaming & event-driven with Kafka (Connect/Streams/Schema Registry)

• Performance & cost optimization (partitions, cache, AQE, Z-Order)

• Data quality, TDD tests, observability (metrics, SLA/SLO)

• Airflow orchestration, CI/CD, Docker packaging

• Kubernetes deployments, secrets management

• Evolutionary/corrective maintenance & debugging of critical jobs

💻 Software Engineering

• Python/Scala/Java backends (FastAPI/REST/Spring Boot)

• Database & model integration (Delta Lake, Hive, Elasticsearch, Postgres)

• Data exposure: APIs, SQL, dashboards & KPIs

🎯 Key Achievements

• 🏦 Crédit Agricole Assurance: x30 performance increase on Spark (hours → minutes) + Spark 3.5 & Airflow migration, automations via GUI.

• 🏦 Société Générale: on-prem market fraud migration → Azure/Databricks, execution time ÷30.

• 🏦 Natixis: optimized risk engine (VaR/CVaR), 95% TDD, jobs 8 min → 1 min.

🧰 Stack

Apache Spark 2/3, Databricks (Delta/Unity Catalog), Kafka, Airflow, Hadoop/Hive/HBase, Snowflake, DBT, Python, Scala/Java, SQL, GitLab/Jenkins/Sonar, Docker/Kubernetes, Azure & AWS, Cloudera/MapR.

📈 My requirement: robust, scalable, tested, and observable solutions, with quick wins from the 1st week.

Need a Data Engineering/DevOps/CI expert?

Write to me with use case, volume, stack: I'll come back with a concrete action plan.

I respond quickly (often < 1 hour).

Agencies/Consulting firms:specific conditions, please contact me.

Languages

English
Fluent
German
Basic
Spanish
Basic
Arabic
Native or bilingual
French
Native or bilingual

Workplace preferences

Can work on-site

Paris (up to 50km), Paris (up to 100km)

Crédit Agricole Assurances
Tech Lead - Senior Data Engineer
BANKING AND INSURANCE
January 2023 - Today (3 years and 5 months)
Paris, France
Context:Refactoring, migration, and evolution of several monthly and annual regulatory declarations (FICOVIE, IER, EAI CDC, EAI, FATCA) in a critical and regulated Big Data environment.

Actions:
Coaching and mentoring of a Data Engineers team on Spark/Hadoop.
Implementation of development standards (Git, TDD, CI/CD, SonarQube).
Management of the Data platform migration: Oozie → Airflow orchestration, Spark 2.4 → Spark 3.5 migration, Java 8 → Java 17, management of compatibility and performance constraints.
Active participation in technical and functional specification workshops, as well as migration monitoring committees.
Development of a business GUI to automate end-to-end declarative processes.
Release management.

Results:
Reduction of Spark processing time from several hours to a few minutes (x30 performance).
Successful migration to Spark 3.5 and Airflow, with securing of critical jobs and minimization of regression risks.
Acceleration of team skill development (Spark 3.x, Airflow, CI/CD training).
Improvement of software quality and pipeline reliability thanks to the implementation of CI/CD and TDD best practices.
Automation of regulatory declarations via the GUI → reduction of manual tasks and increased reliability of business processes.

Technologies:Spark 2.4 & 3.5, Hadoop, Hive, HBase, Kafka, Python, Airflow, Java 8, Scala 2.12, MAPR, Spring Boot, Angular, PostgreSQL, SQL, Kubernetes, JFrog, Jenkins, SonarQube, GitLab, Github Copilot (GPT, Claude, Gemini), IntelliJ, Windows.
Apache Spark Java Scala Airflow Kubernetes
Société Générale
Senior Data Engineer
BANKING AND INSURANCE
August 2021 - December 2022 (1 year and 4 months)
Fontenay-sous-Bois, France
Context:Development of a market control and fraud detection platform on massive volumes, in a cloud migration context.

Actions:
Management of the On-Premise → Azure Cloud ecosystem migration (HDInsight, Azure Storage).
Spark 2.x → Spark 3.x migration, with performance optimization (AQE, DPP, adaptive joins).
Coaching and mentoring of the Data Engineering team (Paris, London, Bangalore).
Implementation of development best practices (Git, TDD, CI/CD, SonarQube)
Close collaboration with business and support teams to secure production deployments.
Release management.

Results:
Reduction of Spark processing time from several hours to a few minutes (x30 performance).
Acceleration of fraud analysis thanks to optimized and scalable pipelines in Azure.
Successful adoption of Spark 3.x and Azure Cloud → improved platform robustness and flexibility.
Significant improvement in the stability and operational costs of risk calculations.
Improvement of software quality and pipeline reliability thanks to the implementation of CI/CD and TDD best practices.

Technologies:Microsoft Azure (HDInsight, Azure Storage Explorer, Databricks), Java 8, Scala 2.11, Apache Spark 2 and 3, Kerberos, Spring Boot, REST API, HDP 2.6, Hive, Hadoop, Windows, IntelliJ.
Data Engineer Spark Scala Java Cloudera Data Platform
BPCE SA
Data Engineer
BANKING AND INSURANCE
February 2019 - June 2021 (2 years and 4 months)
Paris, France
Subject: Software for calculating risk indicators and P&L (Profit & Loss).

Ensure maintenance and evolution of existing components.
Implement calculation processes for HVAR, Sensi x Shocks, Stress Tests.
Implementation of the TDD methodology in the team (from 0% to 95% code coverage by UT).
Optimization of algorithms and Spark code for greater efficiency in time and memory usage (from 8 mins to 1 min in calculation processes).
Analysis, design, and development of solutions and new features.
Participation in the development of technical and functional specifications.
Participation in deliverable costing meetings.
Ensure production deployments.
Coordination with support and infrastructure teams.

Technologies: Windows, IntelliJ, Scala 2.11, Apache Spark 2, Apache Kafka, Kerberos, Spring Boot, Java 8, REST API, HDP 2.6, HBase, Hive, Hadoop.
Data Engineer Spark Scala Event-driven architecture Cloudera Data Platform (CDP)

Check out Chafiq's experience

Former user and 5 other people have recommended Chafiq

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

Baptiste Duhen

Fullstack developer

4.6

(4)

Amed Hamou

Senior Lead Developer

(2)

Audrey Champion

Web developer

4.3

(3)

Signup to reveal

Master MIAGE - Big Data for Finance
Université Paris-Dauphine
2017
- Big Data & Data Engineering - Machine Learning & Data Analytics
Engineering Degree - Software Engineering (Big Data Option)
ENSEIRB-MATMECA
2017
- Développement Logiciel - Architecture des Système d'information - Développement Web - Bonnes pratiques et méthode Agile - Big Data

Big Data Analysis with Scala and Spark
École polytechnique fédérale de Lausanne (EPFL)
2017
https://www.coursera.org/account/accomplishments/verify/C78AT8HQV4YD
Scala Big Data Data Engineer Spark
Apache Kafka Series - Core & Internals
Udemy - Stephane Maarek
2019
https://www.udemy.com/certificate/UC-C4YC2LTJ/
Apache Kafka

Chafiq's certifications are only visible to Malt Community members

Data Engineer

Chafiq M.

🥇 Data engineer: Spark, Databricks, Lakehouse, AI

About Chafiq

Experience

Recommendations

These freelancer profiles also match your criteria

Education

Certifications

Skill set

Categories