You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Rafik B.RB

Rafik B.

Data Engineer | Databricks | Pyspark | ADF

€700/day
Paris, FR
3-7 years

Average response time: 1 hour

Freelancer profile translated to English.
Back to original language

About Rafik

Passionate about transforming data into business value 📊, I help companies unlock the full potential of their data by implementing scalable, secure, and high-performing architectures.

Certified Databricks and Azure 🎓, I have expertise in Big Data 🌐, data governance 🛠️, and analytical pipeline optimization 📈.

My Data Engineering skills are based on technologies such as:

- Python
- Spark
- SQL
- Azure
- Databricks
  • French

    Native or bilingual

  • English

    Fluent

Can work on-site
Paris (up to 50km), Lille (up to 10km)

Experience

  • KERIALIS
    Cloud Data Engineer
    BANKING AND INSURANCE
    October 2025 - Today (8 months)
    In an environment marked by strong competition and accelerating digital uses, the ambition was to implement a strategic and sustainable data platform on Microsoft Fabric.
    This platform was intended to be a value creation lever, by offering a 360° view of policyholders, beneficiaries, and prospects, in order to strengthen customer knowledge, optimize commercial steering, and support a more targeted offer strategy.
    Tasks:
    • Close collaboration with the Data Architect for the definition of the platform's logical and technical architecture (Lakehouse, Workspaces, domain separation, data ingestion and exposure patterns, capacity sizing).
    • Participation in data modeling workshops with the Data Manager and Data Architect (domain scoping, business object definition, etc.).
    • Development of data orchestration and integration pipelines using Microsoft Fabric Pipelines and Spark Notebooks.
    • Implementation of the Medallion architecture (Bronze / Silver / Gold) in a Data Mesh oriented foundation, with separation of Lakehouses and Workspaces by business domain.
    • Definition and implementation of a data governance strategy:
    • Setup of automated data quality controls and reporting via Soda Core,
    • Data lineage and traceability management via Microsoft Purview, Access security via separation of business Workspaces and Row-Level Security (RLS) implementation.
    • Data historization via implementation of the Data Vault 2.0 model at the Silver level.
    • Industrialization and automation of deployments via Fabric Deployment Pipelines and CI/CD integration with Azure DevOps.
    Microsoft Fabric Microsoft Purview Soda Spark
  • Valiuz
    Cloud Data Engineer
    RETAIL (SMALL BUSINESS)
    August 2025 - October 2025 (2 months)
    As a Databricks partner and member of the Delivery Partner Program, I participated in a mission to migrate the gold layer from Big Query and Amazon RDS to Databricks.
    Tasks:
    • Study of the technical and functional architecture.
    • Participation in data layer modeling – Gold.
    • Migration of data from Big Query and Amazon RDS to Databricks SQL with Lakeflow Connect.
    • Design and development of data pipelines via Lakeflow Declarative Pipelines.
    Databricks Spark Big Query Google Cloud Storage
  • CERBA HEALTHCARE (DECILIA)
    Cloud Data Engineer
    October 2022 - July 2025 (2 years and 9 months)
    Issy-les-Moulineaux, France
    The Cerba Group initiated a hybrid data analysis platform project based on a data mesh architecture to improve data sharing, quality, and security within its entities. This project also aimed to ensure compliance with CNIL (GDPR) regulations for health data.

    My intervention took place in two phases:
    1. Development and production deployment of the first functional batch: implementation of an operational version of the platform.
    2. Extension of capabilities and integration of new data sources: expanding analysis to financial and HR data to obtain a 360° view of the business.

    Missions performed:
    • Gathering technical requirements and participating in architecture workshops.
    • Design and modeling of data layers (Bronze, Silver, Gold).
    • Development of Big Data pipelines on Databricks (PySpark, SparkSQL).
    • Pseudonymization of health data in PySpark.
    • Implementation of a quality and compliance control framework (Pydeequ, Great Expectations).
    • Development of right-to-be-forgotten mechanisms and Delta Sharing between Databricks.
    • Performance optimization (ZOrder, partitioning).
    • Workflow orchestration via Azure Data Factory.
    • Implementation of data governance with Unity Catalog.
    • Technical documentation and operating manuals.

    Technical Environment:
    • Azure (Data Factory, Databricks, DataLake, DevOps)
    • Spark (PySpark, SQL)
    • Databases (Oracle)
    • Data Governance and Quality (Pydeequ, Great Expectations)
    • Networks (Virtual Networks)
    Databricks

Recommendations

FL
IS
Imed BezahafIB
+2
Florian Lamant and 4 other people have recommended Rafik

These freelancer profiles also match your criteria

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Education

  • Master in Intelligent Systems Engineering, apprenticeship at Sopra Steria Infrastructure Security & Cloud Services
    Sorbonne Université
    2021
    Master en Ingénierie des Systèmes Intelligents en apprentissage au sein de Sopra Steria Infrastructure Security & Cloud Services
  • Bachelor's degree in Electronics, Electrical Energy, and Automation (EEA)
    Sorbonne Université
    2019
    Licence en Electronique, Energie Electrique et Automatique (EEA)

Certifications

Skill set

Categories