You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Romain DuvalRD

Romain Duval

Data Engineer / Data Architect

€600/day
Paris, FR
8-15 years

Average response time: 1 hour

Freelancer profile translated to English.
Back to original language

About Romain

Data Engineer passionate about the Cloud. I am open to new challenging missions! Data Engineer / Architect position. Otherwise, I am a friendly person, a fan of manga and tennis 😉
  • French

    Native or bilingual

Can work on-site
Paris (up to 50km)

Experience

  • Direct Assurance
    Data Architect
    BANKING AND INSURANCE
    February 2024 - Today (2 years and 4 months)
    Paris, France
    Data architecture design, including:

    Database and storage system design.
    Definition of data ingestion and processing pipelines.

    Efficient integration of company data sources by developing:
    Data flows.
    ETL (Extract, Transform, Load) processes.
    Data quality management:
    Implementation of data cleaning and validation processes.
    Definition of rules to ensure data accuracy and reliability.

    Data security:
    Implementation of security measures such as data encryption.
    Access and authorization management.
    Compliance with data protection regulations.

    Optimization of data infrastructure performance, ensuring that:
    Queries and analysis processes run efficiently and quickly.

    Cross-functional collaboration with other teams:
    Software development teams.
    Data analysts.
    Business managers, to understand needs and design appropriate solutions.

    Implementation of the SAFe methodology
    Evangelizing teams on Craftsmanship
  • TF1
    Cloud Data Engineer
    FILM AND AV
    February 2023 - February 2024 (1 year)
    Boulogne-Billancourt, France
    PROJECT CONTEXT 1:
    Within the BI department on the project to redesign and migrate the on-premise datamart of TV advertising data to the cloud. This datamart allows tracking indicators on advertising revenue evolution. Additionally, it involves implementing a data quality control process.

    ACTIVITIES:
    Design of Azure Data Factory pipelines for data ingestion into the different layers of the Data Lake.
    - Implementation and deployment of Apache Spark data transformation jobs with Azure Databricks and ADLS Gen2.
    - Implementation of data quality controls with Spark Scala.
    - Code review and quality control with SonarQube.
    - Writing technical and operational architecture documents.
    - Design of a monitoring and supervision dashboard for components (Azure Databricks cluster, Data Factory pipeline, Azure SQL Database, Azure Blob Storage) with Datadog.

    METHODOLOGY: Agile (SCRUM)
    TECHNOLOGIES: Azure Data Lake Storage Gen 2, Apache Spark, Scala, Azure Databricks, Azure Data Factory, Azure DevOps, Azure SQL Database, Azure Key Vault, Microsoft PureView

    PROJECT CONTEXT 2: Implementation of a BI data quality management process

    ACTIVITIES:
    - Modeling of an advertising data catalog.
    - Proposal of a plan for defining key data by functional area.
    - Business workshops for defining data quality rules.
    - Definition of data quality KPIs
  • Societe Generale Corporate and Investment Banking - SGCIB
    Cloud Data Engineer
    March 2022 - December 2023 (1 year and 9 months)
    Paris, France
    Data engineer within the Collateral and Risk entity of the Transverse Markets department on the Data Initiative project, which aims to establish a common cloud data platform for all collateral data consumers. This data platform ingests data from multiple sources, transforms it according to a repository, and then exposes it via APIs. Main tasks include:
    - Proposal of an efficient platform architecture (in terms of cost and performance).
    - Implementation of APIs for extracting and loading data from Oracle DB to ADLS Gen2 using Spring Batch and Apache Camel.
    - Design of datalake datasets and cross-functional data lineage. Business workshops for building the data catalog.
    - Supervision and monitoring of APIs and Azure HDInsight, Azure Kubernetes Services clusters with the Elasticsearch Logstash and Kibana suite.
    - Implementation and deployment of Apache Spark data transformation jobs with Spark Scala, Azure HDInsight, and Azure Kubernetes Services and ADLS Gen2.
    - Implementation of data quality controls with Spark SQL.
    - Pipeline orchestration with Apache Airflow and continuous delivery with Jenkins, SonarQube, Gitlab.

Recommendations

Be the first to recommend Romain

Help this freelancer shine by sharing your experience working together.

These freelancer profiles also match your criteria

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Education

  • Master's degree in Mathematics, Theoretical Mathematics
    UPEC
    Maîtrise de Mathématiques, Mathématiques théoriques
  • Computer Engineering, Networks and Information Systems
    Institute of Mathematics and Physical Sciences
    2013
    Ingénieur informatique , Réseaux et systèmes d'information

Skill set (47)

Categories