You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Sakaria DiarrassoubaSD

Sakaria Diarrassouba

Consultant Data Engineer

€520/day
Paris, FR
3-7 years

Average response time: 1 hour

Freelancer profile translated to English.
Back to original language

About Sakaria

I have a solid background as a Data Engineer including data migration, development and management of data flows, database management, development and deployment of ML models. My expertise covers Python programming, advanced use of Google Cloud Platform and Hadoop. I also gained insight into the challenges of Big Data development in the cloud by proposing scalable and high-performance solutions to leverage company data.
  • French

    Native or bilingual

  • English

    Conversational

Can work on-site
Paris (up to 50km)

Experience

  • Eviden
    DATA ENGINEER
    CONSULTING AND AUDITS
    November 2024 - May 2025 (6 months)
    End-to-End Automated Implementation of a Data Engineering System on Real Data
    Context:
    Development of an end-to-end solution for real-time collection and analysis of weather and fictitious user data, aiming to create an automated and scalable monitoring system in the cloud. Article here
    Objective:
    • Implement a scalable, reliable modern data architecture
    • Design robust ETL pipelines and visualize data in real-time.
    • Automate workflows without manual intervention and ensure continuous execution
    Achievements:
    - Integration of Weather and Random User APIs to collect weather data and fictitious demographic data, ensuring rapid and continuous ingestion.
    - Development of robust pipelines to transform raw data and ingest it into Bigquery for fast and reliable analytical processing.
    - Automation of workflows with Airflow DAGs, allowing regular and uninterrupted execution.
    - Use of Docker Compose to orchestrate containers, simplifying deployment and maintenance.
    - Full deployment of processes on an automated virtual machine (VM) using Terraform to ensure scalable and reproducible infrastructure management.
    - Creation of interactive dashboards with Looker Studio, providing a clear view of real-time temperatures and user demographic distribution by country.
    - Documentation and knowledge sharing on Medium detailing the entire project.
    Technical Environment:
    Python, SQL, Pyspark, Kafka, Airflow, GCP, Bigquery, Looker Studio, Terraform, Docker
  • HUTCHINSON
    Consultant Data Engineer
    RAW MATERIALS INDUSTRY
    May 2024 - Today (2 years and 1 month)
    Context: As part of its digital transformation, Hutchinson wanted to modernize its manufacturing data management to optimize the collection and analysis of industrial data in the Azure cloud. Objective:
    • Optimize manufacturing data ingestion and transformation processes.
    • Ensure data integrity, consistency, and reliability throughout the pipeline.
    • Analysis of data quality and reliability in the cloud. Achievements:
    • Configuration and deployment of Apache Nifi to automate and ensure a continuous flow of data into the data ingestion system.
    • Integration of a monitoring system to detect anomalies in data flows. Management and optimization of SQL query performance for the data ingestion pipeline configuration. Development of KQL scripts for the data transformation process.
    • Processing of data anomalies in the cloud. Design of Dashboards for data quality monitoring. Technical Environment: SQL, KQL, GitHub, SQL-Server, Git, Nifi, Python, Azure, MQTT Broker, SQL-Server
  • ORANGE CARAÏBES
    Consultant Data Engineer
    July 2023 - April 2024 (9 months)
    Context: As part of its transition to cloud solutions, Orange Caraïbes undertook the complete migration of its BI infrastructure and databases to GCP, aiming to reduce infrastructure costs, improve analytical processing performance, and facilitate the integration of new reporting solutions. Objective:
    • Migrate Oracle databases to Bigquery
    • Convert PL/SQL scripts to Bigquery standard SQL while preserving their functionality and performance.
    • Automate and optimize validation, deployment, and monitoring processes for pipelines in the GCP environment. Achievements: Implementation of an automated pipeline to transfer a portion of critical data to Bigquery.
    • Transcription of complex PL/SQL packages into Bigquery-compatible standard SQL, as well as optimization of complex SQL query performance.
    • Conduct technical tests with the DBA to ensure data consistency and integrity.
    • Complete migration of data from Oracle to Bigquery via continuous flows based on Cloud Function and Workflows, ensuring continuous time synchronization.
    • Contribute to and implement CI/CD pipelines to automatically validate, test, and deploy migrated packages, thereby improving the efficiency and quality of deliverables.
    • Collaboration with BI and business teams to ensure a smooth transition and guarantee reports. Technical Environment: GCP, SQL, KQL, Gitlab, PL/SQL, Git, Bigquery, Python, Cloud Function, Workflows, Cloud Registry

Recommendations

Be the first to recommend Sakaria

Help this freelancer shine by sharing your experience working together.

These freelancer profiles also match your criteria

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Education

  • Master 2 Engineering
    Sorbonne University (formerly UPMC) & ISUP
    2020
    Master 2 d'Ingénierie
  • Microsoft Fabric Data Engineer Associate
    Microsoft Fabric Data Engineer Associate

Skill set

Categories