About Bassirou
- Data Migration: On-premises to Cloud, Cloud to on-premises, technology transitions (e.g., Spark to DBT, Nifi to Airflow)
- Data Technologies: Apache Spark, DBT, Apache NiFi, Kafka, Apache Airflow, Dataproc, Hive, HDFS
- Cloud: GCP, AWS, Cloud Run, AWS Lambda
- Programming Languages: Python, Scala, SQL, Java
- CI/CD & Automation: Gitlab CI/CD, Github Action, Ansible, Jenkins, Terraform
- Data Visualization: Power BI, Qlik
- Database Management: BigQuery, Cassandra, Solr, MongoDB, ElasticSearch
- Batch and real-time processing with Spark, DBT, Kafka, PubSub
- Data Architecture
- Data Modeling
French
Native or bilingual
English
Fluent
Experience
- LegallaisSenior Data EngineerCIVIL ENGINEERINGNovember 2023 - February 2024 (3 months)Caen, FranceMission Objective: Implementation of Legallais data ingestion and processing flows using NIFI, Spark, Airflow to populate the Google Cloud Platform Data Lake and Qlik for Data Visualization.Achievements:
- Ingestion of data sources via NIFI into the Data Lake in Parquet format.
- Implementation of the first unit tests for Spark code using the Scalatest library.
- Configuration of Spark applications in different environments (dev and prod).
- Setup of the entire Logistics flow with the creation of models for data visualization.
- Task scheduling using Airflow Python.
- Implementation of an Airflow library to manage cluster launches.
- ORANGESenior Data EngineerTELECOMMUNICATIONSMay 2024 - Today (2 years and 1 month)Issy-les-Moulineaux, FranceMission Objective: Migrate all data pipelines from Cloudera Hive, HDFS, Hadoop to GCP BigQuery, DBT, GCS, Composer (Airflow), PubSub from scratch.Achievements:
- Acquisition of source data with Cloud Function, PubSub, GCS.
- Implementation of a routing pipeline with Airflow (Python) to route source files to destination projects.
- Development of a Python library to load data into BigQuery based on use cases.
- Writing Airflow DAGs in Python to schedule data processing.
- Transformation of raw data into data marts with DBT.
- Development of CI/CD pipelines with GitLab CI/CD.
- Automation of GCP resource builds with Terraform.
- Close collaboration with the business to better understand needs and propose suitable solutions.
- KeringKering Big Data DeveloperLUXURY GOODSDecember 2020 - April 2024 (3 years and 5 months)Paris, FranceMission Objective: Implementation of data ingestion and processing flows via NIFI, Spark, Kafka, Airflow, exposed in caches (Cassandra, Solr) through an API (Node.js).Achievements:
- Migration of processing to AWS Cloud.
- Batch and real-time processing via Kafka and Spark.
- Deployment of real-time Spark applications with Jenkins.
- Anonymization of customer information in compliance with GDPR.
- Automation and continuous integration
Recommendations
These freelancer profiles also match your criteria
Agatha Frydrych
Backend Java Software Engineer
4.7
(3)
2
Baptiste Duhen
Fullstack developer
4.6
(4)
5
Amed Hamou
Senior Lead Developer
4
(2)
7
Audrey Champion
Web developer
4.3
(3)
4
Education
- Scientific Preparatory ClassINSA Hauts de France2016
- Computer Engineering, Big Data OptionInstitut National des Sciences Appliquées (INSA) de Rennes2019
Certifications
- Google Cloud Professional Data EngineerGoogle2021