About Abdelmajid
French
Native or bilingual
English
Fluent
Experience
- ENGIEData & ML EngineerENERGY AND UTILITIESJanuary 2024 - Today (2 years and 5 months)Paris, FranceDesign and development of a robust data processing framework for ENGIE clients on the Databricks platform.
- Development of reusable data processing libraries in Python and PySpark, enabling large-scale and scalable data ingestion and transformation.
- Refactoring and optimization of PySpark jobs on Databricks, with significant performance gains and a notable reduction in execution times for distributed workloads.
- Implementation of CI/CD pipelines to automate the deployment of Databricks jobs via GitLab, ensuring fast, reliable, and traceable updates.
- Design and orchestration of data pipelines for large-scale processing and analysis of gas and electricity consumption data.
- Design and development of a forecasting engine to anticipate customer consumption patterns from historical data.
- Contribution to the design of ENGIE's data lake architecture, ensuring the maintainability and reusability of data pipelines.
- SACEMDATA ARCHITECTFILM AND AVDecember 2021 - November 2023 (1 year and 11 months)Paris, FranceDesign and deployment of a cloud data platform on AWS for processing data streams from major music platforms (Spotify, YouTube, Deezer, iTunes), optimizing business analysis efficiency and decision-making.
- Design and implementation of the complete data processing infrastructure architecture on AWS, using S3, Glue, EMR, Lambda, and Elasticsearch.
- Development of reusable Python libraries to interact with AWS services, promoting standardization of ingestion and transformation processes.
- Automation and scheduling of data ingestion flows for collecting and processing information from multiple streaming platforms, ensuring reliable and continuously updated datasets.
- Migration of IBM DataStage workflows (financial data processing) to AWS Glue.
- Implementation of analytical pipelines on AWS EMR for large-scale analysis of user behavior, listening patterns, and usage statistics.
- Indexing and making data available in Elasticsearch, facilitating its use by Frontend teams to power visualization applications and dynamic dashboards, offering fluid and efficient data analysis.
- Caisse des Dépôts et ConsignationsSoftware & DATA EngineerPUBLIC SECTORNovember 2018 - November 2021 (3 years)Arcueil, Paris, FranceDesign and deployment of the centralized data platform for the Caisse des Dépôts Group (CDC), based on the Cloudera distribution to meet the data storage and processing needs of all subsidiaries. Implementation of a scalable Data Lake, supporting both batch and real-time processing, with the goal of industrializing ingestion flows, ensuring GDPR compliance, and providing reliable data for business teams.
- Design of the data ingestion and processing architecture on the Cloudera environment.
- Automation of HDFS directory and Hive table structure configuration via Shell scripts, reducing environment deployment time.
- Provision of work tools for Data Engineers, including JupyterLab notebooks and ready-to-use Hive/HDFS/HBase environments.
- Implementation of a Kafka-based streaming pipeline for real-time data ingestion.
- Development of an application for managing and processing application logs using the ELK stack (Elasticsearch, Logstash, Kibana), facilitating continuous monitoring and analysis.
- Development of a generic RDBMS ingestion solution with Python and Apache Sqoop for relational data integration.
- Building ETL pipelines for large-scale data processing with PySpark, ensuring robustness and scalability.
- Data modeling and schema denormalization to support high-performance OLAP analytical loads on Hive, improving query speed and scalability on large data volumes.
- Implementation and deployment of GDPR-compliant solutions, including encryption, anonymization, and deletion of sensitive data.
Recommendations
Be the first to recommend Abdelmajid
Help this freelancer shine by sharing your experience working together.
These freelancer profiles also match your criteria
Agatha Frydrych
Backend Java Software Engineer
4.7
(3)
2
Baptiste Duhen
Fullstack developer
4.6
(4)
5
Amed Hamou
Senior Lead Developer
4
(2)
7
Audrey Champion
Web developer
4.3
(3)
4
Education
- Master in Computer Science and Operations ResearchEcole Polytechnique de Paris (l'X)2018Master Informatique et recherche opérationnelle
- State Engineering Diploma in Computer ScienceENSIAS2016Diplôme d'ingénieur d'état en informatique