Freelancer profile translated to English.

Description

Welcome to my Malt profile 🙂!

Data engineer with over 6 years of experience, I have worked for various large companies such as Groupe Seb, Sanofi, FDJ, Atos.

My mission is to help my clients design and develop their data models, data pipelines, and data architecture🛢⚙📊.

What I can do for you in an industrial data context:

👉 Data Ingestion

I implement jobs/pipelines that ingest your data regardless of volume and format (csv, json, avro...), I can also ingest your data in compliance with GDPR.

👉 Data Transformation and Aggregation

I implement jobs/pipelines that filter, clean, transform, and enrich your data to provide usable data.

I also implement aggregation calculations for your KPIs.

👉 Data Exposure

Scripts will be industrialized to expose your data in the form of SQL views or tables, Hive (these tables will constitute your Data Warehouse).

👉 Data Architecture

I advise and support the implementation of your data platform (storage and processing architecture).

👉 Data Science Industrialization

Feel free to contact me 🙂!

Languages

French
Native or bilingual
English
Native or bilingual

Workplace preferences

Can work on-site

Paris (up to 20km), Lyon (up to 20km)

Groupe Seb
Data Engineer
ENTERTAINMENT AND LEISURE
April 2021 - Today (5 years and 2 months)
Within the PCM (Professional Coffee Machine) team, I am responsible for designing and implementing data pipelines and Spark jobs to feed the data warehouse and build data visualization dashboards/reports for the various clients using these professional coffee machines.

👉Data Engineering Tech Lead
• Interface with the backend team to manage changes at the data source level and anticipate technical impacts
• Support for new incoming Data Engineers
• Writing technical stories
• Pair programming
• Pull request reviews
• Release delivery (branch

👉Data Pipelines
• Design and development of Spark ingestion jobs for telemetry, twins, and reference data
• Design and development of Spark jobs (full-process mode) for transformation according to business rules and exposure via SQL/Synapse tables for PBI Dashboards
• Design and development of Spark jobs for the management entity
• Design and Development of transformation Spark jobs (delta-process mode) (initialization pipeline + delta-process pipeline)
• Design and Development of data pipelines for aggregated tables: golden data preparation pipeline, aggregation calculation pipeline (Daily, Monthly, Weekly), aggregated table exposure pipeline
• Optimization of transformation jobs

👉Data Stack
• Integration of Azure Synapse with ARM on the data platform
• Upgrade of Spark versions (Spark project + Spark run-time)
• Sizing of the Spark pool according to the use case for the run-time of data pipelines
• Configuration of Spark log transfer to Log Analytics
• Implementation of monitoring for data/Spark pipelines
• Integration of Delta Lake at the Spark jobs level and implementation of the Vacuum pipeline
Spark Scala Microsoft Azure Azure DevOps PySpark Git Gitflow CI/CD ARM Azure Synapse
Ynov Campus
Jury Member and Evaluator - Final Project (Data & AI)
EDUCATION AND E-LEARNING
August 2023 - September 2023 (1 month)
I acted as a data professional to evaluate the final project defenses (Master's degree) in Data and Artificial Intelligence.
Française des Jeux
Data Engineer
ENTERTAINMENT AND LEISURE
January 2019 - March 2021 (2 years and 2 months)
Data Engineer within the DataLake team

👉Data Pipelines
👉👉Batch Processing:
• Design and development of Spark/Scala ingestion jobs
• Design and development of Spark/Scala jobs for GDPR (key generation, encryption, right to be forgotten)
• Development of Airflow DAGs for ingestion jobs (GDPR compliant)
• Development of Salt formulas for creating Hive Tables and Views
• Development of Salt formulas for creating Phoenix/Hbase Tables
• Development of Spark job for compacting small HDFS blocks
👉👉Streaming Processing:
• Development of Nifi workflows for collecting events (sports reference data) then buffering in Kafka topics and processing in Spark streaming as well as restitution in Hbase
• Development of a Spark streaming job for enriching sports alerts with reference data

👉Data Stack
Modeling and implementation of the DataLake batch layer on the DEV environment
• Modeling and implementation of the data science platform (Jupyter, Hue) on the DEV environment
• Modeling and implementation of the speed layer (nifi/kafka/elastic) of the DataLake on the DEV and Pre-prod environments
• Study and Migration of the Prod DataLake (batch layer) to a new VLAN
• Development/Implementation of Salt Formulas for stopping and starting all batch layer services
• Development/Implementation of Salt formulas for unit testing of DataLake batch layer services
• Troubleshooting and Correction of Anomalies and Incidents

👉Data Science Industrialization
• Development of an industrialized pyspark data science project model (modular project + unit testing + CI/CD)
• Demo for data scientists on data science industrialization.
Spark PySpark Scala setuptools Python Anaconda Hadoop Apache Kafka ELK GDPR GDPR Compliance Openstack Infrastructure as code Saltstack logisland Hive Gitlab CI/CD Git Gitflow Docker

Check out Mohammed's experience

Be the first to recommend Mohammed

Help this freelancer shine by sharing your experience working together.

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

Baptiste Duhen

Fullstack developer

4.6

(4)

Amed Hamou

Senior Lead Developer

(2)

Audrey Champion

Web developer

4.3

(3)

Signup to reveal

Master's degree in Data Mining
Université lyon 2
2017
Computer Engineering degree - Software Engineering option
Ecole nationale d'informatique et d'analyse des systèmes
2016

Check out Mohammed's education

Building Resilient Streaming Systems on Google Cloud Platform
coursera
2018
https://www.coursera.org/account/accomplishments/verify/JXQBX6ANHDG6
Serverless Machine Learning with Tensorflow on Google Cloud Platform
coursera
2018
https://www.coursera.org/account/accomplishments/verify/65WA9H9VRWEK

Mohammed's certifications are only visible to Malt Community members

Mohammed T.

Senior Data Engineer (Python-SQL-Spark-Scala)

About Mohammed

Experience

Recommendations

These freelancer profiles also match your criteria

Education

Certifications

Skill set

Categories