You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Mohammed T.MT

Mohammed T.

Senior Data Engineer (Python-SQL-Spark-Scala)

€725/day
Paris, FR
3-7 years

Average response time: 1 hour

Freelancer profile translated to English.
Back to original language

About Mohammed

Welcome to my Malt profile 🙂!

Data engineer with over 6 years of experience, I have worked for various large companies such as Groupe Seb, Sanofi, FDJ, Atos.
My mission is to help my clients design and develop their data models, data pipelines, and data architecture🛢⚙📊.

What I can do for you in an industrial data context:

👉 Data Ingestion
I implement jobs/pipelines that ingest your data regardless of volume and format (csv, json, avro...), I can also ingest your data in compliance with GDPR.

👉 Data Transformation and Aggregation
I implement jobs/pipelines that filter, clean, transform, and enrich your data to provide usable data.
I also implement aggregation calculations for your KPIs.

👉 Data Exposure
Scripts will be industrialized to expose your data in the form of SQL views or tables, Hive (these tables will constitute your Data Warehouse).

👉 Data Architecture
I advise and support the implementation of your data platform (storage and processing architecture).

👉 Data Science Industrialization

Feel free to contact me 🙂!





  • French

    Native or bilingual

  • English

    Native or bilingual

Can work on-site
Paris (up to 20km), Lyon (up to 20km)

Experience

  • Groupe Seb
    Data Engineer
    ENTERTAINMENT AND LEISURE
    April 2021 - Today (5 years and 2 months)
    Within the PCM (Professional Coffee Machine) team, I am responsible for designing and implementing data pipelines and Spark jobs to feed the data warehouse and build data visualization dashboards/reports for the various clients using these professional coffee machines.

    👉Data Engineering Tech Lead
    • Interface with the backend team to manage changes at the data source level and anticipate technical impacts
    • Support for new incoming Data Engineers
    • Writing technical stories
    • Pair programming
    • Pull request reviews
    • Release delivery (branch

    👉Data Pipelines
    • Design and development of Spark ingestion jobs for telemetry, twins, and reference data
    • Design and development of Spark jobs (full-process mode) for transformation according to business rules and exposure via SQL/Synapse tables for PBI Dashboards
    • Design and development of Spark jobs for the management entity
    • Design and Development of transformation Spark jobs (delta-process mode) (initialization pipeline + delta-process pipeline)
    • Design and Development of data pipelines for aggregated tables: golden data preparation pipeline, aggregation calculation pipeline (Daily, Monthly, Weekly), aggregated table exposure pipeline
    • Optimization of transformation jobs

    👉Data Stack
    • Integration of Azure Synapse with ARM on the data platform
    • Upgrade of Spark versions (Spark project + Spark run-time)
    • Sizing of the Spark pool according to the use case for the run-time of data pipelines
    • Configuration of Spark log transfer to Log Analytics
    • Implementation of monitoring for data/Spark pipelines
    • Integration of Delta Lake at the Spark jobs level and implementation of the Vacuum pipeline
    Spark Scala Microsoft Azure Azure DevOps PySpark Git Gitflow CI/CD ARM Azure Synapse
  • Ynov Campus
    Jury Member and Evaluator - Final Project (Data & AI)
    EDUCATION AND E-LEARNING
    August 2023 - September 2023 (1 month)
    I acted as a data professional to evaluate the final project defenses (Master's degree) in Data and Artificial Intelligence.
  • Française des Jeux
    Data Engineer
    ENTERTAINMENT AND LEISURE
    January 2019 - March 2021 (2 years and 2 months)
    Data Engineer within the DataLake team

    👉Data Pipelines
    👉👉Batch Processing:
    • Design and development of Spark/Scala ingestion jobs
    • Design and development of Spark/Scala jobs for GDPR (key generation, encryption, right to be forgotten)
    • Development of Airflow DAGs for ingestion jobs (GDPR compliant)
    • Development of Salt formulas for creating Hive Tables and Views
    • Development of Salt formulas for creating Phoenix/Hbase Tables
    • Development of Spark job for compacting small HDFS blocks
    👉👉Streaming Processing:
    • Development of Nifi workflows for collecting events (sports reference data) then buffering in Kafka topics and processing in Spark streaming as well as restitution in Hbase
    • Development of a Spark streaming job for enriching sports alerts with reference data

    👉Data Stack
    Modeling and implementation of the DataLake batch layer on the DEV environment
    • Modeling and implementation of the data science platform (Jupyter, Hue) on the DEV environment
    • Modeling and implementation of the speed layer (nifi/kafka/elastic) of the DataLake on the DEV and Pre-prod environments
    • Study and Migration of the Prod DataLake (batch layer) to a new VLAN
    • Development/Implementation of Salt Formulas for stopping and starting all batch layer services
    • Development/Implementation of Salt formulas for unit testing of DataLake batch layer services
    • Troubleshooting and Correction of Anomalies and Incidents

    👉Data Science Industrialization
    • Development of an industrialized pyspark data science project model (modular project + unit testing + CI/CD)
    • Demo for data scientists on data science industrialization.
    Spark PySpark Scala setuptools Python Anaconda Hadoop Apache Kafka ELK GDPR GDPR Compliance Openstack Infrastructure as code Saltstack logisland Hive Gitlab CI/CD Git Gitflow Docker

Recommendations

Be the first to recommend Mohammed

Help this freelancer shine by sharing your experience working together.

These freelancer profiles also match your criteria

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Education

  • Master's degree in Data Mining
    Université lyon 2
    2017
  • Computer Engineering degree - Software Engineering option
    Ecole nationale d'informatique et d'analyse des systèmes
    2016

Certifications

Skill set

Categories