You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Mickaël VitryMV

Mickaël Vitry

Senior Data Engineer

€550/day
Lyon, FR
8-15 years

Average response time: 1 hour

Freelancer profile translated to English.
Back to original language

About Mickaël

Senior data engineering consultant with 26 years of experience in software development, the last 6 of which have been specialized in data engineering. I support companies in the design and implementation of their modern data platforms, from architecture to production.

What I can bring you:
✓ Architecture & Migration: Design of data platforms (data mesh, distributed architecture), migration from batch to real-time streaming, infrastructure modernization
✓ Technical Implementation: Robust data pipelines (batch/streaming), infrastructure as code (Terraform), cloud integration (AWS, Databricks), GDPR compliance
✓ Team Empowerment: Technical training, documentation, implementation of standards and best practices, change management

My added value:
My atypical profile - 16 years of entrepreneurship followed by 6 years in data engineering - allows me to combine product vision with in-depth technical expertise. I understand your business challenges as well as your technical ones.

Concrete results:
In my last mission (streaming platform with 50M+ users), I enabled the launch of 10 new streaming products in production (up from 0), reduced time-to-market to 4 weeks, empowered 5 domain teams on their data products, and implemented an operational data mesh architecture.

Types of missions I handle:
  • Design and implementation of data platforms (data lake, data mesh, lakehouse)
  • Architecture and development of real-time streaming pipelines
  • Cloud migration (AWS, Azure, Databricks)
  • GDPR compliance and data governance
  • Refactoring of legacy architectures and cost optimization
  • Data team upskilling
**Technical Stack**: AWS (Kinesis, Firehose, Lambda, S3, Glue, EMR), Databricks (Unity Catalog, Spark Declarative Pipeline), Terraform, Airflow, Python, Docker, DataHub


Based in Lyon, available remotely or on-site. Let's discuss your data project!
  • French

    Native or bilingual

  • English

    Fluent

Can work on-site
Lyon (up to 30km), Paris (up to 10km)

Experience

  • Bedrock
    Tech Lead | Data Engineer
    SOFTWARE PUBLISHING
    April 2024 - Today (2 years and 2 months)
    Lyon, France
    **Context**: Bedrock Streaming (RTL/Bertelsmann group) operates the M6+, RTL+, Videoland video streaming platforms - 50M+ users.

    **Mission**: Design and implement the new data mesh analytical platform within a team of 5 (flat organization).

    **Challenge**: The centralized data team was a bottleneck: almost zero new data products in 2024, time-to-market of several months, integration of new clients blocked.



    **What I implemented**:
    • **Architecture**: Migration from a centralized data lake to a decentralized data mesh. Streaming architecture design: DynamoDB → Kinesis → Firehose → S3 → Databricks Spark Declarative Pipeline
    • **Infrastructure as Code**: 7 reusable Terraform Modules (AWS: IAM, S3, KMS, Kinesis, Firehose, Lambda + Databricks: Unity Catalog, pipelines, jobs, alerts). Semantic versioning.
    • **GDPR & Governance**: GDPR by design implementation (PII tagging bi-weekly, right to be forgotten within 30 days, audit trail), lineage tracking (DataHub + Unity Catalog), automatic cataloging. Member of the data governance committee.
    • **Enablement**: Support for 4 domain teams (User, Payment, Catalog, Audience, Content) through workshops, pair programming, ADRs, GitHub unbooks, dedicated Slack per domain.
    • **Assisted AI**: Automated doc agent (Claude integrated VSCode), planning complex migrations, PR and IaC review.
    **Measurable Results**:

    ✓ 0->10 new streaming products in 2025
    ✓ Time to market: 4 weeks (vs. several months)
    ✓ 5 domains autonomous in production
    ✓ Daily batch → near real-time
    ✓ 7 reusable Terraform modules delivered
    ✓ Operational data mesh architecture — migration of 5 domains in progress (H1 2026)

    **Stack**: AWS (Kinesis, Firehose, Lambda, S3, KMS, Glue) • Databricks (Unity Catalog, Spark Declarative Pipeline) • Terraform • Python • DataHub • GitHub Actions
    Terraform AWS Databricks Data Mesh Tech Lead
  • Campus Numérique in The Alps
    Data Trainer
    EDUCATION AND E-LEARNING
    January 2024 - May 2025 (1 year and 4 months)
    Grenoble, France
    Trainer at Campus Numérique In the Alps, I support Data Analyst/Engineer learners in their professionalization path:

    **Active Pedagogy**: 10% theory / 90% practice, project-based approach focused on "learning by doing"

    **Taught Subjects**: ETL/ELT (Airflow, Docker), data modeling (UML, normal forms), data warehousing, big data (PySpark, distributed architecture), cloud (Azure Synapse Analytics, Data Factory, Databricks)

    **Personalized Support**: Individual support adapted to levels, mentoring on real projects, job preparation



    **Approach**: Encourage learners to find the solution themselves rather than giving them the answer directly. Develop autonomy and problem-solving skills.

    **Technologies**: Python, SQL, Airflow, Docker, Azure, PySpark
    Python SQL Data Visualization Pedagogy Microsoft Azure
  • TVTY
    Data Engineer
    SOFTWARE PUBLISHING
    March 2021 - September 2023 (2 years and 6 months)
    Lyon, France
    **Context**: Nielsen | TVTY measures the impact of TV campaigns on the web traffic of 100+ international advertisers (Carglass, Dyson, Verisure).

    Data Pipeline: JavaScript tag collection → API enrichment (Google Analytics, Ads) → transformation (spot impact attribution, aggregations) → database exposure.

    **Main Challenge**: Anticipating the deprecation of third-party cookies by browsers, which threatened the core of the collection model. Urgent need to redesign the tracking system while maintaining service continuity for 100+ clients.

    **My Mission**:

    Within a data team of 5 people in Scrum, I ensured the maintenance and evolution of the infrastructure:
    • **Critical Modernization (6 months)**: Complete redesign of the JavaScript collection tag to anticipate the end of third-party cookies. Specifications, JS development, impact study, progressive blue-green rollout with coexistence of old/new solution, client-by-client validation.
    • **Extensibility**: Design and development of custom Airflow connectors for Google Analytics (GA3, GA4) and other sources. Automatic enrichment via multiple APIs.
    • **Distributed Infrastructure**: Maintenance of an AWS EMR Hadoop cluster managing 100+ advertisers. Multi-tenant architecture with an Airflow DAG per advertiser, each DAG with a branch per campaign + an always-on branch for reference data.
    • **IaC & DevOps**: Infrastructure as code management (boto3, Ansible, Docker), test/prod environment deployment, CI/CD with GitHub.
    • **Cross-functional Support**: Data technical referent. Ad hoc analyses with customer service (EDA, Jupyter/Zeppelin visualization), production debugging, query optimization.
    **Achievements**:

    ✓ Successful migration to new tracking system without service interruption
    ✓ Scalable infrastructure supporting 100+ simultaneous advertisers

    **Stack**: Python, JavaScript, Spark, Airflow, AWS EMR, MariaDB, GitLab, Docker
    PySpark Airflow Data Engineering AWS Docker

Recommendations

These freelancer profiles also match your criteria

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Education

  • Programme Freelance For Good, Devenir freelance dans le secteur de l'impact
    Social Declik
    Programme Freelance For Good, Devenir freelance dans le secteur de l'impact
  • Master 2 (M2), Mathématiques appliquées
    Université du Maine-Le Mans-Laval
    Master 2 (M2), Mathématiques appliquées

Skill set

Categories