You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Arnaud FrancoisAF

Arnaud Francois

Data Engineer

€500/day
Lyon, FR
3-7 years

Average response time: 1 hour

About Arnaud

Data Engineer - Transport & Mobility - GCP - MDS (Modern data stack)

6 years of experience in data engineering and analytics, including 2 years at WanData — a SaaS platform for local authorities and transport operators.

What I do:
- End-to-end data pipelines (raw → analytics/OBT) with the modern data stack: Airbyte · Airflow · dbt · BigQuery
- On-prem → GCP migration (Cloud Run, Functions, Pub/Sub, Dataflow/Beam)
- GDPR compliance: passenger data anonymization
- FastAPI services and dashboards for business teams
Tech stack:
GCP (Cloud Run, BigQuery, Pub/Sub, GCS, IAM, WIF) · dbt · Airflow · Airbyte · Terraform · Terragrunt · Github actions · Streamlit · Looker · FastAPI · Snowflake

What sets me apart:
- Migration to Modern Data Stack
- Google Professional Data Engineer + Astronomer Airflow certified
- Global Top 100 on the DataTalksClub Data Engineering Zoomcamp (Bruin/dlt + Redpanda + Streamlit)
  • French

    Native or bilingual

  • English

    Fluent

  • Russian

    Basic

  • Spanish

    Basic

Can work on-site
Lyon (up to 50km)

Experience

  • Ubitransport
    Data Engineer / Data Analyst
    September 2021 - Today (4 years and 9 months)
    Lyon, France
    WanData Project
    - Context: WanData is an innovative SaaS platform, specifically designed to simplify the process of developing a data / AI strategy for local authorities and transport operators.
    - Summary:
    - KPI development: Implementation using GCS, Pandas, dbt, and BigQuery.
    - Endpoint Exposure : Creation and serving of services via FastAPI.
    - CI/CD Pipeline Setup : CI / CD using GitHub Actions, Artifact Registry, Podman.
    - GCP Infrastructure Setup : Infrastructure design and deployment (Cloud Run, GCS, IAM permissions, Secret Manager, VPC, Terraform, Datadog, etc)
    - Tool modernization : Migration of the dependency management tool (from Poetry to uv, from pylint to ruff).
    - Participation in the modern data stack migration : Transition to a modern architecture (Airbyte, dbt, shifting indicators (KPIs) to OBTs (One Big Table)).
    - PoC Gemini LLM agent using function calling. : Gemini / BigQuery / Jupyter / Streamlit

    DataViz Project
    - Summary: Analyzed trip launch rate and wild stops using data visualization.
    - Technologies Used: GCS (Datalake), Airflow (Pipelines), BigQuery (DW), Tableau online

    Modernization of Exports
    - Context: Modernization of an export system aimed at reducing resource costs, preventing queue blockages, and minimizing support tickets. The exports consist of Excel or CSV files containing data on subjects such as trip histories, sales histories and so on. The system operates in a Serverless environment, with resources provisioned at the time of export.
    - Technologies Used: Cloud Functions / Google Cloud Storage (GCS) / Python / Terraform / Cloud Build

    API calls migration (modernization)
    - Context: Transfer API calls data from the relational database to Pub/Sub and then an analytical database (BigQuery).
    - Technologies Used: Pub/Sub, Dataflow (Apache beam), BigQuery, Java

    Anonymization of usages
    - Context: Anonymization of passenger usages to be GDPR Compliant.
    - Technologies Used: Pandas, Airflow (Composer), Postgres
    bruin DBT Big Query Terraform Google Cloud Platform (GCP)
  • Ubitransport
    Exp. Innovation ALT
    September 2020 - January 2022 (1 year and 4 months)
    Mâcon, France
    Next Trip Prediction Project
    - Context: Designed and implemented predictive models for trip planning.
    - Technologies Used: AI Platform, Google Cloud Storage (GCS), CSV, SKlearn, Weka, Jupyter Notebook. Usage Projection Project
    - Context: Created usage projection models for optimizing transportation operations.
    - Technologies Used: AI Platform, TensorFlow, LSTM, AutoKeras, Facebook Prophet (ARIMA/SARIMA). School Transportation Data Analysis
    - Context: Collaborated on a study (ANATEEP) involving the consolidation of data from approximately twenty different transport networks. The objective was to analyze and provide valuable insights into school transport usage, focusing on indicators such as punctuality, service duration, and user-centric school runs.
    - Technologies Used: Metabase, Snowflake (Data Warehouse), Talend (ETL).

Recommendations

Be the first to recommend Arnaud

Help this freelancer shine by sharing your experience working together.

These freelancer profiles also match your criteria

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Education

  • Master's degree, Databases and A.I
    Université de Bourgogne
    2021
    Master's degree, Databases and A.I
  • Master's degree
    NORWEGIAN UNIVERSITY OF SCIENCE AND TECHNOLOGY (NTNU)
    2020
    Master's degree

Certifications

Skill set

Categories