Freelancer profile translated to English.

Description

Are your data pipelines slow, expensive, or difficult to industrialize? I design and deploy robust, production-ready Azure/Databricks architectures.

What I bring concretely:

Design of batch and streaming ETL/ELT pipelines (ADF, Databricks, PySpark)

Delta Lake architecture (Bronze/Silver/Gold), data quality, and cost optimization

DataOps Industrialization: monitoring, partitioning, performance

Experience with Palantir Foundry for demanding environments

Recent projects: Serverless Azure platform for real-time Bloomberg ingestion (Finance/CORUM), multi-source Denodo data virtualization (Orange Business Services), ADF pipelines for Cloud migration (CNAS).

Stack: Databricks, Spark/PySpark, Python, SQL, ADF, Synapse, Azure, Kafka, Denodo, Palantir

Available for build / migration / optimization data platform missions — Paris

Industry field of expertise

Languages

Arabic
Native or bilingual
English
Fluent
French
Fluent

Workplace preferences

Can work on-site

Paris (up to 50km), Lille (up to 10km)

CORUM
Data Engineer
BANKING AND INSURANCE
January 2026 - Today (7 months)
Paris, France
Design and development of an end-to-end serverless Azure platform for ingesting, processing, and exposing market data from Bloomberg Data License API, covering investment, valuation, and portfolio tracking needs.
Development of Azure Functions in Python to automate Bloomberg flows (DataRequest, HistoryRequest), with OAuth2 / JWT HS256 authentication, asynchronous polling, retry policy, and exponential back-off for long-running requests.
Optimization of large financial data retrieval (CSV, CSV.gz, ZIP) with stream reading, Python parsing, schema normalization, and quality controls: missing value detection, forward-fill on business days, and complete traceability of REAL / FORWARD_FILLED / FALLBACK statuses.
Automation of quantitative processing on financial historical data: calculations of return, NAV, valuation, and temporal aggregations in Python, producing datasets directly usable by Finance, Risk, and Investment teams.
Incremental ingestion of financial data: after each Bloomberg execution, data is loaded incrementally into Azure SQL via Azure Data Factory (ADF) pipelines, with delta management and flow orchestration between environments.
Daily feed to an SFTP: a dedicated ADF pipeline consumes data stored in Azure SQL and generates a file daily, automatically deposited on the target SFTP, ensuring reliable and scheduled delivery to consuming systems.
Storage and exposure of data in Azure Cosmos DB and Azure SQL, with collection modeling, SQL queries for interrogation and aggregation, and development of stored procedures to encapsulate critical business logic.
Containerization of Azure Functions with Docker and multi-environment deployment (dev / preprod / prod) via Azure DevOps CI/CD pipelines in YAML.
Microsoft Azure Bloomberg Python SQL Data Engineer
cnas
Data Engineer
TRAVEL AND TOURISM
June 2025 - December 2025 (5 months)
Guyancourt, France
Analysis, redesign, and security of Azure integration flows for the Voyagiste project following the migration of SharePoint sources to SFTP (FileZilla), within an Azure Cloud environment.
Design and development of Azure Data Factory (ADF) pipelines, including Data Flows, for ingestion, transformation, and automatic orchestration of multi-format CSV and TXT files.
Centralization of data in Azure Data Lake Storage Gen2 (ADLS) through the implementation of a standardized landing zone, ensuring schema consistency.
Implementation of data quality rules (cleaning, typing, normalization, consistency checks) directly within ADF Mapping Data Flows to ensure the reliability of the Azure Data Lake.
Advanced management of ingestion errors (inconsistent schemas, corrupted files, missing data) through logging, alerting, and exception handling mechanisms in Azure Data Factory.
Support and maintenance of historical Talend flows, correction of incident tickets, and impact analysis in coordination with the RUN team.
Support for the technical transition from Talend to Azure Data Factory, ensuring service continuity and gradual scaling of Azure processing.
Contribution to the High-Level Design (HLD/HLDF) of the Azure integration architecture, in collaboration with the Data Architect, integrating principles of Cloud scalability, maintainability, and evolvability.
Azure Data Factory DBeaver Talend Azure Databricks Data Engineer
Orange Business Services GmbH
data engineer
DIGITAL AND IT
October 2024 - May 2025 (7 months)
Courbevoie, France
- Migration and unification of heterogeneous data sources (relational databases, files, real-time streams) to a Denodo data virtualization platform (Data Bridge), with a focus on governance and performance.
- Design and implementation of Denodo virtualization layers (Curated and Refined), ensuring data standardization, quality, and reusability for analytical uses.
- Advanced modeling and optimization of Denodo views, including complex joins, business calculations, and transformation rules, to expose BI-ready datasets.
- Integration and exploitation of geospatial data (GPS): latitude, longitude, trajectories, and waypoints, with dataset enrichment to enable spatial and temporal analysis.
- Complete migration of Jaspersoft reports to Power BI, with redesign of semantic models and improvement of performance and user experience.
- Creation and automation of Power BI dashboards, connected to Denodo via custom connectors, ensuring controlled and secure data refresh.
- Analysis and introspection of real-time data streams from Kafka topics, including schema understanding, message quality, and integration into the data ecosystem.
- Training and support for technical teams on Databricks usage (distributed processing, data exploration and preparation) complementing the virtualization layer.
- Detailed documentation of migration processes, data models, and flows, and training of end-users on Denodo and Power BI tools.
- Technical support and cross-functional expertise for other team members, including incident resolution and continuous improvement of pipelines and performance.
Denodo Microsoft Power BI Databricks Python SQL

Check out Toni's experience

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

Baptiste Duhen

Fullstack developer

4.6

(4)

Amed Hamou

Senior Lead Developer

(2)

Audrey Champion

Web developer

4.3

(3)

Signup to reveal

Analysis, Data Management, and Innovation
Université Gustave Eiffel
2022
- Ingestion et transformation de données (ETL / ELT) - Conception de pipelines data batch - Traitements distribués Spark / Databricks - Modélisation analytique (facts, dimensions) - Requêtage et transformations SQL - Data Engineer - Hadoop - Power BI - Scrum - Azure Data Engineering - Databricks - Palantir - Python - SQL

Data Warehousing with Microsoft Azure Synapse Analytics
Coursera
2023
https://www.coursera.org/account/accomplishments/certificate/VY3QJXY9FNM4
Data Engineering with MS Azure Synapse Apache Spark Pools
Coursera
2023
https://www.coursera.org/account/accomplishments/certificate/SF924VX3VKUU

Toni's certifications are only visible to Malt Community members

Data Engineer

Toni Badr

Data Engineer | Azure | Databricks | Palantir

About Toni

Experience

Recommendations

These freelancer profiles also match your criteria

Education

Certifications

Skill set

Categories