About Amelie
French
Native or bilingual
English
Fluent
Experience
- VizcabData engineer / DeveloperSOFTWARE PUBLISHINGApril 2024 - Today (2 years and 2 months)Paris, France- Designs and develops new data pipelines in Azure Databricks for data ingestion to/from product applications, Azure Data Lake, and PostgreSQL databases.- Implements Datadog metric ingestion pipelines in Databricks, joins this data with other datasets, and exposes insights in Power BI reports.- Creates and optimizes models to organize and structure data from various applications and sources, making it usable by users.- Develops and maintains Power BI and Databricks dashboards to visualize information, monitor pipeline performance, and ensure data quality.- Improves code quality by applying best practices and establishing robust CI/CD pipelines using Databricks Bundle Assets, GitLab, and SonarQube.- Implements unit and integration tests.- Develops and implements data contracts as a framework for monitoring data models and defining clear specifications.- Collaborates with business teams to identify their needs and provide tailored data solutions that deliver value.
- Cour des comptes, Paris.Machine learning engineer / Project LeadPUBLIC SECTORDecember 2017 - August 2022 (4 years and 8 months)â Designs and supervises the architecture and development of the Court of Auditors' unified search platform based on a Hadoop datalake.â Builds Python scraping pipelines to collect HTML pages of reports produced by the Court of Auditors from 1870 to 2022 (180k+).â Creates and develops Python projects to extract raw text from 250k+ reports of types PDF, Word, HTML, Image documents (OCR), etc.â Implements Python programs to clean, process, and structure heterogeneous data, and especially to identify connections between data for indexing (Elasticsearch) and textual analysis.â Leads and develops Spark pipelines for ingesting content from various databases (e.g., audits, Court of Auditors' agent registry, ...).â Collaboratively develops the search engine's Web platform (React, Django).â Conducts an NER (Named Entity Recognition) POC to automatically extract relevant names and expressions from the text of reports (Spacy, Deep learning).â Organizes and leads manual annotation workshops (Doccano) of reports to build a learning base for the NER POC specific to the Court of Auditors' context.â Organizes several user workshops to gather internal needs regarding efficient text search, document organization, and logical links between information.â Works hand-in-hand with the UX designer to create mockups for the search platform.
- SOLOCALData Engineer / Full Stack DeveloperE-COMMERCEFebruary 2016 - October 2018 (2 years and 9 months)Paris, Franceâ Develops from scratch a data visualization application for Pages Jaunes professionals. The application provides a 360° view of professionals (subscribed products, audience, click share, reviews, and paid/free content, ...).â Refactors and develops an application that allows geographical visualization of Pages Jaunes clients' audiences and activities (migration from Java to React+Node).â Develops Spark pipelines for data ingestion.â Collects, processes, and loads data into Elasticsearch search engines.â Writes technical documentation.â Trains a student in Web development (3 months).â Trains a group of 10+ professionals in Scala.
Recommendations
These freelancer profiles also match your criteria
Agatha Frydrych
Backend Java Software Engineer
4.7
(3)
2
Baptiste Duhen
Fullstack developer
4.6
(4)
5
Amed Hamou
Senior Lead Developer
4
(2)
7
Audrey Champion
Web developer
4.3
(3)
4
Education
- DoctorateUniversité Pierre et Marie Curie - France2011Sujet: Méthodes automatiques pour la classification et la prédiction des pannes de réseaux
Certifications
- Neural Networks and Deep LearningDeepLearning.AI2017
- Functional Programming Principles in ScalaECOLE POLYTECHNIQUE FĂDĂRALE DE LAUSANNE2013