About Aargan
English
Conversational
French
Native or bilingual
Experience
- BpifranceData EngineerBANKING AND INSURANCEJune 2022 - August 2024 (2 years and 1 month)Maisons-Alfort, FranceComplete overhaul of the Market Data department at BPI. Retrieve and manage external data used by various trading desks within BPI.
- Migration of Python jobs to PySpark to optimize performance and scalability.
- Implementation of CI/CD pipelines and Infrastructure as Code (IaC) to improve deployment efficiency and reliability.
- Review and optimization of the use of various technologies (AWS Glue, MWAA, AWS Lambda, etc.).
- Creation of BPI-WORKSPACE to facilitate development and access to resources for data engineers within our team (IaC + usage scripts).
- Implementation of Data Vault for better data management and organization.
- Setup and maintenance of observability tools to ensure effective system monitoring and tracking.
- Cost optimization related to cloud service usage.
- Adherence to security doctrines: Migration to HSM and implementation of assumed roles within our jobs.
- Support for new Spark developers for quick and efficient integration.
- Implementation of an internal framework for the department and participation in the development of a PySpark framework for BPI.
- Populating and maintaining Mongo databases and Kafka topics.
- Wigglytrout SoftwareCTOSOFTWARE PUBLISHINGSeptember 2021 - May 2022 (8 months)Creation of a notebook-type platform to assist security teams. All POCs were carried out on GCP with a Dockerized product:- Securityhub is a notebook platform based on Zeppelin that allows creating, scheduling, and sharing notebooks. Installed on Kali Linux, this solution aims to make all tools available to security teams and improve information exchange with other teams.- Integration of Zeppelin with authentication through an AD server via Apache Shiro.- Kraken is a framework based on Trino/Presto, DBT, and Hive Metastore. Its purpose is to allow users to connect to a wide range of data sources: s3, gcp, BigQuery, Hive, HDFS, etc..- Github Action: build and deploy a Docker image into Container Registry (GCP) and Dockerhub, deploy the image on Cloud Run, and perform a series of automated tests on the container.- Data provision via Google Cloud Storage and querying through BigQuery.
- AdaltasStudy EngineerCONSULTING AND AUDITSJuly 2019 - February 2022 (2 years and 7 months)Boulogne-Billancourt, FranceIT consultant with two main missions:DATAKILI Paris – Big Data Engineer- Development of Spark jobs in Scala- Development of Java Spring jobs- Modification of multi-tenant databases with Liquibase- Correction and integration of client files- Testing and deployment of the solution- Implementation of metrics via KibanaEDF R&D Paris/Saclay, Adaltas – InfraOps, Big Data Engineer- Support and training for business teams, project support- Deployment, operation, and supervision of HDP, HDF, Docker, and R clusters- Deployment of new components: Airflow, HDF5, H20 AutoML- Optimization and security of Hadoop clusters- Study and implementation of visualization libraries integrated with Python/PySpark: Streamlit, Geospark- Docker support in a Data Science environment (GPU integration, Conda, Jupyter, R)- Automation of data ingestion pipelines with Airflow, PySpark, and Python- Support on AWS tools
Recommendations
These freelancer profiles also match your criteria
Agatha Frydrych
Backend Java Software Engineer
4.7
(3)
2
Baptiste Duhen
Fullstack developer
4.6
(4)
5
Amed Hamou
Senior Lead Developer
4
(2)
7
Audrey Champion
Web developer
4.3
(3)
4
Education
- Master in Big Data and Artificial IntelligenceESGI2021Prise en main des technologies de big data et de machine learning