About Sarra
French
Native or bilingual
English
Fluent
Experience
- Renault DigitalSenior Data Engineer / GenAI ArchitectMay 2025 - November 2025 (6 months)• Diagnosis and redesign of an internal GenAI platform (upload → parsing → embeddings → search).• Stabilization of the MongoDB Atlas ingestion chain (rebuild of textual and vector indexes, consistency check).• Design of a target GCP architecture: Cloud Run + Dataflow + Airflow to decouple processing.• Executable documentation and knowledge transfer to Data & Cloud teams.Stack: Python, GCP (Cloud Run, Dataflow, BigQuery, Airflow), MongoDB, GitLab CI/CD, RAG, OpenAPI, GKE.
- Fnac DartyData Engineer GCPAugust 2023 - December 2024 (1 year and 4 months)
- Migration of Bash scripts (crontab on VM) to Airflow, to make processing reliable and industrialized
- Code refactoring (documentation, factorization, Python upgrade) and implementation of CI/CD via GitLab & Terraform
- Optimization of Looker Studio dashboards (clustering, BigQuery splitting) to reduce costs
- Automation of customer review moderation with an LLM model (Text-Bison) deployed via Cloud Function
- BPCE Infogérance & TechnologiesData Engineer & Tech Lead Streaming FactoryDecember 2017 - April 2023 (5 years and 4 months)
- Successive roles:
1. Creation & Management of the Streaming Factory (2021 – 2023)Development of real-time pipelines & standardization of best practicesDeployment of distributed architectures (Kafka, NiFi)Recruitment and skill development of Data Engineers in streamingStack: Kafka, NiFi, Solr, Hive, GCP, Java, Python2. Data Engineer / Tech Lead – BPCE Referential, Financing & Trade (2021 – 2022)Support for business teams on Hadoop, from design to implementationPySpark scripts to analyze XML files and store data in HiveDevelopment and optimization of data flows for the Referential Data LakeStack: CDP, Hive, Spark, Kafka (Python lib), PySpark, CI/CD (XLDeploy, Jenkins)3. Industrialization & Best Practices – BPCE Life Insurance (2019 – 2020)Industrialization of data science models (origination score)Implementation of reusable templates (versioning, logging, packaging)Collaboration on Group guidelines for model industrializationStack: Python, PySpark, Jupyter, Git, Cookie Cutter4. Data Engineer – Trade & Treasury (2019 – 2020)Data transfer to HDFS and deployment of fraud algorithms (scoring, profiling)Stack: HDP, Hive, Python, PySpark, Git, CI/CD5. Data Engineer – Full Trade Monitoring (2018 – 2019)Implementation of a Data Lake (Kafka, Hive, Solr, PySpark) and a Flask search engineClose collaboration with Data Science & business teams
Recommendations
Be the first to recommend Sarra
Help this freelancer shine by sharing your experience working together.
These freelancer profiles also match your criteria
Agatha Frydrych
Backend Java Software Engineer
4.7
(3)
2
Baptiste Duhen
Fullstack developer
4.6
(4)
5
Amed Hamou
Senior Lead Developer
4
(2)
7
Audrey Champion
Web developer
4.3
(3)
4
Education
- Master in Big Data and Machine LearningUniversity Paris 82017- Cours connexes : Intelligence artificielle, plateformes Big Data, sécurité informatique, algorithmes avancés} - Réalisation d'un mémoire portant sur la prédiction de la maladie de Parkinson à l'aide de données provenant de smartphones (Data Set Kaggle), SVM, WEKA, Python, (publications ACM)
- Bachelor's degreeUniversity Paris 82015Licence en Conception, Développement et Validation des Applications
Certifications
- Google Cloud Certified Professional Data EngineerGoogle Cloud2023