About Sidi
English
Native or bilingual
French
Native or bilingual
Experience
- Sanofi AcceleratorSanofi - Data & AI EngineerPHARMACEUTICALS INDUSTRYApril 2026 - Today (2 months)Paris, FranceContext:GenAI platform for automated generation of regulatory documents (Clinical Trial Documents) in the pharmaceutical industry. Critical production environment with strict traceability, security, and compliance requirements.Achievements:Design and development of the end-to-end RAG architecture: document parsing, chunking, embedding, vector store (Pinecone, S3 Vectors), retrieval, and LLM generationIntegration of LLM models in production: Azure OpenAI (GPT-4o), AWS Bedrock (Claude)Observability architecture for LLM pipelines with Weave/W&B: step-by-step tracing for Data Science teamsPerformance optimization: replaced FAISS with pre-computed S3 Vectors, reducing costs by ~70%Refactoring of the backend architecture towards DDD-light: resolved 12 audit findingsWriting technical specifications (16-section design doc) aligning Data Science, Data Engineering, and BackendMulti-environment configuration (dev/test/prod) with Pinecone and EventBridgeTechnical Stack:Python 3.12 · FastAPI · AWS (Lambda, Step Functions, ECS, S3, Bedrock) · Azure OpenAI · LangChain · Pinecone · Weave/W&B · Terraform · Docker · GitHub Actions · Snowflake · NestJS · React · TypeScript
- BNP Paribas CIBSenior Data & AI EngineerBANKING AND INSURANCEMay 2022 - February 2026 (3 years and 9 months)Pantin, FranceInvolvement in Data Engineering and Generative AI projects for the IT Trade Finance team, focusing on AML (Anti-Money Laundering) and Fraud Detection.📊 Data Project — AML & Fraud Detection PipelinesDevelopment of end-to-end pipelines processing millions of transactions: ETL, transformation, scoring, and alert generation.→ Spark Optimization (advanced tuning, data skew management)→ Quantexa Integration for relational graphs and contextual alert enrichment→ Private cloud deployment with Kubernetes, Skaffold, Kustomize👥 Establishment and Structuring of a New Data Engineering TeamLeadership in building a data team from scratch with 7+ members: defining needs, recruitment, onboarding, and skill development.→ Creation and scaling of an offshore team in India (4 Data Engineers, 1 DevOps, 1 BA, 1 PO)→ Implementation of development standards, architectural patterns, and best practices→ Daily technical supervision: code reviews, architectural decisions, mentoring🤖 Generative AI Project — Document Assistance RAG PlatformDesign and deployment of a conversational platform for natural language querying of all project documentation (Confluence, Jira, Elasticsearch, emails).→ 90% reduction in information retrieval time for teams→ Multi-source vectorization pipeline, vector database, LLM orchestration via LangChain with prompt engineering and optimized retrieval strategies→ Python/FastAPI backend API, Kubernetes deploymentStack: Python, LangChain, LangGraph, FastAPI, Elasticsearch, Vector DB, Scala, Spark, Kafka, Kubernetes, AWS, S3, Quantexa, ELK, RAG
- Bedrock streamingSenior Data engineerPRESS AND MEDIAJanuary 2022 - May 2022 (4 months)Lyon, FranceFreelance mission within the A/B Testing team, on the M6+, RTL+ Hungary, and Videoland streaming platforms.📊 Multi-platform Data PipelinesDesign and development of real-time and batch pipelines for experimentation and analytics across multiple international streaming platforms.→ Ingestion of high volumes of user events via AWS Glue, EMR, and Athena→ Scalable workflows with Spark and Databricks to ensure the reliability of experimentation metrics→ Infrastructure automation via Terraform and CI/CD pipelines (Jenkins, GitHub Actions)Stack: AWS (Glue, EMR, Athena), Terraform, Python, Scala, Spark, Databricks, Airflow, Docker, Jenkins, GitHub Actions, Iceberg, dbt
Recommendations
These freelancer profiles also match your criteria
Agatha Frydrych
Backend Java Software Engineer
4.7
(3)
2
Baptiste Duhen
Fullstack developer
4.6
(4)
5
Amed Hamou
Senior Lead Developer
4
(2)
7
Audrey Champion
Web developer
4.3
(3)
4
Education
- Computer Science Master's degreeSorbonne université (ex Université Pierre et Marie Curie)2018
Certifications
- Machine learningStanford University - Coursera2018
- Hadoop Platform and Application FrameworkUniversity of San Diego - Coursera2018