You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Walid B.WB

Walid B.

DATA ENGINEER ( PYTHON | SQL | ETL | AIRFLOW )

€650/day
Paris, FR
15+ years

Average response time: 1 hour

Freelancer profile translated to English.
Back to original language

About Walid

Senior Data Engineer (16+ years of experience, BNP Paribas) expert in complex banking data architectures. I design robust and scalable ETL/ELT pipelines, ensuring the transition between legacy systems and modern Cloud.

🛠️ Stack : Python, SQL, Oracle, DataStage, Power BI
☁️ Cloud : GCP, IBM Cloud.
⚙️ Ops : Airflow, CI/CD, AzureDevops

Specialized in financial environments with strict constraints (security, massive volumes, governance). Ready to propel your data projects.
  • French

    Native or bilingual

  • English

    Fluent

Can work on-site
Paris (up to 50km)

Experience

  • BNP PARIBAS,
    Lead Data Engineer
    BANKING AND INSURANCE
    June 2023 - Today (3 years)
    Nanterre, France
    Project:
    • Move to cloud : Migration of FinReport to the IBM Cloud platform
    • FinReport Application : Centralization of financial reporting (TCA, RFQ, etc.) and regulatory reporting (MIFID2, BestExecution, etc.)

    Missions:
    • Leadership of the migration of legacy ETL to the Cloud (planning, costing, team coordination, technical workshops)
    • Design and development of ETL pipelines in Python (pandas, PySpark, SQLAlchemy, cx_Oracle) and SQL
    • Workflow orchestration with Apache Airflow (DAGs, scheduling, monitoring)
    • Implementation of a solution (REST API) for exchange with external partners
    • Development of comparison scripts (SQL, Python) for parallel run and migration validation
    • Implementation of automated tests for data pipelines (Pytest)
    • Deployment of jobs on Kubernetes via CI/CD pipelines on Azure DevOps.
    • Performance optimization:
    o Python ETL pipelines (PySpark distributed processing, Airflow parallelization)
    o SQL tuning (query rewriting, execution plans, window functions)
    o Oracle Exadata (indexing, partitioning, SQL tuning)
    • Implementation of a multi-source data virtualization solution with Denodo.
    • Semantic layer modeling (virtual views, derived views, business interfaces)
    • Migration of Power BI reports to Denodo Platform
    • Review and optimization of Datasets and SQL queries
    • Implementation of daily monitoring for activity tracking
    • Maintenance of existing Datastage and resolution of Production incidents

    Environment:

    Python (pandas, PySpark), Oracle Exadata, SQL, PL/SQL, Apache Airflow, Denodo, IBM Cloud (COS, S3, Vault), Shell, Kubernetes, Docker, Git, Azure DevOps, Sentinel, Datastage Px, Power BI
    Python SQL Airflow IBM Cloud Datastage
  • SOCIETE GENERALE,
    Data Engineer
    BANKING AND INSURANCE
    January 2022 - June 2023 (1 year and 5 months)
    Fontenay-sous-Bois, France
    Project:
    • Data Marketing : Redesign of data marketing tools for sending marketing and regulatory campaigns (Migration from UNICA software to Adobe Campaign) as well as all associated processing
    • YOGA : Project to merge data and information systems between Société Générale and Crédit Du Nord

    Missions:
    • Definition of the new Datamart architecture
    • Design and development of ETL feeding pipelines (Python, PL/SQL, Datastage, PostgreSQL, Control-M)
    • Development of Python scripts for XML/JSON file integration
    • Implementation of Python scripts (pandas) for data quality control: duplicate detection, null values, format control, inter-table consistency, and business rule validation
    • Development and optimization of PL/SQL stored procedures (Oracle) and migration to PL/pgSQL (PostgreSQL)
    • Development of Shell scripts for automation and launching
    • Workflow orchestration with Control-M
    • Management of script versioning with Git
    • Deployment of pipelines in testing and production environments
    • Implementation of ETL pipeline monitoring with Grafana
    • Monitoring and resolution of Production incidents
    • Comparative performance studies (Oracle vs PostgreSQL) & optimization of high-volume processing:
    o Massive insertion of large data (Bulk mode)
    o Parallelization of processing
    o Separation of Extractions / Loads
    o Increase in Datastage Nodes
    o Activation / Deactivation of constraints

    Environment:

    PostgreSQL 12, Oracle 12C, Python3 (pandas), Teradata 17.1, DataStage 11.7, Grafana, Shell, PL/SQL, Control-M, Git, JIRA
    Python PostgreSQL Datastage Oracle PL/SQL Git
  • ING DIRECT
    Data Engineer
    BANKING AND INSURANCE
    March 2018 - December 2021 (3 years and 9 months)
    Paris, France
    Project:

    Dare, a global multi-country bancassurance platform providing insurance products and associated services via a central digital insurance platform (Germany, Australia, Italy, France, Czech Republic, and Austria)
    International agile context 100% English

    Missions:
    • Tech Lead of an offshore development team (India): technical supervision, code reviews, and deliverable validation.
    • Architecture and design of the multi-country Data Lake & Data Warehouse strategy
    • Design of POCs and execution of comparative studies for business teams and country management.
    • Development of data pipelines (Batch & Streaming) for feeding the Shared Data Lake with DataStage, Oracle PL/SQL, Python, and Kafka
    • Cloud Migration (On-Premise to GCP): Design and deployment of ELT pipelines from the analytical environment to Google Cloud Platform (Cloud Storage to BigQuery).
    • Data modeling and implementation of dbt models on BigQuery.
    • Creation of multi-source extraction scripts (APIs, databases, files, Cloud Storage).
    • Data Quality Framework: Development of a custom data quality control tool in Python (Pandas) integrating business rule management, anomaly detection, and inter-table consistency.
    • Complete orchestration of pipelines with Cloud Composer (Airflow) / UAC
    • Deployment of CI/CD pipelines on Azure DevOps
    • Performance study and process optimization
    • Creation of reporting dashboards on Power BI

    Environment:

    Python 3, SQL, PL/SQL, Oracle 19C, DataStage 11.7, GCP (BigQuery, Cloud Storage, Cloud Composer/Airflow, Google Kubernetes Engine), Kafka, Docker, Azure DevOps, GitLab, UAC, Grafana, Kibana, JIRA, DBT
    Python SQL Apache Airflow Google Cloud Platform (GCP) DBT

Recommendations

Be the first to recommend Walid

Help this freelancer shine by sharing your experience working together.

These freelancer profiles also match your criteria

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Education

  • in Computer Engineering
    National Diploma
    2010
    d'Ingénieur en Informatique
  • Preparatory cycle for
    Grandes Écoles d'Ingénieurs
    2007
    Cycle préparatoire aux

Skill set

Categories