You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
David Gómez SoriaDG

David Gómez Soria

Data Engineer | Spark, Scala, Airflow & Databricks

€450/day
Madrid, ES
3-7 years

Average response time: 1 hour

About David

Data Engineer specialized in distributed data processing using Apache Spark and Scala.

Experience in designing, developing, and optimizing ETL pipelines on Big Data architectures, working with large-scale datasets in critical production environments.

Specialized in:

  • Spark job optimization and performance tuning
  • Batch ETL pipeline development
  • Workflow orchestration with Airflow
  • Big Data environment migrations
  • Distributed processing and scalability

I have worked on projects focused on financial data processing, system integrations, and data platform modernization, participating in migrations to cloud architectures and Databricks environments.

Main stack:
Scala, Spark, Airflow, Hive, SQL, Databricks, PostgreSQL, Cloudera, CI/CD, and APIs.
  • Spanish

    Native or bilingual

  • English

    Conversational

Remote only
Primarily works remotely

Experience

  • BOSONIT S.L.
    Data Engineer
    BANKING AND INSURANCE
    January 2022 - Today (4 years and 5 months)
    Madrid, Spain
    Desarrollo, mantenimiento y evolución de pipelines ETL para el procesamiento de mensajes de pago (SWIFT, ISO 20022, SEPA, ACH) Procesamiento batch de datos desde capa landing (S3) hasta capa common, aplicando validaciones técnicas y funcionales Normalización de múltiples fuentes de datos en un modelo común para su posterior explotación Optimización de jobs Spark reduciendo tiempos de ejecución de varias horas a minutos mediante mejoras en particionado, configuración y lógica de procesamiento la de procesos
    ETL/ELT Apache Spark Data Engineer Scala Databricks
  • BINAIA
    Big Data Engineering Mentor
    EDUCATION AND E-LEARNING
    July 2023 - Today (2 years and 11 months)
    Madrid, Spain
    • Mentoring new Big Data trainees, providing guidance on both theoretical and practical aspects of Big Data technologies.

    • Conduct bi-weekly follow-ups to ensure learning progress. The mentorship program begins with foundational knowledge in Hadoop, HDFS, Hive, Apache Spark, Scala/Python, followed by practical ETL simulations and hands-on experience with Apache Airflow for building and orchestrating data pipelines.
    Coaching and mentoring Scala Apache Spark ETL/ELT Databricks

Recommendations

Be the first to recommend David

Help this freelancer shine by sharing your experience working together.

These freelancer profiles also match your criteria

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Education

  • Grado Superior
    MEDAC
    2022
    Grado Superior
  • Certified Associate Developer for Apache Spark 3.0
    Databricks
    Certified Associate Developer for Apache Spark 3.0

Skill set

Categories