About Houssem
French
Native or bilingual
English
Native or bilingual
Experience
- ContentsquareSenior Data EngineerE-COMMERCEOctober 2021 - Today (4 years and 7 months)Paris, France
- Designed a quota system by presenting different options, trade-offs, technologies, costs, and estimating the time required for production.
- Developed a service to receive, validate, and process quota requests using Scala and Akka Http.
- Developed a streaming service to read and aggregate credit deduction messages from Kafka, and update credits in the PostgreSQL database.
- Created a monitoring system for quota services using Prometheus, Grafana, and Alert Manager.
- Benchmarked services by injecting artificial traffic to estimate the necessary resources for proper functioning (CPU, memory, number of instances…).
- Deployed services on cloud (kubernetes, AWS, and Azure) using Jenkins, Terraform, and Ansible.
- Designed a scraping system composed of 3 services: extractor, scrapper, and provider, capable of managing over 10k resources per second.
- Developed an extractor service to extract URLs from payloads (protobuf) using Scala, Akka stream, and Kafka.
- Developed a scraping service to download resources and store them in a cloud storage system (aws, Azure).
- Managed resource retention using Lifecycle rules on S3 and Azure blob storage.
- Optimized scraping by implementing revisit, caching, and rate limiting strategies.
- Implemented a provider service to retrieve resources from cloud storage.
- Estimated the cost of the scraping service (storage, cloud operations, Kubernetes…).
- Created a metadata validation system for session replays by aggregating a massive amount of data (100k msg/s) and implementing business rules using Flink.
Technologies: scala, golang, kafka, akka, aws, azure, kubernetes, clickhouse, aerospike, promotheus, grafana, jenkins, terraform - Société GénéraleData EngineerSeptember 2019 - October 2021 (2 years and 1 month)
- Developed batch jobs in Spark/Scala to create regulatory and financial reports to meet the needs of the recovery and resilience plan. Inputs included files in HDFS, Hive tables, REST APIs, and Teradata databases.
- Automated and orchestrated the processing workflow using Oozie.
- Executed data analysis queries on Hive.
- Created a CI/CD pipeline using Jenkins, Ansible, and Nexus.
- Migrated to a new Big Data platform (Cloudera).
- Created regulatory reports and dashboards on Power BI.
- Managed the connection with Hive using Presto.
- Implemented a REST API to insert and read regulatory report configurations using Scala, Akka HTTP, and Postgres.
- Deployed the service using Openshift.
Technologies: Scala, Spark, HDFS, Sqoop, Hive, Oozie, Hue, jenkins, ansible, Power BI, Presto, Akka Http, Postgres, Openshift - KayrrosData EngineerENERGY AND UTILITIESJanuary 2019 - September 2019 (8 months)Paris, France
- Developed, automated, and optimized the performance of data pipelines for analyzing satellite images to monitor oil and gas production using Pyspark.
- Scraped information on oil and gas production and stored it in Elasticsearch.
- Industrialized image processing and machine learning mathematical models.
- Deployed services on a Kubernetes cluster with Rancher.
- Manipulated SQL and NoSQL databases.
- Created monitoring dashboards with Kibana.
Technologies: Python, Spark, Pandas, HDFS, airflow, elasticsearch, kibana, docker, rancher
Recommendations
Be the first to recommend Houssem
Help this freelancer shine by sharing your experience working together.
These freelancer profiles also match your criteria
Agatha Frydrych
Backend Java Software Engineer
4.7
(3)
2
Baptiste Duhen
Fullstack developer
4.6
(4)
5
Amed Hamou
Senior Lead Developer
4
(2)
7
Audrey Champion
Web developer
4.3
(3)
4
Education
- Computer Science Engineering Degree (Double Degree)Télécom SudParis2019Diplôme d'ingénieur en informatique (Double diplôme)
- Telecommunications Engineering DegreeEcole supérieure des communications de Tunis2017Diplôme d'ingénieur en Télécommunications