You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Souleymane NdoyeSN

Souleymane Ndoye

Statistical Engineer - Senior Data Scientist

€650/day
Paris, FR
8-15 years

Average response time: 1 hour

Freelancer profile translated to English.
Back to original language

About Souleymane

Computer statistician, data scientist, I have worked for over 18 years for major accounts and an international organization (EDF, Autorité des Marchés Financiers, Airbus, Société Générale, UN, ..).
My field of expertise includes: Modeling, data processing in distributed architectures, implementation of complex algorithms, training in mathematics institute.
  • English

    Fluent

  • French

    Native or bilingual

Can work on-site
Paris (up to 50km)

Experience

  • JEMS group
    Data Scientist
    DIGITAL AND IT
    February 2016 - Today (10 years and 4 months)
    Neuilly-sur-Seine, France
    In client at:

    Airbus / Airbus Corporate Governance Product Safety - 9 months - Toulouse, France.

    Mission description: Automated processing of root cause analysis from flight data and maintenance messages from the aircraft.

    Functional analysis
    Data collection and cleaning.
    Algorithmic development:
    Capture of maintenance messages observed in an interval of +/- 3mn around the
    disengagement of the autopilot and a degradation of the flight control laws.
    Implementation of business rules to define failure classes.
    Automation and planning of Root Cause Analysis in a distributed environment.

    Technical environment: Palantir - Skywise (Datalake, Airbus analytics platform) - Python - Pyspark.
    Functional environment: Aviation safety - Avionics - Autopilot.


    Renault / Quality Department - 8 months - Paris, France.

    Mission description: Typology of rolling sequences for the purpose of detecting undesirable events of the autonomous vehicle.
    Translation of business needs into analytical terms
    Analysis of sensor signal data
    Implementation and development of models (Hidden Markov Models, DTW, Andrew Curves, ...)
    Framing of a Big Data project for the collection, storage and preprocessing of data to be analyzed

    Technical environment: Python (Pandas, Numpy, Scipy, Scikit-learn), Notebook Jupyter.
    Functional environment: Quality department, Statistical control.

    General Electric Healthcare - 2 months - Paris, France.

    Mission description: Log file analysis.

    Collection and analysis of log files from angiography systems to improve
    the user interface
    Parsing logs
    Searching for patterns using Awk commands and regular expressions
    Technology recommendations

    Technical environment: UNIX - Awk - Python.
    Functional environment: Research and Development.

    Air Liquide Healthcare - 5 months - Paris, France.

    Mission description: Statistical modeling of sales and turnover.

    Collaboration with business lines (Marketing, Sales) in defining and understanding sales models
    Extraction and preprocessing of marketing and sales action data in SQL Server
    Loading and statistical analysis of data in Pandas (Python): Trend analysis,
    Pareto diagrams, interpretation with business lines, correction and anomaly detection in the
    data, graphical analysis with Bokeh (Python)
    Definition and construction of a marketing typology of customers to qualify a propensity
    to customer churn
    Study and implementation of a mathematical prediction model (SVM, Naïf Bayes) allowing to
    predict customer churn
    Transfer of skills to the marketing teams in charge of data visualization

    Technical environment: UNIX - Python (Pandas, Scikit Learn - Bokeh) - SQL Server
    Functional environment: Marketing, Sales

    IFF (International Flavors & Fragrances) - 4 months - Amsterdam/Paris.

    Mission description: Framing of a Big Data project.

    Organization of launch meetings for a Big Data project
    Conducting workshops to collect needs and analyze existing information systems from Business Units.
    Study of the relevance of an evolution of existing systems towards a Big Data solution by Business Unit.
    Recommendations of architecture and Big Data application tools.

    Technical environment: Talend (ETL) - MapR (Big Data Hadoop distribution) - Spark/R (distributed computing system)
  • MODIS France
    Data Scientist
    DIGITAL AND IT
    January 2015 - July 2015 (7 months)
    Paris, France
    Financial Business Analyst at Modis France (R&D Financial Business Analyst, 4 months):

    Methodological research for the processing and analysis of data in Python.
    Analysis of methods for calculating the return on financial assets.
    Implementation and simulation of the Markov Switching Multifractal Model in Python (Numpy, Pandas, Scipy).
    Academic collaboration (Prof. Laurent Calvet) for the implementation of the Markov Switching Multifractal Model and Realized Volatility for predictive analysis purposes.
    Drafting of technical and functional specifications for the transcoding in Java of the studied models.

    In client at Essilor International (Data Scientist, 3 months):

    Translate business issues into statistical/mathematical problems.
    Find relevant data sources (Open Data, geo-referenced public data; census data).
    Data management and data analysis in massively parallel processing under Netezza (SQL).
    Analysis of price elasticity - Studies of product baskets using Machine Learning techniques and classical data analysis: Neural networks, k-means, factorial methods, short-term forecasting methods, curve classification techniques.

    Technical environment: Windows - LINUX - Python - Matlab/Octave - (Pandas, Scipy, Numpy) - R
    Matlab Python R
  • Societe Generale Corporate & Investment Banking
    Data Scientist
    BANKING AND INSURANCE
    May 2014 - October 2014 (6 months)
    Paris, France
    Information Technology / Financing Income & Currencies Division:

    Analysis of log files and monitoring of a risk analysis application by building KPIs and analytical dashboards.
    Implementation of the Elasticsearch-Logstash-Kibana (ELK) stack.
    Configuration for cluster creation to ensure high availability, performance.

    Technical environment: UNIX - Elasticsearch-Logstash-Kibana
    UNIX Elasticsearch Kibana

Recommendations

Be the first to recommend Souleymane

Help this freelancer shine by sharing your experience working together.

These freelancer profiles also match your criteria

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Education

  • Master II in Statistical Engineering
    University of Versailles Saint-Quentin-en-Yvelines
    2006
    • Le M2 Ingénierie de la Statistique est une formation professionnelle en statistique. Son objectif est, grâce à un approfondissement conséquent des méthodes statistiques et à une spécialisation en actuariat et/ou en étude de marché, de former des cadres statisticiens, dotés d’une compétence en actuariat ou en études de marchés, ou d’une double compétence actuariat/ études de marchés.
  • Specialized Master in Engineering of Open Computer Systems
    Ecole Centrale Paris
    Fondamentaux Langage, systèmes et réseaux Nouvelles technologies Leadership, management, systèmes ouverts Professionnalisation

Skill set (13)

Categories