System Reliability Engineer
Sanofi
Barcelona
hace 4 días

JOB OVERVIEW

Within Sanofi Industrial Affairs division, Digital 4.0 program aims to improve the industrial performance and the agility of operations worldwide.

As a SRE you will be contributing to raise the Industrial Affairs Data Platform, which is a key foundation supporting the program, to the highest standards of reliability and performance.

Your primary role will be ensure operational excellence of the Data Platform modules for data ingestion, processing and exposition to the multiple projects connected to the platform.

The IA Data platform addresses batch ingestion from various sources including ERP, Quality & Supply chain systems,

It covers as well near real time IoT & events data ingestion from production & laboratory equipment These data are processed into advanced business data models serving use cases in Dashboarding & reporting, Data science & Transactional business applications, by orchestrated Data products (pipelines) The content of the data platform is exposed as a Data Lake, hot storage and API's.

The IA Data platform is developed in agile as a serverless cloud platform in a AWS environment. It serves GxP processes that are critical to the company.

YOUR DAILY ACTIVITIES WILL BE :

  • Preventive maintenance of the platformParticipate to the Data modules tests plans definitionParticipate to the Data products tests plans definitionIdentify potential risks in operations affectingReliabilityScalabilityPerformanceCostsMitigate these risks with the development teams
  • Curative maintenance of the platformIdentify & classify recuring operation issuesAnalyze Data platform run detailsData quality & QtySuccess & latency of Data products executionsAccess & Usage of Data by consumersAWS running costsHypercare Support ticketsAWS logs
  • Identify & define improvement plans to solve recuring issues Process improvementsDocumentation & standards improvementsTraining modulesData Platform modules and Data Products fixes
  • Act as L3 primary contact of the Hypercare team (support Level 1 & 2)
  • Lead the post mortem analysis after crisis in production
  • Your will interact in agile mode, on 2 weeks long sprints with :

  • Service owner
  • Hypercare team
  • Scrum master
  • Architects (Cloud, Data, Security, )
  • Platform development team (Cloud developers)
  • Data products development teams (Data engineers)
  • Quality for modules GxP validation
  • External partners
  • KEY ACCOUNTABILITIES

  • Grow the reliability of the data platform operations to the highest standards
  • Contribute to meet the Service Level Agreements (SLA) with data consumers
  • Support the Data modules and Data Products engineers on reliability improvement plans
  • REQUIREMENTS

  • 5 years' experience in operations of a cloud platform serving data, with high quality standards
  • Strong AWS managed services skills and experience including IAM, S3, Glue, Containers, MSK, RDS, Lambda, Step functions, MWAA, CloudFormation
  • Fluent English as it the primary working language.
  • LOCATION INFORMATION

    This position is based in Barcelona (Spain).

    Reportar esta oferta
    checkmark

    Thank you for reporting this job!

    Your feedback will help us improve the quality of our services.

    Inscribirse
    Mi Correo Electrónico
    Al hacer clic en la opción "Continuar", doy mi consentimiento para que neuvoo procese mis datos de conformidad con lo establecido en su Política de privacidad . Puedo darme de baja o retirar mi autorización en cualquier momento.
    Continuar
    Formulario de postulación