Talent.com
Esta oferta de trabajo no está disponible en tu país.
Site Reliability Engineer

Site Reliability Engineer

Altimetrik MéxicoMexico City, Mexico
Hace 22 días
Descripción del trabajo

Senior Site Reliability Engineer (SRE)

with advanced English skills (B2 / C1) for a full-time position.

Location : Mexico

Job Description :

We are currently seeking a highly skilled SRE Sr Engineer with solid experience to help lead transformational initiatives within IT operations and encompassing development. As a crucial figure in this role, you will participate / help designing, developing, and implementing cutting-edge SRE solutions, driving the transformation of IT operations organizations to adopt an engineering-centric approach.

Key Responsibilities

  • Should be very well equipped with all SRE parameters and key metrics and transformation steps.
  • Drive automation for repetitive operational tasks (toil reduction) through scripts, playbooks, and self-healing workflows.
  • Design and implement automated runbooks, pipelines, and reliability blueprints to accelerate incident mitigation and enhance system resiliency.
  • Knowledge of traditional support to SRE transformation is a great advantage.
  • Worked in large scaled production with ITIL & SRE processes, good understanding on ticket management.
  • Strong understanding on Agile / Waterfall / Scrum / Kanban and leading SRE deliverables.
  • Collaborate with development teams on resiliency to ensure that services and applications are designed with operational reliability in mind.
  • Implement monitoring systems to assess the performance of applications and infrastructure and proactively identifying areas for optimization.
  • Understanding incident and problem management process, post-mortems, and driving improvements to prevent future incidents.
  • Ability to translate technical language from Spanish to English, mainly within Monitoring Dashboards and Alerting.

Required Skills & Experience

  • Around 8-10 years of SRE hands on experience with cloud technologies, development, SRE toolsets and automation.
  • Should have automation (data refresh, releases, DB snapshots) experience using Ansible or any other scripting languages.
  • Solid Experience building AI Workflows / Operations Orchestration for Toil reduction and Issue resolution with Self-Healing.
  • Hands-on experience in AIOPS Tools and Technologies for building AI Agents and Agentic flows.
  • Participate in architecture of reliable, scalable, and high-performance systems and services with a focus on operational excellence, availability, and performance.
  • Hands on experience in building Observability as a service, Telemetry data collection using Open Telemetry, APM, SolarWinds, Open-Source tools (Prometheus and Grafana), Log Aggregations (Kibana or Splunk).
  • Observability Single Pane Dashboarding.
  • Strong hands-on experience with any Cloud Technology (AWS) : Control Tower, Project Setup, Creating Accounts, RDS, SSO.
  • Solid understanding and hands on experience with Docker / Kubernetes.
  • Should have good experience with Linux Commands, GitLab CICD Setup and Terraform (state management, etc).
  • Monitoring & alerting setup experience with Splunk, Prometheus, Grafana, Kibana, ELK etc.
  • Hands on APM Tool / s experience, preferably Datadog or AppDynamics or Dynatrace.
  • Good understanding of Observability Framework leveraging programmatic SLI / SLO blueprints to standardize the collection of golden signals.
  • Experience with following languages (Groovy-DSL, Java, Python, Yaml and microservices architecture).
  • Good understanding and hands on experience with MQ, Kafka.
  • Experience with Databases (Oracle, MySQL)
  • Nice to Have

  • Any of the relevant professional certifications – Certified Site Reliability Engineer (CSRE), Certified Kubernetes Administrator (CKA), AWS Certified DevOps Engineer Professional, Google Cloud Professional; DevOps Engineer, Developer background highly desired.
  • Crear una alerta de empleo para esta búsqueda

    Site Reliability Engineer • Mexico City, Mexico

    Ofertas relacionadas
    • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    Translation Back OfficeCiudad de México, Ciudad de México, Mexico
    Teletrabajo
    We are looking for a highly skilled Site Reliability Engineer (SRE) to join our team and ensure the reliability, scalability, and efficiency of our platforms and services.The ideal candidate will h...Mostrar másÚltima actualización: hace 20 días
    • Oferta promocionada
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    SimNaucalpan de Juárez, Estado de México, Mexico
    Teletrabajo
    Join some of the most innovative thinkers in FinTech as we lead the evolution of financial technology.If you are an innovative, curious, collaborative person who embraces challenges and wants to gr...Mostrar másÚltima actualización: hace 10 días
    • Oferta promocionada
    Site Reliability Engineer (SRE) – Cloud Ops Focus (Mexico Only)

    Site Reliability Engineer (SRE) – Cloud Ops Focus (Mexico Only)

    VaricentCiudad de México, Ciudad de México, Mexico
    Teletrabajo
    Site Reliability Engineer (SRE) – Cloud Ops Focus (Mexico Only).Site Reliability Engineer (SRE) – Cloud Ops Focus (Mexico Only). Be among the first 25 applicants.At Varicent, We’re Not Just Transfor...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    UST España & LatamMexico City, Mexico
    Born digital, UST transforms lives through the power of technology.We walk alongside our clients and partners, embedding innovation and agility into everything they do. We help them create transform...Mostrar másÚltima actualización: hace 5 días
    • Oferta promocionada
    Lead, Site Reliability Engineer

    Lead, Site Reliability Engineer

    Royal Caribbean GroupCiudad de México, Ciudad de México, Mexico
    Journey with us! Combine your career goals and sense of adventure by joining our incredible team of employees at.We are proud to offer a competitive compensation and benefits package, and excellent...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    Tata Consultancy ServicesCiudad de México, Ciudad de México, Mexico
    We are looking for a Site Reliability Engineer (SRE) to join our team and help us ensure seamless, high-performing, and reliable technology operations. Azure DevOps - Pipelines, repositories, and au...Mostrar másÚltima actualización: hace 23 días
    • Oferta promocionada
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ZillowCiudad de México, Ciudad de México, Mexico
    Teletrabajo
    Senior Site Reliability Engineer.Senior Site Reliability Engineer.Get AI-powered advice on this job and more exclusive features. The FUB+ Infrastructure & Security Team at Zillow Group supports the ...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    CanonicalEcatepec de Morelos, Estado de México, Mexico
    Teletrabajo
    Senior Site Reliability Engineer.Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used i...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Next MatterCiudad de México, Ciudad de México, Mexico
    The FUB+ Infrastructure & Security Team at Zillow Group supports the Follow Up Boss systems, applications, and software engineering teams that power the businesses of tens of thousands of real esta...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Site Reliability Engineer...

    Site Reliability Engineer...

    HCLTechMexico City, Mexico City, MX
    HCLTech is a global technology company, home to more than 223,000 people across 60 countries, delivering industry-leading capabilities centered around digital, engineering, cloud and AI, powered by...Mostrar másÚltima actualización: hace 4 días
    • Oferta promocionada
    Site Reliability Engineering Design & Support Engineer

    Site Reliability Engineering Design & Support Engineer

    PepsiCoCiudad de México, Mexico
    Site Reliability Engineering Design & Support Engineer.Site Reliability Engineering Design & Support Engineer.Being part of PepsiCo means being part of one of the largest food and beverage companie...Mostrar másÚltima actualización: hace 1 día
    • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    HCLTechMexico City, Mexico
    HCLTech is a global technology company, home to more than 223,000 people across 60 countries, delivering industry-leading capabilities centered around digital, engineering, cloud and AI, powered by...Mostrar másÚltima actualización: hace 19 días
    • Oferta promocionada
    • Nueva oferta
    ▷ (Quedan 3 Días) Site Reliability Engineer...

    ▷ (Quedan 3 Días) Site Reliability Engineer...

    EXLMéxico, Mexico, MX
    About the Company : We are seeking a highly motivated and skilled Site Reliability Engineer (SRE) to join our team.The ideal candidate will have a passion for continuous learning, a collaborative mi...Mostrar másÚltima actualización: hace menos de 1 hora
    • Oferta promocionada
    Site Reliability Engineer - AWS / SQL

    Site Reliability Engineer - AWS / SQL

    S&P GlobalMexico City Metropolitan Area, Mexico
    Site Reliability Engineer - Data Support | S&P Dow Jones Indices.We are seeking an Site Reliability Engineer - Data Support to be a key player in the implementation and support of our Global Index ...Mostrar másÚltima actualización: hace 29 días
    • Oferta promocionada
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    SimCorpCiudad de México, Ciudad de México, Mexico
    Teletrabajo
    Lead Site Reliability Engineer (SRE) role at SimCorp.You will lead efforts to maintain and improve the reliability, scalability, and performance of SimCorp products and services, collaborating with...Mostrar másÚltima actualización: hace 8 días
    • Oferta promocionada
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Svitla Systems, Inc.Ciudad de México, Ciudad de México, Mexico
    Senior Site Reliability Engineer for a full-time position (40 hours per week) in Latin America.Our client is a leading expert network, providing business and government professionals opportunities ...Mostrar másÚltima actualización: hace 3 días
    • Oferta promocionada
    Site Reliability Engineer - Remote Work | REF#180173

    Site Reliability Engineer - Remote Work | REF#180173

    BairesDevTlalnepantla, Estado de México, Mexico
    Teletrabajo
    Site Reliability Engineer - Remote Work | REF#180173.Site Reliability Engineer - Remote Work | REF#180173.Site Reliability Engineer - Remote Work | REF#180173. Be among the first 25 applicants.Site ...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Reliability Engineer

    Reliability Engineer

    GivaudanJiutepec, Morelos, Mexico
    At Givaudan, you contribute to delightful taste and scent experiences that touch people’s lives.You work within an inspiring teamwork culture – where you can thrive, collaborate and learn from othe...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Systems Reliability Engineer

    Systems Reliability Engineer

    NutanixCiudad de México, Ciudad de México, Mexico
    Teletrabajo
    We are looking for an accomplished Systems Reliability Engineer to support our cloud solution in the field and provide an enriched and successful product experience to our customers leveraging our ...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    • Nueva oferta
    Site Reliability Engineer

    Site Reliability Engineer

    DematicCentro, Mexico
    The Site Reliability Engineer will be part of the Platform Operations Global Team, responsible for managing cloud-based infrastructure and services for the custom Java-based and third-party applica...Mostrar másÚltima actualización: hace 9 horas