Esta oferta de trabajo no está disponible en tu país.

Site Reliability Engineer

Altimetrik MéxicoMexico, Mexico

Hace 25 días

Descripción del trabajo

Senior Site Reliability Engineer (SRE)

with advanced English skills (B2 / C1) for a full-time position.

Location : Mexico

Job Description :

We are currently seeking a highly skilled SRE Sr Engineer with solid experience to help lead transformational initiatives within IT operations and encompassing development. As a crucial figure in this role, you will participate / help designing, developing, and implementing cutting-edge SRE solutions , driving the transformation of IT operations organizations to adopt an engineering-centric approach.

Key Responsibilities

Should be very well equipped with all SRE parameters and key metrics and transformation steps.
Drive automation for repetitive operational tasks (toil reduction) through scripts, playbooks, and self-healing workflows.
Design and implement automated runbooks , pipelines , and reliability blueprints to accelerate incident mitigation and enhance system resiliency.
Knowledge of traditional support to SRE transformation is a great advantage.
Worked in large scaled production with ITIL & SRE processes , good understanding on ticket management.
Strong understanding on Agile / Waterfall / Scrum / Kanban and leading SRE deliverables.
Collaborate with development teams on resiliency to ensure that services and applications are designed with operational reliability in mind.
Implement monitoring systems to assess the performance of applications and infrastructure and proactively identifying areas for optimization.
Understanding incident and problem management process, post-mortems, and driving improvements to prevent future incidents .
Ability to translate technical language from Spanish to English , mainly within Monitoring Dashboards and Alerting.

Required Skills & Experience

Around 8-10 years of SRE hands on experience with cloud technologies , development, SRE toolsets and automation.

Should have automation (data refresh, releases, DB snapshots) experience using Ansible or any other scripting languages.

Solid Experience building AI Workflows / Operations Orchestration for Toil reduction and Issue resolution with Self-Healing.

Hands-on experience in AIOPS Tools and Technologies for building AI Agents and Agentic flows.

Participate in architecture of reliable , scalable, and high-performance systems and services with a focus on operational excellence, availability, and performance.

Hands on experience in building Observability as a service, Telemetry data collection using Open Telemetry, APM, SolarWinds, Open-Source tools (Prometheus and Grafana) , Log Aggregations (Kibana or Splunk).

Observability Single Pane Dashboarding.

Strong hands-on experience with any Cloud Technology (AWS) : Control Tower, Project Setup, Creating Accounts, RDS, SSO.

Solid understanding and hands on experience with Docker / Kubernetes.

Should have good experience with Linux Commands, GitLab CICD Setup and Terraform (state management, etc).

Monitoring & alerting setup experience with Splunk, Prometheus, Grafana, Kibana, ELK etc.

Hands on APM Tool / s experience , preferably Datadog or AppDynamics or Dynatrace.

Good understanding of Observability Framework leveraging programmatic SLI / SLO blueprints to standardize the collection of golden signals.

Experience with following languages ( Groovy-DSL, Java, Python, Yaml and microservices architecture).

Good understanding and hands on experience with MQ, Kafka.

Experience with Databases (Oracle, MySQL)

Nice to Have

Any of the relevant professional certifications – Certified Site Reliability Engineer (CSRE), Certified Kubernetes Administrator (CKA), AWS Certified DevOps Engineer Professional, Google Cloud Professional; DevOps Engineer, Developer background highly desired .

Crear una alerta de empleo para esta búsqueda

Site Reliability Engineer • Mexico, Mexico

Ofertas relacionadas

Oferta promocionada

Site Reliability Engineer

UST España & LatamMexico, Mexico

Born digital, UST transforms lives through the power of technology.We walk alongside our clients and partners, embedding innovation and agility into everything they do. We help them create transform...Mostrar másÚltima actualización: hace 8 días

Oferta promocionada

Senior Site Reliability Engineer

DuckDuckGoMexico

Teletrabajo

Be among the first 25 applicants.Hi, we're DuckDuckGo, the online protection company and remote-first team of 300+ on a mission to raise the standard of trust online. Founded in 2008 and profitable ...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada

Senior Site Reliability Engineer

Zillow GroupMexico

Teletrabajo

About the team The FUB+ Infrastructure & Security Team at Zillow Group supports the Follow Up Boss systems, applications, and software engineering teams that power the businesses of tens of thousan...Mostrar másÚltima actualización: hace 7 días

Oferta promocionada

Sr. Site Reliability Engineer (Remote, Mexico)

NovaMexico

Teletrabajo

Site Reliability Engineer (Remote, Mexico).Site Reliability Engineer (Remote, Mexico).Site Reliability Engineer (Remote, Mexico). Be among the first 25 applicants.Site Reliability Engineer (Remote, ...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada

Senior Site Reliability Engineer

SimMexico

Teletrabajo

For over 50 years, we have worked closely with investment and asset managers to become the world’s leading provider of integrated investment management solutions. We are 3,000+ colleagues with a bro...Mostrar másÚltima actualización: hace 9 días

Oferta promocionada

Site Reliability Engineer

Altimetrik MexicoMexico

Senior Site Reliability Engineer (SRE).English skills (B2 / C1) for a full-time position.We are currently seeking a highly skilled. IT operations and encompassing development.As a crucial figure in th...Mostrar másÚltima actualización: hace 24 días

Oferta promocionada

FBS Site Reliability Engineer

CapgeminiMexico

Teletrabajo

Be among the first 25 applicants.This range is provided by Capgemini.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Our Client is one of the Un...Mostrar másÚltima actualización: hace 10 días

Oferta promocionada

Site Reliability Engineer

EXLMéxico, Mexico, Mexico

We are seeking a highly motivated and skilled Site Reliability Engineer (SRE) to join our team.The ideal candidate will have a passion for continuous learning, a collaborative mindset to work with ...Mostrar másÚltima actualización: hace 26 días

Oferta promocionada

Senior Site Reliability Engineer (SRE)

EPAM SystemsMexico

Teletrabajo

Be among the first 25 applicants.EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employ...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada

Senior Site Reliability Engineer

PerficientMexico

We currently have a career opportunity for a.Senior Site Reliability Engineer.Mexico or Colombia (only this locations).As a Senior Technical Consultant you will participate in all aspects of the so...Mostrar másÚltima actualización: hace 24 días

Oferta promocionada

Site Reliability Engineer

HCLTechMexico, Mexico

HCLTech is a global technology company, home to more than 223,000 people across 60 countries, delivering industry-leading capabilities centered around digital, engineering, cloud and AI, powered by...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada

Site Reliability Engineer

New Era TechnologyMexico

Teletrabajo

Senior IT Recruiter | IT Recruitment, Talent Acquisition Specialist, Recruitment Team Lead.Join our team as a Site Reliability Engineer (SRE) Engineer. We’re looking for someone who has fresh ideas ...Mostrar másÚltima actualización: hace 25 días

Oferta promocionada

Lead Site Reliability Engineer

SimMexico

Teletrabajo

Join some of the most innovative thinkers in FinTech as we lead the evolution of financial technology.If you are an innovative, curious, collaborative person who embraces challenges and wants to gr...Mostrar másÚltima actualización: hace 9 días

Oferta promocionada

Site Reliability Engineer (SRE)

OnHiresMexico, Mexico

Site Reliability Engineer (SRE) .Fully Remote (Offices in Limassol, Kyiv, London, Tbilisi).Availability to work between 5 PM and 8 AM CET, in one of the following shifts : 17 : 00–01 : 00 or 00 : 00–08 : 00...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada

Site Reliability Engineer

KI peopleMexico

Teletrabajo

Be among the first 25 applicants.Direct message the job poster from KI people.In Search of the Best Global IT & Digital Talent. The SRE Operations specialist focuses on B2B applications support prov...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada

7777 - Site Reliability Engineer Cloud and Infrastructure Mexico Published Today

UnosquareMexico

Teletrabajo

Senior Site Reliability Engineer.We are looking for a Senior Site Reliability Engineer to join our Cloud Infrastructure Engineering division. Cloud Infrastructure Engineering ensures the continuous ...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada

Site Reliability Engineer

ConfidencialMexico, Mexico

Estamos en búsqueda de un / a Ingeniero / a SRE senior para potencialmente sumarse a un proyecto de consultoría.El rol tendrá como objetivo fortalecer la confiabilidad, estabilidad y resiliencia de los...Mostrar másÚltima actualización: hace 24 días

Oferta promocionada

7772 - Site Reliability Engineer Cloud and Infrastructure Mexico Published Today

UnosquareMexico