Talent.com
Esta oferta de trabajo no está disponible en tu país.
Senior Site Reliability Engineer (SRE)

Senior Site Reliability Engineer (SRE)

MarvikMexico, MX
Hace más de 30 días
Tipo de contrato
  • Quick Apply
Descripción del trabajo

What’s the opportunity?

We’re looking for a Site Reliability Engineer (SRE) to join our team!

As an SRE, you're expected to ask key questions like :

What data do we need to understand how our systems are performing?

How do we collect that data?

What patterns are we looking for, and what do they mean?

Who needs to be alerted when something isn’t working?

Are there any systems where we need more or better data?An SRE designs systems and processes to answer these questions and automate support and response wherever possible.

Responsibilities :

Own OpenTelemetry Pipelines : Design, implement, and maintain observability pipelines across logs, metrics, and traces, ensuring standardized, scalable, and efficient data ingestion. Optimize ingestion strategies for cost, performance, and usability.

Empower Engineering Teams : Build self-service automation and tooling that lets development teams implement observability without needing manual SRE support. Drive best practices and ensure teams take ownership of their telemetry.

Support Incident Management : Act as the engineering arm of the Incident Management Team—designing playbooks, processes, checklists, and automations to support teams during incidents.

Collaborate Across Teams : Work with teams across the business to understand their monitoring, alerting, and SLO / SLA needs. Design solutions that meet or exceed these requirements and influence architectural decisions from the start to ensure scalability and resilience.

Automate Observability Infrastructure : Use Infrastructure-as-Code (IaC) to manage monitoring tools, alert rules, and observability configurations across OTEL pipelines.

Define Baseline Observability Standards : Create base-level requirements to ensure all infrastructure and code is monitored consistently and accurately.

Own Technical and Security Health : Take full ownership of infrastructure reliability and ensure alignment with key availability and security KPIs.

Optimize Alerting Systems : Continuously fine-tune alerting to reduce noise, ensure alerts are actionable, and improve response efficiency.

If you have

4+ years of experience as an SRE or in a similar observability-focused role.

Strong Kubernetes expertise, including components, deployment practices, and monitoring.

Familiarity with OpenTelemetry—setting up collectors, instrumentation, and pipeline optimization.

Experience with tools like Grafana, Prometheus, Loki, New Relic, or Datadog.

Hands-on experience with Infrastructure-as-Code (Terraform) and GitOps CI / CD (e.g., ArgoCD, GitHub Actions).

Experience integrating incident platforms (PagerDuty, Jira) into alerting workflows.

Strong scripting skills (Python, Go, etc.) to automate observability tasks.

A problem-solving mindset and ability to collaborate across teams to improve reliability.

It’s a plus :

Cloud experience, especially with AWS and ECS workloads.

Experience managing observability pipelines at scale in high-throughput environments.

Familiarity with Configuration-as-Code tools (Ansible, Chef, or SaltStack).

Experience with database performance monitoring in large-scale distributed systems.

Crear una alerta de empleo para esta búsqueda

Reliability Engineer • Mexico, MX

Ofertas relacionadas
Senior SRE (Site Reliability Engineer) - Remote

Senior SRE (Site Reliability Engineer) - Remote

SailPoint(Mexico)
SailPoint is the leader in identity security for the cloud enterprise.Our identity security solutions secure and enable thousands of companies worldwide, giving our customers unmatched visibility i...Mostrar másÚltima actualización: hace 27 días
  • Oferta promocionada
Senior Site Reliability Engineer

Senior Site Reliability Engineer

DuckDuckGoMexico
Be among the first 25 applicants.Hi, we're DuckDuckGo, the online protection company and remote-first team of 300+ on a mission to raise the standard of trust online. Founded in 2008 and profitable ...Mostrar másÚltima actualización: hace más de 30 días
Lead Site Reliability Engineer

Lead Site Reliability Engineer

EpamMexico
Lead Site Reliability Engineer!.In this role, you'll supervise and monitor a variety of projects, set up application Observability and Telemetry, and perform advanced troubleshooting of incidents i...Mostrar másÚltima actualización: hace más de 30 días
  • Oferta promocionada
Site Reliability Engineer

Site Reliability Engineer

RELEX SolutionsMexico
English is our primary business language, we kindly ask that all applications (CVs / resumes and cover letters) be submitted in English. At RELEX, Engineering means to tackle complex challenges with c...Mostrar másÚltima actualización: hace 23 días
Site Reliability Engineer

Site Reliability Engineer

E-SolutionsMexico
Months Contract / Contract to Hire.Bachelor’s degree in CS, CE, SE, CIS, IT or IS.Java and Terraform knowledge will be an added advantage. Working experience in one of the Cloud services – .Deep exper...Mostrar másÚltima actualización: hace más de 30 días
  • Oferta promocionada
Senior Site Reliability Engineer (SRE)

Senior Site Reliability Engineer (SRE)

EPAM SystemsMexico
Be among the first 25 applicants.EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employ...Mostrar másÚltima actualización: hace 16 días
  • Oferta promocionada
Site Reliability Engineer

Site Reliability Engineer

BukMexico
En Buk, estamos en una misión audaz : .Somos un equipo joven, lleno de energía y pasión por revolucionar la gestión de RRHH en Latinoamérica. La tecnología es nuestra herramienta principal, y nos esfo...Mostrar másÚltima actualización: hace más de 30 días
  • Oferta promocionada
Site Reliability Engineer

Site Reliability Engineer

S&P GlobalMexico, Mexico
Principal Site Reliability Engineer.Automotive Insights leverages technology and data science to provide unique insights, forecasts and advisory services spanning every major market and the entire ...Mostrar másÚltima actualización: hace más de 30 días
  • Oferta promocionada
Site Reliability Engineer

Site Reliability Engineer

Gold Light DataMexico, Mexico
Candidates Applying for this position need to have a proeficient level of English Professional (Spoken / Writing).Please" Is Useless to apply if you dont have this level of English.Candidates MUST ...Mostrar másÚltima actualización: hace 11 días
  • Oferta promocionada
SRE Engineer

SRE Engineer

2BrainsMexico
Get AI-powered advice on this job and more exclusive features.Brains es una empresa dedicada a construir y desarrollar el Futuro Digital de nuestros clientes, con una visión excepcional que radica ...Mostrar másÚltima actualización: hace 11 días
  • Oferta promocionada
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Persistent SystemsMexico
Be among the first 25 applicants.Get AI-powered advice on this job and more exclusive features.We are an AI-led, platform-driven Digital Engineering and Enterprise Modernization partner, combining ...Mostrar másÚltima actualización: hace más de 30 días
  • Oferta promocionada
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Turtle TraxMexico
Senior Site Reliability Engineer.Be among the first 25 applicants.Senior Site Reliability Engineer.Job Title : Senior Site Reliability Engineer (SRE). Experience : 5+ years Location : Mexico / LATAM.Enga...Mostrar másÚltima actualización: hace más de 30 días
Site Reliability Engineer

Site Reliability Engineer

Tyk Technologies LtdMX
Quick Apply
Who are Tyk, and what do we do?.The Tyk API Management platform is helping to drive the connected world and power new products and services. We’re changing the way that organisations connect a...Mostrar másÚltima actualización: hace 5 días
  • Oferta promocionada
Site Reliability Engineer

Site Reliability Engineer

Ust GlobalMéxico
Lead I - Software Engineering • •.Born digital, UST transforms lives through the power of technology.We walk alongside our clients and partners, embedding innovation and agility into everything they ...Mostrar másÚltima actualización: hace 1 día
  • Oferta promocionada
Site Reliability Engineer

Site Reliability Engineer

HCLTechMexico, Mexico
HCL is looking for around 7 technical support engineers with the below skills : .Provide exceptional technical support to customers and partners via chat, email, phone, and screen-share sessions.Trou...Mostrar másÚltima actualización: hace 18 días
  • Oferta promocionada
Site Reliability Engineer

Site Reliability Engineer

KI peopleMexico
Be among the first 25 applicants.Direct message the job poster from KI people.In Search of the Best Global IT & Digital Talent. The SRE Operations specialist focuses on B2B applications support prov...Mostrar másÚltima actualización: hace más de 30 días
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Valce Talent SolutionsMexico
Quick Apply
We help our clients enhance their talent attraction capacities, especially in technological profiles.We constantly innovate and actively seek to find the best solutions for clients and professional...Mostrar másÚltima actualización: hace más de 30 días
  • Oferta promocionada
Site Reliability Engineer

Site Reliability Engineer

HcltechMéxico
HCL is looking for around 7 technical support engineers with the below skills : SKILLS : • Provide exceptional technical support to customers and partners via chat, email, phone, and screen-share ses...Mostrar másÚltima actualización: hace 2 días