Esta oferta de trabajo no está disponible en tu país.

Principal Site Reliability Developer

OracleZapopan, Jalisco, Mexico

Hace más de 30 días

Descripción del trabajo

As a senior member of the Site Reliability Engineering (SRE) team, you'll take ownership of highly available systems, influence service design, and work across teams to drive resiliency, automation, and operational excellence. This is a hands-on engineering role where deep infrastructure knowledge meets software engineering expertise, ideal for experienced SREs ready to take the lead.

Qualifications

Career Level - IC4

Responsibilities

What You’ll Do :

Lead the design, automation, and support of OCI services with a focus on resiliency, security, scalability, and performance.
Own and improve the end-to-end reliability metrics (SLOs, SLAs, KPIs) for your services.
Design and implement high-availability architectures and standards for large-scale distributed systems.
Serve as the ultimate escalation point for complex operational issues, using a deep understanding of service topologies and interdependencies.
Architect and build automation and orchestration tools that reduce manual work and prevent problem recurrence.
Collaborate with development teams to improve service designs, optimize deployments, and implement best practices for operational efficiency.
Guide technical decision-making and mentor junior SREs and developers across teams.
Participate in and lead postmortems, root cause analysis, and preventative design changes.
Contribute to capacity planning, demand forecasting, and long-term service scalability strategies.
Participate in a rotational on-call schedule to ensure the health and availability of production services.

What We’re Looking For :

Advanced experience with Linux systems administration

Strong programming skills in Python (with automation libraries)

Advanced Bash / Shell scripting

Deep understanding of distributed systems, networking, and service architecture

Solid knowledge of databases and how they behave in production (SQL or NoSQL)

Strong understanding of CI / CD pipelines, Agile methodologies, and DevOps best practices

Experience writing and maintaining unit tests and production-grade software

Proven ability to lead cross-functional efforts and technical problem-solving in live environments

Nice to Have :

Hands-on experience with monitoring and observability tools (Grafana, Prometheus, New Relic, etc.)

Familiarity with Oracle Cloud Infrastructure (OCI) or other cloud platforms (AWS, Azure, GCP)

Experience with Infrastructure-as-Code (Terraform, Ansible) and container orchestration (Kubernetes)

#J-18808-Ljbffr

Crear una alerta de empleo para esta búsqueda

Site Reliability • Zapopan, Jalisco, Mexico

Ofertas relacionadas

Oferta promocionada

Principal Service Reliability Engineer

OracleZapopan, Jalisco, México

Job DescriptionThis role requires a SRE mindset combined with AI / ML expertise and strong application engineering skills across public and private cloud environments. ResponsibilitiesKey Responsibili...Mostrar másÚltima actualización: hace 3 días

Oferta promocionada
Nueva oferta

Principal I, Application Development (.Net).

HerbalifeTlaquepaque, Jalisco, Mexico

Teletrabajo

Position reports to : Mauricio Gonzalez.Work schedule : Hybrid, going to the office in GDL for 3 days.The Principal of Application Development acts as a technical expert on a specific area in Applica...Mostrar másÚltima actualización: hace 13 horas

Oferta promocionada

Reliability Leader Spicy

The Hershey CompanyEl Salto, Jalisco, Mexico

Vacante : Supervisor de Mantenimiento / Líder de Mantenimiento (Reliability Leader).Ubicación : El Salto, Guadalajara.A cargo de la supervisión del área de mantenimiento en el área de producción, cum...Mostrar másÚltima actualización: hace más de 30 días

Site Reliability Engineer (Middle) ID38916

AgileEngineZapopan, JAL, mx

Quick Apply

Fortune 500 brands and trailblazing startups across 17+ industries.We rank among the leaders in areas like application development and AI / ML, and our people-first culture has earned us multiple Bes...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada

Site Reliability Engineer

HcltechZapopan, Jalisco, México

HCLTech is a global technology company, home to more than 223,000 people across 60 countries, delivering industry-leading capabilities centered around digital, engineering, cloud and AI, powered by...Mostrar másÚltima actualización: hace 18 días

Oferta promocionada

Reliability Solutions Architect

BebeeinnovationZapopan, Jalisco, México

HCLTech has over 223,000 people across 60 countries and offers industry-leading capabilities centered around digital, engineering, cloud and AI. We work with clients across various sectors, providin...Mostrar másÚltima actualización: hace 2 días

Oferta promocionada

Site Reliability Engineer

HCLTechZapopan, Jalisco, Mexico

Oferta promocionada

SRE - Remote

USTGuadalajara, Jalisco, Mexico

Site Reliability Engineer (SRE).DevOps expertise to help build, scale, and maintain our infrastructure and services.You will play a critical role in ensuring high availability, performance, scalabi...Mostrar másÚltima actualización: hace 7 días

Oferta promocionada
Nueva oferta

Site Reliability Engineer

Epsilon Solutions Ltd. Sa De Cv.Guadalajara, Jalisco, México

Job profile : -SRE (Site reliability Engineer)Location : Guadalajara, MexicoJob : Full TimeSkills : SRE practices, DevOps (Limited experience), monitoring and alertingTools - Bamboo, Chef, Git, Kubernetes...Mostrar másÚltima actualización: hace 4 horas

Oferta promocionada

Site Reliability Engineer (10 / 09 / 2025)...

Tata Consultancy ServicesGuadalajara, Jalisco, MX

Role : Site Reliability Engineer Location : Guadalajara Work Mode : On-Site Technical : - Experienced in implementing SRE practices to help us setup SLOs within SLO repository for our applications....Mostrar másÚltima actualización: hace 9 días

Oferta promocionada

Technical Construction Projects Lead

DollarcityGuadalajara, Mexico Metropolitan Area, Mexico

Dollarcity sigue rompiendo esquemas en el mundo del retail.Con nuestro innovador modelo de negocio hemos logrado aperturar más de 650 tiendas en 5 países de la región, agregándole valor a nuestros ...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada

Engineering Technical Lead

ATG (Auction Technology Group)Guadalajara, Mexico Metropolitan Area, Mexico

Auction Technology Group is expanding its team in Guadalajara, Mexico! We are looking for exceptional engineering talent to join the team and help us transform the Auction industry.The important th...Mostrar másÚltima actualización: hace 16 días

Oferta promocionada

ETL Developer

HCLTechGuadalajara, Jalisco, Mexico

Teletrabajo

Get AI-powered advice on this job and more exclusive features.Sign in to access AI-powered advices.Continue with Google Continue with Google. Continue with Google Continue with Google.Continue with ...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada

System Integration Developer / Programmer Analyst

QuantumZapopan, Jalisco, Mexico

Teletrabajo

Quantum delivers end-to-end data management solutions designed for the AI era.With over four decades of experience, our data platform has allowed customers to extract the maximum value from their u...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada

Engineer Software - Partner Portal - Remote, Mexico

PaylocityGuadalajara, Jalisco, Mexico

Teletrabajo

Paylocity is an equal opportunity employer.Remote (Must be based in Guadalajara).Paylocity is an award-winning provider of cloud-based HR and payroll software solutions, offering the most complete ...Mostrar másÚltima actualización: hace 29 días

Oferta promocionada

Mid-Level Software Developer

Unkown DestinationGuadalajara, Jalisco, Mexico

Teletrabajo

Be among the first 25 applicants.Direct message the job poster from Unkown Destination.Contract for 12 months with the possibility of extension. At least 3 years of experience.Notes : Please note that...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada

Principal ML Engineer, Recommendation Systems

Launch PotatoGuadalajara, Jalisco, Mexico

Teletrabajo

Principal ML Engineer, Recommendation Systems.As The Discovery and Conversion Company, our mission is to connect consumers with the world’s leading brands through data-driven content and technology...Mostrar másÚltima actualización: hace 6 días

Oferta promocionada

Software Development Lead – R01554670

BrillioGuadalajara, Jalisco, Mexico

Software Development Lead - R01554670.Typescript, JavaScript, NodeJS, CSS3, Nestjs, CI / CD Pipeline, Mongo, Docker, HTML5, Express JS, Kubernetes. MEAN Fullstack with Microservices : Senior Software D...Mostrar másÚltima actualización: hace 21 días

Oferta promocionada

Site Reliability Engineer

Tata Consultancy ServicesGuadalajara, Jalisco, Mexico

As a Site Reliability Engineer, you will be responsible for ensuring the reliability, availability, and performance of our systems and services. You will work closely with development and operations...Mostrar másÚltima actualización: hace 23 días

Oferta promocionada

TIDAL Expert - Remote

USTGuadalajara, Jalisco, Mexico

Creates and supports the ETL process to extract the data from source systems and place it into the data warehouse.Performs data warehouse design and testing, including data design, database archite...Mostrar másÚltima actualización: hace 16 días