Talent.com
Principal Site Reliability Developer
Principal Site Reliability DeveloperOracle • Región Centro, Jalisco, Mexico
No se aceptan más aplicaciones
Principal Site Reliability Developer

Principal Site Reliability Developer

Oracle • Región Centro, Jalisco, Mexico
Hace más de 30 días
Descripción del trabajo

Overview

As a senior member of the Site Reliability Engineering (SRE) team, you'll take ownership of highly available systems, influence service design, and work across teams to drive resiliency, automation, and operational excellence. This is a hands-on engineering role where deep infrastructure knowledge meets software engineering expertise, ideal for experienced SREs ready to take the lead.

Responsibilities

  • Lead the design, automation, and support of OCI services with a focus on resiliency, security, scalability, and performance.
  • Own and improve the end-to-end reliability metrics (SLOs, SLAs, KPIs) for your services.
  • Design and implement high-availability architectures and standards for large-scale distributed systems.
  • Serve as the ultimate escalation point for complex operational issues, using a deep understanding of service topologies and interdependencies.
  • Architect and build automation and orchestration tools that reduce manual work and prevent problem recurrence.
  • Collaborate with development teams to improve service designs, optimize deployments, and implement best practices for operational efficiency.
  • Guide technical decision-making and mentor junior SREs and developers across teams.
  • Participate in and lead postmortems, root cause analysis, and preventative design changes.
  • Contribute to capacity planning, demand forecasting, and long-term service scalability strategies.
  • Participate in a rotational on-call schedule to ensure the health and availability of production services.

Qualifications

  • Advanced experience with Linux systems administration
  • Strong programming skills in Python (with automation libraries)
  • Advanced Bash / Shell scripting
  • Deep understanding of distributed systems, networking, and service architecture
  • Solid knowledge of databases and how they behave in production (SQL or NoSQL)
  • Strong understanding of CI / CD pipelines, Agile methodologies, and DevOps best practices
  • Experience writing and maintaining unit tests and production-grade software
  • Proven ability to lead cross-functional efforts and technical problem-solving in live environments
  • Nice to Have

  • Hands-on experience with monitoring and observability tools (Grafana, Prometheus, New Relic, etc.)
  • Familiarity with Oracle Cloud Infrastructure (OCI) or other cloud platforms (AWS, Azure, GCP)
  • Experience with Infrastructure-as-Code (Terraform, Ansible) and container orchestration (Kubernetes)
  • #J-18808-Ljbffr

    Crear una alerta de empleo para esta búsqueda

    Site Reliability • Región Centro, Jalisco, Mexico

    Ofertas relacionadas
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Canonical • Región Centro, Jalisco, Mexico
    Teletrabajo
    We are hiring a Senior Site Reliability Engineer.Next-gen operations at scale, with pure Python infra-as-code, from bare metal to containers and applications. Our goal is to perfect enterprise infra...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Golang Developer - Remote, Latin America

    Golang Developer - Remote, Latin America

    Bluelight • Región Centro, Jalisco, Mexico
    Teletrabajo
    Golang Developer - Remote, Latin America.Bluelight is a leading software consultancy dedicated to designing and developing innovative technology that enhances users’ lives.With a steadfast commitme...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Site Reliability Engineer (Middle / Senior) ID38916

    Site Reliability Engineer (Middle / Senior) ID38916

    AgileEngine • Región Centro, Jalisco, Mexico
    Teletrabajo
    Site Reliability Engineer (Middle / Senior) ID38916 at AgileEngine.Join to apply for the Site Reliability Engineer (Middle / Senior) ID38916 role at AgileEngine. Shift : Monday – Thursday 8AM – 7PM PST (...Mostrar más
    Última actualización: hace 26 días • Oferta promocionada
    Site Reliability Engineer (Hybrid / Flexible)

    Site Reliability Engineer (Hybrid / Flexible)

    Insulet Corporation • Región Centro, Jalisco, Mexico
    Teletrabajo
    Insulet started in 2000 with an idea and a mission to enable our customers to enjoy simplicity, freedom and healthier lives through the use of our Omnipod product platform.In the last two decades w...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Principal ML Engineer

    Principal ML Engineer

    Launch Potato • Región Centro, Jalisco, Mexico
    Teletrabajo
    As The Discovery and Conversion Company, our mission is to connect consumers with the world’s leading brands through data-driven content and technology. Headquartered in South Florida with a remote-...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Site Reliability Engineer ID45689

    Site Reliability Engineer ID45689

    AgileEngine • Región Centro, Jalisco, Mexico
    Site Reliability Engineer ID45689.Fortune 500 brands and trailblazing startups across 17+ industries.We rank among the leaders in areas like application development and AI / ML, and our people-first ...Mostrar más
    Última actualización: hace 1 día • Oferta promocionada
    Software Development Lead - R01554901

    Software Development Lead - R01554901

    Brillio • Región Centro, Jalisco, Mexico
    Typescript, JavaScript, NodeJS, CSS3, Nestjs, React JS, CI / CD Pipeline, Oracle RDBMS, Mongo, Kafka, Docker, HTML5, Jest, Express JS, Kubernetes. Serve as senior full stack engineer developing respon...Mostrar más
    Última actualización: hace 11 días • Oferta promocionada
    Site Reliability Engineer - Engineer II

    Site Reliability Engineer - Engineer II

    FICO • Región Centro, Jalisco, Mexico
    Teletrabajo
    Site Reliability Engineer - Engineer II.Be among the first 25 applicants.Site Reliability Engineer - Engineer II.Get AI-powered advice on this job and more exclusive features.Join our world-class t...Mostrar más
    Última actualización: hace 11 días • Oferta promocionada
    SRE Developer

    SRE Developer

    TouchTunes • Región Centro, Jalisco, Mexico
    Teletrabajo
    As a Site Reliability Engineer (SRE) embedded in our mobile app development squads, you will work side‑by‑side with backend and mobile engineers to ensure new features and services are reliable, sc...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    NTT DATA North America • Región Centro, Jalisco, Mexico
    SRE – Site Reliability Engineer.We are currently seeking a Site Reliability Engineer to join our team in Guadalajara, Jalisco, Mexico. In this role you will perform L1.You will monitor the efficienc...Mostrar más
    Última actualización: hace 2 días • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    Canonical • Región Centro, Jalisco, Mexico
    Teletrabajo
    Be among the first 25 applicants.Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used i...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Site Reliability Engineer - Engineer I

    Site Reliability Engineer - Engineer I

    FICO • Región Centro, Jalisco, Mexico
    Site Reliability Engineer - Engineer I page is loaded## Site Reliability Engineer - Engineer Ilocations : Guadalajara, Mexicotime type : Full timeposted on : Posted Yesterdayjob requisition id : ...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Developer Relations Engineer

    Developer Relations Engineer

    Canonical • Región Centro, Jalisco, Mexico
    Teletrabajo
    Be among the first 25 applicants.As the publisher of Ubuntu we serve millions of developers, building for the cloud, IoT and data science. We aim to make open source easier and more reliable for inn...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Senior Site Reliability Engineer (SRE)

    Senior Site Reliability Engineer (SRE)

    Capgemini • Región Centro, Jalisco, Mexico
    Teletrabajo
    Senior Site Reliability Engineer.Join a collaborative team building and operating large-scale cloud platforms that power next‑generation connectivity and customer experiences.This is a hands‑on rol...Mostrar más
    Última actualización: hace 5 días • Oferta promocionada
    Remote SRE : Build Reliable Cloud, DevSecOps & CI / CD

    Remote SRE : Build Reliable Cloud, DevSecOps & CI / CD

    AgileEngine • Región Centro, Jalisco, Mexico
    A leading tech company in Mexico is seeking a Site Reliability Engineer to design and optimize AWS infrastructure.You will work closely with product and platform teams and drive automation and obse...Mostrar más
    Última actualización: hace 1 día • Oferta promocionada
    Senior Site Reliability Engineer (SRE)

    Senior Site Reliability Engineer (SRE)

    Capgemini Engineering • Región Centro, Jalisco, Mexico
    Teletrabajo
    Senior Site Reliability Engineer (SRE).Engineering and Information Technology.We’re hiring a Senior Site Reliability Engineer to join a major telecom client through Capgemini Engineering.Join a col...Mostrar más
    Última actualización: hace 28 días • Oferta promocionada
    Azure SRE : Cloud Reliability Engineer

    Azure SRE : Cloud Reliability Engineer

    NTT DATA • Región Centro, Jalisco, Mexico
    A global technology services provider is seeking a Site Reliability Engineer in GDL, Jalisco, Mexico.The role involves monitoring and managing Azure cloud systems to prevent outages, executing depl...Mostrar más
    Última actualización: hace 2 días • Oferta promocionada
    Principal ML Engineer, Recommendation Systems

    Principal ML Engineer, Recommendation Systems

    Launch Potato • Región Centro, Jalisco, Mexico
    Teletrabajo
    As The Discovery and Conversion Company, our mission is to connect consumers with the world’s leading brands through data-driven content and technology. Headquartered in South Florida with a remote-...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada