Talent.com
Esta oferta de trabajo no está disponible en tu país.
Principal Site Reliability Developer

Principal Site Reliability Developer

OracleZapopan, Jalisco, Mexico
Hace más de 30 días
Descripción del trabajo

As a senior member of the Site Reliability Engineering (SRE) team, you'll take ownership of highly available systems, influence service design, and work across teams to drive resiliency, automation, and operational excellence. This is a hands-on engineering role where deep infrastructure knowledge meets software engineering expertise, ideal for experienced SREs ready to take the lead.

Qualifications

Career Level - IC4

Responsibilities

What You’ll Do :

  • Lead the design, automation, and support of OCI services with a focus on resiliency, security, scalability, and performance.
  • Own and improve the end-to-end reliability metrics (SLOs, SLAs, KPIs) for your services.
  • Design and implement high-availability architectures and standards for large-scale distributed systems.
  • Serve as the ultimate escalation point for complex operational issues, using a deep understanding of service topologies and interdependencies.
  • Architect and build automation and orchestration tools that reduce manual work and prevent problem recurrence.
  • Collaborate with development teams to improve service designs, optimize deployments, and implement best practices for operational efficiency.
  • Guide technical decision-making and mentor junior SREs and developers across teams.
  • Participate in and lead postmortems, root cause analysis, and preventative design changes.
  • Contribute to capacity planning, demand forecasting, and long-term service scalability strategies.
  • Participate in a rotational on-call schedule to ensure the health and availability of production services.

What We’re Looking For :

  • Advanced experience with Linux systems administration
  • Strong programming skills in Python (with automation libraries)
  • Advanced Bash / Shell scripting
  • Deep understanding of distributed systems, networking, and service architecture
  • Solid knowledge of databases and how they behave in production (SQL or NoSQL)
  • Strong understanding of CI / CD pipelines, Agile methodologies, and DevOps best practices
  • Experience writing and maintaining unit tests and production-grade software
  • Proven ability to lead cross-functional efforts and technical problem-solving in live environments
  • Nice to Have :

  • Hands-on experience with monitoring and observability tools (Grafana, Prometheus, New Relic, etc.)
  • Familiarity with Oracle Cloud Infrastructure (OCI) or other cloud platforms (AWS, Azure, GCP)
  • Experience with Infrastructure-as-Code (Terraform, Ansible) and container orchestration (Kubernetes)
  • #J-18808-Ljbffr

    Crear una alerta de empleo para esta búsqueda

    Site Reliability • Zapopan, Jalisco, Mexico

    Ofertas relacionadas
    • Oferta promocionada
    Principal Service Reliability Engineer

    Principal Service Reliability Engineer

    OracleZapopan, Jalisco, México
    Job DescriptionThis role requires a SRE mindset combined with AI / ML expertise and strong application engineering skills across public and private cloud environments. ResponsibilitiesKey Responsibili...Mostrar másÚltima actualización: hace 3 días
    • Oferta promocionada
    • Nueva oferta
    Principal I, Application Development (.Net).

    Principal I, Application Development (.Net).

    HerbalifeTlaquepaque, Jalisco, Mexico
    Teletrabajo
    Position reports to : Mauricio Gonzalez.Work schedule : Hybrid, going to the office in GDL for 3 days.The Principal of Application Development acts as a technical expert on a specific area in Applica...Mostrar másÚltima actualización: hace 13 horas
    • Oferta promocionada
    Reliability Leader Spicy

    Reliability Leader Spicy

    The Hershey CompanyEl Salto, Jalisco, Mexico
    Vacante : Supervisor de Mantenimiento / Líder de Mantenimiento (Reliability Leader).Ubicación : El Salto, Guadalajara.A cargo de la supervisión del área de mantenimiento en el área de producción, cum...Mostrar másÚltima actualización: hace más de 30 días
    Site Reliability Engineer (Middle) ID38916

    Site Reliability Engineer (Middle) ID38916

    AgileEngineZapopan, JAL, mx
    Quick Apply
    Fortune 500 brands and trailblazing startups across 17+ industries.We rank among the leaders in areas like application development and AI / ML, and our people-first culture has earned us multiple Bes...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    HcltechZapopan, Jalisco, México
    HCLTech is a global technology company, home to more than 223,000 people across 60 countries, delivering industry-leading capabilities centered around digital, engineering, cloud and AI, powered by...Mostrar másÚltima actualización: hace 18 días
    • Oferta promocionada
    Reliability Solutions Architect

    Reliability Solutions Architect

    BebeeinnovationZapopan, Jalisco, México
    HCLTech has over 223,000 people across 60 countries and offers industry-leading capabilities centered around digital, engineering, cloud and AI. We work with clients across various sectors, providin...Mostrar másÚltima actualización: hace 2 días
    • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    HCLTechZapopan, Jalisco, Mexico
    HCLTech is a global technology company, home to more than 223,000 people across 60 countries, delivering industry-leading capabilities centered around digital, engineering, cloud and AI, powered by...Mostrar másÚltima actualización: hace 18 días
    • Oferta promocionada
    SRE - Remote

    SRE - Remote

    USTGuadalajara, Jalisco, Mexico
    Site Reliability Engineer (SRE).DevOps expertise to help build, scale, and maintain our infrastructure and services.You will play a critical role in ensuring high availability, performance, scalabi...Mostrar másÚltima actualización: hace 7 días
    • Oferta promocionada
    • Nueva oferta
    Site Reliability Engineer

    Site Reliability Engineer

    Epsilon Solutions Ltd. Sa De Cv.Guadalajara, Jalisco, México
    Job profile : -SRE (Site reliability Engineer)Location : Guadalajara, MexicoJob : Full TimeSkills : SRE practices, DevOps (Limited experience), monitoring and alertingTools - Bamboo, Chef, Git, Kubernetes...Mostrar másÚltima actualización: hace 4 horas
    • Oferta promocionada
    Site Reliability Engineer (10 / 09 / 2025)...

    Site Reliability Engineer (10 / 09 / 2025)...

    Tata Consultancy ServicesGuadalajara, Jalisco, MX
    Role : Site Reliability Engineer Location : Guadalajara Work Mode : On-Site Technical : - Experienced in implementing SRE practices to help us setup SLOs within SLO repository for our applications....Mostrar másÚltima actualización: hace 9 días
    • Oferta promocionada
    Technical Construction Projects Lead

    Technical Construction Projects Lead

    DollarcityGuadalajara, Mexico Metropolitan Area, Mexico
    Dollarcity sigue rompiendo esquemas en el mundo del retail.Con nuestro innovador modelo de negocio hemos logrado aperturar más de 650 tiendas en 5 países de la región, agregándole valor a nuestros ...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Engineering Technical Lead

    Engineering Technical Lead

    ATG (Auction Technology Group)Guadalajara, Mexico Metropolitan Area, Mexico
    Auction Technology Group is expanding its team in Guadalajara, Mexico! We are looking for exceptional engineering talent to join the team and help us transform the Auction industry.The important th...Mostrar másÚltima actualización: hace 16 días
    • Oferta promocionada
    ETL Developer

    ETL Developer

    HCLTechGuadalajara, Jalisco, Mexico
    Teletrabajo
    Get AI-powered advice on this job and more exclusive features.Sign in to access AI-powered advices.Continue with Google Continue with Google. Continue with Google Continue with Google.Continue with ...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    System Integration Developer / Programmer Analyst

    System Integration Developer / Programmer Analyst

    QuantumZapopan, Jalisco, Mexico
    Teletrabajo
    Quantum delivers end-to-end data management solutions designed for the AI era.With over four decades of experience, our data platform has allowed customers to extract the maximum value from their u...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Engineer Software - Partner Portal - Remote, Mexico

    Engineer Software - Partner Portal - Remote, Mexico

    PaylocityGuadalajara, Jalisco, Mexico
    Teletrabajo
    Paylocity is an equal opportunity employer.Remote (Must be based in Guadalajara).Paylocity is an award-winning provider of cloud-based HR and payroll software solutions, offering the most complete ...Mostrar másÚltima actualización: hace 29 días
    • Oferta promocionada
    Mid-Level Software Developer

    Mid-Level Software Developer

    Unkown DestinationGuadalajara, Jalisco, Mexico
    Teletrabajo
    Be among the first 25 applicants.Direct message the job poster from Unkown Destination.Contract for 12 months with the possibility of extension. At least 3 years of experience.Notes : Please note that...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Principal ML Engineer, Recommendation Systems

    Principal ML Engineer, Recommendation Systems

    Launch PotatoGuadalajara, Jalisco, Mexico
    Teletrabajo
    Principal ML Engineer, Recommendation Systems.As The Discovery and Conversion Company, our mission is to connect consumers with the world’s leading brands through data-driven content and technology...Mostrar másÚltima actualización: hace 6 días
    • Oferta promocionada
    Software Development Lead – R01554670

    Software Development Lead – R01554670

    BrillioGuadalajara, Jalisco, Mexico
    Software Development Lead - R01554670.Typescript, JavaScript, NodeJS, CSS3, Nestjs, CI / CD Pipeline, Mongo, Docker, HTML5, Express JS, Kubernetes. MEAN Fullstack with Microservices : Senior Software D...Mostrar másÚltima actualización: hace 21 días
    • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    Tata Consultancy ServicesGuadalajara, Jalisco, Mexico
    As a Site Reliability Engineer, you will be responsible for ensuring the reliability, availability, and performance of our systems and services. You will work closely with development and operations...Mostrar másÚltima actualización: hace 23 días
    • Oferta promocionada
    TIDAL Expert - Remote

    TIDAL Expert - Remote

    USTGuadalajara, Jalisco, Mexico
    Creates and supports the ETL process to extract the data from source systems and place it into the data warehouse.Performs data warehouse design and testing, including data design, database archite...Mostrar másÚltima actualización: hace 16 días