Talent.com
Senior Site Reliability Engineer

Senior Site Reliability Engineer

ZillowCiudad de México, Mexico
Hace 14 días
Tipo de contrato
  • Teletrabajo
Descripción del trabajo

About The Team The Touring and Connections (TCE) EngOps team drives reliability, scalability, and operational excellence across our engineering platforms. We partner with product, infrastructure, and development teams to ensure systems are performant, observable, secure, and cost‑efficient. This team resides within the Touring & Connections organization, which develops consumer‑facing features that are central to Zillow’s growth strategy. The development teams we support are responsible for Real‑time Tours (RTT), Zillow In‑App Messaging (ZIM), and agent–buyer connections. EngOps supports Zillow development teams in delivering quality, reliable software quickly and confidently.

About The Role As a Senior Site Reliability Engineer joining TCE EngOps, you will design, build, and operate the systems and tooling that ensure the availability and reliability of critical services. You’ll lead initiatives in observability, incident management, infrastructure automation, and performance optimization. In this role, you’ll collaborate closely with development teams, promote SRE best practices, and mentor peers to strengthen reliability culture across the organization.

You will participate in an L3 on‑call rotation, drive rapid recovery during incidents, and champion systemic improvements afterward. You’ll explore and apply emerging technologies, including AI‑driven practices and tooling, to continuously improve reliability, automation, and developer experience. This position emphasizes both hands‑on engineering and coaching / enablement, helping uplift the reliability capabilities of the broader engineering organization.

Responsibilities

  • Own the reliability, scalability, and performance of production services.
  • Define and implement SLOs / SLAs, error budgets, and capacity planning.
  • Design and evolve monitoring, alerting, and observability dashboards with tools such as Prometheus, Grafana, and Datadog.
  • Participate in incident response, blameless postmortems, chaos testing, and systemic remediation.
  • Drive safe release practices, including canary and blue‑green deployments, rollback automation, and CI / CD improvements.
  • Enable performance and load testing tooling to validate scalability and efficiency.
  • Apply cost optimization strategies to improve cloud spend efficiency.
  • Build and manage Infrastructure as Code with Terraform.
  • Operate and scale containerized services with Docker and Kubernetes.
  • Automate workflows and tooling using Python, Go, and Bash.
  • Implement cloud best practices in AWS (EC2, VPC, IAM, S3, Route 53).
  • Promote shift‑left reliability practices through pre‑launch reviews, CI quality gates, and risk identification.
  • Mentor, coach, and embed with engineering teams to share SRE practices and build reliability maturity.

Qualifications

  • 8+ years of SRE, DevOps, or Platform Engineering experience.
  • Proven expertise in designing SLOs, monitoring strategies, and incident response frameworks.
  • Strong proficiency with Terraform, GitLab CI / CD, and cloud infrastructure (AWS).
  • Hands‑on experience with Kubernetes and Docker.
  • Skilled in Python, Go, or Bash for automation and tooling.
  • Experienced with Prometheus, Grafana, Datadog, or Splunk for observability.
  • Deep understanding of networking, security practices, and cloud cost optimization.
  • Strong collaborator with experience in developer enablement, coaching, and knowledge sharing.
  • Excellent communicator who values blamelessness, automation, and continuous improvement.
  • Committed to continuous learning and exploration of emerging technologies, including AI and automation, to drive reliability excellence.
  • This is a remote position. U.S. employees may live in any of the 50 United States, with limited exceptions.

    In addition to a competitive base salary and benefits, this position is also eligible for equity awards based on experience, performance, and location.

    Zillow is reimagining real estate to make it easier to unlock life’s next chapter. As the most‑visited real estate website in the United States, Zillow® and its affiliates help people find and win their home through digital solutions, first‑class partners, and easier buying, selling, financing and renting experiences. Our culture promotes innovation, an inclusive work environment, and a commitment to equity and belonging.

    Zillow Group is an equal opportunity employer committed to fostering an inclusive, innovative environment with the best employees. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, and gender identity. If you have a disability or special need that requires accommodation, please contact your recruiter directly.

    Qualified applicants with arrest or conviction records will be considered for employment in accordance with applicable state and local law.

    #J-18808-Ljbffr

    Crear una alerta de empleo para esta búsqueda

    Site Reliability Engineer • Ciudad de México, Mexico

    Ofertas relacionadas
    • Oferta promocionada
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    SimCorpCiudad de México, Ciudad de México, Mexico
    Senior Site Reliability Engineer.Be among the first 25 applicants.Senior Site Reliability Engineer.Get AI-powered advice on this job and more exclusive features. For over 50 years, we have worked cl...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    Royal Caribbean InternationalCiudad de México, Mexico
    Journey with us! Combine your career goals and sense of adventure by joining our incredible team of employees at.We are proud to offer a competitive compensation and benefits package, and excellent...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    USTCiudad de México, Ciudad de México, Mexico
    Continue with Google Continue with Google.Get AI-powered advice on this job and more exclusive features.Sign in to access AI-powered advices. Continue with Google Continue with Google.Continue with ...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Lead, Site Reliability Engineer

    Lead, Site Reliability Engineer

    Royal Caribbean GroupCiudad de México, Mexico
    Journey with us! Combine your career goals and sense of adventure by joining our incredible team of employees at.We are proud to offer a competitive compensation and benefits package, and excellent...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Collide Capital LLCCiudad de México, Mexico
    Teletrabajo
    The Touring and Connections (TCE) EngOps team drives reliability, scalability, and operational excellence across our engineering platforms. We partner with product, infrastructure, and development t...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Lead, Site Reliability Engineer

    Lead, Site Reliability Engineer

    Royal Caribbean InternationalCiudad de México, Mexico
    Combine your career goals and sense of adventure by joining our incredible team of employees at.We are proud to offer a competitive compensation and benefits package, and excellent career developme...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Next MatterCiudad de México, Mexico
    Teletrabajo
    The FUB+ Infrastructure & Security Team at Zillow Group supports the Follow Up Boss systems, applications, and software engineering teams that power the businesses of tens of thousands of real esta...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    SimNaucalpan de Juárez, Estado de México, Mexico
    Teletrabajo
    Join some of the most innovative thinkers in FinTech as we lead the evolution of financial technology.If you are an innovative, curious, collaborative person who embraces challenges and wants to gr...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Royal Caribbean GroupCiudad de México, Ciudad de México, Mexico
    Senior Site Reliability Engineer.Be among the first 25 applicants.Combine your career goals and sense of adventure by joining our incredible team at. We offer a competitive compensation and benefits...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    Tata Consultancy ServicesCiudad de México, Ciudad de México, Mexico
    We are looking for a Site Reliability Engineer (SRE) to join our team and help us ensure seamless, high-performing, and reliable technology operations. Azure DevOps - Pipelines, repositories, and au...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Site Reliability Engineer (Middle / Senior) ID38916

    Site Reliability Engineer (Middle / Senior) ID38916

    AgileEngineCiudad de México, Mexico
    Fortune 500 brands and trailblazing startups across 17+ industries.We rank among the leaders in areas like application development and AI / ML, and our people-first culture has earned us multiple Bes...Mostrar másÚltima actualización: hace 2 días
    • Oferta promocionada
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    CanonicalEcatepec de Morelos, Estado de México, Mexico
    Teletrabajo
    Senior Site Reliability Engineer.Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used i...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    Publicis SapientCiudad de México, Ciudad de México, Mexico
    Teletrabajo
    Publicis Sapient is a leading digital transformation partner helping organizations reimagine their future in a digitally enabled world. We combine a start-up mindset with modern methods and deep ind...Mostrar másÚltima actualización: hace 2 días
    • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    DelineaCiudad de México, Ciudad de México, Mexico
    Teletrabajo
    Delinea is a pioneer in securing human and machine identities through intelligent, centralized authorization, empowering organizations to govern their interactions across the modern enterprise.The ...Mostrar másÚltima actualización: hace 22 días
    • Oferta promocionada
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    DelineaCiudad de México, Ciudad de México, Mexico
    Teletrabajo
    Delinea is a pioneer in securing human and machine identities through intelligent, centralized authorization, empowering organizations to seamlessly govern their interactions across the modern ente...Mostrar másÚltima actualización: hace 22 días
    • Oferta promocionada
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Royal Caribbean InternationalCiudad de México, Mexico
    Journey with us! Combine your career goals and sense of adventure by joining our incredible team of employees at Royal Caribbean Group. We are proud to offer a competitive compensation and benefits ...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    New Era TechnologyMexico City, Mexico
    Site Reliability Engineering (SRE) Engineer! the SRE Engineer we’re searching for someone who has fresh ideas and a unique viewpoint, and who enjoys collaborating with a cross-functional team to de...Mostrar másÚltima actualización: hace 4 días
    • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    MediumCiudad de México, Mexico
    Teletrabajo
    DEUNA is a rapidly growing startup revolutionizing global commerce with ATHIA, our AI-powered orchestration and payments platform that helps large enterprises boost approval rates, reduce costs, an...Mostrar másÚltima actualización: hace 22 días