Talent.com
Esta oferta de trabajo no está disponible en tu país.
7777 - Site Reliability Engineer Cloud and Infrastructure Mexico Published Today

7777 - Site Reliability Engineer Cloud and Infrastructure Mexico Published Today

UnosquareMexico
Hace más de 30 días
Tipo de contrato
  • Teletrabajo
Descripción del trabajo

Senior Site Reliability Engineer

We are looking for a Senior Site Reliability Engineer to join our Cloud Infrastructure Engineering division. Cloud Infrastructure Engineering ensures the continuous availability of the technologies and systems that are the foundation of Client’s services. We are directly responsible for thousands of servers, petabytes of storage, and handling thousands of web requests per second, all while sustaining growth at a meteoric rate. We enable an operating system for the medical office that abstracts away administrative complexity, leaving doctors free to practice medicine.

But enough about us; let’s talk about you!

You’re a seasoned engineer with a passion for identifying and resolving reliability and scalability challenges. You are a curious team player, someone who loves to explore, learn, and make things better. You are excited to uncover inefficiencies in business processes, creative in finding ways to automate solutions, and relentless in your pursuit of greatness. You’re a nimble learner capable of quickly absorbing complex solutions and an excellent communicator who can help evangelize engineering excellence.

  • Job Responsibilities++

Reliability and Availability :

  • Define, measure, and maintain Service Level Objectives (SLOs) and Service Level Indicators (SLIs) for cloud services and infrastructure components.
  • Lead efforts to continuously improve system availability, fault tolerance, andisaster recovery capabilities.
  • Ensure proactive incident detection, efficient root cause analysis, and timely resolution of production incidents
  • On-Call participation in 24x7 setup (8A-8P)
  • Automation and Infrastructure as Code (IaC) :

  • Drive automation efforts to reduce manual intervention and streamline cloud infrastructure management.
  • Implement Infrastructure as Code (IaC) using tools like Terraform, AWS CloudFormation, and Ansible to provision, manage, and scale cloud resources.
  • Automate deployment, scaling, and monitoring processes to improve efficiency and reduce operational complexity.
  • Monitoring, Observability, and Performance Tuning :

  • Design and implement monitoring, logging, and alerting solutions to track cloud infrastructure health, performance, and security.
  • Use observability tools (e.g., Prometheus, Grafana, Cloud Watch) to ensure continuous visibility into cloud infrastructure performance and capacity.
  • Identify bottlenecks and performance issues, proposing and implementing improvements to ensure optimal resource usage.
  • Security and Compliance :

  • Ensure that cloud infrastructure is built with security best practices in mind and meets all relevant compliance and regulatory requirements.
  • Collaborate with security teams to implement security controls and risk mitigation strategies across cloud environments.
  • Regularly audit and review cloud infrastructure for security vulnerabilities and compliance gaps.
  • Collaboration and Cross-Functional Leadership :

  • Work closely with development, DevOps, and operations teams to ensure cloud infrastructure aligns with application and business requirements.
  • Lead and mentor a team of Site Reliability Engineers, promoting best practices and fostering a culture of operational excellence.
  • Act as a key technical point of contact for cloud-related infrastructure and operations issues.
  • Incident Management and Post-Mortem :

  • Lead the incident response efforts for cloud infrastructure-related issues, ensuring that all incidents are managed effectively.
  • Conduct post-incident reviews (PIRs) to identify root causes and implement preventive measures.
  • Continuously refine incident management processes to reduce downtime and enhance recovery times.
  • Qualifications++
  • 8-10 years of hands-on experience with cloud automation and configuration management tools (e.g., Terraform, AWS CloudFormation, Ansible). On a Hybrid Cloud Set-up.
  • 7+ years of experience in a Site Reliability Engineering (SRE), Infrastructure Engineering, or DevOps role, with at least 3+ years in a technical leadership capacity.
  • Deep knowledge of cloud services and technologies (e.g., EC2, S3, Lambda, Kubernetes, etc.).
  • Proficiency in scripting or programming languages (Python, Go, Bash, etc.).
  • Experience with monitoring, logging, and observability tools (e.g., Prometheus, Grafana, Datadog, ELK stack).
  • Familiarity with Continuous Integration / Continuous Deployment (CI / CD) pipelines and cloud-native development practices.
  • Strong expertise in managing cloud infrastructure (AWS, Google Cloud, Azure) in production environments.
  • Experience with cloud-native architectures, microservices, and containerized environments (Kubernetes, Docker).
  • Proven experience in building and managing highly available, scalable, and fault-tolerant systems in the cloud.
  • Strong understanding of cloud networking, storage, compute services, On-Prem and security best practices.
  • Behaviors & Abilities Required : ++
  • Strong Technical leadership and mentoring abilities, with a track record of developing high-performance engineering teams.
  • Excellent problem-solving, troubleshooting, and diagnostic skills.
  • Ability to work in a cross-functional, collaborative environment.
  • Effective communication skills, with the ability to translate technical concepts to non-technical stakeholders.
  • #J-18808-Ljbffr

    Crear una alerta de empleo para esta búsqueda

    Site Reliability Engineer • Mexico

    Ofertas relacionadas
    • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    UST España & LatamMexico, Mexico
    Born digital, UST transforms lives through the power of technology.We walk alongside our clients and partners, embedding innovation and agility into everything they do. We help them create transform...Mostrar másÚltima actualización: hace 7 días
    • Oferta promocionada
    Cloud Resiliency Architect / Site Reliability Engineer

    Cloud Resiliency Architect / Site Reliability Engineer

    CognizantMexico
    Teletrabajo
    Cloud Resiliency Architect / Site Reliability Engineer – Remote in Mexico.Experience with High Availability Azure Design. Experience with Disaster Recovery Azure Design.Understanding of Azure Archit...Mostrar másÚltima actualización: hace 22 días
    • Oferta promocionada
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Zillow GroupMexico
    Teletrabajo
    About the team The FUB+ Infrastructure & Security Team at Zillow Group supports the Follow Up Boss systems, applications, and software engineering teams that power the businesses of tens of thousan...Mostrar másÚltima actualización: hace 6 días
    • Oferta promocionada
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    DuckDuckGoMexico
    Teletrabajo
    Be among the first 25 applicants.Hi, we're DuckDuckGo, the online protection company and remote-first team of 300+ on a mission to raise the standard of trust online. Founded in 2008 and profitable ...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Sr. Site Reliability Engineer (Remote, Mexico)

    Sr. Site Reliability Engineer (Remote, Mexico)

    NovaMexico
    Teletrabajo
    Site Reliability Engineer (Remote, Mexico).Site Reliability Engineer (Remote, Mexico).Site Reliability Engineer (Remote, Mexico). Be among the first 25 applicants.Site Reliability Engineer (Remote, ...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    Altimetrik MexicoMexico
    Senior Site Reliability Engineer (SRE).English skills (B2 / C1) for a full-time position.We are currently seeking a highly skilled. IT operations and encompassing development.As a crucial figure in th...Mostrar másÚltima actualización: hace 23 días
    • Oferta promocionada
    FBS Site Reliability Engineer

    FBS Site Reliability Engineer

    CapgeminiMexico
    Teletrabajo
    Be among the first 25 applicants.This range is provided by Capgemini.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Our Client is one of the Un...Mostrar másÚltima actualización: hace 9 días
    • Oferta promocionada
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    SimMexico
    Teletrabajo
    For over 50 years, we have worked closely with investment and asset managers to become the world’s leading provider of integrated investment management solutions. We are 3,000+ colleagues with a bro...Mostrar másÚltima actualización: hace 8 días
    • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    EXLMéxico, Mexico, Mexico
    We are seeking a highly motivated and skilled Site Reliability Engineer (SRE) to join our team.The ideal candidate will have a passion for continuous learning, a collaborative mindset to work with ...Mostrar másÚltima actualización: hace 25 días
    • Oferta promocionada
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    PerficientMexico
    We currently have a career opportunity for a.Senior Site Reliability Engineer.Mexico or Colombia (only this locations).As a Senior Technical Consultant you will participate in all aspects of the so...Mostrar másÚltima actualización: hace 23 días
    • Oferta promocionada
    Senior Site Reliability Engineer (SRE)

    Senior Site Reliability Engineer (SRE)

    EPAM SystemsMexico
    Teletrabajo
    Be among the first 25 applicants.EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employ...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    HCLTechMexico, Mexico
    HCLTech is a global technology company, home to more than 223,000 people across 60 countries, delivering industry-leading capabilities centered around digital, engineering, cloud and AI, powered by...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    SimMexico
    Teletrabajo
    Join some of the most innovative thinkers in FinTech as we lead the evolution of financial technology.If you are an innovative, curious, collaborative person who embraces challenges and wants to gr...Mostrar másÚltima actualización: hace 8 días
    • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    New Era TechnologyMexico
    Teletrabajo
    Senior IT Recruiter | IT Recruitment, Talent Acquisition Specialist, Recruitment Team Lead.Join our team as a Site Reliability Engineer (SRE) Engineer. We’re looking for someone who has fresh ideas ...Mostrar másÚltima actualización: hace 24 días
    • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    Altimetrik MéxicoMexico, Mexico
    Senior Site Reliability Engineer (SRE).English skills (B2 / C1) for a full-time position.We are currently seeking a highly skilled. IT operations and encompassing development.As a crucial figure in th...Mostrar másÚltima actualización: hace 24 días
    • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    KI peopleMexico
    Teletrabajo
    Be among the first 25 applicants.Direct message the job poster from KI people.In Search of the Best Global IT & Digital Talent. The SRE Operations specialist focuses on B2B applications support prov...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    ConfidencialMexico, Mexico
    Estamos en búsqueda de un / a Ingeniero / a SRE senior para potencialmente sumarse a un proyecto de consultoría.El rol tendrá como objetivo fortalecer la confiabilidad, estabilidad y resiliencia de los...Mostrar másÚltima actualización: hace 23 días
    • Oferta promocionada
    7772 - Site Reliability Engineer Cloud and Infrastructure Mexico Published Today

    7772 - Site Reliability Engineer Cloud and Infrastructure Mexico Published Today

    UnosquareMexico
    Senior Site Reliability Engineer.We are looking for a Senior Site Reliability Engineer to join our Cloud Infrastructure Engineering division. Cloud Infrastructure Engineering ensures the continuous ...Mostrar másÚltima actualización: hace más de 30 días