Talent.com
▷ [11 / 11 / 2025] Site Reliability Engineer (SRE) Manager...

▷ [11 / 11 / 2025] Site Reliability Engineer (SRE) Manager...

ConcordNuevo León, Mexico, MX
Hace 1 día
Descripción del trabajo

Location : Hybrid in Monterrey, MX. 8 days a month on-site.

Possibility to get a travel or relocation stipend for travel.

Type of Employment : contract to hire. 1-3 month remote contract, and then full-time employment.

Requirement : Must be legally authorized to work for any Mexican employer without sponsorship, now or in the future.

About Us

Concord isn't your typical consulting firm; we're an execution focused company passionate about delivering results. Our mission is to help clients enhance customer experiences, optimize operations, and revolutionize product offerings through seamless integration, optimization, and activation of technology and data.

Our services and solutions include Digital Experience (Salesforce, Headless Commerce, UI / UX), Data and Analytics (Snowflake, Databricks, Martech Analytics), and Engineering and Application Services (Application Modernization, Greenfield Apps, Portal Buildout, etc.).

About the Role

We are seeking a strategic, technically adept, and hands-on SRE Manager to lead the reliability, scalability, and operational excellence of our production systems. This role is ideal for a leader who thrives in high-pressure environments, excels at debugging complex production issues, and is passionate about building and mentoring high-performing teams.

The SRE Manager will be responsible for hiring and managing a team of SREs, driving incident response and postmortem processes, and collaborating with multiple product teams to build and maintain robust CI / CD pipelines and deployment practices. This role demands a strong sense of ownership, a deep understanding of cloud-native infrastructure, and the ability to lead by example.

Business Alignment

The SRE Manager will partner with business stakeholders to ensure reliability goals support customer experience, compliance, and growth targets. This includes aligning SRE initiatives with broader business objectives such as revenue protection, innovation, and regulatory adherence.

Key Responsibilities

  • Build and lead a high-performing Site Reliability Engineering team.
  • Create individualized development plans for SREs, encourage participation in industry conferences, and support certification programs.
  • Debug and resolve complex production issues, ensuring minimal downtime and rapid recovery.
  • Own the incident lifecycle, including coordination, communication, and creation of detailed postmortem documentation.
  • Implement blameless postmortems and maintain a library of runbooks for common incident types.
  • Follow up with product teams to ensure resolution and implementation of long-term fixes.
  • Partner with internal product and engineering teams to understand infrastructure needs and deliver scalable, secure, and reliable solutions.
  • Drive the design, implementation, and automation of cloud infrastructure using Azure, Terraform, and Kubernetes (AKS).
  • Lead the adoption and management of tools such as Argo CD, Argo Workflows, Azure DevOps, and Octopus Deploy.
  • Architect and manage API Gateways, WAFs, Service Mesh, and multi-cloud networking (VNets, private networks).
  • Establish and enforce deployment best practices, including documentation, versioning, rollback strategies, and environment management.
  • Collaborate with product teams to build and maintain CI / CD pipelines, ensuring reliable and repeatable deployments.
  • Foster a culture of ownership, accountability, and continuous improvement across the team.
  • Define and track key performance indicators (KPIs) for system reliability and team effectiveness.
  • Define and manage Service Level Objectives (SLOs) and error budgets for all critical services.
  • Lead the adoption of advanced observability tools for proactive reliability management.
  • Collaborate with security, compliance, and architecture teams through joint reviews, shared dashboards, and audits to ensure infrastructure meets enterprise standards.

Required Qualifications

  • 10+ years of experience in infrastructure, DevOps, or SRE roles, with 3+ years in a technical leadership or management capacity.
  • Proven experience debugging and resolving production issues in large-scale systems.
  • Experience building and scaling cloud-native infrastructure on Azure.
  • Deep expertise in Kubernetes (AKS), CI / CD pipelines, and Infrastructure as Code (Terraform).
  • Strong understanding of networking, VNets, private cloud connectivity, and multi-cloud architectures.
  • Hands-on experience with Argo CD, Argo Workflows, Azure DevOps.
  • Demonstrated ability to hire, mentor, and lead engineering teams.
  • Excellent communication and stakeholder management skills.
  • Strong problem-solving mindset with a bias for action and ownership.
  • Ability to create and maintain detailed deployment documentation and lead by example in operational excellence.
  • Advanced English proficiency (C1 or C2) with proven success collaborating in global, English-speaking environments.
  • Preferred Qualifications

  • Experience supporting internal product teams or platform engineering organizations.
  • Familiarity with FinOps, cost optimization, and cloud governance.
  • Exposure to compliance frameworks (SOC2, ISO, HIPAA).
  • Experience with service mesh technologies (Istio, Linkerd).
  • Knowledge of emerging technologies such as AI / ML ops, edge computing, and sustainability practices.
  • What Success Looks Like

  • A high-performing SRE team that operates with autonomy and accountability.
  • Internal customers view the SRE team as a trusted partner in delivering reliable, scalable systems.
  • Infrastructure is automated, observable, and resilient by design.
  • Incidents are rare, well-managed, and always lead to learning and improvement.
  • CI / CD pipelines are robust, well-documented, and consistently deliver high-quality deployments.
  • Crear una alerta de empleo para esta búsqueda

    Site Reliability Engineer • Nuevo León, Mexico, MX

    Ofertas relacionadas
    • Oferta promocionada
    Service Engineer

    Service Engineer

    LGMGMarín, Nuevo León, Mexico
    LGMG is a privately-owned subsidiary of the Lingong Machinery Group.Lingong Machinery Group was founded in 1972 and is one of the leading construction machinery groups, officially designated as one...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Site Reliability Engineer (SRE) Manager

    Site Reliability Engineer (SRE) Manager

    Concord USAMonterrey, Nuevo León, Mexico
    Teletrabajo
    Location : Hybrid in Monterrey, MX.Possibility to get a travel or relocation stipend for travel.Type of Employment : contract to hire. Initial 6-12 month contract with pay in USD.Concord isn't your ty...Mostrar másÚltima actualización: hace 13 días
    • Oferta promocionada
    Site Reliability Engineer (Middle / Senior) ID38916

    Site Reliability Engineer (Middle / Senior) ID38916

    AgileEngineMonterrey, Nuevo León, Mexico
    Teletrabajo
    Site Reliability Engineer (Middle / Senior) ID38916 — AgileEngine.Site Reliability Engineer (Middle / Senior) ID38916.Get AI-powered advice on this job and more exclusive features.Fortune 500 brands an...Mostrar másÚltima actualización: hace 11 días
    • Oferta promocionada
    Senior Engineering Manager

    Senior Engineering Manager

    GenthermMonterrey, Nuevo León, Mexico
    We’re with you on a cold winter day when you turn on your heated seat and steering wheel or helping manage patient body temperature in the operating room, recovering room or intensive care units.We...Mostrar másÚltima actualización: hace 28 días
    • Oferta promocionada
    Customer Quality Engineer

    Customer Quality Engineer

    YanfengCiénega de Flores, Nuevo León, Mexico
    Yanfeng Seating is the worldwide leader in automotive seating with more than 33,000 employees in 20 countries.At this moment we are seeking : . Visit customers as planned and complete the visit record...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    CanonicalMonterrey, Nuevo León, Mexico
    Teletrabajo
    Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiat...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Site Reliability Engineer (SRE) Manager

    Site Reliability Engineer (SRE) Manager

    ConcordNuevo León, Mexico, Mexico
    Location : Hybrid in Monterrey, MX.Possibility to get a travel or relocation stipend for travel.Type of Employment : contract to hire. Requirement : Must be legally authorized to work for any Mexican e...Mostrar másÚltima actualización: hace 17 días
    • Oferta promocionada
    Recruitment Site Lead

    Recruitment Site Lead

    Confidential CareerGeneral Escobedo, Nuevo León, Mexico
    Plant Recruitment Leader focused on results and quality of the hiring process.This key position will be responsible for directing and executing all recruitment and selection activities for operatio...Mostrar másÚltima actualización: hace 4 días
    • Oferta promocionada
    Maintenance Engineer

    Maintenance Engineer

    YanfengCiénega de Flores, Nuevo León, Mexico
    Yanfeng Seating is the worldwide leader in automotive seating with more than 33,000 employees in 20 countries.At this moment we are seeking : . Bachelor degree, Electronic or Mechatronic.Experience in...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Senior Site Reliability Engineer : (Mexico - Monterrey)

    Senior Site Reliability Engineer : (Mexico - Monterrey)

    GTMnowMonterrey, Nuevo León, Mexico
    Teletrabajo
    Regrello is a 40-person startup reimagining automation in supply chains, in which companies still communicate about $13T of annual shipments almost entirely via email. This is a $220-billion, Amazon...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Senior Site Reliability / Gitops Engineer

    Senior Site Reliability / Gitops Engineer

    CanonicalMonterrey, Nuevo León, Mexico
    Teletrabajo
    Senior Site Reliability / Gitops Engineer.Senior Site Reliability / Gitops Engineer.Canonical is a leading provider of open source software and operating systems to the global enterprise and techno...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Supply Chain Lead

    Supply Chain Lead

    Bobcat CompanySalinas Victoria, Nuevo León, Mexico
    The Supply Chain Lead objective is to oversee inventory levels / accuracy, replenishment signals, parts presentation, and delivery processes in alignment with the overall plant strategy within their ...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Sr Maintenance Engineer-Stiva 3

    Sr Maintenance Engineer-Stiva 3

    Johnson ControlsApodaca, NLE, Mexico
    Perform routine inspections of building systems and equipment to ensure proper operation.Conduct preventative and corrective maintenance on electrical, HVAC, plumbing, and mechanical systems.Diagno...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Mechanical / Sustaining Design Engineer

    Mechanical / Sustaining Design Engineer

    Bobcat CompanySalinas Victoria, Nuevo León, Mexico
    The Mechanical / Sustaining Design Engineer will create and maintain engineered designs of assigned products, systems or components in order to be competitive in function, manufacturability and marke...Mostrar másÚltima actualización: hace 27 días
    • Oferta promocionada
    Supply Chain Manager

    Supply Chain Manager

    Confidential CareerCiénega de Flores, Nuevo León, Mexico
    The Senior Supply Chain Manager oversees and directs supply chain activities with a strategic focus on aligning global and regional operations. This role is responsible for developing and executing ...Mostrar másÚltima actualización: hace 6 días
    • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    KyndrylGuadalupe, Nuevo León, Mexico
    At Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day. We are always moving forward – always pushing ourselves to go further ...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Sr. Maintenance Engineer Stiva 2

    Sr. Maintenance Engineer Stiva 2

    Johnson ControlsApodaca, NLE, Mexico
    Perform routine inspections of building systems and equipment to ensure proper operation.Conduct preventative and corrective maintenance on electrical, HVAC, plumbing, and mechanical systems.Diagno...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Senior Mechanical Design Engineer

    Senior Mechanical Design Engineer

    HussmannCiénega de Flores, Nuevo León, Mexico
    The COD (Customer Oriented Design Engineer will be responsible for exploring, developing and drive efforts to get best and optimized solution for different customer requests with focus in all the c...Mostrar másÚltima actualización: hace más de 30 días