Talent.com
Senior Site Reliability Engineer (SRE)
Senior Site Reliability Engineer (SRE)EPAM Systems • Mexico
Senior Site Reliability Engineer (SRE)

Senior Site Reliability Engineer (SRE)

EPAM Systems • Mexico
Hace más de 30 días
Tipo de contrato
  • Teletrabajo
Descripción del trabajo

2 weeks ago Be among the first 25 applicants

EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.

We are seeking a talented and experienced Senior Site Reliability Engineer (SRE) to join our dynamic team.

As a Senior SRE, you will play a critical role in designing, developing, and maintaining highly reliable systems and processes to ensure optimal performance and scalability of applications and infrastructure across diverse environments.

Responsibilities

  • Build and containerize applications and deploy them using open-source container management tools such as Docker or Podman
  • Design and maintain Kubernetes resource manifests, deploying them into clusters on platforms like AKS or GKE
  • Configure and deploy Prometheus agents to monitor infrastructure and application behaviors, raising alerts when necessary
  • Create and manage continuous deployment pipelines using tools like Helm and ArgoCD
  • Optimize observability by implementing monitoring, logging, and tracing solutions
  • Maintain and manage CI / CD processes within Azure DevOps or similar environments
  • Develop and implement solutions on cloud platforms, leveraging expertise in at least one provider (e.g., Microsoft Azure, GCP, AWS)
  • Troubleshoot infrastructural and application issues by utilizing logs and traces to isolate events effectively

Requirements

  • Minimum 3+ years of programming experience, preferably in GoLang
  • Hands-on experience with at least one scripting language (e.g., Bash or Python)
  • Proficiency with Kubernetes, with at least 3 years of practical expertise
  • Fundamental knowledge of observability tools, with a focus on Prometheus or similar monitoring platforms
  • Skills in configuring and managing CI / CD pipelines using Azure DevOps or tools like Helm and ArgoCD for GitOps-style continuous deployment
  • Background in cloud platforms with competency in at least one provider (e.g., Microsoft Azure, Google Cloud, AWS)
  • Flexibility to use open-source tools like Docker or Podman to containerize applications and manage their runtime environments effectively
  • Nice to have

  • Familiarity with multiple cloud providers, including AWS and GCP alongside Azure
  • Expertise in GitOps packaging and deployment tools like Argo CD and Helm
  • Understanding of service meshes like Istio for Kubernetes-based microservices architectures
  • Competency in infrastructure-as-code tools such as Terraform
  • Background in software development with experience across multiple domains
  • We offer

  • International projects with top brands
  • Work with global teams of highly skilled, diverse peers
  • Employee financial programs
  • Paid time off and sick leave
  • Upskilling, reskilling and certification courses
  • Unlimited access to the LinkedIn Learning library and 22,000+ courses
  • Global career opportunities
  • Volunteer and community involvement opportunities
  • EPAM Employee Groups
  • Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn
  • Seniority level

  • Mid-Senior level
  • Employment type

  • Full-time
  • Job function

  • Engineering, Information Technology, and Business Development
  • Industries

  • Software Development, IT Services and IT Consulting, and Nanotechnology Research
  • We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

    #J-18808-Ljbffr

    Crear una alerta de empleo para esta búsqueda

    Site Reliability Engineer • Mexico

    Ofertas relacionadas
    Site Reliability Engineer

    Site Reliability Engineer

    Pyramid Consulting, Inc • Mexico
    Senior Technical Recruiter specializing in End to End Recruitments.As a Sr Site Reliability Engineer on this team, you’ll be responsible for design, development and implementation of cloud based te...Mostrar más
    Última actualización: hace 24 días • Oferta promocionada
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    DuckDuckGo • Mexico
    Teletrabajo
    Be among the first 25 applicants.Hi, we're DuckDuckGo, the online protection company and remote-first team of 300+ on a mission to raise the standard of trust online. Founded in 2008 and profitable ...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    Baufest • Mexico
    Teletrabajo
    En Baufest, nuestra misión es mejorar la vida con tecnología, generando un impacto positivo en la sociedad.Responsabilidades principales : . Diseñar y adaptar el modelo operativo SRE al contexto de la...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Site Reliability Engineer – Azure DevOps

    Site Reliability Engineer – Azure DevOps

    EPAM Systems • Mexico
    Site Reliability Engineer – Azure DevOps.Be among the first 25 applicants.Get AI-powered advice on this job and more exclusive features. EPAM is a leading global provider of digital platform enginee...Mostrar más
    Última actualización: hace 25 días • Oferta promocionada
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Incode Technologies • Mexico
    Incode is the leading provider of world-class identity solutions that is reinventing the way humans authenticate and verify their identities online to power a world of digital trust.Through our rev...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Sr. Site Reliability Engineer (Remote, Mexico)

    Sr. Site Reliability Engineer (Remote, Mexico)

    Nova • Mexico
    Teletrabajo
    Site Reliability Engineer (Remote, Mexico).Site Reliability Engineer (Remote, Mexico).Site Reliability Engineer (Remote, Mexico). Be among the first 25 applicants.Site Reliability Engineer (Remote, ...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Incode • Mexico
    Incode is the leading provider of world-class identity solutions that is reinventing the way humans authenticate and verify their identities online to power a world of digital trust.Through our rev...Mostrar más
    Última actualización: hace 28 días • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    Canonical Group Ltd • Mexico
    Teletrabajo
    Canonical is a pioneering open source software company best known for publishing Ubuntu.We operate globally with a distributed workforce and few office-based roles. Teams collaborate in person 2–4 t...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    EPAM Systems • Mexico
    Teletrabajo
    Senior Site Reliability Engineer.EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employ...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Principal Site Reliability Engineer (AI-first SRE)

    Principal Site Reliability Engineer (AI-first SRE)

    Groupon • Mexico
    Principal Site Reliability Engineer (AI-first SRE).Principal Site Reliability Engineer (AI-first SRE).Groupon is a marketplace where customers discover new experiences and services everyday and loc...Mostrar más
    Última actualización: hace 2 días • Oferta promocionada
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Perficient • Mexico
    We currently have a career opportunity for a.Senior Site Reliability Engineer.Mexico or Colombia (only this locations).As a Senior Technical Consultant you will participate in all aspects of the so...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    Strike • Mexico
    Teletrabajo
    With Strike, you can buy and sell bitcoin, pay bills, and borrow against your holdings.From individuals to businesses, Strike is purpose-built for every step of the Bitcoin journey.Available in mor...Mostrar más
    Última actualización: hace 3 días • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    Capgemini • Mexico
    Teletrabajo
    This position will focus on infrastructure & code reviews to ensure solutions built and delivered are highly available and minimize unplanned downtime. Expert troubleshooter within IT who has broad ...Mostrar más
    Última actualización: hace 20 días • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    Confidencial • Mexico
    Estamos en búsqueda de un / a Ingeniero / a SRE senior para potencialmente sumarse a un proyecto de consultoría.El rol tendrá como objetivo fortalecer la confiabilidad, estabilidad y resiliencia de los...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    HCLTech • Mexico, Mexico
    Site Reliability Engineer (SRE).Fulltime Permanent Position with HCLTech.Collaborate with development teams to improve.Datadog, New Relic) – Strong Requirement. Python, Bash, or similar) – Required....Mostrar más
    Última actualización: hace 4 días • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    KI people • Mexico
    Teletrabajo
    Be among the first 25 applicants.Direct message the job poster from KI people.In Search of the Best Global IT & Digital Talent. The SRE Operations specialist focuses on B2B applications support prov...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Canonical Group Ltd • Mexico
    Teletrabajo
    We are hiring a Senior Site Reliability Engineer.Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets.Our platform, Ubuntu, ...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Site Reliability Engineer (Azure)

    Site Reliability Engineer (Azure)

    EPAM Systems • Mexico
    Be among the first 25 applicants.Get AI-powered advice on this job and more exclusive features.EPAM is a leading global provider of digital platform engineering and development services.We are comm...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada