Talent.com
Esta oferta de trabajo no está disponible en tu país.
Senior Data Platform Operations Engineer

Senior Data Platform Operations Engineer

EPAM SystemsMexico
Hace 6 horas
Descripción del trabajo

Overview

EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.

We are seeking a highly skilled Senior Data Platform Operations Engineer to ensure the stability, security, performance, and cost efficiency of our global enterprise data platform.

This role is critical for providing 8 / 5 operational coverage as part of the follow-the-sun 24x5 support model, ensuring the platform continuously supports business activities worldwide. The ideal candidate will possess expertise in cloud-based data platforms, a strong operational mindset, and a proactive approach to performance optimization, observability, and cost management.

Responsibilities

  • Maintain a stable, secure, and performant enterprise data platform (Snowflake, AWS data stack, dbt, orchestration tools, BI / analytics, etc.)
  • Provide operational coverage within an 8 / 5 support model and participate in a 24 / 7 on-call rotation for critical incidents
  • Implement robust monitoring, alerting, and observability solutions to ensure proactive incident detection and resolution
  • Perform platform upgrades, patching, and configuration management in alignment with security and compliance requirements
  • Continuously tune system performance to meet evolving business needs
  • Use holistic observability frameworks covering infrastructure, data pipelines, and platform services to execute monitoring activities
  • Deliver actionable operational insights through monitoring dashboards and reporting
  • Identify and implement process automation to improve efficiency and reduce manual interventions
  • Suggest and execute continuous improvements to enhance platform resilience, scalability, and cost-effectiveness
  • Contribute to infrastructure-as-code and configuration-as-code practices for consistent, repeatable operations

Requirements

  • Hands-on experience of over 3 years managing cloud-native data platforms (e.g., Snowflake, Databricks, BigQuery, or similar)
  • Proficiency in cloud infrastructure (AWS) with focus on operations, automation, and cost governance
  • Experience with monitoring and observability tools (Datadog, Prometheus, Grafana, ELK, CloudWatch, etc.)
  • Knowledge of Infrastructure as Code (Terraform, Pulumi, Ansible) and configuration management practices
  • Strong understanding of networking, security, and compliance in cloud environments
  • Strong problem-solving skills with a proactive, service-oriented mindset
  • Ability to work in a global operations environment with on-call responsibilities
  • Clear communication and collaboration with engineering, data, and business stakeholders
  • Commitment to continuous improvement and operational excellence
  • English language proficiency at an Upper-Intermediate level (B2) or higher
  • Nice to have

  • Experience implementing FinOps frameworks and cost optimization practices
  • Prior experience in regulated industries (pharma, healthcare, finance) with compliance-driven environments
  • Familiarity with modern data stack tools (dbt, Dagster / Airflow, ThoughtSpot, Tableau, Power BI)
  • Exposure to SRE (Site Reliability Engineering) principles and practices
  • We offer

  • International projects with top brands
  • Work with global teams of highly skilled, diverse peers
  • Employee financial programs
  • Paid time off and sick leave
  • Upskilling, reskilling and certification courses
  • Unlimited access to the LinkedIn Learning library and 22,000+ courses
  • Global career opportunities
  • Volunteer and community involvement opportunities
  • EPAM Employee Groups
  • Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn
  • Seniority level

  • Mid-Senior level
  • Employment type

  • Full-time
  • Job function

  • Business Development, Information Technology, and Engineering
  • Industries

  • Software Development, IT Services and IT Consulting, and Pharmaceutical Manufacturing
  • #J-18808-Ljbffr

    Crear una alerta de empleo para esta búsqueda

    Senior Data Engineer • Mexico