Site Reliability Engineering Lead Design & Support Engineer
We Are PepsiCo
Overview
Join PepsiCo and Dare for Better! We are the perfect place for curious people, thinkers and change agents. From leadership to front lines, we're excited about the future and working together to make the world a better place.
Being part of PepsiCo means being part of one of the largest food and beverage companies in the world, with our iconic brands consumed more than a billion times a day in more than 200 countries.
Our product portfolio, which includes 22 of the world's most iconic brands, such as Sabritas, Gamesa, Quaker, Pepsi, Gatorade and Sonrics, has been a part of Mexican homes for more than 116 years.
A career at PepsiCo means working in a culture where all people are welcome. Here, you can dare to be you. No matter who you are, where you're from, or who you love, you can always influence the people around you and make a positive impact in the world.
Know more : PepsiCoJobs
Join PepsiCo, dare for better.
The Opportunity
As SRE Lead Design & Support Engineer your scope would consist of :
We are looking for a self‑driven, software engineering mindset SRE engineer to :
- Drive new shift‑left activities critical to apply Site Reliability Engineering (SRE) and quality assurance principles within the application design / project roadmap that enable resilient outcomes.
- Apply a pre‑emptive approach into production minimizing business impact, via SRE‑driven orchestration of connecting all components of the ecosystem, diagnosing anomalies prior to user engagement and remediating through automation.
- Be a critical enabler achieving a high resiliency during operations and continuously improving through design during the software development lifecycle.
- Act as an integral part of the global team with the main purpose to provide a delightful customer experience for users of the global consumer, commercial, supply chain and enablement functions in the PepsiCo digital products application portfolio of 260+ applications, enabling a full SRE Practice incident prevention / proactive resolution model.
- Focus on the cloud architecture application full‑stack development for B2B Pepsiconnect and Direct to Customer and other S&T roadmap applications.
- Ensure that PepsiCo DPA applications service performance, reliability and availability meet expectations of our customers and internal groups.
- Leverage a blend of technical expertise on SRE tools, modern applications cloud architecture, IT operations experience, and analytics & influence skills.
Responsibilities
Ensure ecosystem availability and performance in production environments, proactively preventing P1, P2, and potential P3 incidents.Engage and influence product and engineering teams during the design and development phases to embed reliability and operability into new services, defining and enforcing events, logging, monitoring, and observability standards across applications.Accountable to institute non‑functional requirements (NFRs) early, including SLA / SLO / SLI and error budgets, as part of the engineering solution.Lead the team diagnosing anomalies before any user impact and drive necessary remediations across the teams involved in end‑to‑end ecosystem availability, performance, and consumption of the cloud‑architected application ecosystem, leveraging SRE orchestration solutions.Collaborate with Engineering & support teams, including participation in escalations and blameless post‑mortems.Work closely with customer‑facing support teams to empower them with SRE insights and tooling.Observe, diagnose, and improve the end‑to‑end ecosystem performance of the modern architected application portfolio, understanding interactions of a full‑stack application alongside peer SRE team members.Continuously optimize L2 / support operations through SRE workflow automation.Shape the SRE orchestration platform design with inputs from Production Operations, Business usage, Product and engineering teams.Actively engage and drive AI Ops adoption across teams.Qualifications
6‑8 years of work experience evolving to an SRE engineer with 2‑4 years of experience continuously improving and transforming IT operations ways of working.Bachelor’s degree in Computer Science, Information Technology or a related field.Proven experience as an SRE designing events diagnostics, performance measures, and alert solutions to meet SLAs / SLOs / SLIs.Highly quantitative with strong judgment, able to connect dots across ecosystems and work cross‑functionally to ensure SRE orchestrating solutions meet customer / end‑user expectations.Pragmatic incident resolution skills, including systematic root‑cause triangulation with internal and external teams.Deep expertise in SRE (Software Reliability Engineering) and IT Service Management (ITSM) processes, with a track record of improving service offerings and proactively resolving incidents.Hands‑on experience in Python, SQL / No‑SQL (MySQL, MongoDB, Cassandra, PostgreSQL), AppDynamics, ELK Stack, Grafana, Splunk, Dynatrace, Kafka, and other SRE Ops toolsets.Strong understanding of cloud architecture for distributed environments.Front‑end technologies : HTML, CSS, JavaScript, and frameworks such as React, Angular, or Vue.js.Back‑end technologies : Java, Spring Boot and related server‑side languages and database interaction.Infrastructure : Azure / AWS cloud platforms and / or client / server environments.Prior experience shaping transformation and developing SRE solutions is a plus.We encourage you to apply even if you do not meet 100% of the requirements.
What can you expect from us
Opportunities to learn and develop every day through a wide range of programs.Internal digital platforms that promote self‑learning.Development programs aligned with leadership skills.Specialized training for the role.Learning experiences with internal and external providers.Recognition programs for seniority, behavior, leadership and life moments.Financial wellness programs to help you reach your goals at all life stages.Flexibility program allowing you to balance personal and work life, adapting your workday to lifestyle.Family benefits such as Wellness Line, agreements and discounts, scholarship programs for children, aid plans for various life moments.We are an equal opportunity employer and value diversity at our company. We do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. We respect and value diversity as a workforce and source of innovation for the organization.
#J-18808-Ljbffr