Description
DESCRIPTION
Join EPAM as a Lead Site Reliability Engineer!
In this role, you'll supervise and monitor a variety of projects, set up application Observability and Telemetry, and perform advanced troubleshooting of incidents in mission-critical systems.
If you have strong experience with Dynatrace, Splunk, Grafana, and Kubernetes, we'd love to hear from you.
EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.
Responsibilities
- Assist team in setting up and maintaining SLIs, SLOs, and error budgets for systems and applications
- Set up application Observability and Telemetry and practices to set up observability and system reliability standards
- Analyze Out of Memory and CPU issues and come up with improvement recommendations
- Perform advanced troubleshooting of incidents in mission-critical systems and participate in preventative problem management activities
Requirements
Strong, hands-on experience with Dynatrace, Splunk, and Grafana and ample expertise experience in KubernetesExperience with a leading cloud provider (AWS, Azure, GCP) and a background in JavaAbility to work independently and as part of a team with strong analytical and problem-solving mindsetStrategic thinking, complex problem solving, and analytical capabilities with experience developing and instilling a culture of operational maturityWe Offer
Career plan and real growth opportunitiesUnlimited access to LinkedIn learning solutionsInternational Mobility Plan within 25 countriesConstant training, mentoring, online corporate courses, eLearning and moreEnglish classes with a certified teacherSupport for employees initiatives (Algorithms club, toastmasters, agile club and more)Enjoyable working environment (Gaming room, napping area, amenities, events, sport teams and more)Flexible work schedule and dress codeCollaborate in a multicultural environment and share best practices from around the globeHired directly by EPAM & % under payrollLaw benefits (IMSS, INFONAVIT, 25% vacation bonus)Major medical expenses insurance : Life, Major medical expenses with dental & visual coverage (for the employee and direct family members)13 % employee savings fund, capped to the law limitGrocery coupons30 days December bonusEmployee Stock Purchase Plan12 vacations days plus 4 floating daysOfficial Mexican holidays, plus 5 extra holidays (Maundry Thursday and Friday, November 2nd, December 24th & 31st)Relocation bonus : transportation, 2 weeks of accommodation for you and your family and moreMonthly non-taxable amount for the electricity and internet bills