HCLTech has over 223,000 people across 60 countries and offers industry-leading capabilities centered around digital, engineering, cloud and AI.We work with clients across various sectors, providing solutions for different industries.The Site Reliability Engineer will take ownership of problems or tasks, drive solutions and continuously improve processes.Key responsibilities include establishing end-to-end monitoring and alerting for critical aspects of supported pipelines.Identifying low-hanging fruits, managing and troubleshooting AWS EKS clusters, ensuring reliability and performance, and improving team practices.Ensuring technical solutions meet quality, security and compliance requirements, working directly with key stakeholders and technical teams to ensure solutions have passed required quality checks.Partnering with other SREs on configuration management at scale, and working with software engineers in product development and SREs to define release software steps.Proactively working on toil reduction, efficiency and capacity planning, promoting a culture of shared responsibility.Preferred qualifications include strong focus on innovation, experience working with business partners and vendors, coordination and planning with project management, and strong soft skills.Advanced infrastructure knowledge, security awareness, advanced knowledge of modern DevOps Stack Tools, big picture thinking, network awareness, and practice incident response and blameless postmortems are highly valued.
Solution Architect • Zapopan, Jalisco, México