Job Overview
SRE – Site Reliability Engineer
We are currently seeking a Site Reliability Engineer to join our team in Guadalajara, Jalisco, Mexico. In this role you will perform L1.5 activities including monitoring, deployment, and rollback. You will monitor the efficiency of Azure cloud systems to prevent outages and initiate an Incident Management bridge during outages. You will troubleshoot Azure resources and elevate issues to Level 3 (Software Development Team).
Responsibilities
- Monitor Azure cloud infrastructure and services.
- Perform L1.5 tasks such as deployment and rollback.
- Initiate Incident Management bridge during outages.
- Troubleshoot Azure resources and coordinate with Level 3 team.
Key Qualifications
Azure Fundamentals certification preferred or a degree in Computer Science / Information Systems Management.Experience with PaaS and IaaS services : VMs, Storage, EventHub, Service Fabric Cluster, Azure Kubernetes Service, CosmosDB, SQL Server, IoT Hub, Databricks, KeyVault, Datalake.Knowledge of IoT concepts : telemetry, ingestion, processing, data storage, reporting.Proficiency with deployment and configuration tools : Octopus, Bamboo, Terraform, Azure DevOps, Jenkins, Github, Ansible.Experience with container orchestration platforms such as Kubernetes.Experience scripting with PowerShell and Python.Understanding of NoSQL and SQL databases and their maintenance.Experience with monitoring and logging systems : LogAnalytics, Splunk, ELK, Prometheus, Nagios, Zabbix.Strong independent thinker who identifies proactive solutions.Seniority Level
Entry level
Employment Type
Full‑time
Job Function & Industries
Engineering & IT; IT Services & IT Consulting
#J-18808-Ljbffr