Join to apply for the Cloud Operations Software Engineer role at Peterson Technology Partners .
We are seeking a Cloud Operations Software Engineer with strong programming and automation skills to join our team. This role is ideal for someone who can write reliable Python code, manage infrastructure with Terraform, and continuously improve support processes. You will help enhance system reliability, build self-service solutions, and reduce operational toil through automation and clean engineering practices.
Responsibilities
- Develop Python-based automation scripts and tools to improve operational efficiency, incident response, and environment management.
- Build, manage, and maintain cloud infrastructure using Terraform and best practices for Infrastructure-as-Code.
- Partner with operations and support teams to analyze recurring issues and design automation or tooling to reduce manual intervention.
- Enhance and standardize support processes, including incident management, escalation, and root cause analysis.
- Improve system observability and monitoring by integrating logs, metrics, and alerts into centralized platforms.
- Participate in on-call rotations and drive improvements to reduce noise and increase self-healing capabilities.
- Document processes, runbooks, and technical designs to ensure repeatability and knowledge sharing.
- Collaborate with cross-functional teams to ensure systems are scalable, secure, and cost-efficient.
- Build self-service and reliable systems that engineers can easily consume.
- Provide support to internal customers for CI / CD pipelines and AWS cloud components.
- Identify and implement opportunities to reduce technical debt and streamline operations.
- Recommend sensible defaults and guide teams on clean code practices to improve maintainability.
Qualifications
Required
Strong programming experience with Python (automation, tooling, integrations).Hands-on experience with Terraform for infrastructure provisioning and management.Solid understanding of cloud platforms (AWS preferred, Azure or GCP a plus).Experience with CI / CD pipelines (GitHub Actions, GitLab CI, Jenkins, etc.).Familiarity with observability tools (Datadog, Prometheus, Grafana, ELK).Understanding of incident management and support workflows.Knowledge of networking, IAM, and cloud security best practices.Preferred
Experience integrating automation into support platforms (ServiceNow, PagerDuty, Slack, etc.).Exposure to event-driven architectures (serverless, webhooks, or event bus).Familiarity with containerization and orchestration (Docker, Kubernetes).Knowledge of ITIL or operational frameworks and process optimization.Soft Skills
Problem-solver with a mindset to reduce manual work through automation.Strong communication and collaboration skills across engineering, support, and business teams.Analytical thinker who can spot patterns in incidents and design systemic fixes.Proactive, adaptable, and focused on continuous improvement.Salary / Rate : $20-$25 / hour (depends on experience level). This is a contract to hire position with candidates expected to work 40 hours / week. Full-Time Conversion Salary : MXN 75,000–MXN 80,000 per month.
Seniority level
Mid-Senior levelEmployment type
ContractJob function
Engineering and Information TechnologyIndustries
Staffing and Recruiting#J-18808-Ljbffr