Job Summary :
The IP4G Engineer is a crucial member of the team, specializing in the integration, optimization, and management of the IP4G environment. This role requires expertise in IBM Power architecture, Google Cloud Platform (GCP), and cloud-native technologies to deliver high-performance, scalable, and cost-effective solutions for clients. The Engineer collaborates closely with cross-functional Engineering and Product teams to design, deploy, and support IBM Power workloads on Google Cloud, ensuring seamless integration and optimal performance.
Essential Functions :
- Deploy code and configure infrastructure components in support of IP4G, leveraging deployment automation tools and infrastructure as code (IaC) practices.
- Participate in workshops, and presentations to discuss project requirements, progress, and future roadmap plans.
- Collaborate with team members on best practices to ensure quality, stability, performance, resiliency, and maintainability of your solutions
- Engage in and improve the whole lifecycle of services from inception and design, through deployment, operation, and refinement.
- Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning, and launch reviews.
- An on-call rotation with other members of the Platform Operations team
- Maintain services once they are live by measuring and monitoring availability, latency, and overall system health.
- Scale systems sustainably through automation and evolve systems by pushing for changes that improve reliability and velocity.
- Practice sustainable incident response and blameless postmortems.
- Contribute to public and internal documentation to amplify your impact.
- Collect, analyze and report on IP4G support metrics / KPIs to identify problems or areas of focus.
- Contribute to the development and continuous improvement of the IP4G infrastructure.
- Collaborate with engineering and observability teams to improve telemetry and log collection, add new dashboards, and create alerts.
- Continuously improve our ability to identify and resolve incidents by improving the observability and tools available to our team
- Ensure adherence to security policies and procedures
- Other duties as assigned.
Required Skills / Abilities / Competencies
Development experience with Python and / or Go preferredExcellent verbal and written communication skills.Ethical and critical thinking.Excellent organizational skills and attention to detail.Excellent time management skills with a proven ability to meet deadlines.Strong analytical and problem-solving skills.Strong supervisory and leadership skills.Ability to prioritize tasks and to delegate them when appropriate.Ability to function well in a high-paced and at times stressful environment.Proficient with Microsoft Office Suite or related software.Proficiency in programming languages.Strong background knowledge of IBM Power architecture, AIX / Linux operating systems, virtualization technologies (e.g., PowerVM, PowerVC, Openstack), storage solutions (e.g., IBM Flash Systems, San Volume Controller) or Networking technologies (e.g. EVPN / VXLAN, BGP, VLANs, Routing, Firewalls)Proficiency in Google Cloud Platform services, including Compute Engine, Kubernetes Engine, Cloud Storage, Networking, and IAM.Education and Experience :
Bachelor’s degree in Computer Science, Information Technology, or a related field.
Certification in IBM Power Systems (e.g., IBM Certified System Administrator - AIX, IBM Certified Technical Sales Specialist) and Google Cloud Platform (e.g., Google Cloud Certified - Professional Cloud Architect) is highly preferred.
Proven experience designing, deploying, and managing IBM Power workloads in cloud environments, preferably within a managed services or cloud service provider organization