Overview
Senior Technical Recruiter specializing in End to End Recruitments
Responsibilities
- As a Sr Site Reliability Engineer on this team, you’ll be responsible for design, development and implementation of cloud based technologies. Provide technical expertise on complex projects and advanced troubleshooting of existing Cloud technology for use by department. Such as guidance and support in the development of progress at all system layers, including data, processing, and back-end systems.
Major duties and responsibilities include :
Performs implementation of software solutions to improve reliability and observabilityPerforms technical implementations for our CaaS / PKS platformExperience with migrating workloads from on-prem / off-prem cloudRecommend settings for applications, operating systems, networks, and cloud services to improve performance, security, and reliabilityCollaborate with a growing team of Cloud Engineers and the Cloud Ops team to develop and support the Client’s IT Cloud StrategyResponsible for the implementation of security best practices and initiatives throughout all layers of the Cloud modelActively and consistently supports all cloud efforts to simplify and enhance the customer experience. Analyze, troubleshoot and resolve system, software, network, and storage failures for a globally distributed cloud infrastructureAccountable to help define and drive out best practices for monitoring, security, and platform reliabilityNSX-T Qualifications
Configure and deploy Kubernetes clusters, Tanzu Kubernetes Grid (TKG), and NSX-T components, such as logical switches, routers, firewalls, load balancers, and VPNsAutomate network provisioning and configuration tasks using Ansible playbooks and templates; develop and maintain network documentation, diagrams, and procedures for deployment, operation, and maintenanceMonitor network performance, capacity, availability, and security using tools such as Grafana, Prometheus, Nagios, and NSX-TTroubleshoot network issues, diagnose root causes, and implement solutions using standard protocols and tools such as TCP / IP, DNS, DHCP, SNMPDesign, deploy, and maintain a secure, reliable, and scalable network infrastructure based on Tanzu, Kubernetes, and VMware NSX-TAutomate network provisioning, configuration, and management tasks using Ansible. Ensure high availability, performance, and compliance of network services and applications running on the infrastructureMonitor and troubleshoot network issues, performance bottlenecks, and security threats proactively. Collaborate with other teams, such as DevOps, security, and infrastructure, to integrate and optimize network services and applicationsWhat Our SR Site Reliability Engineers Enjoy Most
Ability to work with cutting edge technologyCollaboration with a growing team of Cloud Engineers to solve and work toward a common goalHave the ability to work with different software packages and to enhance the learning experience and knowledge of the engineerQualifications
Required Qualifications
4+ years of Network experience4+ years of System Administration experience4+ years of Troubleshooting1+ year of Container Services2+ years of ScriptingBachelor’s degree in computer science or related field, or equivalent experienceProven experienced with the VMware suite of productsProven experienced with managing both physical and Virtual infrastructureExperienced with multiple operating systems (e.g. Windows and Linux)Hands-on experience in one or more of cloud computing services (e.g. AWS, Microsoft Azure, Client Cloud Platforms, Client, etc.)Familiar knowledge hands-on experience with a variety of cloud service models (e.g. Private, Public, Multi-Cloud)Familiar with CI / CD experience with Puppet, Ansible, JenkinsExperience managing monitoring and alerting toolsFamiliar with containerized workloads (e.g. Kubernetes, Openshift, TKGI)Experienced with firewalls, routing and load balancingSkilled in troubleshooting methodologiesMust have excellent written and oral communications, including technical documents, and process documentsRequires attention to detail and excellent organizational skillsExperience managing small projectsSelf-starter, ability to manage tasks with little supervisionAbility to read, write, speak and understand EnglishAbility scripting in one or more languages (e.g. Python, Shell, PowerShell, Ansible or Perl)Ability to contribute independently as well as be a team playerPreferred Qualifications
5+ years of VMware System Administration experience2+ years of TKGI Enterprise Pivotal Container Services2+ years of VMware NSX-T3+ years of vROPs, Log Insight, vRNI, vRIL3+ years of Cisco networking3+ years of Firewall configuration management3+ years of Load Balancer configuration management#J-18808-Ljbffr