Roles and Responsibilities
- Support and help manage the whole AWS infrastructure for all production sites for world class uptime and resiliency metrics.
- Help Build, scale, and secure application cloud infrastructure using tools like Terraform, Kubernetes, and Docker.
- Build and maintain robust CI/CD pipelines with code Deploy and Bitbucket pipelines
- Advocate and implement industry best practices for configuration management and build/deployment automation
- Work closely with developers to provide insight into operational, security, and performance considerations
- Work closely with developers during the deployment and testing phases to provide insight into operational, security, and performance considerations
- Participate in an on-call rotation to triage and analyze abnormalities in system operation leveraging instrumentation like ELK
- Perseverance to debug complex problems across the whole stack
- Create tooling that works across cloud providers like AWS, Azure
- Help optimize and define engineering processes.
Desired Candidate Profile
- Advanced Degree in Computer Science or relevant engineering discipline
- 8+ years of experience in DevOps/Systems Administration with 4+ years of experience with cloud-based provisioning, monitoring, and troubleshooting (preferably AWS or Azure) applications.
- Expert Practitioner experience with containerization (docker & Kubernetes), cloud technologies, tools (Jenkins, CodeDeploy) and practices (CI/CD patterns, automated provisioning & release, GitOps, IaC)
- Hands on experience Deploying and managing Highly Available, Scalable and resilient AWS/AZURE cloud application.
- Expertise in Infrastructure automation tools like Terraform, Ansible or CloudFormation
- Strong knowledge in at least one scripting language, preferably Python/Golang
- Strong experience of monitoring solution like Prometheus, Grafana, Kibana, ELK
- Outstanding interpersonal skills, the ability to innovate, inspire, and collaborate with cross group/functional teams with a high degree of independence and success
- Excellent written and verbal communication skills targeting a broad range of audiences from engineers to senior leadership.