Excellent stakeholder management skills and a proven ability to build strong relationships and trust throughout the organization, including with senior leadership
Familiarity with Kanban/SCRUM to prioritize work and set strategic goals
Experience supporting distributed systems within a cloud platform
4+ years of experience working with GCP/AWS Cloud
4+ years writing applications in Java (or equivalent development experience)
2+ years of experience working with Kubernetes or other container-based technology
2+ years working with CI/CD tools and technology; like ArgoCD, Github Actions and Artifactory
2+ years working with Terraform or equivalent IaC
Ability to document and publish recommendations and guidance
Key Responsibilities:
Driving reliability improvements back into applications
Building code to resolve reliability/resiliency issues
Mentor and educate team members to aid in strengthening technical
Collaborate closely with cloud architects to drive cloud solutions
Curating proper SLI/SLOs to accurately measure or assess error budgets
Embed with the development teams to assist with cloud methodologies when developing products to ensure that the deliverable is as reliable as possible
Work with development teams to build and strengthen application security and compliance
Manage high impact situations that involve technically challenging issues across diverse audiences and drive to find the root cause, mitigate, and identify a solution