US Citizen
Green Card
EAD (OPT/CPT/GC/H4)
H1B Work Permit
Corp-Corp
W2-Permanent
W2-Contract
1099-Contract
Contract to Hire
Consulting/Contract
UG :- - Not Required
PG :- - Not Required
No of position :- ( 1 )
Post :- 21st Jul 2023
· Lead a team of Site Reliability Engineers, following agile methodologies
· Provide technical consultation to, and collaborate with product delivery teams
· Manage requirements gathering and task prioritization
· Collaborate with Site Reliability Engineers and Software Delivery teams to define and implement software deployments, monitoring, and infrastructure requirements
· Ensure platforms are highly available, resilient, fault tolerant, performant, and observable
· Promote SRE and DevOps principles, including automation and self-service
· Ensure Service Level Objectives and Service Level Indicators are defined and measured
· Infrastructure provisioning and management
· Lead development of custom software for automation, observability, or other requirements
· Develop methodologies to safely deploy and test network and infrastructure changes, including customized tests and chaos engineering
· Ensure operational documentation, wikis, and readmes are maintained
· Troubleshooting and problem solving
· Participate in, and lead code reviews
· Mentor, and provide feedback to engineers
· Provide support for operations and delivery teams to remediate production issues as appropriate
· Build cloud-agnostic solutions that can be quickly deployed against a wide variety of cloud computing providers
· Manage a 24/7 on-call rotation
Basic Qualifications
· Bachelor’s degree in Computer Science, Information Technology, or a relevant field
· Experience managing DevOps, SRE or software engineers
· Strong written and verbal communication skills
· Track record of running high-performance teams
· Software development experience plus architecting and designing cloud software/platforms
· Good understanding of internet protocols and cybersecurity best practices.
· Experience with CDN delivery providers and concepts
· Demonstrated experience with large scale 24/7 production environments
· Infrastructure as code experience, preferably Terraform, Ansible
· Experience with CI/CD pipelines and tools, preferably Concourse
· Basic networking (Load Balancing, Routing, Security Groups, VPC, Subnetting)
· Linux
· Containerization experience (Docker, Kubernetes, Helm)
· Experience with at least 1 major cloud platforms (AWS/GCP/Azure)
· Experience with monitoring tools, preferably Prometheus/Grafana
· Experience with logging tools, preferably ELK, Splunk, or Loki
Desired Characteristics
· Experience with a digital media direct-to-consumer business highly preferred.
· Certification in AWS, GCP, or Azure a plus
· Exceptional verbal and written communication skills, comfortable communicating with technical and non-technical colleagues and executives
· Ability to understand large complex software systems and their interdependencies