Should have at least 8-10+ years of overall experience.
Implement and maintain observability using Dynatrace to monitor system health and performance, enabling proactive identification and resolution of issues.
Automate deployment, scaling, and management of containerized applications using Kubernetes and similar technologies.
Collaborate with development teams to enhance the scalability and reliability of applications through reviews and implementing changes to the system architecture.
Participate in on-call rotations, providing timely responses to incidents and ensuring rapid resolution.
Develop automation tools for efficient system management and to reduce human intervention in operational tasks.
Utilize strong problem-solving skills to troubleshoot and resolve infrastructure issues.
Continuously improve system performance by analyzing issues and existing solutions, and implementing changes to hardware, software, or network setups.
Document system architecture and operational procedures to ensure clarity and consistency across the team.
Develop, configure, and optimize cloud-based services and infrastructure to ensure high availability and performance.