- Deploy cloud-based enterprise applications into K8S cluster on GCP, AWS and/or BTP.
- Monitor production systems and trigger production issue alert notification.
- Manage critical incident, including escalation, debugging, fix and root cause analysis.
- Help engineer to debug production issues by looking for error in Splunk, application logs.
- Create/maintain knowledge base documentation in the area of DevOps.
- Create monitoring SRE dashboard in Splunk and Dynatrace.
- Designing and deploying robust infrastructure solutions to support SAP products and services.
- Ensure that the infrastructure needs are met and aligned with business objectives.
- Automation deployment of cloud resources to enhance the processes and procedures efficiency.
- Collaborate with engineering and product management teams as well as other service groups to deliver scalable applications that meet the customer demand.
Experience (Role Requirements)
- Familiar with Linux admin task and debugging technics.
- Familiar with shell langrage, such as bash or python.
- Familiar with public cloud infrastructure and operation (GCP, AWS and BTP is preferred).
- Familiar with log analysis, such as Splunk or ELK.
- Familiar with Dynatrace, Akamai, Istio and Apache.
- Familiar with SRE operation and on-call procedure.
- Familiar with K8s, helm charts for packaging, configuring, and deploying applications.
- Familiar with building Jenkins pipeline
- Excellent verbal and written communication skills.
- 3-year DevOps/SRE experience
- 2-year Public cloud experience
Plus:
- SAP BTP experience
- Direct FedRAMP experience.
- Cloud data security
- Java development experience
- Test automation experience.
- Leadership experience in SRE/DevOps
- Ariba Network and/or Ariba Procurement product knowledge
- Customer escalation experience