- Standard RPE and excellent communication skills both written and verbal.
- Strong Linux skills
- Experience w/ Python for task automation
- Good communication skills
- Experience with Incident management processes
- Oncall support is required
- Strong Linux troubleshooting skills
- Task automation experience in any programming language
- Practical experience of at least one pillar of observability (metrics, logs or traces)
Exhibit working knowledge in at least ONE of the following areas
- SQL
- REST services (API)
- Load balancing and networking
- Performance troubleshooting and resolution
- Confident collaboration skills
Desired Skills
- Python development for task automation
- Experience with site reliability engineering practices, like service level objectives (SLOs), error budgets, blameless postmortems, toil reduction
- Prior experience creating operational dashboards (Splunk, Grafana, etc)
- Experience administering and/or supporting ServiceNow