- 9+ years as a Site Reliability Engineering or DevOps Engineer
- Experience solving for scalability, performance, and stability
- Expert knowledge of Linux operating systems and environment and Scripting (Shell and Python preferred)
- Expert at troubleshooting complex system and application stacks
- Operational Experience in Big Data Stacks ( Hadoop ecosystem, Spark is a plus)
- Operational Experience in real-time ,streaming and data pipelines relevant frameworks ( Kafka and NiFi is a plus)
- Operational experience troubleshooting network/server communication
- Experience with performance Tuning of Database Schemas, Databases, SQL, ETL Jobs, and related scripts
Expertise in enterprise metrics/monitoring with frameworks such as Splunk, Druid, Grafana