Us Citizen
Green Card
Corp-Corp
Consulting/Contract
UG :- - Not Required
PG :- - Not Required
No of position :- ( 1 )
Post :- 5th Nov 2021
Summary: The Site Reliability Engineer serves as an infrastructure and operations engineer within the eCommerce department. This role possesses a mix of development, networking, security, and system administration skills, as the Site Reliability Engineer is required to provide developer support, application systems administration, production support, and many other tasks. The Site Reliability Engineer works closely with developers, facilitating the development process by removing obstacles interfering with development, and by satisfying infrastructure requirements or delegating them to the infrastructure group for fulfillment.
GENERAL RESPONSIBILITIES
Support Commerce PKS/Kubernetes environments
· Installing and operating Kubernetes and working with Jenkins X and Istio/Ingress technologies
· Implementing DevOps practices based on a strong understanding of principles
· Working tightly with our developers, understand and capture their requirements
· Monitoring and troubleshooting multiple Kubernetes environments
Support eCommerce environments
· Engineer extensive scripting and automation to enable applications to install and run in all environments with minimal manual intervention
· Evaluate, test, deploy and maintain both custom developed and third-party software upgrades
· Provide developer support on an ongoing basis, frequently embedded in development teams to facilitate collaboration
· Create & maintain application architecture and troubleshooting documentation
· Provide 24x7 production support as part of a team rotation, resolving or escalating issues as appropriate
· Maintain production services to highly demanding SLA's
· Take ownership of production issues, working closely with the infrastructure and development teams on issue resolution
· Support releases on a regularly scheduled basis, as well as emergency releases as needed
· Deploy application and data changes to all environments as needed
Planning, Design and Implementation
· Design and implement new environments, services and application architecture modifications
· Research, evaluate and implement operational improvements, application packages and architectural modifications
· Participate in change control, release planning, and other operational planning
· Remain current on both private and public cloud hosting technologies
· Remain current on modern open-source persistence technologies (Hazelcast, Couchbase, Cassandra, MEMCACHED, etc.)
· Remain current on modern containerization technologies (Docker, Kubernetes)
QUALIFICATIONS
Bachelor's degree in computer science or equivalent experience
5+ years production application support experience in a high uptime environment
5+ years UNIX administration and .Net framework experience including diagnosis of performance issues, package management, load estimation, kernel tuning, networking configuration, etc.
5+ years hosting experience in a large heavy-traffic environment
3+ years of infrastructure as code experience (various languages – Java, Python, GO, C#)
Strong scripting skill and understand or worked with automation tools
Working experience with Jenkins/Jenkins X, Python, Jira, Ansible, Artifactory, and Confluence
Understand or worked with CI/CD environment
Understanding of networking principles, esp. TCP/IP
Excellent troubleshooting and analytic skills
Ability to work independently on large, complex projects with minimal guidance
Working experience of container technologies Docker and Kubernetes prefer VMware PKS
Nice to have some experience or knowledge
Service Discovery systems such as Etcd, Zookeeper, Consul or equivalent
Networking systems like NGINX, Istio, or equivalent
Experience designing and implementing systems to automate platform level features, such as configuration management, network routing, blue/green or canary deployments, etc.
Vault, Packer, Azure, GCP, NGINX