Required Skills

Site reliability Engineer

Work Authorization

  • US Citizen

  • Green Card

Preferred Employment

  • W2-Permanent

  • W2-Contract

Employment Type

  • Consulting/Contract

education qualification

  • UG :- - Not Required

  • PG :- - Not Required

Other Information

  • No of position :- ( 1 )

  • Post :- 15th Feb 2024

JOB DETAIL

EAS RDS family is responsible for providing technical support for GTR and TIW applications that are beyond the development stage and are running in the daily operations of the firm. SRE team, part of EAS RDS, works closely with development teams, infrastructure partners, and internal / external clients to improve operational supportability, resiliency and mean time to restore service through non-functional requirements and improvements to support capabilities.

 

Your Primary Responsibilities:

•    Join all project stakeholders planning and design sessions, sprint zero and stand-ups for all new delivery fully understanding the changes and impact.

•    Attend and present operational readiness with application support (EAS L2) at each project management meeting - raise any operational risks and concerns.

•    Partner with IT Embedded Risk Managers to identify strategic solutions for risk incidents.

•    Metrics and Reporting – demonstrate operational improvements through defined KPIs.

•    Ensure NFRs are raised, properly defined and prioritized as part of delivery. 

•    Review all Controls and Alerting for new delivery and ensure it meets operational standards.

•    Test NFRs in UAT environments to validate effectiveness and completeness of operational capabilities.

•    Partner with ETE to drive resiliency testing scenarios.

•    Evaluate how the application behaves for hardware failures in the middle of processing. 

•    Make design recommendations that will allow the application to recover without cleanup activities or create a recovery runbook for application support team to follow for improved application recovery times.   

•    Ensure avoidance of creation of “control of controls” or “alert of alerts” instead of improving application controls and alerting. 

•    Participate in daily EAS RDS L2 activities to understand what can be improved to make support of RDS applications more efficient and organized.   

 

Talents needed for success:

•    Understanding of SRE Principles/Practices and metrics as well as Traceability

•    Excellent Problem Solving skills and passion for automation

•    Hand-on Experience in SQL/PLSQL, Unix, Linux, Windows

•    Working experience in Shell Scripting, Python, Perl, JavaScript

•    Failure Mode Analysis to - evaluate how the application behaves for hardware failures in the middle of processing

•    Good knowledge in AutoSys, ServiceNow, JIRA

•    Demonstrate knowledge of DevOps toolchains and process

•    Monitoring / Big data tools such as Splunk, Dynatrace

•    Knowledge in maintenance and support of AWS functionalities and services

•    Leadership experience is a plus.

Company Information