Citizen
Full Time
Direct Hire
UG :- - Not Required
PG :- - Not Required
No of position :- ( 1 )
Post :- 11th Nov 2022
The systems reliability engineer will be responsible to incorporate aspects of software engineering and applies them to infrastructure and operations problems. The main goal of a systems reliability engineer is to create scalable and highly reliable systems. A systems reliability engineer (SRE) will spend up to 50% of their time doing operations related work such as supporting issues, on-call, writing documentation, and system management. Since the system that an SRE oversees is expected to be highly automatic and self-healing, the SRE should spend the other 50% of their time on development tasks such as new features, scaling, and automation.
Primary Responsibilities
Bring the SRE mindset for Availability, Reliability, Scalability, Disaster Recovery, Problem/Incident Management, and Performance of production services.
Manage JAMF Casper Suite and infrastructure, including deployment, administration and integration with AD and AzureAD.
Manage SCCM and Intune, including deployment, administration, and integration with AD and AzureAD
Provide advance support on all client related systems and serve as an escalation point for Global EUS
Evaluate, select and integrate 3rd party products in the support of new or existing services.
Provide technical leadership on large/complex systems and platform projects.
Build tooling to support the automation, management, and reliability of applicable systems.
Build and/or support release pipelines for applicable systems.
Works as part of a team to continuously evaluate, troubleshoot, and improve existing systems.
Manage the system lifecycle from design and implementation, to turn-down and decommissioning.
Write documentation for peers and stakeholders supporting applicable systems.
Work with business partners to define SLOs and SLIs and build robust monitoring solutions supporting agreed upon metrics.
Lead communications efforts regarding both system issues/activities as well as blameless post-mortems with all stakeholders.
Provide mentorship to junior team members.
Be part of the on-call rotations to provide 24/7 support.
Responsible for upholding F5 s Business Code of Ethics and for promptly reporting violations of the Code or other company policies.
Knowledge, Skills and Abilities
Advanced understanding of Windows and Apple Clients
Admin level knowledge of SCCM, JAMF, Intune, Autopilot, Device Trust
Good knowledge of AD, Azure, VDI, VMware
Knowledge of client apps for Security and backups Code42, Cylance, Win Defender, Malware Bytes, etc..
Develop and maintain automated tasks using scripting
Strong ability to effectively multitask and prioritize work, juggling daily support responsibilities with multiple project/product driven activities.
Strong troubleshooting and problem-solving skills
Strong ability to work with customers and business partners such as Customer Service and Product Management to turn business requirements into technical implementation.
Scripting and automation experience
Solid ability to work independently or as part of a team to deliver features on agreed upon timelines.
Qualifications
Bachelor s Degree in computer science or related field and 3+ years of experience or equivalent combination of education and experience
Minimum 2 years in an enterprise-level system engineering or reliability engineering role
Strong base knowledge of operating systems, networking basics, and security best-practices.
Working knowledge of Agile delivery and Devops principles
Working knowledge of at least one Public Cloud provider (Azure, AWS, GCP, etc.) and/or applicable SaaS platform (Salesforce, etc.)
Proficiency in one or more programing or programing/scripting language (PowerShell, Python, Bash, and/or Java preferred)
Strong understanding of DevOps tools likes Jenkins or Azure DevOps
Familiarity with IT governance methodologies (ITIL, COBIT, MOF, etc.)
Working knowledge of security best practices
Technical certification(s) desired