Roles and Responsibilities
Minimum Knowledge and Skills:
- Total IT experience between 6-8 years with a degree in Computer Science/Information Technology or equivalent
- At least 2 years experience in deploying on/operating in AWS
- Worked for at least 1 to 2 years in AWS infrastructure support role Resource provisioning, monitoring, analytics and logging (EC2, CloudWatch, CloudTrail, S3, Storage, Route53, VPC etc.,)
- At least 2 years of hands on experience in Windows Server troubleshooting, Active directory & SQL (basic query, jobs, backup/restore etc.,)
- Strong experience in managing Windows server infrastructures and effectively troubleshoot performance issues, analyze bottlenecks and make recommendations to improve performance of the solution
- At least 1-2 years of experience in SQL database backup, restore, ODBC , Maintenance and troubleshooting
- At least 1 year of Automation experience - should have automation mindset and reduce toil in everything that we do in any one of the below
- Powershell or Python
- AWS System Manager/Chef/Puppet/Ansible or any Automation tool etc.,
- Terraform/CloudFormation
- At least 1 year working knowledge of monitoring tools like SolarWinds, Sumo Logic , Nagios, Splunk/ELK/EFK, Grafana etc.,
- Should have basic experience on migration or upgrades of Infrastructure OS/applications like Windows, SQL server, IIS etc.,
- Knowledge in Disk space/Storage solutions , Task Manager, PerfMon, SysMon
- Knowledge in Printers, Drivers, Spooler Etc..
- Knowledge of THIN vs FAT client systems and general server OS functionality, Terminal services and Profile types
- Basic knowledge of Networking, Firewall management, SMTP and Latency troubleshooting
- Excellent communication and reasoning skills, Team Chemistry and Soft Skills
- Logical thinker, Good analytical and problem-solving skills
- Knowledge of Active Directory and VMware/virtualization
- Working knowledge of ticketing tools like Salesforce, ServiceNow etc.,
Job Responsibilities
- Drive a culture within the operations team to adopt DevOps and improve operational excellence
- Participate in collaboration related to Site Reliability Engineering framework, including integrated, automated monitoring and self-healing
- Ensure high uptime and reliability of public-facing production infrastructure and applications
- Execute within SLAs on infrastructure task to improve reliability, quality, security and performance
- Experience working in complex, high availability, 24X7X365 SaaS environments
- One-stop support from our dedicated hosting support team and technicians
- 24/7 monitoring and a data recovery solution
- Participate in On Call responsibilities when needed (P1/P2 issues)
- Simplify implementations and upgrades
- Work in complex, high availability, 24x7x365 environments
- Collaborate with various R&D teams and client services teams across products
- Reduce toil through Automation
- Responsible for managing work requests, incidents or tickets assigned
Education & Experience
- Required:
- Degree in Computer Science/Technology Degree
- 6-8 years of experience in the Information Systems industry
- Preferred:
- Microsoft Certifications, MCP, MCSA
- AWS CCP or any AWS certification
- Software Deployment Automation (2-5 years)
- DevOps