Required Skills

Site Reliability Engineer

Work Authorization

  • US Citizen

  • Green Card

  • EAD (OPT/CPT/GC/H4)

  • H1B Work Permit

Preferred Employment

  • Corp-Corp

  • W2-Permanent

  • W2-Contract

  • Contract to Hire

Employment Type

  • Consulting/Contract

education qualification

  • UG :- - Not Required

  • PG :- - Not Required

Other Information

  • No of position :- ( 1 )

  • Post :- 18th Aug 2023

JOB DETAIL

Job Description:  

  • Develop and maintain comprehensive monitoring solutions for cloud-based services and applications.
  • Configure monitoring tools and systems to collect relevant metrics, logs, and traces.
  • Create custom monitoring dashboards and reports using DataDog or other tools, to provide real-time insights into system performance and health.
  • Continuously monitor the cloud infrastructure's performance and capacity, anticipating and addressing potential scalability issues.
  • Proactively suggest and implement improvements to enhance the system's reliability, resilience, and fault tolerance.
  • Work on automating tasks to streamline operational processes and reduce manual intervention.
  • Collaborate with cross-functional teams to investigate and resolve critical incidents, ensuring minimal impact on end-users.
  • Work with Problem Management team to complete post-mortem analysis of incidents to identify root causes and implement preventive measures.

 

Ideal Qualifications:

 

  • 3+ years’ experience working with cloud platforms and services (AWS, Azure, GCP, etc.) in a production environment.
  • Solid understanding of monitoring and logging tools, such as Prometheus, Grafana, ELK stack, Splunk, etc.
  • Experience with infrastructure as code (IaC) tools, like Terraform, CloudFormation, or Ansible.
  • Strong scripting and automation skills (e.g., Python, Bash) to facilitate operational tasks.
  • Knowledge of containerization technologies (Docker, Kubernetes) and microservices architecture.
  • Familiarity with DevOps practices and Agile methodologies.

 

Company Information