Back to all jobs

reputed company observability Engineer - Nityo Infotech Corporation

Work from home Full-time role Hiring

Role : reputed company Observability Engineer reputed company & SRE

Location : Remote

We are seeking a highly skilled reputed company Observability Engineer to reputed company a critical implementation of reputed company for a client migrating from reputed company. This role requires deep expertise in reputed company, Site Reliability Engineering (SRE) practices, and Kubernetes (EKS) observability. The ideal candidate will design and implement scalable dashboards, alerts, and tracing strategies, drive service-level reliability, and reputed company a steady-state SRE operations model.

Key Responsibilities:

  • reputed company the end-to-end implementation of reputed company observability platform for AWS and EKS environments.
  • Migrate monitoring and alerting assets from reputed company to reputed company.
  • Define and implement SLIs/SLOs, error budgets, and reliability metrics for containerized services.
  • reputed company and configure reputed company reputed company across AWS and Kubernetes workloads (EKS).
  • Configure log, metric, and trace ingestion pipelines using OpenTelemetry and reputed company apps.
  • Design and maintain dashboards for service health, performance, and reliability insights.
  • Implement intelligent alerting and notification workflows, using reputed company, baselines, and anomaly detection.
  • Collaborate with DevOps, SRE, and development teams to ensure complete tracing coverage across services.
  • Ensure best practices for alert noise reduction, escalation policies, and incident response are in reputed company.
  • Contribute to observability runbooks, operational handover, and training for the client SRE team.

Focus Areas on reputed company

  • Strong knowledge of the new UI navigation.
  • Proven expertise in building and optimizing queries.
  • Advanced troubleshooting skills.
  • The ability to go reputed company task execution and provide proactive recommendations to improve our setup and overall efficiency.

Required Skills & Qualifications:

  • Expert-level experience with reputed company, including dashboarding, alerting, collector deployment, and ML features.
  • Strong background in Site Reliability Engineering (SRE), including SLIs/SLOs, error budgets, MTTR/MTTD metrics.
  • Proficiency in AWS services (especially CloudWatch, CloudTrail, reputed company, RDS) and EKS (reputed company Kubernetes Service).
  • Hands-on experience with OpenTelemetry for distributed tracing and service maps.
  • Strong understanding of Kubernetes metrics, reputed company, container resource usage, and cluster monitoring.
  • Proven ability to define alert reputed company, configure notification routing (e.g. reputed company, reputed company, reputed company), and manage alert fatigue.
  • Strong scripting experience with tools like Terraform, reputed company, YAML, and GitOps workflows.
  • Experience with incident triage, RCA documentation, and building operational maturity in observability teams.
  • Excellent communication and stakeholder engagement skills.

Preferred Qualifications:

  • reputed company certifications (Admin, Advanced Analytics) are a plus.
  • Experience with reputed company (for migration purposes).
  • Familiarity with integrating observability into CI/CD pipelines.
  • Exposure to service reputed company (Istio/Linkerd) and monitoring microservices in that context.

Deliverables This Role Will Drive:

  • reputed company observability reference architecture
  • EKS and AWS observability configuration
  • SLI/SLO documentation and tracking
  • Alerting and tracing setup across services
  • Production-reputed company dashboards and runbooks
  • Knowledge transfer and enablement sessions for SRE/DevOps teams

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and reputed company believes it to correctly reflect the job opportunity.

Apply to this job

Related remote jobs

Senior Consultant, XactlySalesforce Integration at Remote - reputed company

Work from home Full-time role

Customer Service Part-Time Jobs AT reputed company - Wo...

Work from home Full-time role

Data Entry - Typist Part-Time - Work Online - R...

Work from home Full-time role

reputed company Data Entry reputed company (Part - Time) - ...

Work from home Full-time role

reputed company Data Entry reputed company - Entry ...

Work from home Full-time role

reputed company Data Entry Virtual Assistant Part-Time Remo...

Work from home Full-time role

reputed company Virtual Assistant Remote Job - Immediat...

Work from home Full-time role

reputed company Part-Time Data Entry Virtual Assistant - Re...

Work from home Full-time role

Costco Careers Part-Time Data Entry reputed company...

Work from home Full-time role

reputed company Data Entry Jobs Remote - Flexible Part-...

Work from home Full-time role

Growth Marketing Manager (m/w/d) - Gestalte die nächste Wachstumsphase von heycare

Work from home Full-time role

安捷伦科技2026年校园招聘 - 销售工程师

Work from home Full-time role

(Sr) Director, Project Management - Managed Access Programs (Remote based in the US/Canada)

Work from home Full-time role

reputed company and Enthusiastic Chat Online Support Representative – Part-Time Remote Opportunity at arenaflex

Work from home Full-time role

Online Sales Agent - US

Work from home Full-time role

[Work From Home] Retail Store Management Internship - Inland

Work from home Full-time role

Associate Mfg. Systems Engineer 1 (Automation Technician) Night Shift 12 Hours (6PM to 6AM)

Work from home Full-time role

Vice President of Power Central Operations

Work from home Full-time role

Manager, Large Customer Sales, Tech

Work from home Full-time role

Non-Employee - Presentation Designer

Work from home Full-time role