Apply for this opportunity

This job application is on an outside website. Be sure to review the job posting there to verify it's the same.

Apply Offsite

Lead SRE

Job

ARK Infotech Spectrum

Remote

Full-Time

Posted 2 weeks ago (Updated 2 weeks ago) • Actively hiring

Expires 6/20/2026

See Job Scorecard

Review key factors to help you decide if the role fits your goals.

How is this calculated?

Pay Growth

out of 5

Not enough data

Not enough info to score pay or growth

Job Security

out of 5

Not enough data

Calculating job security score...

Total Score

out of 100

Average of individual scores

Were these scores useful?

Skill Insights

Compare your current skills to what this opportunity needs—we'll show you what you already have and what could strengthen your application.

Job Description

Job Title:

Lead Integration & Observability Specialist (SRE Lead)

Location:

McKiney, TX (Hybrid role)

Client:

NTT DATA

Globe Life Insurance Job Summary:

We are seeking a Lead Integration & Observability Specialist to design, implement, and lead enterprise observability and reliability solutions , while supporting cloud-based integration platforms on AWS/Azure . The role focuses on monitoring, automation, and operational readiness of applications, APIs, data pipelines, and messaging systems. This is a hands-on technical leadership role with mentoring and solution ownership responsibilities. Key Responsibilities Lead the implementation of enterprise observability for applications, APIs, services, batch jobs, and data pipelines. Design and standardize monitoring, alerting, logging, metrics, and health checks across distributed systems. Integrate observability platforms with incident management and automation tools to support proactive issue detection and remediation. Support reliability and availability of integration platforms built on AWS/Azure Perform advanced troubleshooting using logs, metrics, and traces to resolve production issues. Define operational readiness standards and non-functional requirements. Mentor engineers on observability best practices and platform usage. Collaborate with product, support, and operations teams to improve service stability and delivery. Required Skills (Mandatory) 15+ years of overall IT experience 7+ years of relevant experience in Observability / Monitoring / Reliability Engineering Strong hands-on experience with enterprise observability tools , such as: Instana, Dynatrace, AppDynamics, Prometheus, Grafana Expertise in: Monitoring and alerting design Log management and analysis Metrics and distributed tracing Health checks and SLO/SLI concepts Experience monitoring AWS/Azure workloads Strong troubleshooting and incident analysis skills Experience defining operational and non-functional requirements Technical leadership and mentoring experience Automation and ITSM integration (ServiceNow workflows, incident automation) CI/CD and release management exposure Cloud integration and messaging exposure Automation and ITSM integration (ServiceNow workflows, incident automation)