Senior Site Reliability Engineer

Job

Fieldguide

[Unknown City], CA (In Person)

Full-Time

Posted 2 days ago (Updated 7 hours ago) • Actively hiring

Expires 6/10/2026

Apply for this opportunity

This job application is on an outside website. Be sure to review the job posting there to verify it's the same.

See Job Scorecard

Review key factors to help you decide if the role fits your goals.

How is this calculated?

Pay Growth

out of 5

Not enough data

Not enough info to score pay or growth

Job Security

out of 5

Not enough data

Calculating job security score...

Total Score

100

out of 100

Average of individual scores

Were these scores useful?

Skill Insights

Compare your current skills to what this opportunity needs—we'll show you what you already have and what could strengthen your application.

Job Description

Job description Who you are 5+ years of experience in site reliability engineering, infrastructure, or a related software engineering discipline Strong experience operating and scaling distributed systems in cloud environments, with AWS preferred Hands-on experience building and managing observability platforms (e.g., Datadog, Prometheus, Grafana, CloudWatch) Experience defining SLOs/SLIs and leveraging them to inform and drive engineering priorities Proficiency with Infrastructure as Code tooling, particularly Terraform or equivalent Deep understanding of system performance, reliability patterns, and distributed system failure modes Experience supporting production systems through on-call rotations and incident response Proficiency in at least one programming or scripting language used for automation and tooling Strong communication and collaboration skills, with the ability to work effectively across engineering and product teams Experience implementing distributed tracing systems, such as OpenTelemetry or similar frameworks Experience with capacity planning and performance benchmarking at scale Familiarity with database performance tuning and observability across high-traffic systems Exposure to regulated or compliance-heavy engineering environments (e.g., SOC 2, FedRAMP, or equivalent frameworks) Experience applying chaos engineering practices to proactively test and strengthen system resilience What the job involves As a Senior Site Reliability Engineer (SRE) at Fieldguide, you will be responsible for ensuring the reliability, scalability, and observability of our production systems. You will apply software engineering principles to infrastructure and operations, designing systems that are resilient, highly available, and capable of scaling with rapid growth You'll work closely with product and platform engineering teams to define and implement reliability standards, improve system performance, and build robust observability practices. This role is central to maintaining a high level of trust in our systems by proactively identifying risks, reducing toil through automation, and driving operational excellence Design and operate highly scalable, fault-tolerant systems that support production workloads across a distributed cloud environment Define and implement Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets to guide reliability decisions Build and improve observability systems (metrics, logs, tracing) to provide deep visibility into system behavior and performance Lead efforts to improve system reliability and performance, including capacity planning, load testing, and performance tuning Automate operational processes to reduce manual toil and improve system consistency and resilience Partner with engineering teams to design systems with reliability and scalability built in from the start Participate in and improve incident response, on-call practices, and post-incident reviews, focusing on root cause analysis and systemic improvements Drive continuous improvement of system resilience, including disaster recovery and chaos testing Establish best practices for monitoring, alerting, and incident management to ensure rapid detection and resolution of issues Advocate for reliability-focused engineering culture, including blameless postmortems and operational excellence Benefits Health Dental PTO Keywords site-reliability-engineering-sre distributed-computing amazon-web-services observability datadog prometheus grafana amazon-cloudwatch infrastructure-as-code-iac terraform incident-response scripting opentelemetry planning-and-forecasting electrical-engineering-and-planning planning-and-design capacity-planning benchmarking freight-rate-analysis-benchmarking vehicle-modification-tuning compliance service-organization-controls-soc security-operations-center-soc system-on-a-chip-soc soc-2 soc-ii-compliance federal-risk-and-authorization-management-program-fedramp chaos-engineering policies-and-practices sensors-test-measurement operational-excellence visual-art-design product-development-and-design environment-health-and-safety-hsse ecology-environment objectives-and-key-results errors-omissions-e-o error-budget testing-and-analysis performance-testing load-testing scalability built-in root-cause-analysis-rca continuous-improvement-process-cip disaster-recovery repair-and-recovery incident-and-problem-management incident-breach-management paid-time-off ashby

Similar remote jobs

Job
Freelance In Person Event Specialist
V
Visit.org
Los Angeles, CA
Posted2 days ago
Updated7 hours ago
Job
Entry Level | Customer Care Coordinator | Online
A
Aisles & Abroad Careers
New York, NY
Posted2 days ago
Updated7 hours ago
Job
Staff Product Designer - Performance Reviews
L
Lattice
Posted2 days ago
Updated7 hours ago
Job
Clinical Pharmacist
ST
Spectraforce Technologies Inc
Posted2 days ago
Updated7 hours ago
Job
Construction Project Manager
HF
Harrison Family Builders
Amarillo, TX
Posted2 days ago
Updated7 hours ago

Similar jobs in [Unknown City], CA

Job
Product Manager
H
Handshake
California
Posted2 days ago
Updated7 hours ago
Job
Clinical Extern (Nursing)
MH
Mackenzie Health
California
Posted2 days ago
Updated7 hours ago
Job
Teacher, Special Education, Home Hospital @ Special Education (ESY Summer School) - SSSACC-31 ***In District Only***
SC
Sacramento City Unified School District
California
Posted2 days ago
Updated7 hours ago
Job
Senior Operations Analyst - Operational Test & Evaluation
SC
Spectrum Comm Inc
California
Posted2 days ago
Updated7 hours ago
Job
Office Technician (Summer School) at Suy:u SSCLE-22 ***IN DISTRICT ONLY***
SC
Sacramento City Unified School District
California
Posted2 days ago
Updated7 hours ago

Similar jobs in California

Job
Coordinator, Literacy & Language
OC
Orange County Department of Education
Costa Mesa, CA
Posted1 day ago
Updated7 hours ago
Job
Freelance In Person Event Specialist
V
Visit.org
Los Angeles, CA
Posted2 days ago
Updated7 hours ago
Job
Barista Cashier
BN
Bird's Nest Cafe
Los Angeles, CA
Posted2 days ago
Updated7 hours ago
Job
Private Wealth Management, Wealth Management Associate (CFP preferred)
MS
Morgan Stanley
San Francisco, CA
Posted2 days ago
Updated7 hours ago
Job
Manager Nurse Practitioner Opportunity in Los Angeles County making upward to
OG
Optigy Group
Cerritos, CA
Posted2 days ago
Updated7 hours ago

Senior Site Reliability Engineer

Fieldguide

See Job Scorecard

Skill Insights

Job Description

Similar remote jobs

Freelance In Person Event Specialist

Entry Level | Customer Care Coordinator | Online

Staff Product Designer - Performance Reviews

Clinical Pharmacist

Construction Project Manager

Similar jobs in [Unknown City], CA

Product Manager

Clinical Extern (Nursing)

Teacher, Special Education, Home Hospital @ Special Education (ESY Summer School) - SSSACC-31 In District Only

Senior Operations Analyst - Operational Test & Evaluation

Office Technician (Summer School) at Suy:u SSCLE-22 IN DISTRICT ONLY

Similar jobs in California

Coordinator, Literacy & Language

Freelance In Person Event Specialist

Barista Cashier

Private Wealth Management, Wealth Management Associate (CFP preferred)

Manager Nurse Practitioner Opportunity in Los Angeles County making upward to