Forward Deployed Site Reliability Engineer (TS/SCI Required)
Twenty
Arlington, VA (In Person)
Full-Time
Skill Insights
Compare your current skills to what this opportunity needs—we'll show you what you already have and what could strengthen your application.
Job Description
Identify and eliminate toil:
build automation for repetitive operational tasks within the constraints of the secure environment. Conduct post-incident reviews, own root cause analysis, and drive durable fixes in partnership with the engineering team. Observability & Incident Response Own the observability posture for the on-site deployment — dashboards, alerting thresholds, and log pipelines using the LGTM stack (Grafana, Loki, Tempo, Mimir).Lead incident response on-site:
triage, containment, coordination with Arlington, and customer communication. Maintain and continuously improve runbooks for operational procedures and emergency response protocols. Serve as the on-call anchor for the customer environment, with clear escalation paths to the engineering team. Deployment & Infrastructure Operations Work with the customer deployment team to get Twenty's platform stood up and updated within the restricted environment. Manage containerized services (Docker, Docker Compose) across deployment lifecycle — configuration, updates, rollbacks. Apply and validate Terraform-based infrastructure changes within the enclave, in coordination with the DSO engineer who owns IaC policy and guardrails. Perform capacity planning and flag scaling requirements to the Arlington team before they become incidents. Customer Liaison & Engineering Feedback Serve as the primary technical interface between the government customer and Twenty's engineering team — translating operational requirements, constraints, and issues in both directions. Represent the operational environment accurately in engineering discussions: what the team in Arlington can't see, you make visible. Partner with the DevSecOps engineer on compliance, logging, and audit requirements specific to the customer environment. Provide technical guidance and support to customer stakeholders on system behavior and troubleshooting procedures. Must Have 5+ years of professional experience in site reliability engineering, production operations, or a closely related infrastructure role. Proven experience defining and tracking SLIs, SLOs, and error budgets in a production environment. Hands-on experience with Docker, Docker Compose, and AWS (EC2, ECS, RDS, VPCs, security groups) in production deployments. Solid Linux/Unix systems administration skills; productive in constrained environments where GUI tooling may be limited or unavailable. Experience with Terraform for infrastructure provisioning and configuration, working within DSO-provided policy guardrails. Experience with the LGTM observability stack or equivalent (Grafana, Loki, Prometheus/Mimir, distributed tracing).Strong incident response experience:
you've led responses, written post-mortems and runbooks, and shipped the preventive fix. Scripting proficiency in Python or Bash for operational automation, with familiarity in Go a plus; experience with PagerDuty or equivalent on-call tooling. Experience working in or directly supporting government or defense environments, including air-gapped or enclave deployments. Nice To Have Experience with NATS or similar pub/sub messaging systems in production. Background in cyber operations, intelligence systems, or signals environments. AWS certifications (Solutions Architect, SysOps, or DevOps Engineer). Security Requirements Must possess and be able to maintain a TS/SCI security clearance with appropriate polygraph U.S. citizenship required Willingness to travel occasionally for customer engagements and operational support If this role sounds like you, apply and share with us your interest. Some positions may require eligibility to obtain a U.S. Government security clearance. Any clearance requirement will be listed in the role description. Twenty is an equal opportunity employer. We consider all qualified applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, veteran status, disability, or any other protected status. If you need a reasonable accommodation during the hiring process, let us know and we will work with you. Apply now See more open positions at TwentySimilar remote jobs
LifeStance Health
New Hyde Park, NY
Posted2 days ago
Updated5 hours ago
Albemarle County Public Schools
Charlottesville, VA
Posted2 days ago
Updated5 hours ago
Intermountain Health
Frankfort, KY
Posted2 days ago
Updated5 hours ago
Similar jobs in Arlington, VA
UHS Physician Careers
Arlington, VA
Posted2 days ago
Updated5 hours ago
Amazon
Arlington, VA
Posted2 days ago
Updated5 hours ago
Guidehouse
Arlington, VA
Posted2 days ago
Updated5 hours ago
Amazon
Arlington, VA
Posted2 days ago
Updated5 hours ago
Hancock County ESC
Arlington, VA
Posted2 days ago
Updated5 hours ago
Similar jobs in Virginia
Rappahannock Community College
Virginia
Posted2 days ago
Updated9 hours ago
The Coca-Cola Company
Dinwiddie, VA
Posted2 days ago
Updated9 hours ago
Albemarle County Public Schools
Charlottesville, VA
Posted2 days ago
Updated5 hours ago