Tallo logoTallo logo

Principal Software Engineer-SRE

Job

PTC

Full-Time

Posted 03/10/2026 (Updated 8 weeks ago) • Actively hiring

Expires 5/27/2026

Apply for this opportunity

This job application is on an outside website. Be sure to review the job posting there to verify it's the same.

Review key factors to help you decide if the role fits your goals.
Pay Growth
?
out of 5
Not enough data
Not enough info to score pay or growth
Job Security
?
out of 5
Not enough data
Calculating job security score...
Total Score
99
out of 100
Average of individual scores

Were these scores useful?

Skill Insights

Compare your current skills to what this opportunity needs—we'll show you what you already have and what could strengthen your application.

Job Description

Own Reliability at Scale Lead design, implementation, and evolution of reliability, availability, and resiliency strategies for large‑scale distributed systems written primarily in Java Apply deep experience operating complex, distributed systems to guide architectural decisions, reliability strategies, and long‑term system evolution Identify systemic risks in application architecture, data flows, and infrastructure, and drive architectural improvements that measurably improve availability, performance, and scalability Set and evolve reliability standards, best practices, and operational principles across R D Apply advanced software engineering practices to eliminate manual work, reduce operational load, and improve system observability Design and build internal platforms, automation, and tooling that support Java‑based services and their operational needs Contribute to longer‑term reliability and infrastructure strategy aligned with business growth US Citizenship or Permanent Residents only due to ITAR requirements. Ability to work east coast (EST) hours. And be available for on-call rotation once every 10 weeks. 10+ years of experience in software engineering, site reliability engineering, or systems engineering roles Extremely strong proficiency with the Java programming language and its ecosystem, including building, debugging, and operating production Java services Deep experience operating complex, distributed systems in production environments Strong software engineering background, with a track record of delivering high‑quality, maintainable code Ability to reason about failure modes across application, data, and infrastructure layers Demonstrated ability to lead complex initiatives that span teams and organizational boundaries Comfortable making high‑impact technical decisions in ambiguous environments Strong communicator who can influence design and operational decisions across a wide range of stakeholders Experience operating or supporting systems using technologies such as MongoDB, ZooKeeper, and RabbitMQ Background in performance tuning and scalability optimization of Java services Experience setting or influencing engineering standards at the organization level Prior involvement in evolving SRE or platform practices in a growing engineering organization Experience designing, operating, or scaling systems in cloud environments such as AWS (preferred), including familiarity with core services, networking models, and reliability features

Similar remote jobs