Tallo logoTallo logo

Software Engineer RL Environments — AfterQuery

Job

David Joseph & Company

San Francisco, CA (In Person)

$200,000 Salary, Full-Time

Posted 1 day ago (Updated 1 hour ago) • Actively hiring

Expires 6/20/2026

Apply for this opportunity

This job application is on an outside website. Be sure to review the job posting there to verify it's the same.

Review key factors to help you decide if the role fits your goals.
Pay Growth
?
out of 5
Not enough data
Not enough info to score pay or growth
Job Security
?
out of 5
Not enough data
Calculating job security score...
Total Score
78
out of 100
Average of individual scores

Were these scores useful?

Skill Insights

Compare your current skills to what this opportunity needs—we'll show you what you already have and what could strengthen your application.

Job Description

Software Engineer
  • RL Environments — AfterQuery David Joseph & Company San Francisco, CA Job Details Full-time $180,000
  • $220,000 a year 5 hours ago Qualifications Experimental design Software development Full Job Description Software Engineer
  • RL Environments —
AfterQuery Location:
San Francisco, CA (Onsite)
Compensation:
$180,000
  • $220,000 base | ~$500,000 total cash + equity About AfterQuery AfterQuery is an AI infrastructure company building training data and evaluation systems for frontier AI labs.
They work directly with leading labs to improve model performance through datasets and experimentation. $30M raised at ~$300M valuation. Founding team from Jane Street, Citadel, Google, Goldman Sachs, and Stanford AI Lab. About the Role This is a high-impact engineering role focused on building the datasets, evaluation systems, and reward frameworks that directly influence how frontier AI models are trained. You will operate at the intersection of software engineering, data pipelines, and reinforcement learning environments. Your output will directly impact model capability, alignment, and performance across real-world domains. What You'll Own Design data slices that expose meaningful model failure modes across domains including finance, code, and enterprise workflows Build and refine evaluation rubrics and reward signals for RLHF and RLVR pipelines Run experiments to analyze model behavior and improve capabilities Develop frameworks to measure dataset quality, diversity, and downstream impact Build and manage real-world and synthetic data pipelines Work directly with research teams at leading AI labs — translating training objectives into concrete data and evaluation systems Requirements 1-4 years of software engineering experience with strong technical depth Strong interest in how data structure and quality influence model behavior Ability to design experiments and extract insights from imperfect data Experience building and shipping production systems Comfort working across domains including finance, engineering, and policy Nice to Have Experience with RL environment companies, AI safety, or benchmarking organizations Experience building data pipelines and working with ML infrastructure Familiarity with RLHF or RLVR training pipelines Startup or early engineer experience This Role Is NOT For Pure research profiles without engineering output Those who prefer traditional product engineering work Candidates unable to operate in ambiguous environments Logistics Role is fully onsite in San Francisco — please only apply if you can commit to this Multiple headcount with active hiring demand Shortlisted candidates will be contacted by David Joseph & Co. , the recruiting partner managing this search on behalf of AfterQuery.

Similar remote jobs

Similar jobs in San Francisco, CA

Similar jobs in California