Tallo logoTallo logo

Reinforcement Learning Engineer ($400k - $800k salary)

Job

Baton Corporation

New York, NY (In Person)

Full-Time

Posted 3 days ago (Updated 23 hours ago) • Actively hiring

Expires 6/14/2026

Apply for this opportunity

This job application is on an outside website. Be sure to review the job posting there to verify it's the same.

Review key factors to help you decide if the role fits your goals.
Pay Growth
?
out of 5
Not enough data
Not enough info to score pay or growth
Job Security
?
out of 5
Not enough data
Calculating job security score...
Total Score
100
out of 100
Average of individual scores

Were these scores useful?

Skill Insights

Compare your current skills to what this opportunity needs—we'll show you what you already have and what could strengthen your application.

Job Description

Who We Are Baton Corporation is the development company that builds and operates the entire technology stack behind pump.fun, the largest memecoin launchpad in production today. The systems are low latency, high throughput, live under constant load, and break if you get them wrong. What You'll Do As our Reinforcement Learning Engineer, you will own a production trading system that directly deploys real capital. This is not a research role - it's about building learning systems that are robust, measurable, and safe under real-world constraints. Own and ship an RL-driven trading agent using real capital to increase trading volume and user participation in a memecoin ecosystem Design reward functions and policies aligned with product goals while enforcing strict downside risk constraints Build evaluation and validation frameworks (simulation, offline analysis) to minimize reliance on live sequential testing Safely transition an existing heuristic-based production system toward learning-based approaches Take end-to-end ownership and technical leadership as the sole RL expert, from data and modeling through deployment, monitoring, and safeguards
Who You Are:
You have previously put an autonomous learning system into production that directly controlled capital, pricing, traffic, or resources and can explain what broke and how they fixed it Have personally designed and enforced hard risk limits (capital caps, loss bounds, circuit breakers) in a live system, not just talked about "risk-aware objectives. Have built a policy evaluation loop from scratch (simulators, replay, counterfactuals, shadow deployments) before trusting live rollout. Can make and defend uncomfortable tradeoffs (e.g. heuristic > RL, bandit > deep RL) based on empirical results instead of ideology Have operated as the single owner of a complex ML system in a small team, with no safety net of research orgs, infra teams, or "ML platforms." What it's like to work here We work in person Hours can be long and unconventional The pace is intense Expectations are high, and impact is immediate Working at Baton is not for everyone Why Join Us? Unmatched ownership and autonomy Exposure to systems operating at the edge of crypto scale The ability to ship fast and see real-world impact immediately If you're motivated by responsibility, speed, and building products used by massive audiences, you'll feel at home here.

Similar remote jobs

Similar jobs in New York, NY

Similar jobs in New York