ML Engineer - Inference [33157]

Job

Stealth Startup

Campbell, CA (In Person)

Full-Time

Posted 4 days ago (Updated 1 day ago) • Actively hiring

Expires 6/23/2026

Apply for this opportunity

This job application is on an outside website. Be sure to review the job posting there to verify it's the same.

See Job Scorecard

Review key factors to help you decide if the role fits your goals.

How is this calculated?

Pay Growth

out of 5

Not enough data

Not enough info to score pay or growth

Job Security

out of 5

Not enough data

Calculating job security score...

Total Score

100

out of 100

Average of individual scores

Were these scores useful?

Skill Insights

Compare your current skills to what this opportunity needs—we'll show you what you already have and what could strengthen your application.

Job Description

ML Engineer - Inference [33157] at Stealth Startup ML Engineer - Inference [33157] at Stealth Startup in Campbell, California Posted in 1 day ago.

Type:

full-time

Job Description:

The role: As our first ML Engineer specializing in inference and optimization, you'll bridge the gap between cutting-edge research models and production systems. Your expertise will transform PyTorch research code into highly optimized, low-latency inference solutions that power our user-facing applications. You'll work closely with our GenAI researchers, vision ML engineers, and backend team to deliver exceptional performance. What you'll do: Deploy and integrate researcher-trained model checkpoints into our cloud infrastructure and production pipelines. Conduct thorough performance profiling and benchmarking to identify and eliminate computational bottlenecks. Implement neural network optimization techniques including quantization, pruning, and architectural refinements while preserving model accuracy. Develop efficient training and fine-tuning strategies with optimal precision trade-offs and parallelism. Build and maintain scalable multi-GPU inference solutions with sophisticated model parallelism and serving architectures. Collaborate with the research team to ensure optimization integrate smoothly with model development workflows. You may be a strong fit if you: Have experience deploying and optimizing deep learning models for production environments, particularly with multi-GPU inference and large-scale model serving. Are well-versed in cutting-edge techniques for optimizing both inference and training workloads. Possess strong knowledge of efficient attention mechanisms and algorithms. Have hands-on experience implementing model quantization and working with inference frameworks. Can write production-quality code and successfully integrate ML models into robust inference pipelines. Are familiar with various cloud platforms, storage solutions, and modern training frameworks.

Logistics:

This role is based in San Jose, where we work in person. We believe the best ideas come from being in the same room. We sponsor visas. We are committed to working through the process together for the right candidates. If you're currently outside the US, we're also committed to helping you relocate to the US throughout this process. We offer generous health, dental, and vision coverage, unlimited PTO, paid parental leave, and relocation support as needed. Don't meet every single qualification? That's okay - we care more about your trajectory than checking every box. If the role excites you and the mission resonates, we'd love to hear from you.

Note:

In the event your application is successful and an offer of employment is made to you, any offer of employment will be conditional on the results of a background check, performed by a third party acting on our behalf.

Similar jobs in Campbell, CA

Job
Barback/Server
PC
Pruneyard Cinemas
Campbell, CA
Posted1 day ago
Updated4 hours ago
Job
Sales Representative
KH
Kalos Health
Campbell, CA
Posted2 days ago
Updated1 day ago
Job
Auto Body Technician
CH
Caliber Holdings LLC
Campbell, CA
Posted2 days ago
Updated4 hours ago
Job
Administrative/Operational Assistant
LG
Los Gatos Therapy Center
Campbell, CA
Posted2 days ago
Updated4 hours ago
Job
Senior Simulation Software Engineer
A
Apera
Campbell, CA
Posted3 days ago
Updated1 day ago

Similar jobs in California

Job
Construction Scheduler
BS
Buildezs Solutions
Los Angeles, CA
Posted1 day ago
Updated4 hours ago
Job
MCQUEEN Store Manager, South Coast Plaza
BV
BOTTEGA VENETA
Costa Mesa, CA
Posted1 day ago
Updated4 hours ago
Job
Accounts Payable/ Receivable Specialist
8T
804 Technology
Irvine, CA
Posted1 day ago
Updated4 hours ago
Job
Production Assistant and Field Coordinator
FS
Five Star Bath Solutions of Los Angeles
Monrovia, CA
Posted1 day ago
Updated4 hours ago
Job
Senior Scientist, Stem Cell Product Attribute Sciences (Allogeneic Cell Therapy)
G
Genentech
South San Francisco, CA
Posted1 day ago
Updated4 hours ago