Apply for this opportunity

This job application is on an outside website. Be sure to review the job posting there to verify it's the same.

Apply Offsite

Multimodal LLM Researcher

Job

DEEPREC.AI

Remote

$350,000 Salary, Full-Time

Posted 3 days ago (Updated 15 hours ago) • Actively hiring

Expires 7/8/2026

See Job Scorecard

Review key factors to help you decide if the role fits your goals.

How is this calculated?

Pay Growth

out of 5

Not enough data

Not enough info to score pay or growth

Job Security

out of 5

Not enough data

Calculating job security score...

Total Score

100

out of 100

Average of individual scores

Were these scores useful?

Skill Insights

Compare your current skills to what this opportunity needs—we'll show you what you already have and what could strengthen your application.

Job Description

Multimodal LLM Researcher $300,000

$400,000 Remote, Palo Alto Full-time / Permanent DeepRec has partnered with a high-growth generative AI company (Series B, $130M raised).

They're building multimodal, multi-agent systems that combine language, vision, audio, and video. If you've been looking for a role where your research reaches production and shapes how millions interact with creative AI, this is worth a closer look. You'll help define the next generation of multimodal AI systems. Your work will span research, experimentation, and deployment, with a focus on real-time performance, multimodal reasoning, and agent-based workflows. You'll have the freedom to explore ambitious ideas while working alongside engineers who can bring them into production. What You'll Do

Lead research across LLMs, VLMs, and Audio Language Models
Design novel multimodal model architectures and training approaches
Improve real-time inference across text, image, audio, and video
Train and fine-tune autoregressive and diffusion models
Build and curate high-quality multimodal datasets
Collaborate with engineering teams to deploy research outcomes
Publish findings at leading AI conferences and journals What You'll Bring Essential
Strong research track record in multimodal AI or foundation models
First-author publications at recognised ML, vision, or audio conferences
Deep expertise in LLMs, VLMs, Audio LMs, or related fields
Strong Python and deep learning experience using modern frameworks Desirable
Experience with diffusion models or world models
Background in real-time AI systems and model serving
Experience building large-scale multimodal datasets We encourage you to apply even if you don't meet every requirement. The right mindset matters as much as the right CV. What's In It For You
USD 300,000-400,000
salary
Fully remote working arrangement
Ownership of research that shapes production systems
Opportunity to publish and contribute to the field
Direct collaboration with product and engineering leadership This role offers the chance to work on multimodal AI problems that sit at the intersection of research and real-world deployment.

If you're excited by advancing the field while seeing your work reach users, we'd love to hear from you.

Multimodal LLM Researcher

DEEPREC.AI

See Job Scorecard

Skill Insights

Job Description

USD 300,000-400,000