Apply for this opportunity

This job application is on an outside website. Be sure to review the job posting there to verify it's the same.

Apply Offsite

Sr. Software Engineer (AI, ML, Python, LLM, Langchain) - Locals Only - Job#3619664

Job

Pave Talent

Belmont, CA (In Person)

$206,960 Salary, Full-Time

Posted 5 days ago (Updated 2 days ago) • Actively hiring

Expires 7/22/2026

See Job Scorecard

Review key factors to help you decide if the role fits your goals.

How is this calculated?

Pay Growth

out of 5

Not enough data

Not enough info to score pay or growth

Job Security

out of 5

Not enough data

Calculating job security score...

Total Score

100

out of 100

Average of individual scores

Were these scores useful?

Skill Insights

Compare your current skills to what this opportunity needs—we'll show you what you already have and what could strengthen your application.

Job Description

Sr. Software Engineer (AI, ML, Python, LLM, Langchain)

Locals Only
Job#3619664 at Pave Talent Sr. Software Engineer (AI, ML, Python, LLM, Langchain)
Locals Only
Job#3619664 at Pave Talent in Belmont, California Posted in about 14 hours ago.

Type:

full-time

Job Description:

????????????'???????? ???????????????????? ???????? ???????????????????????????? ???????????????? ???????????????? ???????? ????????????????????????????????????????. ???????????? ???????????????????? ???????????? ???????????????? ???????????????????? ???????????? ????????????????????. Pave Talent is recruiting on behalf of a commercial-stage autonomous mobility company making the leap from R D into live service. The AI systems you build here won't be demos or internal tools. They'll power a fleet in the real world, interacting with real customers, in real time. This is a contract role paying $97 to $102 per hour, on-site five days a week at Foster City CA through June 2026, with strong conversion potential as the company scales. ???????????? ???????????????????????????? ????????????????

You've shipped AI agents or retrieval-augmented generation (RAG) systems into production and have the scars to prove it
You think in systems: embeddings, vector stores, prompt chains, model evaluation, latency trade-offs
You're energized by ambiguity. This company is scaling fast and the roadmap evolves quickly
You want to be in the room where architecture decisions get made, not handed a spec to implement
You're comfortable owning reliability and performance, not just handing off to DevOps This role is NOT a good fit if you prefer a fully defined backlog, a slow enterprise release cycle, or want to stay in research without shipping.

???????????? ???????????????????????????????????????????? You'll join a cross-functional engineering team at the exact moment the company transitions from building to operating. The AI work here spans the full stack of what modern applied AI looks like: autonomous decision-making agents, conversational interfaces for customer service, RAG pipelines that pull from live operational data, and large language model (LLM) deployments balanced for cost, latency, and accuracy in a real fleet environment. ???????????????? ????????????'???????? ????????????????????

Design and deploy AI agents and autonomous systems capable of multi-step task execution and real-time decision-making
Build conversational AI solutions including chatbots and voice-based customer service systems integrated into fleet operations
Develop and optimize RAG systems: vector databases, embedding strategies, retrieval pipelines, and prompt engineering
Implement and tune machine learning models using PyTorch, balancing accuracy, latency, and cost for production use
Evaluate and deploy LLMs from providers including OpenAI, Anthropic, Google, and Meta based on application-specific requirements
Build AI-powered integrations across services using REST APIs, gRPC, or event-driven architectures (Kafka)
Architect production-grade AI systems with reliability, observability, and scalability built in from the start
Collaborate with product, data, and platform teams to translate commercial requirements into technical solutions ???????????????????????????????????????????????????????? ????????????????????????????????:
6+ years of Python development with a focus on AI or machine learning applications
Hands-on production experience with

PyTorch:

model training, fine-tuning, or deployment

Direct experience building or operating RAG systems: vector databases, embeddings, retrieval strategies, and prompt engineering
Familiarity with AI agent frameworks (LangChain, LlamaIndex, AutoGen, or similar)
Working knowledge of transformer architecture and attention mechanisms
Experience integrating AI capabilities into applications via REST APIs, gRPC, or Kafka
Ability to work on-site in San Diego five days per week ???????????????????? ????????????????????????:
Proficiency in Kotlin and full-stack development experience
Cloud deployment experience on AWS, GCP, or Azure, especially microservices
Containerization and orchestration with Docker and Kubernetes
CI/CD pipeline experience and DevOps practices in an AI systems context
Prior experience in autonomous vehicles, robotics, or mobility tech ?

??????????????????????????????????????????????? ???????????? ???????????????????????????????? ???????????????????????? ????????????????: $97 to $102 per hour ????????????????????????????????: Monday through Friday, standard business hours ????????????????????????????????????: On-site, Foster City CA ???????????????????????????????????????? ????????????????????????: Contract through June 30, 2026 ????????????????????????????????????????: Not guaranteed per this listing, but the company is in active growth mode Apply via and we'll reach out to schedule a conversation. Confidential search; your application is fully private. ???????????????? ???????????????????????? | ???????????????????????? ????????????????????????????????????????