Skip to main content
Tallo logoTallo logo

AI Architect

Job

TalentOla

Dallas, TX (In Person)

Full-Time

Posted 3 days ago (Updated 1 day ago) • Actively hiring

Expires 7/1/2026

Apply for this opportunity

This job application is on an outside website. Be sure to review the job posting there to verify it's the same.

Review key factors to help you decide if the role fits your goals.
Pay Growth
?
out of 5
Not enough data
Not enough info to score pay or growth
Job Security
?
out of 5
Not enough data
Calculating job security score...
Total Score
100
out of 100
Average of individual scores

Were these scores useful?

Skill Insights

Compare your current skills to what this opportunity needs—we'll show you what you already have and what could strengthen your application.

Job Description

AI Architect at TalentOla AI Architect at TalentOla in Dallas, Texas Posted in about 15 hours ago.
Type:
full-time
Job Description:
Strong experience in designing and implementing high-performance, large-scale distributed systems Proven experience in implementing and deploying AI/ML platforms at scale Expertise in building agent-based architectures, evaluation frameworks, and prompt/context engineering Knowledge of MCP (Model Context Protocol) servers Hands-on experience in LLM inference optimization, including batching and caching strategies Strong experience with Kubernetes and cloud infrastructure (AWS/Azure/GCP) Proficiency in at least one programming language (Python, Java, Go, etc.) Expertise in designing agent data stacks & retrieval systems, including: Vector databases Hybrid search Data freshness strategies Memory systems Graph reasoning BM25 and advanced retrieval techniques Key Responsibilities Architect and deliver scalable, high-performance distributed systems Design and deploy AI/ML and GenAI platforms at enterprise scale Build and manage agent-based architectures, including: Prompt and context engineering MCP servers Evaluation frameworks Optimize LLM inference pipelines for latency, throughput, and efficiency Design and implement agent data & retrieval systems (vector DBs, hybrid search, memory, graph-based reasoning) Lead Kubernetes-based, cloud-native deployments Provide technical leadership, architecture governance, and hands-on mentoring to engineering teams