AI Lead Engineer Generative AI & LLM Applications Jobs in Plano,TX,US
Job
W3global
Plano, TX (In Person)
Full-Time
Review key factors to help you decide if the role fits your goals.
Pay Growth
?
out of 5
Not enough data
Not enough info to score pay or growth
Job Security
?
out of 5
Not enough data
Calculating job security score...
Total Score
100
out of 100
Average of individual scores
Skill Insights
Compare your current skills to what this opportunity needs—we'll show you what you already have and what could strengthen your application.
Job Description
Role:
AI Lead Engineer - GenerativeAI & LLM
Applications Location - Plano, TX (onsite)Full Time Experience Required:
8-15 years in AI/ML development, with 3+ years specialized in Generative AI and LLM applications. Role Overview The AI Lead Engineer will design, build, and operate production-grade Generative AI solutions for complex enterprise scenarios. The role focuses on scalable LLM-powered applications, robust RAG pipelines, and multi-agent systems with MCP deployed across major cloud AI platforms. Key Responsibilities Technical Leadership & Development Design and implement enterprise-grade GenAI solutions using LLMs (GPT, Claude, Llama and similar families). Build and optimize production-ready RAG pipelines including chunking, embeddings, retrieval tuning, query rewriting, and prompt optimization. Develop single- and multi-agent systems using LangChain, LangGraph, LlamaIndex and similar orchestration frameworks. Design agentic systems with robust tool calling, memory management, and reasoning patterns. Author MCP (Model Context Protocol) servers, tools, and resources, and integrate them with Cursor, Claude, Codex, Copilot, and internal enterprise systems. Build plugins and extensions for Claude, Codex, Cursor and GitHub Copilot ecosystems. Building AI Agents and Sub-Agents, Agent Skills for tools like Claude Code, Codex, and GitHub Copilot. Build scalable Python + FastAPI/Flask or MCP microservices for AI-powered applications, including integration with enterprise APIs. Implement model evaluation frameworks using RAGAS, DeepEval, or custom metrics aligned to business KPIs. Implement agent-based memory management using Mem0, LangMem or similar libraries. Fine-tune and evaluate LLMs for specific domains and business use cases. Deploy and manage AI solutions on Azure (Azure OpenAI, Azure AI Studio, Copilot Studio), AWS (Bedrock, SageMaker, Comprehend, Lex), and GCP (Vertex AI, Generative AI Studio). Implement observability, logging, and telemetry for AI systems to ensure traceability and performance monitoring. Ensure scalability, reliability, security, and cost-efficiency of production AI applications. Deep understanding of RAG architectures, hybrid retrieval, and context engineering patterns. Translate business requirements into robust technical designs, architectures, and implementation roadmaps. Drive innovation by evaluating new LLMs, orchestration frameworks, and cloud AI capabilities (including Copilot Studio for copilots and workflow automation).Required Skills & Experience Core Technical Programming:
Expert-level Python with production-quality code, testing, and performance tuning.GenAI Frameworks:
Strong hands-on experience with LangChain, LangGraph, LlamaIndex, agentic orchestration libraries.LLM Integration:
Practical experience integrating OpenAI, Anthropic Claude, Azure OpenAI, AWS Bedrock, and Vertex AI models via APIs/SDKs.RAG & Search:
Deep experience designing and operating RAG workflows (document ingestion, embeddings, retrieval optimization, query rewriting).Vector Databases:
Production experience with at least two of OpenSearch, Pinecone, Qdrant, Weaviate, pgvector, FAISS.Cloud & AI Services Azure:
Azure OpenAI, Azure AI Studio, Copilot Studio, Azure Cognitive Search.AWS:
Bedrock, SageMaker endpoints, AWS Nova, AWS Transform etc.GCP:
Vertex AI (models, endpoints), Agentspace, Agent Builder. Preferred Qualifications Master's degree in Computer Science, AI/ML, Data Science, or related field. Experience with multi-agent systems, Agent-to-Agent (A2A) communication, and MCP-based ecosystems. Familiarity with LLMOps / observability platforms such as LangSmith, Opik, Azure AI Foundry. Experience integrating graph databases and knowledge graphs to enhance retrieval and reasoning.Similar remote jobs
The Advocates for Human Rights
Minneapolis, MN
Posted12 hours ago
Updated1 hour ago
Goldberg Miller & Rubin
Albany, NY
Posted1 day ago
Updated1 hour ago
Similar jobs in Plano, TX
Weiss Services LLC
Plano, TX
Posted1 day ago
Updated1 hour ago
Similar jobs in Texas
SPECS FAMILY PARTNERS LTD
Austin, TX
Posted1 day ago
Updated1 hour ago
The University of Texas Rio Grande Valley
Alamo, TX
Posted1 day ago
Updated1 hour ago