Job Description
Position#1
Job Title:
Senior AWS AI / Data Engineer Location:
Detroit, MI Hire Type:
Long-term contract Experience:
7+ years | Detroit, MI (mandatory) - Remote up to 50% travel | Agentic AI LLMs Python AWS Native Data Pipelines Structured + Unstructured Data ABOUT THE ROLE
As a Senior AWS AI/Data Engineer at DataFactZ you will architect and deliver enterprise-grade AI and data pipeline solutions for large-scale client engagements. You will lead the design of agentic AI systems, LLM-powered applications, and high-throughput data pipelines on AWS - translating complex business problems into production-ready solutions while mentoring junior engineers. KEY RESPONSIBILITIES
Design and build end-to-end data pipelines for ingesting, transforming, and serving structured (SQL, Redshift, Parquet) and unstructured (PDFs, emails, documents, images) data on AWS Architect agentic AI systems using LLMs with tool use, memory, and multi-step reasoning via Amazon Bedrock, OpenAI, or Anthropic Claude Build multi-agent orchestration workflows using LangChain, LlamaIndex, CrewAI, or AutoGen for enterprise automation Design RAG pipelines connecting structured and unstructured data sources to LLMs via vector databases (Pinecone, OpenSearch, pgvector) Lead AWS data architecture across S3, Glue, Lambda, EMR, Athena, Step Functions, and Redshift Develop prompt engineering strategies and fine-tuning approaches for domain-specific LLM customization Mentor junior engineers, lead code reviews, and drive engineering best practices Engage client stakeholders to scope AI/data use cases, define success metrics, and deliver on commitments REQUIRED SKILLS
Python:
Advanced proficiency for data engineering, pipeline orchestration, and AI integrations AWS services: Deep hands-on experience with S3, Glue, Lambda, EMR, Athena, Step Functions, Redshift, and Bedrock LLMs & Agentic AI:
Production experience building LLM-powered agents, tool-calling workflows, and multi-agent systems Data pipelines: Batch and real-time ETL/ELT for large-scale structured and unstructured datasets RAG & vector search: Building retrieval-augmented generation systems with embedding pipelines and semantic search System design: Architecting scalable, secure, cost-efficient cloud-native data and AI systems Leadership:
Proven ability to lead technical workstreams and communicate designs to senior stakeholders PREFERRED AWS
certifications : Solutions Architect, Data Analytics, or Machine Learning Specialty Document intelligence: AWS Textract or custom document parsing pipelines Multi-modal AI:
Experience with vision or document-aware models