Location:
Anywhere in Country At EY, we're all in to shape your future with confidence. We'll help you succeed in a globally connected powerhouse of diverse teams and take your career wherever you want it to go. Join EY and help to build a better working world. Technology
- Data and Decision Science
- AI Native Engineering Manager, AI/ML Engineer
- Memory Layer & Knowledge Graph Master's preferred (CS or related technical field)
- Hybrid
- AI Native Engineering The opportunity Our Artificial Intelligence and Data team helps apply cutting-edge technology and techniques to bring solutions to our clients.
As part of that, you'll sit side-by-side with clients and diverse teams from EY to create a well-rounded approach to advising and solving challenging problems, some of which have not been solved before. No two days will be the same, and with constant research and development, you'll find yourself building knowledge that can be applied across a wide range of projects now, and in the future. You'll need to have a passion for continuous learning, stay ahead of the trends, and influence new ways of working so you can position solutions in the most relevant and innovative way for our clients. You can expect heavy client interaction in a fast-paced environment and the opportunity to develop your own career path for your unique skills and ambitions. EY is investing significantly in our agentic AI platform, and the memory layer is the engineering foundation of that platform. Without memory, agents are amnesiacs they cannot accumulate context, learn from prior runs, or ground decisions in enterprise knowledge. We are hiring a Manager, AI/ML Engineer to build the memory layer end-to-end: the graph databases, ontologies, vector stores, retrieval services, and grounding pipelines that power our cognitive harness and the agents that run on top of it. This is a hands-on engineering leadership role. You will architect and build the infrastructure
- knowledge graphs, vector indices, hybrid retrieval, ontology services, memory APIs, and the integration glue that ties memory into the harness, agent runtimes, and downstream client solutions.
You will work closely with our data-science leadership on memory representation and retrieval quality, and you will own the engineering delivery and production-readiness of the memory layer itself. Your key responsibilities As a Manager in AI Native Engineering, you will play a pivotal role in delivering the memory infrastructure that underpins EY's agentic AI offerings. You will work with a wide variety of clients to deliver the latest data science and big data technologies. Your teams will design and build scalable solutions that unify, enrich, and derive insights from varied data sources across a broad technology landscape. You will help our clients navigate the complex world of modern data science, analytics, and software engineering. We'll look to you to provide technical guidance and perform technical development tasks to ensure data science solutions are properly engineered and maintained to support the ongoing business needs of our clients
- and to build the memory layer that makes those solutions possible.
Architect and build the memory layer of EY's cognitive harness end-to-end graph databases, vector stores, hybrid retrieval services, ontology services, memory APIs, and the integration into agent runtimes. Design and implement knowledge graphs and ontologies that support agent grounding vocabulary, schema rules, instances, provenance, and cross-domain mappings (sector and functional ontologies for Finance, Risk, Tax, Supply Chain, HR). Build and operate graph database infrastructure (Neo4j, Spanner Graph, Neptune, TigerGraph, or similar) schema design, ingestion, query optimization, and integration with the broader data platform. Engineer the vector and hybrid retrieval stack embeddings pipelines, vector indices (Vertex AI Vector Search, OpenSearch, Pinecone, Weaviate, pgvector), reranking, and lexical-plus-dense retrieval services. Build memory services for working, episodic, semantic, and procedural memory
- including TTL, retention, consolidation, forgetting, and audit-grade provenance for regulated workloads (SOX, HIPAA, GDPR).
Implement grounding pipelines that connect agent runtimes to the memory layer with low latency, citation tracking, and hallucination guardrails. Lead a team of AI/ML engineers and data engineers set technical standards, run code and design reviews, and mentor on production engineering rigor. Partner with data-science leadership on memory representation, retrieval quality, and evaluation, translating science prototypes into hardened production services. Stay abreast of AI and data trends new graph paradigms, embedding models, retrieval techniques, agent frameworks (Google ADK, Bedrock AgentCore, LangGraph) and recommend tools and patterns that fit our clients' existing ecosystems. Apply combined business and technical knowledge to develop and execute target memory architectures that enable implementation, monitoring, and ongoing evolution of agentic AI at scale. Skills and attributes for success This role will work to deliver tech at speed, innovate at scale, and put humans at the center. You will provide technical guidance and share knowledge with team members with diverse skills and backgrounds. You will consistently deliver quality client services, focusing on more complex, judgmental, and specialized issues surrounding agentic AI, memory infrastructure, and emerging foundation-model technology. You will demonstrate deep technical capabilities and lead through building. Strong AI/ML engineer who builds comfortable owning a memory service from schema to API to deploy, and writing the code that gets it there. Deep, hands-on knowledge of graph databases (Neo4j, Spanner Graph, Neptune, TigerGraph, or Stardog)
- schema design, query languages (Cypher, GQL, SPARQL, Gremlin), and operating graph infrastructure in production.
Strong grasp of ontologies and knowledge representation vocabulary, schema rules, instances, axioms, provenance and how ontologies support grounding and reasoning for AI agents. Solid working knowledge of cognitive harness / agent-runtime architectures memory, tools, policies, evaluation, observability and how memory infrastructure plugs into them. Hands-on experience with vector databases and hybrid retrieval
- embeddings, ANN indexes, reranking, query rewriting, and semantic caching.
Strong software engineering fundamentals Python (and ideally one of Java / Go / TypeScript), API design, testing, CI/CD, containerization, and observability. Experience designing and operating production AI systems on at least one major cloud (GCP, AWS, Azure, Databricks) including IAM, network controls, encryption, and responsible-AI guardrails. Track record of leading engineering teams setting technical direction, mentoring, and delivering production systems on time. Excellent communication skills able to explain memory and graph concepts to clients, and able to defend engineering decisions to architects, data scientists, and partners. To qualify for the role you must have Master's preferred in Computer Science, Software Engineering, Data Engineering, or a closely related technical field. Bachelor's with strong applied experience also considered. 6+ years of applied AI/ML or data engineering experience, with at least 2 years leading engineering teams or major workstreams. Demonstrable production experience with graph databases (Neo4j, Spanner Graph, Neptune, TigerGraph, Stardog, or similar), including schema design and query tuning. Hands-on experience designing and implementing ontologies and knowledge graphs for real systems
Production experience building retrieval / RAG / memory systems vector indices, hybrid retrieval, embedding pipelines, reranking,