Data Engineer III
New Era Technology
Menlo Park, CA (In Person)
Full-Time
Skill Insights
Compare your current skills to what this opportunity needs—we'll show you what you already have and what could strengthen your application.
Job Description
Skills Required Strong software engineering fundamentals. Python, data structures, concurrency/async programming. Advanced SQL & data pipeline expertise. Complex queries, query optimization, pipeline orchestration frameworks (Airflow, Data swarm, or equivalent). Experience integrating ML models into data pipelines. Calling inference endpoints, managing model versions, batching requests, handling inference failures at scale. Proficiency with AI-assisted coding agents (e.g., Copilot, Cursor, Codex). Expected to leverage AI tools as a force multiplier for writing, debugging, and reviewing code, building pipelines faster, and accelerating day-to-day engineering workflows Strong verbal and written communication skills, problem-solving ability, and cross-functional collaboration. Preferred Skills Working knowledge of embeddings and vector representations like generating, storing, indexing, and querying embeddings (FAISS, Milvus, or equivalent). Familiarity with content-understanding models like image classifiers, object detection, OCR, NSFW detection, aesthetic scoring. Experience with LLMs for data tasks like prompt engineering for annotation, data cleaning, or evaluation using LLM APIs. Knowledge of generative AI like diffusion models, image generation, evaluation metrics (FID, CLIP score, etc.).