Machine Learning Engineer — LLM Evaluation & Automation
Job
TEKsystems
Remote
$135,200 Salary, Full-Time
Review key factors to help you decide if the role fits your goals.
Pay Growth
?
out of 5
Not enough data
Not enough info to score pay or growth
Job Security
?
out of 5
Not enough data
Calculating job security score...
Total Score
100
out of 100
Average of individual scores
Skill Insights
Compare your current skills to what this opportunity needs—we'll show you what you already have and what could strengthen your application.
Job Description
Overview:
We are seeking a Machine Learning Engineer to join a high-impact team focused on advancing LLM evaluation, NLP, and AI-driven automation. This role centers on designing scalable evaluation frameworks, optimizing prompt strategies, and building systems that ensure high-quality, consistent model outputs across product domains. You will partner closely with product, engineering, and research teams to drive measurable improvements in AI performance. This is a hands-on role with a strong emphasis on LLM evaluation systems, prompt engineering, and data-driven model optimization.Job Details:
Location:
Culver City, CA (Hybrid with 3 days a week onsite)Pay Rate:
$60-70 hr/w2Job Type:
Contract Contract Length:
6 monthsExperience Level:
Mid-level toSenior Key Responsibilities:
Design and build LLM-based evaluation frameworks, including automated scoring pipelines and rubric-based grading systems Build and maintain data pipelines for evaluation datasets using Python, SQL, and scalable processing tools Translate complex evaluation results into clear, actionable insights for technical and non-technical stakeholders Implement automation workflows and agentic evaluation systems to improve efficiency and reduce manual efforts Develop prompt engineering strategies to evaluate output quality, accuracy, and consistency Create and maintain metrics, KPIs, and dashboards to track and communicate model performance Conduct error analysis, root-cause investigations, and quality deep dives to guide model improvements Partner cross-functionally to define evaluation methodologies and integrate them into production workflowsMust-Have Qualifications:
5+ years of experience in ML engineering, NLP, or AI/ML automation Strong programming skills in Python and SQL Deep understanding of machine learning concepts with a focus on NLP and advanced LLM capabilities (e.g., Chain-of-Thought, agentic workflows) Experience working with large-scale datasets and data pipelines Strong experience with LLM evaluation, prompt engineering, or auto grading systems Experience developing metrics and KPIs to measure model output quality and consistencyNice-to-Have:
Experience with LLM-as-judge systems or human + model evaluation frameworks Background in inter-rater reliability, evaluation calibration, or judged systems design Experience with PySpark or distributed data processing tools Exposure to building dashboards or visualization tools for model performance trackingTechnical Skills Python, SQL, NLP, LLM Evaluation, Prompt Engineering, Machine Learning, Data Pipelines, Automation Systems NOTE:
This posting is for an existing vacancy. Job Type & Location This is a Contract position based out of Culver City, CA. Pay and Benefits The pay range for this position is $60.00 - $70.00/hr. Eligibility requirements apply to some benefits and may depend on your job classification and length of employment. Benefits are subject to change and may be subject to specific elections, plan, or program terms. If eligible, the benefits available for this temporary role may include the following:- Medical, dental & vision
- Critical Illness, Accident, and Hospital
- 401(k) Retirement Plan - Pre-tax and Roth post-tax contributions available
- Life Insurance (Voluntary Life & AD&D for the employee and dependents)
- Short and long-term disability
- Health Spending Account (HSA)
- Transportation benefits
- Employee Assistance Program
- Time Off/Leave (PTO, Vacation or Sick Leave) Workplace Type This is a fully remote position.
San Francisco Fair Chance Ordinance:
Pursuant to the San Francisco Fair Chance Ordinance, for all positions located in the city and county of San Francisco, we will consider for employment qualified applicants with arrest and conviction records.Massachusetts Lie Detector:
It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability. Use of Artificial Intelligence (AI): We may use Artificial Intelligence (AI) to support parts of our hiring process, including sourcing, screening, and evaluating candidates. AI helps assess applications and qualifications, but final decisions are made by our hiring team. By applying, you acknowledge and agree that your application may be reviewed using AI tools.Similar jobs in Culver City, CA
Amazon.com, Inc.
Culver City, CA
Posted1 day ago
Updated8 hours ago
Similar jobs in California
Stanford Health Care
Palo Alto, CA
Posted1 day ago
Updated8 hours ago
Na Ali'i Consulting & Sales, LLC.
San Diego, CA
Posted1 day ago
Updated8 hours ago