Apply for this opportunity

This job application is on an outside website. Be sure to review the job posting there to verify it's the same.

Apply Offsite

GenAI / Machine Learning Engineer-68886

Job

PRIMUS Global Services Inc.

Sunrise, FL (In Person)

Full-Time

Posted 3 days ago (Updated 14 hours ago) • Actively hiring

Expires 7/11/2026

See Job Scorecard

Review key factors to help you decide if the role fits your goals.

How is this calculated?

Pay Growth

out of 5

Not enough data

Not enough info to score pay or growth

Job Security

out of 5

Not enough data

Calculating job security score...

Total Score

100

out of 100

Average of individual scores

Were these scores useful?

Skill Insights

Compare your current skills to what this opportunity needs—we'll show you what you already have and what could strengthen your application.

Job Description

=== POSTING == GenAI /

Machine Learning Engineer- Sunrise, FL Onsite Job Description:

We are seeking a highly skilled GenAI / Machine Learning Engineer to design, develop, and deploy AI-powered solutions leveraging Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), and modern machine learning frameworks. The ideal candidate will have hands-on experience building scalable AI applications, developing ML models, and integrating Generative AI solutions into enterprise environments.

Key Responsibilities:

Design, develop, validate, and deploy machine learning models using supervised and unsupervised learning techniques. Build and optimize Generative AI applications utilizing Large Language Models (LLMs) such as GPT and Ollama. Develop Retrieval-Augmented Generation (RAG) pipelines for enterprise AI solutions. Create and maintain AI workflows using LangChain and LangGraph frameworks. Implement prompt engineering strategies, chain-of-thought reasoning, and model optimization techniques. Configure and tune LLM parameters including Temperature, Top-K, and Context Length for optimal performance. Develop and integrate MCP tools using FastMCP. Build scalable backend APIs and AI services using FastAPI and Uvicorn. Implement observability, monitoring, and tracing solutions using LangSmith and LangFuse. Design and manage vector databases and embedding solutions using PGVector and Ollama Embeddings. Collaborate with cross-functional teams to deploy AI/ML solutions into production environments. Evaluate model performance and continuously improve accuracy, reliability, and scalability. Required Skills & Experience Strong experience in Machine Learning model development, validation, and deployment. Expertise in supervised and unsupervised learning algorithms, including: Regression Classification Clustering Experience with feature engineering and model evaluation techniques. Hands-on experience with Large Language Models (LLMs), including GPT and Ollama.

Strong understanding of:

Temperature Top-K Sampling Context Length Management Experience with LangChain and LangGraph. Expertise in RAG (Retrieval-Augmented Generation) development. Strong backend development experience with FastAPI and Uvicorn. Experience developing MCP tools using FastMCP. Proficiency in Prompt Engineering and Chain-of-Thought techniques. Experience with observability tools such as LangSmith and LangFuse. Experience with vector databases and embeddings, including PGVector and Ollama Embeddings. Strong problem-solving and analytical skills. Preferred Skills Experience deploying AI/ML applications in cloud environments. Knowledge of MLOps and model lifecycle management. Experience with Python-based AI/ML ecosystems. Familiarity with enterprise-scale AI application development and deployment. Top of Form Bottom of Form•ALL successful candidates for this position are required to work directly for PRIMUS. No agencies please only W2•For immediate consideration, please contact: