On-prem Cloud Engineer

Job

MASE Insights

Charlotte, NC (In Person)

$93,600 Salary, Full-Time

Posted 3 days ago (Updated 10 hours ago) • Actively hiring

Expires 6/11/2026

Apply for this opportunity

This job application is on an outside website. Be sure to review the job posting there to verify it's the same.

See Job Scorecard

Review key factors to help you decide if the role fits your goals.

How is this calculated?

Pay Growth

out of 5

Not enough data

Not enough info to score pay or growth

Job Security

out of 5

Not enough data

Calculating job security score...

Total Score

out of 100

Average of individual scores

Were these scores useful?

Skill Insights

Compare your current skills to what this opportunity needs—we'll show you what you already have and what could strengthen your application.

Job Description

On-prem Cloud Engineer MASE Insights Charlotte, NC Job Details Contract From $45 an hour 21 hours ago Qualifications GPU programming IT system monitoring Model deployment Red Hat OpenShift Benchmarking AI Batch data processing MLOps Generative AI System performance monitoring Full Job Description Job Duties Build, configure, and operate on‑prem Kubernetes/OpenShift AI platforms for deploying and serving GenAI models and LLM inference workloads.

Design and optimize high‑performance inference stacks using vLLM, TensorRT‑LLM, Triton Inference Server, SGLang, and advanced techniques (continuous batching, speculative decoding, KV caching).
Manage GPU orchestration and capacity using

Run:

AI, MIG, CUDA/NCCL, and tensor parallelism to maximize utilization and throughput.

Deploy and operate Kubernetes ML serving frameworks (KServe, Helm, Operators) for scalable, reliable model serving.
Drive inference optimization and benchmarking, leveraging FP8, AWQ, GPTQ, and performance tools such as GuideLLM and Locust.
Implement observability and ML monitoring using Prometheus, Grafana, Arize AI, ensuring SLA/SLO compliance for GenAI services.
Collaborate with ML and research teams to onboard new models, tune inference performance, and productionize Tech Skills needed vLLM
TensorRT‑LLM
Triton Inference Server
SGLang
Inference Optimization
Continuous Batching
Speculative Decoding
KV Cache / Prefix Caching

FP8 / AWQ / GPTQ

Tensor Parallelism
Kubernetes ML Serving
KServe
OpenShift AI
Helm / Operators
GPU Orchestration

Run:

Performance Benchmarking

CUDA / NCCL / MIG

Prometheus / Grafana

ML Observability GuideLLM, Locust Pay:

From $45.00 per hour

Work Location:

In person

Similar remote jobs

Job
Psychologist- Top Market Pay - New Hyde Park, NY
LH
LifeStance Health
New Hyde Park, NY
Posted2 days ago
Updated10 hours ago
Job
Sr. Azure Architect
T
TEKsystems
Atlanta, GA
Posted2 days ago
Updated10 hours ago
Job
Watershed Stewardship Manager (Temp)
AC
Albemarle County Public Schools
Charlottesville, VA
Posted2 days ago
Updated10 hours ago
Job
Data Analyst - Technical - Senior
IH
Intermountain Health
Frankfort, KY
Posted2 days ago
Updated10 hours ago
Job
Senior Amazon Channel Manager
S
STERRY
Posted2 days ago
Updated10 hours ago

Similar jobs in Charlotte, NC

Job
Assistant General Manager
TB
Taco Bell
Charlotte, NC
Posted2 days ago
Updated10 hours ago
Job
Full Time Pharmacy Certified Technician
HT
Harris Teeter, LLC
Charlotte, NC
Posted2 days ago
Updated10 hours ago
Job
Business Development Director
SI
SAM, Inc.
Charlotte, NC
Posted2 days ago
Updated10 hours ago
Job
Server/Bartender
HN
Hawthorne's NY Pizza and Bar
Charlotte, NC
Posted2 days ago
Updated10 hours ago
Job
Armed Security Officer
TP
Tailormade Protective Services
Charlotte, NC
Posted2 days ago
Updated10 hours ago

Similar jobs in North Carolina

Job
Occupational Therapist (OT)
PR
Powerback Rehabilitation
Pinehurst, NC
Posted2 days ago
Updated10 hours ago
Job
Floor Staff
TB
THE BLUE ROOM
Raleigh, NC
Posted2 days ago
Updated10 hours ago
Job
MEAT MANAGER
HF
HOUCHENS FOOD GROUP INC
Murphy, NC
Posted2 days ago
Updated10 hours ago
Job
Housekeeper Part Time-101020
EM
ESA Management, LLC
Wilmington, NC
Posted2 days ago
Updated10 hours ago
Job
Part Time Educator | North Hills
LA
lululemon athletica
Raleigh, NC
Posted2 days ago
Updated10 hours ago