Apply for this opportunity

This job application is on an outside website. Be sure to review the job posting there to verify it's the same.

Apply Offsite

Sr Full Stack Engineer Generative AI & Python

Job

InfoVision, Inc.

Irving, TX (In Person)

Full-Time

Posted 3 days ago (Updated 10 hours ago) • Actively hiring

Expires 7/4/2026

See Job Scorecard

Review key factors to help you decide if the role fits your goals.

How is this calculated?

Pay Growth

out of 5

Not enough data

Not enough info to score pay or growth

Job Security

out of 5

Not enough data

Calculating job security score...

Total Score

100

out of 100

Average of individual scores

Were these scores useful?

Skill Insights

Compare your current skills to what this opportunity needs—we'll show you what you already have and what could strengthen your application.

Job Description

Job title:

Full Stack Developer Location:

Irving, TX Duration:

Long-term

ROLE SUMMARY

We are seeking a Full Stack Developer - AI & Cloud to design, build, and deploy scalable enterprise applications at the intersection of Java/Python server-side development, AWS cloud services, and AI/LLM edge deployments.

KEY RESPONSIBILITIES

Design and develop robust server-side applications and RESTful microservices using Java (Spring Boot) and Python, ensuring scalability, security, and high availability across distributed systems. Architect and deploy cloud-native solutions on AWS leveraging services including Lambda, ECS, API Gateway, SageMaker, S3, and EventBridge. Fine-tune open-weight LLM models (e.g., LLaMA, Mistral, Phi) using frameworks such as Hugging Face PEFT and LoRA for domain-specific enterprise use cases. Deploy and manage AI/LLM inference runtimes on edge devices including laptops, on-premise servers, and network routers using tools such as Ollama, llama.cpp, or TensorRT-LLM. Build and maintain CI/CD pipelines for containerized microservices and edge AI model deployments using Docker, Kubernetes, and AWS DevOps tooling. Conduct code reviews, contribute to architectural decisions, and mentor junior engineers on AI-integrated full stack development practices.

REQUIRED QUALIFICATIONS

10+ years of full stack development experience with strong server-side proficiency in Java (Spring Boot) and Python. Telecom Industry experience is a must. Hands-on experience building and deploying microservices on AWS, including services such as Lambda, ECS, API Gateway, and SageMaker. Demonstrated experience fine-tuning LLM models using Hugging Face Transformers, PEFT, or LoRA. Proven ability to deploy and optimize LLM inference on edge devices (CPU/edge GPU) using runtimes such as Ollama, llama.cpp, or ExecuTorch. Proficiency with containerization and orchestration tools including Docker and Kubernetes. Strong understanding of RESTful API design, event-driven architectures, and distributed microservices patterns.