Skip to main content
Tallo logoTallo logo

Lead Senior Systems Engineer - PAE Fires IT Support Services

Job

Technology, Automation, and Management, Inc.

Huntsville, AL (In Person)

Full-Time

Posted 2 days ago (Updated 6 hours ago) • Actively hiring

Expires 6/23/2026

Apply for this opportunity

This job application is on an outside website. Be sure to review the job posting there to verify it's the same.

Review key factors to help you decide if the role fits your goals.
Pay Growth
?
out of 5
Not enough data
Not enough info to score pay or growth
Job Security
?
out of 5
Not enough data
Calculating job security score...
Total Score
44
out of 100
Average of individual scores

Were these scores useful?

Skill Insights

Compare your current skills to what this opportunity needs—we'll show you what you already have and what could strengthen your application.

Job Description

Lead Senior Systems Engineer - PAE Fires IT Support Services Technology, Automation, and Management, Inc. Huntsville, AL Job Details 9 hours ago Qualifications Improving data center operations Data center services Knowledge management Data center experience Production design Linux support Ansible Procedural guides Technical documentation Infrastructure as Code (IaC) Improving system uptime or availability Resource planning methods IT system monitoring High availability architecture System design Data recovery Configuration management Server backup and recovery Application deployment Team leader (DevOps) Scalable systems Windows Maintenance schedule management Server virtualization deployment Team development Compliance monitoring Maintenance task scheduling Mentoring Systems engineering Operating system security Full Job Description
PENDING CONTRACT AWARD
Mission Objectives - PAE Fires operates multiple separate networks consisting of ESXi Hosts, VMs/appliances, RedHat OpenShift, and VDI desktops requiring highly secure and available systems on a 24/7 basis. The Lead Senior Systems Engineer directs all server/virtualization/data center operations, backup and recovery, disaster recovery, and configuration management activities.
Position Responsibility Summary:
Own the availability and performance of all server, virtualization, and storage infrastructure that approximately 3,500 users and mission-critical applications depend on daily; when systems go down, lead the recovery and take accountability for restoring service Architect and evolve the VMware
VCF/SDDC
environment to meet growing demand; make capacity planning decisions that balance cost, performance, and future scalability without over-provisioning Drive the adoption and maturation of containerized workloads on Red Hat OpenShift; establish deployment standards, manage the container lifecycle, and ensure the platform remains stable as development teams push new applications into production Design and validate backup and recovery strategies that actually work under pressure; regularly test restores, validate RPO/RTO compliance, and ensure the team can reconstitute the full IT environment from the COOP site if required Maintain deep technical mastery across both Windows Server and Red Hat Enterprise Linux; troubleshoot complex cross-platform issues that intermediate administrators cannot resolve and serve as the final escalation point for the team Build and maintain infrastructure-as-code practices using Ansible and Chef to ensure consistent, repeatable, and auditable system configurations across hundreds of VMs and servers Think like a defender: harden all systems to
DISA STIG
standards not because a checklist says so, but because you understand the threat landscape and know that a misconfigured server is an open door Develop your team; grow intermediate engineers into senior-level performers through mentoring, knowledge sharing, and progressively challenging assignments; create internal documentation and runbooks that reduce single points of knowledge failure Coordinate closely with the Network and Cybersecurity leads to ensure infrastructure changes are planned holistically; a server migration impacts network routing, firewall rules, and security monitoring simultaneously Plan and execute complex maintenance windows with zero or minimal mission impact; communicate clearly to stakeholders about what is happening, why, and what the fallback plan is if something goes wrong

Similar jobs in Huntsville, AL

Similar jobs in Alabama