Lead Machine Learning Infrastructure Engineer – Infrastructure & Data Position Available In Montgomery, Pennsylvania
Tallo's Job Summary: This job listing in Montgomery - PA has been recently added. Tallo will add a summary here for this job shortly.
Job Description
Job Description:
Upwork ($UPWK) is the world’s work marketplace. We serveeveryone from one-person startups to large, Fortune 100 enterpriseswith a powerful, trust-driven platform that enables companies andtalent to work together in new ways that unlock their potential.
Last year, more than $3.8 billion of work was done through Upworkby skilled professionals who are gaining more control by findingwork they are passionate about and innovating their careers. TheMachine Learning Infrastructure & Data team is responsible forarchitecting and building the foundational systems and tools thatenable efficient development, deployment, and management of machinelearning models at scale. As a Lead Machine Learning InfrastructureEngineer, you will be pivotal in designing, developing, andmaintaining robust and scalable infrastructure components tosupport Upwork’s machine learning initiatives. You will workclosely with cross-functional teams—including machine learningresearchers, data scientists, and software engineers—to buildstate-of-the-art platforms and tools that accelerate thedevelopment and deployment of machine learning models.
Responsibilities :
- Design, implement, and optimize distributedsystems and infrastructure components to support large-scalemachine learning workflows, including data ingestion, featureengineering, model training, and serving.
- Develop and maintainframeworks, libraries, and tools that streamline the end-to-endmachine learning lifecycle, from data preparation andexperimentation to model deployment and monitoring.
- Architect andimplement highly available, fault-tolerant, and secure systems thatmeet the performance and scalability requirements of productionmachine learning workloads.
- Collaborate with machine learningresearchers and data scientists to understand their requirementsand translate them into scalable and efficient software solutions.
- Stay current with advancements in machine learninginfrastructure, distributed computing, and cloud technologies,integrating them into our platform to drive innovation.
- Mentorjunior engineers, conduct code reviews, and uphold engineering bestpractices to ensure the delivery of high-quality softwaresolutions. What it takes to catch our eye:
- Strong technicalexpertise in designing and building scalable ML infrastructure.
- Experience with distributed systems and cloud-based ML platforms.
- Proficiency in programming languages such as Python, Java, orScala.
- Deep understanding of ML workflows, including datapipelines, model training, and deployment.
- Passion for innovationand eagerness to implement the latest advancements in MLinfrastructure.
- Strong problem-solving skills and ability tooptimize complex systems for performance and reliability.
- Collaborative mindset with excellent communication skills to workacross teams.
- Ability to thrive in a fast-paced, dynamicenvironment with evolving technical challenges.