Technical Program Manager, AI Network Infra
Summary:
This position will play a critical role in driving end-to-end AI product introductions and AI operations initiatives supporting Meta's growing AI/HPC infrastructure for our Family of Apps . They will be responsible for overseeing the entire program lifecycle, from concept to planning to execution to monitoring, ensuring successful delivery and implementation. This includes collaborating with cross-functional Engineering teams to define scope, goals, and timelines, as well as leading the cross functional teams in delivering the business outcomes. They will help solve some of the most challenging networking problems in the industry, drive innovative, creative and ground-breaking solutions and technologies. As such, they need to understand the problem space and domain in depth, create roadmaps, prioritize based on impact and drive product development from concept to production. They will operate in a multi-organization landscape.
Required Skills:
Technical Program Manager, AI Network Infra Responsibilities:
Lead technical program management of next-generation Artificial Intelligence/Machine Learning (AI/ML) platform(s) for Meta's Network Infrastructure in a matrix organization covering a range of areas (Data Center, Network, Hardware Systems, Infrastructure Engineering, Software Engineering, Capacity Management) and across multiple physical locations
Collaborate with Engineering and business owners to define program requirements, set priorities, and establish scope which includes defining the roadmap and long-term strategy of the teams that you are partnering with
Manage cross functional dependencies, risks, and changes effectively by optimizing scope, schedule, and resources accordingly
Develop and own communication plans to effectively and proactively communicate program status, issues, and risks to stakeholders
Partner with cross functional teams to drive technical analysis, design, development, testing, implementation, and post implementation phases
Define and track key metrics and key quality and performance indicators and drive cross functional execution of program deliverables
Proactively identify and analyze complex, long-term, critical infrastructure problems with engineering leaders and stakeholders
Drive internal and external process improvements across multiple teams and functions including reducing the manual efforts through automation
Build aligned program teams to efficiently deliver on shared goals
The ideal candidate will have experience in AI/HPC product development and operations, demonstrated experience in the Network communications stack for AI solutions, fundamental knowledge of the hardware components , proven track record of communication and leadership and program management
Minimum Qualifications:
Minimum Qualifications:
B.S. in Computer Science or a related technical discipline, or equivalent experience
12+ years of software engineering, systems engineering, hardware engineering, or technical product/program management experience
8+ years experience in delivering Network solutions/Programs for Data Center applications
Experience delivering tech programs or products from inception to delivery
Experience operating autonomously across multiple teams, demonstrated critical thinking, and thought leadership
Communication experience and experience working with technical management teams to develop systems, solutions, and products
Analytical and problem-solving experience with large-scale systems
Experience establishing work relationships across multi-disciplinary teams and multiple partners in different time zones
Understanding of the Network communication stack, Network Hardware (NICs, Optics & Switches)
Experience Developing & Delivering AI Cluster Solutions for training & inference use cases
Preferred Qualifications:
Preferred Qualifications:
Experience working with ODMs and silicon vendors.
Experience in Network protocols (RoCE, InfiniBand, Ethernet).
Experience working with large scale distributed systems.
Experience with data center architecture & Deployment.
Experience with AI training and inference model deployments to physical infrastructure.
Public Compensation:
$168,000/year to $234,000/year + bonus + equity + benefits
Industry: Internet
Equal Opportunity:
Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment.
Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at [email protected].
Recommended Jobs
Medical Assistant
We are a busy and growing dermatology practice, offering medical, cosmetic and surgical dermatology services, looking for a motivated and positive Medical Assistant and Front Office Receptionist to…
Home Health Speech-Language Pathologist (SLP)
Home Health Visits with Growth Potential Job Summary: We are looking for a compassionate Speech-Language Pathologist (SLP) to provide home health and outpatient therapy services. This role offe…
Board-Certified Behavior Analyst
Qualifications : Required : Bachelor's degree or higher Applied Behavior Analysis (ABA) (1+ years) Valid Board-Certified Behavior Analyst (BCBA) in the state of Michigan (MI) Overvi…
Materials Coordinator
: Do you want to work at an organization that is people focused, service minded and results oriented, that offers their customers creative problem solving, progressive solutions, and improved outcome…
Pilates Teacher Trainee - Club Pilates Okemos
Pay: From $30.00 per hour Club Pilates - Pilates Teacher Trainee Club Pilates Okemos! Are you ready to turn your passion for movement into a fulfilling and rewarding career? Club Pilates invite…
Assistant Director, Impact, Research, & Evaluation
ASSISTANT DIRECTOR, IMPACT, RESEARCH, & EVALUATION Position Description: The Assistant Director of Impact, Research & Evaluation (IRE) plays a central role in helping Commonpoint use data, learni…
Territory Manager
This sales position will provide various types of industrial hardware directly to customers within a defined geographic territory through cold-calling and prospecting activities. Must reside within …
Civil Engineer/Project Manager
Company Boss Engineering Company, a leading civil engineering / surveying consulting firm headquartered in Howell, MI, seeks motivated, team oriented, professionals for our corporate office. Job Des…