Infrastructure engineer
The Site Operations team is responsible for the delivery of data center compute and storage at Meta, enabling our family of apps and services to support a growing global community. We are seeking a forward-thinking individual skilled across multiple disciplines to lead global initiatives on this team. The mission of this role is to identify and tackle the biggest technical and operational challenges and opportunities before SiteOps. The Infrastructure Engineer is expected to personally advance our highest impact initiatives, and to work with others to closure through the right working groups and delegates. The scope of the role is Infra-wide; the DC Infra Engineer is expected to work with the data center teams, Core Systems, CEA, PE, and hardware engineering to architect and implement adaptable solutions that transform our infrastructure in dimensions including performance, efficiency, quality, and resiliency. Areas of emphasis include next gen platforms, tools, and technologies; the interplay between our platforms and data centers; and the underlying architecture of our infrastructure including physical vs logical layer trade-offs.
Global Infrastructure Engineer Responsibilities:- Represent Site Operations in leading work to define and architect new solutions on global initiatives, working with stakeholders across Infra Data Centers & Infrastructure teams
- Assemble and lead teams to address complex engineering challenges, requiring technical expertise as well as a broad understanding of Meta’s overall infrastructure
- Address issues that can be ambiguous and global in nature, requiring leadership and collaboration across time zones, teams, and technical domains
- Act as key SME and mentor in the design, operation, and troubleshooting of tools, technologies, and processes utilized within Site Operations
- Understand and assess risks and challenges associated with emerging new hardware, data center and software technologies, and define & implement effective mitigations for these
- Employ a holistic understanding of the full infrastructure stack to lead solutions that appropriately balance physical and logical layer
- Act as a global communication and advisory point of contact for the design, implementation and delivery of projects that affect our global data center and server fleet and facilitate resolution of issues drawing on local expertise and global support partners
- Leverage data-driven methodologies to understand a problem at the onset, define a plan, and measure progress throughout a project
- Provide data supplied narratives and ensure a focus on continuous improvement
- Build and support, trusted, cross-functional connections with teams across the globe and serve as an advocate for the Site Operations Team with key stakeholders, influencing policies and procedures to improve global data center operations
- Approximately 20% - 30% travel
- Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
- Knowledge of the full stack of infrastructure, with experience building or operating logical infrastructure on top of a complex, distributed physical infrastructure
- Proven communication skills and experience working in a highly distributed environment, across teams/department boundaries
- 10+ years of technical experience, in a large-scale data center or IT Infrastructure environment, or equivalent experience building platforms and systems for large scale compute
- Experience building globally scalable solutions and translating global strategic initiatives into local executable projects
- Knowledge of the interdependencies of data center functions and technologies including electrical, cooling, structured cabling, security, network, server and storage systems
- Experience building, operating, and scaling with Linux or Unix Operating systems
- Experience communicating the results of analysis and insights to cross functional teams and influencing the strategy of these teams
- Experience with Data Center Design and Expansion
- Extensive knowledge of storage and AI/ML related services and the hardware that supports them
- Coding or scripting experience such as Bash, PHP, Python, SQL, or Perl
- Experience in providing technical guidance to external vendors and partners. Knowledge and experience with virtualization, containerization, distributed systems, fault tolerance, and incident management
- Experience with high level data center design, operations, basic electrical/mechanical infrastructure, and scaling physical infrastructure
Recommended Jobs
Research Scientist
Job Description Job Description Research Scientist, preferably with a PhD in molecular biology or lesser degree with comparable experience. Must meet legal requirement to work in the USA. The …
Machinist
Lincoln Electric is the world leader in the engineering, design, and manufacturing of advanced arc welding solutions, automated joining, assembly and cutting systems, plasma and oxy-fuel cutting equi…
Cost Control Manager
About Aegis The Aegis Companies provide expert project control services to the construction industry's most respected contractors, owners, and operators. Headquartered in Silver Spring, MD, we emp…
Specialist - Child Welfare Services (Foster Care)
Description Working in Child Welfare can be rewarding and very heart wrenching at times. It is a joy helping others because it absolutely requires one to give of themselves to help another. You w…
Au Pair
Get hired for Jim's aupair Job in Grandville, MI. Test. Find aupair care work in Grandville.
Class A Dry Bulk Pneumatic Tanker Driver Job
Class A Dry Bulk Pneumatic Tanker Driver Job Brink Transfer Services is a local, family-owned trucking company who knows you by name and not a number. We have more work than drivers! We are looking t…
Leasing Specialist
Job Description Job Description Description As a Leasing Specialist, you are the first step in creating a sense of community for current and prospective residents. You are responsible for prov…
SAP Order to Cash (SD) Consultant, Manager Save for Later Remove job
A career in our SAP Customer team, within our SAP consulting practice, will provide you with the opportunity to lead our clients in their customer transformation journey by reimagining exceptional …
Registered Nurse
Job Description Job Description We are seeking a Registered Nurse to join our team! You will be responsible for the assessment and treatment of assigned patients. Responsibilities: Administ…
Area Sales Director- Modernization (Midwest)
What We Expect The first 3 letters in workplace safety are Y-O-U! TK Elevator is currently seeking an Area Sales Director- Modernization for the Midwest . The Area Sales Director- Moderni…