Senior/Principal Infrastructure Engineer, Data Centers

Utilidata
Ann Arbor, MI
Utilidata is a fast-growing NVIDIA-backed edge AI company enabling greater visibility and control of power utilization in energy-intensive infrastructure, like the electric grid and data centers. Karman, the company’s distributed AI platform powered by a custom NVIDIA module, is transforming the way utility companies operate the grid edge and will enable data centers to unlock more compute for the same provisioned power.

The Infrastructure Engineer is responsible for deploying, configuring, and supporting Karman systems within high-density data center environments, ensuring optimal performance, reliability, and scalability of infrastructure operations. This role provides Tier 3 infrastructure support across Utilidata’s self-hosted data center and partner colocation environments. The engineer will operate across networking, power, Linux systems, and rack infrastructure to ensure high availability and rapid issue resolution in environments where monitoring and visibility are evolving. In addition to operational ownership, this role will contribute to building the policies, tooling, and processes required to deliver robust Tier 3 support. This includes defining incident response practices, escalation paths, root cause analysis standards, and reliability targets (SLAs/SLOs), while improving logging, monitoring, and observability. The engineer will not only resolve complex cross-layer issues, but also drive the systematic improvements needed to reduce recurrence and strengthen long-term reliability across both environments. This position is based onsite at our company headquarters in Ann Arbor, Michigan, with flexibility for occasional remote work. Candidates will be expected to collaborate cross-functionally with remote teams based across the country.

Responsibilities
  • Deploy and configure Karman systems in high-density data center environments, ensuring adherence to best practices and organizational standards
  • Monitor, troubleshoot, and resolve technical issues related to Karman applications, networking, and infrastructure components
  • Manage and maintain B300 or equivalent rack systems, including PDU (Power Distribution Units) and PSU (Power Supply Units) configuration and optimization
  • Perform Linux system administration tasks including installation, configuration, patch management, and performance tuningDesign and implement network configurations to support Karman deployments, including routing, switching, and connectivity optimization
  • Develop a deep understanding of how compute, power, and networking resources are being consumed by internal teams, and proactively identify constraints, risks, and scaling bottlenecks
  • Collaborate with cross-functional teams to plan capacity requirements and scale infrastructure to meet growing demands
  • Document deployment procedures, configuration standards, and troubleshooting guides for knowledge sharing and operational continuity
  • Provide technical support and training to internal teams on Karman system operations and best practices
  • Conduct regular system health checks, performance monitoring, and proactive maintenance to prevent downtime
  • Participate in on-call rotation to ensure 24/7 system availability and rapid incident response
Minimum Qualifications
  • 8+ years of experience with Linux system administration (RHEL, Ubuntu, CentOS, or similar distributions)
  • Proven experience deploying and managing applications in high-density data center environments
  • Strong understanding of data center infrastructure including rack systems (B300 or equivalent), PDUs, PSUs, cooling systems, and power management
  • Hands-on experience with enterprise networking concepts including TCP/IP, DNS, DHCP, VLANs, firewall concepts, routing protocols, switching, and network troubleshooting
  • Demonstrated ability to operate effectively in environments with evolving processes and incomplete tooling, using first-principles debugging and cross-domain reasoning.
  • Strong ownership mindset with a bias toward proactive improvement rather than reactive ticket resolution.
  • Willingness to travel up to 20% of the time, including international travel
Enhanced Qualifications (Nice to Have)
  • Proficiency with configuration management and automation tools (Ansible, Puppet, Chef, or similar)
  • Familiarity with tail-scale/wireguard experience
  • Experience with monitoring and observability tools (Prometheus, Grafana, ELK stack, or similar)
  • Knowledge of containerization and orchestration technologies (Docker, Kubernetes)
Salary Range: $160,000 to $195,000 base compensation depending on experience and stock options. Salary will be commensurate with an individual's skills, training, years of experience, and in line with internal compensation bands.

Location: This position is based onsite at our company headquarters in Ann Arbor, Michigan, with flexibility for occasional remote work.

Our Commitments:
Utilidata values the diversity of our team. We provide equal employment opportunities without regard to race, color, religion, creed, sex, gender, sexual orientation, gender identity or expression, national origin, age, physical disability, mental disability, medical condition, pregnancy or childbirth, sexual orientation, genetics, genetic information, marital status, or status as a covered veteran or any other basis protected by applicable federal, state and local laws.

We are committed to:
  • Creating a diverse and inclusive workplace that is welcoming, supportive, affirming and respectful
  • Empowering employees to solve problems and work together to make a difference
  • Providing mentorship and growth opportunities as part of a collaborative team
  • A flexible work environment with flexible paid time off
  • Competitive compensation and benefits, including health, dental, vision, and employer-match 401k

Posted 2026-03-02

Recommended Jobs

Peer Recovery Coach

Community Medical Services
Sterling Heights, MI

Description Monday - Friday 5:00am- 1:00pm Community Medical Services (CMS) is hiring a Peer Recovery Coach. Under the supervision of the Clinic Manager, the Peer Recovery Coach is responsible …

View Details
Posted 2026-02-27

Business Tax Services- Passthrough Transactions Group Analytics- Senior - HDG #1512

EY
Detroit, MI

Location: Anywhere in Country At EY, we’re all in to shape your future with confidence.  We’ll help you succeed in a globally connected powerhouse of diverse teams and take your career where…

View Details
Posted 2026-03-03

AWS Engineer - Manager

Lensa
Detroit, MI

Lensa is a career site that helps job seekers find great jobs in the US. We are not a staffing firm or agency. Lensa does not hire directly for these jobs, but promotes jobs on LinkedIn on behalf of …

View Details
Posted 2026-03-03

Sales Representative - Grand Rapids, MI - Sports Medicine

Stryker
Grand Rapids, MI

The company Stryker is one of the world’s leading medical technology companies and together with our customers, we are driven to make healthcare better. The Company offers a diverse array of inn…

View Details
Posted 2026-01-30

FOOD SERVICE HOSPITALITY AMBASSADOR- COOPER

Covenant HealthCare
Saginaw, MI

Overview: This position is responsible for being proficient in one or more of the following positions: Tray Check , Hospitality Ambassador, Grill Cook, Caf� Cook, Sandwich Shop, Baker, Stock Porter…

View Details
Posted 2026-02-17

STORE MANAGER CANDIDATE IN GRAND RAPIDS, MI (Grand Rapids)

Dollar General
Grand Rapids, MI

Work Where You Matter At Dollar General, our mission is Serving Others! We value each and every one of our employees. Whether you are looking to launch a new career in one of our many convenient St…

View Details
Posted 2026-02-07

Logistics Sales Relocate to Cincinnati

Gateway Logistics, Inc.
Flint, MI

Logistics Sales Representative | Uncapped Commission + Paid Training |  Cincinnati, OH - Full Time Ready to launch a high-earning career in logistics? Relocate to Cincinnati and work at Gateway…

View Details
Posted 2026-02-27

Web Applications/Communications Intern (Year-Round)

BorgWarner Inc.
Auburn Hills, MI

Position Web Applications/Communications Intern – CORP Location Auburn Hills, PTC About us BorgWarner is a global product leader in delivering innovative and sustainable mobili…

View Details
Posted 2026-01-29

Associate Engineer, Systems

Moseley Technical Services, Inc.
Sterling Heights, MI

Check out this new opportunity! System Integration Engineer Military Vehicle Prototypes Sterling Heights, MI $35.70-$39.28/Hour Join a team dedicated to designing and producing armored combat and su…

View Details
Posted 2026-02-09

Shift Leader/Assistant Manager

Menchie's Frozen Yogurt
Northville, MI

Menchies Frozen Yogurt is looking to hire a dynamic Shift Leader/Assistant Manager For Northville, MI, and Farmington, MI locations.  The Role:  Are you a great leader? Can you inspire, motivat…

View Details
Posted 2025-08-12