Principal Operations Engineer – Mechanical, Data Center Operations

🕒 5 days ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of FluidStack

FluidStack

11 - 50 employees

🤖 Artificial Intelligence

Artificial Intelligence • Cloud Computing

FluidStack is a company that provides GPU supercomputing infrastructure for AI labs. It offers on-demand access to thousands of Nvidia GPUs, enabling large-scale AI training and inference. The company specializes in deploying and managing large GPU clusters with support for technologies like Kubernetes and Slurm, ensuring high availability and excellent support. FluidStack provides a fully managed cloud infrastructure, helping AI companies to focus on developing models without worrying about the underlying hardware. They emphasize performance and cost-efficiency, offering services that scale to thousands of GPUs with high uptime and rapid response times.

📋 Description

• Serve as the principal operational technical authority for mechanical infrastructure across the fleet, including chillers, cooling towers, CRACs/CRAHs, CDUs (direct-to-chip and immersion), dry coolers, economizers, pumping systems, water treatment, and associated piping infrastructure. • Lead technical and operational site audits across active and pre-activation sites; produce operational health assessments with prioritized findings and own the remediation roadmap through closure with site leadership. • Own mechanical operational readiness for new sites coming online — assess team capability, validate procedures, and personally sign off on operational handover from commissioning to steady-state operations. • Review operational designs for new builds and capacity upgrades; represent the operational point of view in design forums and ensure operability, maintainability, and reliability concerns are surfaced and addressed before they are built in. • Feed structured operational learnings back into the design and manufacturing organization as we shift toward repeatable, productized data center builds; reinforce patterns that work and drive out patterns that have not held up in operations. • Author and approve high-risk MOPs, EOPs, and AOPs; serve as the final technical approver for high-consequence mechanical work across the fleet. • Lead root cause analysis for significant thermal or mechanical events; drive corrective actions through to closure and ensure learnings propagate across all sites. • Contribute to and uphold the mechanical safety program in partnership with EHS, with explicit accountability for refrigerant handling, confined space, LOTO, and high-pressure systems discipline. • Partner with QA/QC during construction to provide an operational perspective on workmanship, installation quality, and pre-energization readiness. • Build and deliver technical training to Field Engineers and operational teams; own the technical curriculum for mechanical content in the campus rotation and training model. • Mentor Field Engineers and rising operational leaders; act as the senior technical voice in operational reviews, incident reviews, and design reviews.

🎯 Requirements

• 10+ years of hands-on experience in mission-critical mechanical and cooling systems, with at least 5 years as the senior technical voice on a site, campus, or fleet. • Data center operations experience strongly preferred; central plant, industrial cooling, pharmaceutical, or semiconductor mission-critical experience considered. • Deep working command of chilled water plants, condenser water systems, CDUs and liquid cooling, air handling, refrigeration cycles, and pumping and piping systems — earned in the field, not from a textbook. • Practical command of psychrometric charts, refrigeration cycles, and ASHRAE thermal guidelines for data center environments. • Demonstrated ability to author, approve, and execute high-risk MOPs and EOPs in live critical environments. • A track record of leading root cause analysis on significant thermal or mechanical events and driving corrective actions to closure. • A track record of holding OEMs, service vendors, and contractors accountable — you know how to enforce a standard without burning the relationship. • Strong written communication: operational health assessments, RCAs, procedure reviews, and design review feedback are second nature. • Comfort operating as the senior technical voice across operations, design, construction, hardware, and EHS. • Willingness to travel extensively across the fleet.

🏖️ Benefits

• Competitive total compensation package (salary + equity). • Retirement or pension plan, in line with local norms. • Health, dental, and vision insurance. • Generous PTO policy, in line with local norms.

Apply Now

Similar Jobs

🕒 5 days ago

Fullsteam

1001 - 5000

💳 Fintech

☁️ SaaS

🤝 B2B

VP of Operations at Fullsteam leading multiple appliance business units for growth and profitability. Overseeing operations, customer satisfaction, and multidisciplinary teams in a dynamic technology environment.

🕒 5 days ago

Biogen

5001 - 10000

🧬 Biotechnology

⚕️ Healthcare Insurance

💊 Pharmaceuticals

Field Director leading Medical Science Liaisons in Neuropsychiatry at Biogen. Providing strategic direction, operational excellence, and ensuring effective medical strategy execution for patient outcomes.

🕒 5 days ago

Sciens Building Solutions

1001 - 5000

🔐 Security

🤝 B2B

National Director of Service Operations at Sciens Building Solutions, optimizing service execution nationally. Enforcing service standards and advancing operational excellence across all regions.

🕒 5 days ago

LXT

501 - 1000

🤖 Artificial Intelligence

☁️ SaaS

🏢 Enterprise

VP of Operations leading strategic initiatives and operational excellence across LXT’s global delivery teams. Partnering with executive leadership to drive growth and profitability.

🕒 5 days ago

Health Catalyst

1001 - 5000

⚕️ Healthcare Insurance

🤖 Artificial Intelligence

☁️ SaaS

Director of Support Operations at Health Catalyst managing systems and insights for customer support. Focusing on compliance and quality in a healthcare environment with leadership responsibilities.