Director, Data Center Operations

Job not on LinkedIn

October 31

Apply Now
Logo of Lambda

Lambda

Artificial Intelligence • SaaS • Hardware

Lambda is a company that provides cloud-based solutions and hardware for AI development. They offer on-demand GPU clusters for multi-node training and fine-tuning, as well as inference endpoints and APIs. Their products include the Lambda GPU Cloud, which features NVIDIA's latest generation of infrastructure for enterprise AI, and customizable GPU workstations and desktops designed for AI and deep learning. Lambda also offers a one-line installation and managed upgrade path for machine learning tools like PyTorch, TensorFlow, and NVIDIA CUDA. By focusing on enabling AI developers, Lambda provides both public and private cloud services with access to powerful NVIDIA Tensor Core GPUs.

51 - 200 employees

🤖 Artificial Intelligence

☁️ SaaS

🔧 Hardware

💰 $39.7M Venture Round on 2022-11

📋 Description

• Develop and execute the North American data center operations strategy aligned with AI infrastructure goals and organizational growth. • Drive continuous improvement across facility operations, emphasizing sustainability, efficiency, and resilience. • Partner with Engineering, Capacity Planning, and Infrastructure teams to forecast and support future AI and GPU-based compute requirements. As well as provide operational feedback on designs and system improvements. • Oversee expansion projects, retrofits, and site selection in collaboration with Data Center Infrastructure Engineering and HPC Architecture teams. • Lead a multi-site operations team ensuring 24/7/365 reliability, availability, and SLA response across all facilities. • Establish standardized procedures, metrics, and best practices for preventive maintenance, incident management, and service delivery. • Monitor operational KPIs including uptime, PUE, safety, and compliance with corporate and regulatory standards. • Implement automation and AI-driven monitoring solutions to optimize system performance and predictive maintenance. Coordinate and communicate data center provider maintenances with customers and impacted teams. • Build, mentor, and scale a high-performing team of operations managers, technicians, and engineers across multiple regions. • Routinely visit all sites to maintain standards, develop relationships, and identify areas of efficiency. • Foster a culture of safety, accountability, and continuous learning driving data center operations to take on more responsibility and work up the stack. • Assist in the build out of new data center whitespace and deployment of AI Infrastructure. • Develop and manage operating budgets, capital expenditures, and cost-optimization initiatives. • Oversee strategic vendor partnerships with numerous data center providers for power, cooling, maintenance, and critical infrastructure components. • Ensure compliance with environmental, safety, and industry regulations (e.g., NFPA, OSHA, ISO standards). • Lead incident response and root cause analysis to drive preventive improvements for incidents related to data center operations or infrastructure. • Act as primary point of contact for audits related to data center operations for compliance such as SOCII, ISO, etc.

🎯 Requirements

• 10+ years of experience in data center operations, with at least 7 years in a leadership role managing multi-site or hyperscale facilities. • Proven experience supporting AI, HPC, or cloud infrastructure at scale. • Deep understanding of power and cooling systems, networking, capacity planning, and facility automation tools (DCIM, BMS, etc.). • Strong track record of improving operational efficiency and managing relationships with data center providers. • Preferred Bachelor’s degree in Engineering, Computer Science, or related field; Master’s bonus. • Exceptional communication, cross-functional collaboration, and stakeholder management skills. Ability to build relationships and consensus and positive team culture. • Willingness to travel (up to 50%) to data center sites across North America and data center sites under construction.

🏖️ Benefits

• Health, dental, and vision coverage for you and your dependents • Wellness and Commuter stipends for select roles • 401k Plan with 2% company match (USA employees) • Flexible Paid Time Off Plan that we all actually use

Apply Now

Similar Jobs

October 31

Omnissa

1001 - 5000

🤖 Artificial Intelligence

🏢 Enterprise

Director of Renewal Operations driving scalable strategies for maximizing customer retention and lifetime value. Collaborating cross-functionally to ensure effective global renewal processes.

🇺🇸 United States – Remote

💵 $136.3k - $227.1k / year

⏰ Full Time

🔴 Lead

⚙️ Operations

October 31

OneSource Virtual

501 - 1000

🤝 B2B

☁️ SaaS

💸 Finance

Project Operations Manager overseeing project teams at OneSource Virtual, delivering customer-facing HR and finance projects. Ensuring timely execution and collaboration across teams for customer success.

🇺🇸 United States – Remote

💰 Series B on 2015-06

⏰ Full Time

🟠 Senior

🔴 Lead

⚙️ Operations

🦅 H1B Visa Sponsor

October 31

National Center for Youth Law

51 - 200

🤝 Non-profit

📚 Education

🌍 Social Impact

Managing Director at National Center for Youth Law leading culture and operations to promote justice for marginalized children and youth. Fostering an inclusive workplace and enhancing operational effectiveness across teams.

🇺🇸 United States – Remote

💵 $181.6k - $267k / year

⏰ Full Time

🔴 Lead

⚙️ Operations

October 31

Aspire Software

1001 - 5000

☁️ SaaS

🏢 Enterprise

🤝 B2B

Head of Operations overseeing day-to-day operations at Valsoft Corp., managing Professional Services functions and spearheading business transformation initiatives.

🇺🇸 United States – Remote

⏰ Full Time

🔴 Lead

⚙️ Operations

October 30

Deckers Brands

1001 - 5000

👥 B2C

👗 Fashion

🛒 Retail

Manager of Enterprise Sales and Operations Planning at Deckers Brands, ensuring alignment between crucial business functions for global operations.

🇺🇸 United States – Remote

💵 $110k - $120k / year

⏰ Full Time

🟠 Senior

🔴 Lead

⚙️ Operations

Developed by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com