Senior Data Center Performance Engineer – Benchmarking, Optimization

Job not on LinkedIn

7 hours ago

Apply Now
Logo of NVIDIA

NVIDIA

Artificial Intelligence • Gaming • Automotive

NVIDIA is a leading technology company specializing in accelerated computing and artificial intelligence. NVIDIA pioneers advancements in graphical processing units (GPUs), cloud computing, data centers, and virtual reality, with a focus on gaming, automotive, healthcare, and robotics industries. The company's innovations, such as NVIDIA Omniverse, transform traditional digital processes by enabling high-fidelity simulations and rendering tasks. Their applications span various industries, from autonomous vehicles using NVIDIA DRIVE to healthcare solutions with NVIDIA Clara, and AI-driven analytics and workflows.

10,000+ employees

Founded 1993

🤖 Artificial Intelligence

🎮 Gaming

📋 Description

• Design and execute comprehensive performance benchmarking strategies for our data center platforms and products • Characterize real-world AI training, inference, and HPC workloads at scale • Define, track, and report key performance indicators (throughput, latency, efficiency, scaling) • Build automation tools and frameworks for performance monitoring and analysis • Identify and analyze performance bottlenecks across compute, memory, network and storage subsystems • Work closely with architecture, hardware, software, networking, storage and customer teams to resolve performance issues • Drive performance improvements through system tuning, configuration optimization, and architectural recommendations for future generation systems

🎯 Requirements

• M.S. or Ph.D. in Computer Science, Electrical Engineering or related field (or equivalent experience). • 8+ years of experience in performance engineering or system architecture • Deep understanding of computer architecture, hardware-software interaction and computing at-scale • Strong proficiency in performance profiling tools (Linux perf, NVIDIA Nsight Systems) • Familiarity with GPU computing and parallel programming (CUDA) • Background with HPC networking technologies (InfiniBand, RoCE, NVLink) • Programming skills in Python, C++, and shell scripting. • Excellent analytical and problem-solving abilities • Adaptability and passion to learn new technologies • Ability to communicate effectively and work with cross-functional global teams.

🏖️ Benefits

• equity • benefits

Apply Now

Similar Jobs

Yesterday

Cyber Splunk Engineer providing expertise in maintaining Cyber operation systems at GDIT. Supporting Cyber Security tools and applications in a collaborative environment to protect federal customers.

Cloud

Cyber Security

Firewalls

JavaScript

Linux

Perl

Python

Splunk

Yesterday

Technical leader deploying AI agents and collaborating with enterprise clients for value creation. Leading the integration of AI systems and mentoring a team as the role expands.

AWS

Azure

Cloud

Google Cloud Platform

JavaScript

Python

SQL

Yesterday

Process Engineer advancing Service Delivery for the American Red Cross. Analyzing complex data and proposing solutions to enhance operational performance.

Yesterday

Process Engineer IV at South Jersey Industries managing project design and execution. Ensuring reliable and compliant project outcomes while coordinating with diverse teams and stakeholders.

Yesterday

Lead Field Engineer providing technical leadership for PWR and BWR reactor maintenance and inspections. Collaborating with teams to ensure successful project execution and customer satisfaction.