Senior DL Algorithms Engineer – Inference Performance

🕒 February 19

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of NVIDIA

NVIDIA

10,000+ employees

Founded 1993

🤖 Artificial Intelligence

🎮 Gaming

Artificial Intelligence • Gaming • Automotive

NVIDIA is a leading technology company specializing in accelerated computing and artificial intelligence. NVIDIA pioneers advancements in graphical processing units (GPUs), cloud computing, data centers, and virtual reality, with a focus on gaming, automotive, healthcare, and robotics industries. The company's innovations, such as NVIDIA Omniverse, transform traditional digital processes by enabling high-fidelity simulations and rendering tasks. Their applications span various industries, from autonomous vehicles using NVIDIA DRIVE to healthcare solutions with NVIDIA Clara, and AI-driven analytics and workflows.

📋 Description

• Implement language and multimodal model inference as part of NVIDIA Inference Microservices (NIMs). • Contribute new features, fix bugs and deliver production code to TRT-LLM, NVIDIA’s open-source inference serving library. • Profile and analyze bottlenecks across the full inference stack to push the boundaries of inference performance. • Benchmark state-of-the-art offerings in various DL models inference and perform competitive analysis for NVIDIA SW/HW stack. • Collaborate heavily with other SW/HW co-design teams to enable the creation of the next generation of AI-powered services.

🎯 Requirements

• PhD in CS, EE or CSEE or equivalent experience. • 5+ years of experience. • Strong background in deep learning and neural networks, in particular inference. • Experience with performance profiling, analysis and optimization, especially for GPU-based applications. • Proficient in C++, PyTorch or equivalent frameworks. • Deep understanding of computer architecture, and familiarity with the fundamentals of GPU architecture. • Proven experience with processor and system-level performance optimization. • Deep understanding of modern LLM architectures. • Strong fundamentals in algorithms. • GPU programming experience (CUDA or OpenCL) is a plus

🏖️ Benefits

• equity • benefits

Apply Now

Similar Jobs

🕒 February 19

Stantec

10,000+ employees

⚡ Energy

🚗 Transport

🏛️ Government

Mining HVAC Engineer ensuring a constant supply of clean air for underground workers. Evaluating ventilation systems and collaborating with clients in the mining industry.

🕒 February 17

DEME Group

5001 - 10000

Project Engineer involved in marine infrastructure and civil projects. Collaborating with on-shore teams and managing project execution within deadlines.

🗣️🇳🇱 Dutch Required

🕒 February 17

Eos Energy Enterprises, Inc.

201 - 500

⚡ Energy

Engineer responsible for diagnostics and resolution of complex issues in high voltage battery systems. Ensuring safety and performance for energy storage technologies across various projects.

🕒 February 17

South Jersey Industries

1001 - 5000

⚡ Energy

Renewable Project Engineer V developing and managing renewable natural gas facilities for SJI. Leading document control processes and supporting project engineering in a remote work setting.

🕒 February 17

Hewlett Packard Enterprise

10,000+ employees

🏢 Enterprise

🔧 Hardware

☁️ SaaS

Senior Network Performance Engineer at HPE designing and implementing simulation motifs. Collaborating with network architects and engineers to evaluate performance and scalability aspects.

Python