Deep Learning Software Engineer, TensorRT Performance

🕒 April 3

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of NVIDIA

NVIDIA

10,000+ employees

Founded 1993

🤖 Artificial Intelligence

🎮 Gaming

Artificial Intelligence • Gaming • Automotive

NVIDIA is a leading technology company specializing in accelerated computing and artificial intelligence. NVIDIA pioneers advancements in graphical processing units (GPUs), cloud computing, data centers, and virtual reality, with a focus on gaming, automotive, healthcare, and robotics industries. The company's innovations, such as NVIDIA Omniverse, transform traditional digital processes by enabling high-fidelity simulations and rendering tasks. Their applications span various industries, from autonomous vehicles using NVIDIA DRIVE to healthcare solutions with NVIDIA Clara, and AI-driven analytics and workflows.

📋 Description

• Establish groundbreaking performance benchmarking methodologies and analysis workflows and identify performance issues and opportunities for NVIDIA’s inference ecosystem (e.g. TensorRT/TensorRT-EdgeLLM/Torch-TensorRT) • Contribute features and code to NVIDIA/OSS inference frameworks including but not limited to TensorRT/TensorRT-EdgeLLM/Torch-TensorRT. • Develop new model pipelines for NVIDIA’s inference ecosystem with optimized performance including but not limited to areas like quantization, scheduling, memory management, and distributed inference to set the gold standard for Gen AI performance. • Work with cross-collaborative teams inside and outside of NVIDIA across generative AI, automotive, robotics, image understanding, and speech understanding to set directions and develop innovative inference solutions. • Scale performance of deep learning models across different architectures and types of NVIDIA accelerators.

🎯 Requirements

• Bachelors, Masters, PhD, or equivalent experience in relevant fields (Computer Science, Computer Engineering, EECS, AI). • 2 years of relevant software development experience. • Strong C++, Python programming and software engineering skills • Experience with DL frameworks (e.g. PyTorch, JAX, TensorFlow, ONNX) and inference libraries (e.g. TensorRT, TensorRT-LLM, vLLM, SGLang, FlashInfer). • Experience with performance analysis and performance optimization

🏖️ Benefits

• equity • benefits

Apply Now

Similar Jobs

🕒 April 3

Blue Tiger

1 - 10

🤝 B2B

🏛️ Government

🌍 Social Impact

Full-Stack Software Engineer role at Blue Tiger, supporting public health through technology. Collaborate with cross-functional teams to deliver complex software solutions in a remote environment.

Angular

Apache

AWS

Azure

Cloud

DynamoDB

JavaScript

Kafka

Node.js

NoSQL

Python

React

Redux

Ruby

Vue.js

🕒 April 3

SHI International Corp.

5001 - 10000

🤝 B2B

🔧 Hardware

☁️ SaaS

Software Architect at SHI leading eCommerce platform architecture and driving technical excellence in a remote role.

JavaScript

React

SQL

🕒 April 3

Nextdoor

501 - 1000

👥 B2C

📱 Media

iOS Software Engineer at Nextdoor, developing and improving app features and infrastructure. Collaborating with cross-functional teams to drive innovations in local community technology.

iOS

Swift

🕒 April 3

Glacis

11 - 50

🤖 Artificial Intelligence

☁️ SaaS

🤝 B2B

Founding Software Engineer defining the technical strategy for AI systems at Glacis. Collaborating with enterprise customers and delivering high-impact features in a fast-growing startup.

AWS

Azure

Cloud

Django

ERP

Google Cloud Platform

JavaScript

Postgres

Python

🕒 April 3

Westlight AI

1 - 10

🔌 API

🤖 Artificial Intelligence

🏢 Enterprise

Windows Software Engineer developing kernel-mode security software for operating systems at Westlight. Collaborating with a team of experts in cybersecurity and software engineering.

TCP/IP