Senior Math Libraries Engineer, CPU and GPU Optimization

September 27

Apply Now
Logo of NVIDIA

NVIDIA

Artificial Intelligence • Gaming • Automotive

NVIDIA is a leading technology company specializing in accelerated computing and artificial intelligence. NVIDIA pioneers advancements in graphical processing units (GPUs), cloud computing, data centers, and virtual reality, with a focus on gaming, automotive, healthcare, and robotics industries. The company's innovations, such as NVIDIA Omniverse, transform traditional digital processes by enabling high-fidelity simulations and rendering tasks. Their applications span various industries, from autonomous vehicles using NVIDIA DRIVE to healthcare solutions with NVIDIA Clara, and AI-driven analytics and workflows.

📋 Description

• Help deliver CUDA-X libraries across NVIDIA's CPU and GPU ecosystem • Design modern, flexible, and easy to use APIs and kernels for math libraries and lead design reviews with collaborators • Work closely with internal teams (Engineering, Product Management) and external researchers to understand use cases and requirements • Work with internal and external customers to deliver timely math libraries releases • Design, develop, and optimize math libraries for high-performance computing and AI applications • Continuously survey current trends in software systems to become a domain expert

🎯 Requirements

• PhD or MSc degree in Computer Science, Applied Math, or related field preferred (or equivalent experience) • 12+ years of experience designing and developing software for high-performance computing and/or AI applications • Advanced C++ skills, including modern design paradigms (e.g., template meta-programming, RAII) • Parallel programming experience with CUDA, OpenCL or vector programming on CPU (AVX, NEON or similar) • Experience with ARM, RISC-V and/or x86_64 CPU architectures • Strong collaboration, communication, and documentation habits • Strong background in numerical methods (e.g., FFT, numerical linear algebra) (preferred) • Programming skills with Python and experience with build automation (cmake) and testing (CI/CD, sanitizers) (preferred) • Background with cross-compilation and CPU/GPU/accelerator cross-compilation toolchains (preferred) • Experience with CCCL, OpenMP, OpenACC, multi-threading, MPI, PGAS (preferred) • Experience with scientific and deep learning libraries and frameworks such as PyTorch, JAX, MKL, MAGMA, PETSc, Kokkos (preferred)

🏖️ Benefits

• Competitive salaries • Generous benefits package

Apply Now

Similar Jobs

September 26

ALTEN

10,000+ employees

🚀 Aerospace

Senior Human Factors Engineer at ALTEN, leading usability engineering and user testing for medical devices. Preparing international regulatory documentation for market approval.

🗣️🇩🇪 German Required

September 18

ALTEN

10,000+ employees

🚀 Aerospace

Senior Human Factors Engineer leading usability engineering and design validation for medical devices at ALTEN. Planning and executing user tests and preparing international regulatory submissions.

🗣️🇩🇪 German Required

September 4

ALTEN

10,000+ employees

🚀 Aerospace

Senior Human Factors Engineer leading usability engineering and design validation for medical devices at ALTEN. Conducting user tests and preparing international regulatory documentation.

🗣️🇩🇪 German Required

August 29

ALTEN

10,000+ employees

🚀 Aerospace

Senior Human Factors Engineer at ALTEN; leads usability engineering for medtech. Plans and conducts user tests; supports regulatory submissions.

🗣️🇩🇪 German Required

August 28

Founding Full-Stack Engineer building Mojito’s Web3 commerce platform. Own architecture across Next.js, Go, GCP, and smart contracts.

Docker

Google Cloud Platform

JavaScript

Kubernetes

Microservices

Next.js

Postgres

Solidity

SQL

Web3

Go

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com