Search Remote Jobs

Senior Deep Learning Framework Engineer

đź•’ January 23

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of NVIDIA

NVIDIA

10,000+ employees

Founded 1993

🤖 Artificial Intelligence

🎮 Gaming

Artificial Intelligence • Gaming • Automotive

NVIDIA is a leading technology company specializing in accelerated computing and artificial intelligence. NVIDIA pioneers advancements in graphical processing units (GPUs), cloud computing, data centers, and virtual reality, with a focus on gaming, automotive, healthcare, and robotics industries. The company's innovations, such as NVIDIA Omniverse, transform traditional digital processes by enabling high-fidelity simulations and rendering tasks. Their applications span various industries, from autonomous vehicles using NVIDIA DRIVE to healthcare solutions with NVIDIA Clara, and AI-driven analytics and workflows.

đź“‹ Description

• Integrate new communication libraries features in AI frameworks: from PoC to performance analysis to production • Perform deep analysis of AI workloads and frameworks to identify multi-GPU communication requirements and opportunities. • Collaborate hands-on with teams working on the latest AI models. • Improve AI compilers to hide communications or perform automatic fusion. • Conduct in-depth AI workload performance characterization on multi-GPU clusters. • Design fault-tolerant and elastic solutions for large-scale or dynamic AI workloads. • Author custom communication or fused compute-communication kernels to showcase ultimate performance on NV platforms. • Influence the roadmap of communication libraries - NCCL & NVSHMEM. • Collaborate with a very dynamic team across multiple time zones.

🎯 Requirements

• B.S, M.S. or PHD in Computer Science, or related field (or equivalent experience) with 5+ software engineering and HPC/AI experience • Development or integration experience with Deep Learning Frameworks such PyTorch, JAX, and Inference Engines such as TRT-LLM, vLLM, SGLang • Rapid prototyping and development with Python, C++, CUDA or related DSLs (Triton, cuTe) • Solid grasp of AI models, parallelisms, and/or compiler technologies (e.g. torch.compile) • Experience conducting performance benchmarking on AI clusters. • Familiarity with at least one performance profiler toolchain (PyTorch profiler, NVIDIA Nsight Systems) • Understanding of HPC/AI communication concepts (1-sided v 2-sided communication, elasticity, resiliency, topology discovery, etc) • Adaptability and passion to learn new areas and tools • Flexibility to work and communicate effectively across different teams and timezones

🏖️ Benefits

• equity • benefits

Apply Now

Similar Jobs

đź•’ January 22

RESPEC

201 - 500

Lead Strategic Communications Specialist managing OCI communications to ensure clarity on federal EHR transformations. Develop executive-level strategies and oversee multimedia communication products.

đź•’ January 21

Vanguard Attorneys, LLC

1 - 10

🤝 B2B

👥 B2C

Telecommunications Patent Agent specializing in IP placements at Vanguard-IP. Focusing on patent preparation, prosecution, and client relations in a nationwide network.

đź•’ January 21

Vanguard Attorneys, LLC

1 - 10

🤝 B2B

👥 B2C

Patent Agent specializing in technology and wireless communications in a team-oriented environment. Opportunity to leverage expertise in patent preparation and prosecution within a reputable firm.

đź•’ January 21

Vanguard Attorneys, LLC

1 - 10

🤝 B2B

👥 B2C

Telecommunications Patent Agent specializing in the placement of IP/Patent professionals nationwide. Understanding clients’ technical needs and offering trusted career advice with a focus on wireless communications.

đź•’ January 21

Vanguard Attorneys, LLC

1 - 10

🤝 B2B

👥 B2C

Patent Agent focusing on technology and wireless communications at Vanguard Intellectual Partners. Requires experience in patent preparation and prosecution with Bachelor's degree in electrical engineering.