Principal Platform Software Engineer – RAS

🕒 January 26

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of NVIDIA

NVIDIA

10,000+ employees

Founded 1993

🤖 Artificial Intelligence

🎮 Gaming

Artificial Intelligence • Gaming • Automotive

NVIDIA is a leading technology company specializing in accelerated computing and artificial intelligence. NVIDIA pioneers advancements in graphical processing units (GPUs), cloud computing, data centers, and virtual reality, with a focus on gaming, automotive, healthcare, and robotics industries. The company's innovations, such as NVIDIA Omniverse, transform traditional digital processes by enabling high-fidelity simulations and rendering tasks. Their applications span various industries, from autonomous vehicles using NVIDIA DRIVE to healthcare solutions with NVIDIA Clara, and AI-driven analytics and workflows.

📋 Description

• Drive next generation fleet management solutions for scaling AI infrastructure using GPUs and Grace solution from Nvidia • Work with customers, product management and other architects to narrow down on requirements for implementation • Bring up clarity on architecture for fleet health monitoring and fault-remediation solution at scale • Work with customers and other architects, understand their requirements on health monitoring • Detailed architecture, do POCs to validate architecture • Educate customers about product architecture and take feedback • Write architecture specs, design documents and own end to end delivery of product • Do code review for the code produced because of architecture specs • Ensure product is properly tested by working with the development team • Drive product life cycles with QA teams to productize the code and be responsible as a product owner • Articulate requirements as part of Jira and bug management tools and work out an end-to-end execution plan • Contribute to all phases of product development, from product definition, architecture, and design, through implementation, debugging, testing and early customer support.

🎯 Requirements

• BS, MS, or PhD in EE/CS or related field of education (or equivalent experience) • 15+ years hands-on coding experience • Strong knowledge of time series databases like Influxdb & Prometheus • Strong knowledge of building and consuming REST APIs (Redfish is big plus) • Strong knowledge of telemetry visualization solutions like Grafana & Influx • Strong knowledge of firmware architecture, optimize firmware for low latency APIs • Strong knowledge of analyzing algorithms for time & space complexity and project system resource requirements • Proven record of solutions for scalability • Strong and demonstrable skill in C/C++ and Python • Experience programming and debugging skills for server platforms • Experience in SCM (e.g., Git, Perforce) and project management tools like Jira.

🏖️ Benefits

• Equity • Benefits

Apply Now

Similar Jobs

🕒 January 24

Fanatics, Inc.

1001 - 5000

🎮 Gaming

🛒 Retail

🛍️ eCommerce

Software Engineer III at Fanatics Betting & Gaming designing, developing, and maintaining high-quality systems. Collaborate with teams to deliver features and implement engineering best practices.

AWS

Distributed Systems

Java

Kotlin

NoSQL

React

Spring

Spring Boot

SpringBoot

SQL

🕒 January 24

Airbnb

5001 - 10000

👥 B2C

🛍️ eCommerce

Staff Software Engineer developing a no-code marketing technology platform at Airbnb. Leading technical solutions and collaborating across product and marketing teams.

🕒 January 22

Bishop Fox

201 - 500

🔒 Cybersecurity

Staff AI Software Engineer building autonomous AI agents to identify vulnerabilities in production applications at Bishop Fox. Innovating within the realm of offensive security leveraging AI/ML systems.

AWS

Azure

Cloud

Docker

Google Cloud Platform

Kubernetes

Python

React

TypeScript

Go

🕒 January 22

Confluent

1001 - 5000

🤖 Artificial Intelligence

☁️ SaaS

Staff Software Engineer developing Flink Control Plane services at Confluent. Building robust control plane for Flink product leveraging advanced database storage for high reliability and efficiency.

Cloud

Distributed Systems

Kubernetes

🕒 January 22

Demandbase

501 - 1000

🤝 B2B

☁️ SaaS

🤖 Artificial Intelligence

Principal Software Engineer at Demandbase focusing on scalable systems and architectural leadership. Collaborating on AI-driven B2B growth solutions and mentoring engineering talent.

AWS

Azure

Cloud

Distributed Systems

Google Cloud Platform

Java

Kafka

Kubernetes

Microservices

Postgres

Pulsar

Python

Redis

Scala

SQL