Senior Solutions Architect - Continuous Bringup and Optimization

May 13

🗣️🇯🇵 Japanese Required

Apply Now
Logo of NVIDIA

NVIDIA

Artificial Intelligence • Gaming • Automotive

NVIDIA is a leading technology company specializing in accelerated computing and artificial intelligence. NVIDIA pioneers advancements in graphical processing units (GPUs), cloud computing, data centers, and virtual reality, with a focus on gaming, automotive, healthcare, and robotics industries. The company's innovations, such as NVIDIA Omniverse, transform traditional digital processes by enabling high-fidelity simulations and rendering tasks. Their applications span various industries, from autonomous vehicles using NVIDIA DRIVE to healthcare solutions with NVIDIA Clara, and AI-driven analytics and workflows.

10,000+ employees

Founded 1993

🤖 Artificial Intelligence

🎮 Gaming

📋 Description

• Lead the hands-on analysis, optimization, and performance tuning of complex GPU-accelerated systems and AI workloads, ensuring high availability and efficiency across customer data centers. • Engage with NVIDIA strategic customers to drive AI infrastructure initiatives, support deployment success, and influence long-term platform adoption. • Serve as a senior technical authority on NVIDIA GPU, DPU, and networking technologies, contributing to architecture reviews and guiding infrastructure decisions at scale. • Collaborate with internal Engineering, Product, and Sales teams to align customer deployments with NVIDIA’s technology roadmap and business objectives. • Establish and refine monitoring and optimization methodologies using analytics, telemetry, and automation to detect bottlenecks and improve infrastructure resiliency. • Participate in post-deployment reviews, incident retrospectives, and strategic planning sessions to shape the customer experience and feed insights into NVIDIA’s infrastructure strategy. • Complete and lead complex technical projects from initial design through implementation and continuous improvement, ensuring alignment to SLAs and mitigation of technical risks. • Support business growth by identifying AI infrastructure opportunities in cloud and enterprise environments and driving technical initiatives that showcase NVIDIA’s leadership in this space.

🎯 Requirements

• 10+ years of experience in large-scale data center service operations with a focus on infrastructure performance, backed by a Bachelor’s, Master’s, or PhD in Computer Science, Engineering, or a related field. • Strong analytical, solving problems, and decision-making skills, capable of identifying root causes, driving continuous improvement, and delivering resilient technical solutions. • Strong communication, time management, and organizational skills, with the ability to lead complex projects, guide technical teams, and meet important metrics. • Preferred certifications in data center, server, or networking technologies, and a willingness to travel up to 25% for customer engagements and team collaboration. • Proficiency in system-level aspects, encompassing Operating Systems, Linux kernel drivers, GPUs, NICs, and hardware architecture. • Demonstrated expertise in cloud orchestration software and job schedulers, including platforms like Kubernetes, Docker Swarm, and HPC-specific schedulers such as Slurm. • Familiarity with cloud-native technologies and their integration with traditional infrastructure is crucial. • Proficiency in both Japanese and English, with the ability to communicate complex technical topics clearly across multicultural teams and with customers.

Apply Now

Similar Jobs

May 5

Join Snowflake as a Solutions Architect, deploying cloud products for customers and migrating data platforms.

AWS

Azure

Cloud

ETL

Greenplum

Hadoop

HBase

Java

MapReduce

OpenStack

Perl

Python

Ruby

SQL

Tableau

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com