Senior Solutions Architect, Cloud Infrastructure – DevOps

🔥 0 minutes ago

🇸🇦 Saudi Arabia – Remote

⏰ Full Time

🟠 Senior

💻 Solutions Engineer

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of NVIDIA

NVIDIA

10,000+ employees

Founded 1993

🤖 Artificial Intelligence

🎮 Gaming

Artificial Intelligence • Gaming • Automotive

NVIDIA is a leading technology company specializing in accelerated computing and artificial intelligence. NVIDIA pioneers advancements in graphical processing units (GPUs), cloud computing, data centers, and virtual reality, with a focus on gaming, automotive, healthcare, and robotics industries. The company's innovations, such as NVIDIA Omniverse, transform traditional digital processes by enabling high-fidelity simulations and rendering tasks. Their applications span various industries, from autonomous vehicles using NVIDIA DRIVE to healthcare solutions with NVIDIA Clara, and AI-driven analytics and workflows.

📋 Description

• Advise on and help maintain large-scale computational and AI infrastructure, including monitoring, logging, and workload orchestration (Kubernetes and Linux job schedulers). • Provide consultative guidance and perform hands-on solving across the full stack—from bare metal and operating system, through the software stack, container platform, networking, and storage. • Assess customer environments and recommend optimized, production-ready Kubernetes-based container platforms integrated with enterprise-grade networking and storage solutions. • Serve as a key technical resource: develop, refine, and document standard methodologies and operational guidelines to be shared with internal teams and customer partners. • Support Research & Development activities and engage in POCs/POVs to validate new features, architectures, and upgrade approaches. • Create and deliver high-quality documentation, including runbooks, onboarding materials, and best-practice guides for customers and internal teams. • Act as the technical leader for assigned customer accounts, providing strategic guidance on DevOps and platform architecture and influencing long-term infrastructure and operations decisions.

🎯 Requirements

• BS/MS/PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, or related fields (or equivalent experience) with 8+ years of professional experience in leading scalable cloud environments and automation engineering roles. • Shown understanding of networking fundamentals, data center architectures, and hands-on experience leading HPC/AI clusters, including deployment, optimization, and solving. • Validated hands-on experience deploying, configuring, and optimizing NVIDIA GPU-accelerated infrastructure, including driver management, CUDA toolkit integration, and GPU workload profiling. • Extensive experience with Kubernetes for container orchestration, resource scheduling, scaling, and integration with GPU-accelerated and HPC environments. • Strong familiarity with HPC and AI technologies (CPUs, GPUs, high-speed interconnects) and supporting software stacks. • Deep knowledge of Linux (RedHat, Ubuntu), OS-level security, and protocols. • Experience with storage solutions such as Lustre, GPFS, ZFS, XFS, and emerging Kubernetes storage technologies. • Proficiency in Python and Bash scripting, configuration management, and Infrastructure-as-Code tools (e.g., Ansible, Terraform). • Experience with observability stacks (Grafana, Loki, Prometheus) for monitoring, logging, and building fault-tolerant systems. • Strong background in crafting scalable solutions and providing consultative support to customers, including leading architectural reviews and speaking publicly to executive partners.

Apply Now

Similar Jobs

🔥 5 hours ago

Ziff Davis

1001 - 5000

📱 Media

Customer Solutions Manager optimizing client networking solutions at Ookla while leveraging AI and technical expertise. Delivering high-quality solutions and building strategic relationships with customers.

🇸🇦 Saudi Arabia – Remote

💵 ر.س360k - ر.س504k / year

💰 $650M Post-IPO Debt on 2017-06

⏰ Full Time

🟡 Mid-level

🟠 Senior

💻 Solutions Engineer

SQL

Tableau

🕒 May 26

Trellix

1001 - 5000

🔒 Cybersecurity

🤖 Artificial Intelligence

🏢 Enterprise

Solutions Engineer working with enterprise account managers as technical sales expert for cybersecurity. Overseeing solution sales activities to gain technical win and customer trust.

🇸🇦 Saudi Arabia – Remote

💰 $35M Venture Round on 2000-04

⏰ Full Time

🟠 Senior

💻 Solutions Engineer

🗣️🇸🇦 Arabic Required

Cloud

🕒 April 5

CME

501 - 1000

🤝 B2B

🤖 Artificial Intelligence

☁️ SaaS

Senior Data Integration Engineer designing and developing ETL/ELT solutions at CME. Focusing on complex data integration and optimization across platforms and systems.

🇸🇦 Saudi Arabia – Remote

⏰ Full Time

🟠 Senior

💻 Solutions Engineer

Airflow

AWS

Azure

Cloud

ERP

ETL

Informatica

Kafka

Oracle

Postgres

SOAP

SQL

🕒 March 28

harrison.ai

51 - 200

🤖 Artificial Intelligence

☁️ SaaS

🤝 B2B

🇸🇦 Saudi Arabia – Remote

💰 Series B on 2021-12

⏰ Full Time

🟠 Senior

💻 Solutions Engineer

AWS

Cloud

DNS

Docker

Kubernetes

Linux

MySQL

Postgres

SQL

TCP/IP

VMware

🕒 February 20

CoverGo | Insurtech

51 - 200

☁️ SaaS

Solutions Architect delivering innovative cloud-based solutions for the insurance industry. Collaborating with clients, Product Managers, and Delivery Managers to ensure successful solution deployment.

🇸🇦 Saudi Arabia – Remote

💰 $15M Series A on 2022-05

⏰ Full Time

🟡 Mid-level

🟠 Senior

💻 Solutions Engineer

Cloud

NoSQL