Software Engineer - GPU Inference

🕒 June 18, 2025

🏢🏡 San Francisco – Hybrid

💵 $380k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

🧑‍💻 Full-stack Engineer

🦅 H1B Visa Sponsor

info
Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of OpenAI

OpenAI

WebsiteLinkedIn

201 - 500 employees

Founded 2015

🤖 Artificial Intelligence

☁️ SaaS

🏢 Enterprise

Artificial Intelligence • SaaS • Enterprise

OpenAI is a leading research organization and company dedicated to creating advanced artificial intelligence technology, with a strong emphasis on safety and ethical considerations. OpenAI's mission is to ensure that artificial general intelligence (AGI) benefits all of humanity. The company develops AI products like ChatGPT, which can assist users with tasks ranging from everyday requests to complex enterprise solutions. OpenAI also provides an API platform that integrates its AI models into various applications. The company is focused on innovation in AI and improving data analysis capabilities, while emphasizing safety and ethical governance of their systems.

📋 Description

• Perform engineering efforts focused on improving model serving, inference performance, and system efficiency • Drive optimizations from a kernel and data movement perspective to improve system throughput and reliability • Partner closely with research and product teams to ensure our models perform effectively at scale • Design, build, and improve critical serving infrastructure to support Sora’s growth and reliability needs

🎯 Requirements

• Have deep expertise in model performance optimization, particularly at the inference layer • Have a strong background in kernel-level systems, data movement, and low-level performance tuning • Are excited about scaling high-performing AI systems that serve real-world, multimodal workloads • Can navigate ambiguity, set technical direction, and drive complex initiatives to completion

🏖️ Benefits

• Medical, dental, and vision insurance for you and your family • Mental health and wellness support • 401(k) plan with 50% matching • Generous time off, many company holidays, and multiple coordinated company office closures throughout the year for focus and recharge. • Paid parental leave (24 weeks paid birth-parent leave & 20-week paid parental leave) and family-planning support • Annual learning & development stipend ($1,500 per year)

Apply Now

Similar Jobs

🕒 June 18, 2025

Abridge

11 - 50

⚕️ Healthcare Insurance

🤖 Artificial Intelligence

☁️ SaaS

WebsiteLinkedIn

Join Abridge to enhance healthcare through AI-driven web applications and support systems.

🕒 June 18, 2025

Nash

11 - 50

🚗 Transport

🛍️ eCommerce

☁️ SaaS

WebsiteLinkedIn

Join Nash as a Full Stack Engineer to design and develop logistics solutions for major retailers.

🕒 June 18, 2025

Candid Health

11 - 50

⚕️ Healthcare Insurance

💸 Finance

☁️ SaaS

WebsiteLinkedIn

Join engineering teams to revolutionize healthcare billing products and drive customer outcomes.

🏢🏡 San Francisco – Hybrid

💵 $195k - $250k / year

💰 $150k Seed Round on 2020-03

⏰ Full Time

🟠 Senior

🧑‍💻 Full-stack Engineer

🕒 June 18, 2025

Stytch

51 - 200

☁️ SaaS

🔒 Cybersecurity

🔌 API

WebsiteLinkedIn

Become an Experienced Software Engineer at Stytch. Collaborate on their powerful identity platform.