Post a Job Affiliates

Search Remote Jobs

Sciforium

Website LinkedIn

11 - 50 employees

Founded 2024

🤖 Artificial Intelligence

🔌 API

🔧 Hardware

🔥 Funding within the last year

💰 $12M Seed Round - Sciforium on 2025-10

Artificial Intelligence • API • Hardware

Sciforium is a serverless AI infrastructure platform that provides production-ready, multimodal AI services via a unified, OpenAI-compatible API. The company runs vertically integrated AMD GPU hardware and offers model hosting, a model library, real-time evaluation pipelines, and managed agent deployments to help teams build, evaluate, and ship text, image, video, and audio AI applications with lower cost, stronger privacy, and predictable performance.

Lead Software Engineer, Model Serving Platform

🕒 April 12

🏢🏡 San Francisco – Hybrid

💵 $230k - $300k / year

⏰ Full Time

🟠 Senior

🧑‍💻 Full-stack Engineer

Kubernetes

Python

Ray

Apply Now

Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Sciforium

Website LinkedIn

11 - 50 employees

Founded 2024

🤖 Artificial Intelligence

🔌 API

🔧 Hardware

🔥 Funding within the last year

💰 $12M Seed Round - Sciforium on 2025-10

Artificial Intelligence • API • Hardware

📋 Description

• Lead the technical direction of the model serving platform, owning architecture decisions and guiding engineering execution. • Build core serving components including execution runtimes, batching, scheduling, and distributed inference systems. • Develop high-performance C++ and CUDA/HIP modules, including custom GPU kernels and memory-optimized runtimes. • Collaborate with ML researchers to productionize new multimodal models and ensure low-latency, scalable inference. • Build Python APIs and services that expose model capabilities to downstream applications. • Mentor and support other engineers through code reviews, design discussions, and hands-on technical guidance. • Drive performance profiling, benchmarking, and observability across the inference stack. • Ensure high reliability and maintainability through testing, monitoring, and engineering best practices. • Troubleshoot and resolve complex issues across GPU, runtime, and service layers.

🎯 Requirements

• Bachelor’s degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent practical experience • 5+ years of experience designing and building scalable, reliable backend systems or distributed infrastructure • Strong understanding of LLM inference mechanics (prefill vs decode, batching, KV cache) • Experience with Kubernetes/Ray, Containerization • Strong proficiency in C++, Python • Strong debugging, profiling, and performance optimization skills at the system level • Ability to collaborate closely with ML researchers and translate model or runtime requirements into production-grade systems • Effective communication skills and the ability to lead technical discussions, mentor engineers, and drive engineering quality • Comfortable working from the office and contributing to a fast-moving, high-ownership team culture.

🏖️ Benefits

• Medical, dental, and vision insurance • 401k plan • Daily lunch, snacks, and beverages • Flexible time off • Competitive salary and equity

Apply Now

Similar Jobs

Software Engineer – Product

🕒 April 12

Koah

1 - 10

🤖 Artificial Intelligence

☁️ SaaS

🤝 B2B

Website LinkedIn

Software Engineer at Koah Labs, shaping AI-native product development and engineering organization with a focus on cross-functional collaboration.

🏢🏡 San Francisco – Hybrid

💵 $180k - $250k / year

🔥 Funding within the last year

💰 $5M Seed on 2025-10

⏰ Full Time

🟡 Mid-level

🟠 Senior

🧑‍💻 Full-stack Engineer

Apply

View Job

Software Engineer – Infrastructure

🕒 April 12

Koah

1 - 10

🤖 Artificial Intelligence

☁️ SaaS

🤝 B2B

Website LinkedIn

Software Engineer building and maintaining infrastructure for adtech platform at Koah Labs. Collaborate with a tight-knit team to ensure system performance and reliability.

🏢🏡 San Francisco – Hybrid

💵 $180k - $250k / year

🔥 Funding within the last year

💰 $5M Seed on 2025-10

⏰ Full Time

🟡 Mid-level

🟠 Senior

🧑‍💻 Full-stack Engineer

Apply

View Job

Software Engineer – Special Projects

🕒 April 12

HeyMilo AI

11 - 50

🤖 Artificial Intelligence

👥 HR Tech

☁️ SaaS

Website LinkedIn

Cracked Software Engineer building AI interviewer solutions for hiring at HeyMilo. Focused on solving real problems fast and delivering production-ready systems.

🏢🏡 San Francisco – Hybrid

⏰ Full Time

🟡 Mid-level

🟠 Senior

🧑‍💻 Full-stack Engineer

Apply

View Job

Software Engineer – Infrastructure, Analytics Platform

🕒 April 11

OpenAI

201 - 500

🤖 Artificial Intelligence

☁️ SaaS

🏢 Enterprise

Website LinkedIn

Staff-level Software Engineer at OpenAI focusing on backend infrastructure and systems. Enhancing performance-sensitive infrastructure in Rust or C++ with a hybrid work model.

🏢🏡 San Francisco – Hybrid

💵 $230k - $385k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

🧑‍💻 Full-stack Engineer

🦅 H1B Visa Sponsor