Software Engineer – BIS, Baseten Inference Stack

🕒 June 2

🏢🏡 San Francisco – Hybrid

💵 $180k - $360k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

🧑‍💻 Full-stack Engineer

🦅 H1B Visa Sponsor

info
Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Baseten

Baseten

WebsiteLinkedIn

11 - 50 employees

Founded 2020

🤖 Artificial Intelligence

☁️ SaaS

🏢 Enterprise

💰 $8M Seed Round on 2022-04

Artificial Intelligence • SaaS • Enterprise

Baseten is a company that provides fast, scalable model inference services, designed for performance, security, and a delightful developer experience. They offer tools to streamline the entire development process, enabling high-throughput inference and fast deployment times. Baseten caters to enterprise companies by delivering robust, secure, and scalable model serving solutions, particularly useful for machine learning and AI model deployment. Their solutions allow organizations to efficiently manage model infrastructure while focusing on creating domain-specific models. Baseten supports open-source model packaging and offers autoscaling features to handle varying demand efficiently.

📋 Description

• Develop infrastructure and orchestration systems for deploying and managing large-scale distributed LLM inference • Work across the stack, from customer-facing features to low-level infrastructure components • Build platform capabilities related to routing, autoscaling, scheduling, observability, and runtime management • Improve the reliability, scalability, and usability of our inference stack • Collaborate closely with Model Performance engineers to make new inference optimizations broadly available to customers and easy to configure • Help define best practices around testing, release automation, benchmarking, and operational excellence • Debug complex production systems spanning Kubernetes, distributed runtimes, networking, and GPU workloads • Make thoughtful engineering tradeoffs balancing performance, reliability, operational simplicity, and developer experience • Own projects end-to-end: from architecture and implementation through deployment, monitoring, and iteration based on customer feedback

🎯 Requirements

• Bachelor's, Master's, or Ph.D. in Computer Science, Engineering, or a related field • Strong background in distributed systems, backend infrastructure, or platform engineering • Experience building and operating production systems where reliability, latency, and scale are first-class concerns • Strong sense of developer experience: you think about how systems are used, not just how they work • Motivated and willing to learn new languages, frameworks, and systems as needed • Ability to debug complex systems across multiple layers of the stack • Genuine interest in inference engineering. You don’t need to have hands on experience but are willing to learn • Excellent communication and collaboration skills.

🏖️ Benefits

• Competitive compensation, including meaningful equity. • 100% coverage of medical, dental, and vision insurance for employee and dependents • Flexible PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!) • Paid parental leave • Fertility and family-building stipend through Carrot • Company-facilitated 401(k) • Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.

Apply Now

Similar Jobs

🕒 June 1

Substack

51 - 200

📱 Media

☁️ SaaS

WebsiteLinkedIn

Full Stack Developer joining Substack's Enterprise team to enhance user activation and conversion. Collaborate with diverse teams and directly interact with customers to drive growth.

🏢🏡 San Francisco – Hybrid

💵 $140k - $260k / year

💰 $65M Series B on 2021-03

⏰ Full Time

🟢 Junior

🟡 Mid-level

🧑‍💻 Full-stack Engineer

🦅 H1B Visa Sponsor

info

🕒 May 31

Adobe

10,000+ employees

WebsiteLinkedIn

Senior Software Engineer working on the video node ecosystem of Adobe's Project Graph. Building and maintaining video generation, editing, and processing nodes for creative workflows.

🕒 May 30

Salesforce

10,000+ employees

☁️ SaaS

🤝 B2B

🤖 Artificial Intelligence

WebsiteLinkedIn

Senior Software Engineer developing core visualization components for Tableau's next-gen product. Collaborating with designers and engineers to optimize data rendering and visual representation.

🕒 May 30

Drata

201 - 500

🔒 Cybersecurity

📋 Compliance

☁️ SaaS

WebsiteLinkedIn

Senior AI Product Engineer at Drata developing AI features for compliance software. Responsible for full-stack development and enhancing user experience with AI capabilities.

🏢🏡 San Francisco – Hybrid

💵 $192k - $259.8k / year

💰 $100M Series B on 2021-11

⏰ Full Time

🟠 Senior

🧑‍💻 Full-stack Engineer

🕒 May 30

Drata

201 - 500

🔒 Cybersecurity

📋 Compliance

☁️ SaaS

WebsiteLinkedIn

Senior AI Product Engineer building end-to-end AI features for Drata's compliance platform. Collaborating with AI Engineers and Product teams to deliver seamless compliance experiences.

🏢🏡 San Francisco – Hybrid

💵 $192k - $259.8k / year

💰 $100M Series B on 2021-11

⏰ Full Time

🟠 Senior

🧑‍💻 Full-stack Engineer