Search Remote Jobs

Senior Software Engineer, AI Inference

🕒 March 11

🏢🏡 Boston – Hybrid

💵 $133.7k - $220.7k / year

⏰ Full Time

🟠 Senior

🧑‍💻 Full-stack Engineer

🦅 H1B Visa Sponsor

info
Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Red Hat

Red Hat

WebsiteLinkedIn

10,000+ employees

Founded 1993

🏢 Enterprise

💰 Corporate Round on 1999-03

Enterprise • Cloud

Red Hat is a leading provider of enterprise open source software solutions, helping companies worldwide to build and deploy applications across hybrid cloud infrastructures. With a strong focus on developing secure, stable, and innovative technologies, Red Hat offers a broad portfolio including products like Red Hat Enterprise Linux, Red Hat OpenShift, and Red Hat Ansible Automation Platform. These products support IT services on any infrastructure efficiently. Trusted by more than 90% of the U. S. Fortune 500, Red Hat empowers organizations to modernize their IT environments, leveraging open source communities to drive technological advancement.

📋 Description

• Build and release vLLM wheels across multiple hardware backends and CPU architectures, managing complex native dependency chains including PyTorch, Triton, and other accelerator-specific libraries • Design and maintain CI/CD pipelines spanning multiple platforms including GitHub Actions, GitLab CI, and Buildkite for build, test, and release workflows • Manage and scale multi-cloud GPU infrastructure using Terraform and Ansible, including both bare-metal and Kubernetes-based compute runners • Own the model validation pipeline, orchestrating accuracy evaluation, performance benchmarking, tool-calling validation, and smoke testing across dozens of LLMs on both bare metal and OpenShift • Develop and maintain the Python tooling and automation that powers the build, packaging, validation, and release processes • Drive adoption of agentic AI and intelligent automation to streamline engineering workflows, accelerate debugging, and reduce toil across the team

🎯 Requirements

• 5+ years of software engineering experience with significant depth in build systems, release engineering, or infrastructure • Strong Python development skills with experience building well-tested, maintainable tooling and automation • Hands-on experience building and packaging Python projects with native compiled extensions, including familiarity with C++ and CUDA build toolchains, wheel packaging, and multi-architecture builds • Deep familiarity with container ecosystems, including Dockerfiles and Containerfiles, image registries, and container build pipelines • Understanding of LLM evaluation methodology, including accuracy benchmarks such as MMLU, GSM8K, and HellaSwag, as well as inference performance metrics like throughput and latency • Experience with CI/CD platforms such as GitHub Actions, GitLab CI, Tekton, or Buildkite • Solid understanding of release engineering practices including reproducible builds, artifact management, dependency pinning, and security scanning • Experience with infrastructure-as-code tools such as Terraform and Ansible, and managing cloud resources at scale • Working knowledge of Kubernetes and/or OpenShift for deploying and testing workloads • Enthusiasm for applying LLM-based agents and AI-assisted tools to automate engineering workflows, with a track record of identifying repetitive processes and replacing them with intelligent automation • Excellent communication skills, capable of interacting effectively with both technical and non-technical team members.

🏖️ Benefits

• Comprehensive medical, dental, and vision coverage • Flexible Spending Account - healthcare and dependent care • Health Savings Account - high deductible medical plan • Retirement 401(k) with employer match • Paid time off and holidays • Paid parental leave plans for all new parents • Leave benefits including disability, paid family medical leave, and paid military leave • Additional benefits including employee stock purchase plan, family planning reimbursement, tuition reimbursement, transportation expense account, and employee assistance program

Apply Now

Similar Jobs

🕒 March 11

Motional

1001 - 5000

🚗 Transport

🤖 Artificial Intelligence

WebsiteLinkedIn

Staff Tech Lead Manager overseeing the ML Data Service team at Motional. Leading development and operation of machine learning data infrastructure for autonomous vehicles.

🕒 March 9

Perk

1001 - 5000

☁️ SaaS

🤝 B2B

💳 Fintech

WebsiteLinkedIn

GTM Engineer at Perk optimizing outbound and pipeline generation for sales. Collaborating with sales and marketing to enhance revenue systems and workflows for data-driven success.

🕒 March 4

Lendbuzz

201 - 500

💸 Finance

💳 Fintech

WebsiteLinkedIn

Software Engineer designing and developing backend systems at Lendbuzz with a focus on user experiences. Building microservices and collaborating on impactful products.

🕒 February 26

zeroRISC

11 - 50

WebsiteLinkedIn

Software Engineer developing cloud services as part of ZeroRISC's security solutions for critical infrastructure. Collaborating with teams to build impactful software and systems with a focus on security and scalability.

🏢🏡 Boston – Hybrid

⏰ Full Time

🟡 Mid-level

🟠 Senior

🧑‍💻 Full-stack Engineer

🕒 February 26

zeroRISC

11 - 50

WebsiteLinkedIn

Software Engineer developing security-focused cloud services at ZeroRISC. Collaborating with the engineering team to build secure systems and APIs.