Senior Software Engineer – AI Eval & Safety

10 hours ago

Apply Now
Logo of Red Hat

Red Hat

Enterprise • Cloud

Red Hat is a leading provider of enterprise open source software solutions, helping companies worldwide to build and deploy applications across hybrid cloud infrastructures. With a strong focus on developing secure, stable, and innovative technologies, Red Hat offers a broad portfolio including products like Red Hat Enterprise Linux, Red Hat OpenShift, and Red Hat Ansible Automation Platform. These products support IT services on any infrastructure efficiently. Trusted by more than 90% of the U. S. Fortune 500, Red Hat empowers organizations to modernize their IT environments, leveraging open source communities to drive technological advancement.

10,000+ employees

Founded 1993

🏢 Enterprise

💰 Corporate Round on 1999-03

📋 Description

• Lead the architecture and implementation of MLOps/LLMOps systems within OpenShift AI, establishing best practices for scalability, reliability, and maintainability while actively contributing to relevant open source communities • Design and develop robust, production-grade features focused on AI trustworthiness, including model monitoring, bias detection, and explainability frameworks that integrate seamlessly with OpenShift AI • Drive technical decision-making around system architecture, technology selection, and implementation strategies for key MLOps components, with a focus on open source technologies like KServe and TrustyAI • Define and implement technical standards for model deployment, monitoring, and validation pipelines, while mentoring team members on MLOps best practices and engineering excellence • Collaborate with product management to translate customer requirements into technical specifications, architect solutions that address scalability and performance challenges, and provide technical leadership in customer-facing discussions • Lead code reviews, architectural reviews, and technical documentation efforts to ensure high code quality and maintainable systems across distributed engineering teams • Identify and resolve complex technical challenges in production environments, particularly around model serving, scaling, and reliability in enterprise Kubernetes deployments • Partner with cross-functional teams to establish technical roadmaps, evaluate build-vs-buy decisions, and ensure alignment between engineering capabilities and product vision • Provide technical mentorship to team members, including code review feedback, architecture guidance, and career development support while fostering a culture of engineering excellence

🎯 Requirements

• 5+ years of software engineering experience, with at least 4 years focusing on ML/AI systems in production environments • Strong expertise in Python, with demonstrated experience building and deploying production ML systems • Deep understanding of Kubernetes and container orchestration, particularly in ML workload contexts • Extensive experience with MLOps tools and frameworks (e.g., KServe, Kubeflow, MLflow, or similar) • Track record of technical leadership in open source projects, including significant contributions and community engagement • Proven experience architecting and implementing large-scale distributed systems • Strong background in software engineering best practices, including CI/CD, testing, and monitoring • Experience mentoring engineers and driving technical decisions in a team environment

🏖️ Benefits

• Comprehensive medical, dental, and vision coverage • Flexible Spending Account - healthcare and dependent care • Health Savings Account - high deductible medical plan • Retirement 401(k) with employer match • Paid time off and holidays • Paid parental leave plans for all new parents • Leave benefits including disability, paid family medical leave, and paid military leave • Additional benefits including employee stock purchase plan, family planning reimbursement, tuition reimbursement, transportation expense account, employee assistance program, and more!

Apply Now

Similar Jobs

11 hours ago

Senior Software Engineer designing scalable data infrastructure and backend systems for healthcare-oriented public benefit goals. Collaborating with AI and engineering teams to support transparent analytics and knowledge initiatives.

AWS

Azure

Cloud

Python

TypeScript

Go

11 hours ago

Full Stack Engineer designing and supporting core product features at Calendly. Contributing to platform growth and collaborating across teams to enhance product quality.

Microservices

Node.js

Ruby on Rails

TypeScript

11 hours ago

Full Stack Engineer developing and scaling features at Calendly. Collaborating with cross-functional teams to enhance product and customer experience.

JavaScript

Microservices

Node.js

React

TypeScript

12 hours ago

Native iOS Tech Lead overseeing the development of cutting-edge iOS applications. Guiding developers in architectural decisions and ensuring high-quality, maintainable solutions.

iOS

Objective-C

Swift

13 hours ago

Product Engineer building fullstack applications for partners in blockchain and fintech. Collaborating with design partners to create customer applications and integrations.

Rust

Solidity

TypeScript

Go

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com