Staff Software Engineer, AI Runtime

Job not on LinkedIn

October 22

Apply Now
Logo of Apollo GraphQL

Apollo GraphQL

API • Enterprise • SaaS

Apollo GraphQL is a company that offers a comprehensive platform for GraphQL development, enabling developers to create powerful, context-rich APIs. The platform, known as Apollo GraphOS, facilitates seamless app development by unifying APIs into a declarative query-based system, enhancing developer efficiency and offering real-time data access. Apollo's solutions support enterprise-level implementations, providing tools for backend teams to create a composable and self-service data graph that powers diverse applications with consistent and performant user experiences. With a focus on scalability, security, and rapid AI-driven iteration, Apollo GraphQL is trusted by industry leaders such as Netflix, PayPal, and Warner Brothers. The company is committed to improving collaboration across development teams and streamlining the delivery of feature improvements.

📋 Description

• Architect and scale an enterprise AI/MCP Server and Gateway that powers multi-agent workflows across Apollo, including routing, orchestration, and integration boundaries. • Design and implement robust server infrastructure to ensure reliability, performance, and security at scale. • Build and maintain tools for agent discovery, communication, and coordination. • Define deployment strategies and runtime optimizations to maximize efficiency and minimize operational overhead. • Develop frameworks and patterns that enable seamless multi-agent collaboration and AI-driven orchestration. • Integrate observability, logging, and monitoring for full visibility into server and agent behavior. • Explore and implement AI-enhanced developer workflows to optimize orchestration and agent interactions. • Collaborate with teams across Apollo to ensure the MCP Server meets evolving product and developer needs.

🎯 Requirements

• Expertise in agent-to-tool orchestration, routing, and coordination in scalable, fault-tolerant systems • Deep expertise in Rust programming language • Strong background in distributed systems, server architecture, and high-performance backend development • Proven experience with protocol design, message routing, and server-side orchestration frameworks • Experience building and maintaining robust runtime infrastructure that supports AI-driven workflows and enables reliable agent-to-tool interactions • Proven experience with protocol design, message routing, and building server-side frameworks that enable scalable, reliable multi-tool agent workflows • Hands-on experience with observability, monitoring, and debugging frameworks for complex systems • Passion for clean, maintainable code, high system reliability, and scalable architecture • Experience in strategic system design, making architectural trade-offs, and planning for long-term scalability and maintainability • Strong technical leadership and mentorship, including guiding junior engineers and driving engineering best practices across teams • Ability to influence cross-team architecture decisions and align engineering efforts with product and business objectives • Production ownership experience: leading incident response, debugging, and performance optimization in high-impact backend systems

🏖️ Benefits

• 3 Anthem Blue Cross medical plans • 2 Kaiser medical plans (California residents) • Dental benefits provided by Sun Life Financial • Vision benefits provided by Sun Life Financial

Apply Now

Similar Jobs

October 22

Staff Engineer for HungerRush developing cloud POS systems. Collaborating on AI technology and team mentoring in software development.

Angular

Azure

Bootstrap

Cloud

Distributed Systems

JavaScript

Redis

SQL

Vue.js

October 22

Lead architecture, design, and implementation of data processing engine at Datapelago utilizing accelerated computing. Collaborate across teams to ensure high-performance and reliability in production environments.

Apache

Linux

Rust

Spark

October 22

Full stack software engineer for logistics at Courtyard.io, helping scale operations and improve tooling. Collaborating with the engineering team to enhance logistics and supply chain processes.

Vault

October 21

Principal Engineer leading CRM & MarTech integrations for Stitch Fix. Designing solutions to enhance client conversion, engagement, and retention.

AWS

EC2

October 21

PPE Specialist at NFPA contributing to safety standards development for emergency services. Focusing on personal protective equipment and guiding organizational initiatives.

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com