Staff Software Engineer

October 17

Apply Now
Logo of DataRobot

DataRobot

Artificial Intelligence • Enterprise • SaaS

DataRobot is a company that provides an AI platform and applications designed to integrate into core business processes. The company offers Enterprise AI Suite, AI Apps, and AI Platform services, which include Generative AI, Predictive AI, AI Governance, and AI Observability. DataRobot aims to help businesses develop, deliver, and govern AI solutions at scale, supporting industries such as energy, financial services, healthcare, manufacturing, and the public sector. With a focus on maximizing business impact and minimizing risk, DataRobot provides solutions that expedite deployment and secure numerous predictions each day.

📋 Description

• Architect, build, and lead backend services that scale to handle large workloads, high concurrency, and low latency requirements. • Design and implement autoscaling strategies (horizontal/vertical), dynamic resource allocation, and load balancing to ensure responsive, cost-efficient service. • Improve end-to-end request pipelines, optimizing for latency, throughput, reliability, and correctness. • Instrument, monitor, and profile systems in production; identify bottlenecks, troubleshoot performance issues, and proactively tune services. • Collaborate with ML/AI teams to ensure models’ serving pipelines uphold accuracy, consistency, and performance under load. • Drive best practices in systems reliability, observability, error handling, capacity planning, resilience, and failover. • Mentor and coach other engineers; provide technical leadership and influence across teams. • Contribute to defining architecture, coding standards, performance benchmarks, and technical roadmap items related to scalability and performance.

🎯 Requirements

• 7+ years of backend engineering experience building scalable, high-performance distributed systems / services. • Strong experience with performance optimization: e.g. profiling, latency tuning, concurrency, caching strategies. • Deep experience with autoscaling, resource management, load balancing, throughput/latency SLAs. • Solid programming skills in one or more backend languages (e.g. Python, Java, Go, C++, or equivalent). • Strong understanding of observability and monitoring: metrics, tracing, logging; and instrumentation of services. • Ability to solve ambiguous challenges and influence technical direction across teams, balancing performance, accuracy, and cost. • Experience operating across multiple cloud providers (AWS, GCP, Azure) and/or hybrid environments. • Nice to Have: Experience with AI/ML model deployment, serving, inference, and production integration. • Experience with Gen AI / serving LLMs, embeddings, etc. • Exposure to on-prem delivery models or regulated environments. • Experience with Docker and building containerized applications.

🏖️ Benefits

• Medical, Dental & Vision Insurance • Flexible Time Off Program • Paid Holidays • Paid Parental Leave • Global Employee Assistance Program (EAP) and more!

Apply Now

Similar Jobs

October 17

Principal Engineer leading technical architecture to transform raw security data into structured information. Engaging in real-time systems and mentoring engineering talent for cybersecurity at Expel.

AWS

BigQuery

Cloud

Kafka

Python

Go

October 17

Principal Engineer leading technical vision and architecture of Expel's Data Ingest Platform. Owning the development of tools and frameworks for integration engineers' success.

Airflow

Amazon Redshift

AWS

BigQuery

Cloud

Distributed Systems

Postgres

Python

Redis

Go

October 17

Staff Software Engineer providing technical leadership in Ads & Insights at Toast. Developing high-quality software solutions and mentoring engineers while working remotely.

GraphQL

Java

Kotlin

October 17

Cloud Software Architect designing and deploying self-healing AWS services for customers. Collaborating with teams to automate processes and drive cloud architecture excellence.

AWS

Cloud

DynamoDB

Java

JavaScript

Microservices

Node.js

Python

Ruby

Swift

Go

October 17

Generalist Full-Stack Staff Product Engineer developing a new privacy-centric AI assistant at MZLA Technologies Corporation. Collaborating with a small, distributed team focused on innovative, user-centric solutions.

React

TypeScript

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com