AI Engineer

51 - 200 employees

👥 B2C

☁️ SaaS

⚡ Productivity

B2C • SaaS • Productivity

In Tandem is a global technology platform that builds digital tools and apps to support families through key stages and daily life. The company develops and operates consumer-focused family solutions—including co-parenting, family organization, communication and parenting-schedule apps—that aim to improve connection, coordination, and peace of mind for modern families. In Tandem’s products are designed to simplify routines, support co-parenting and family communication, and provide resources during challenging times.

AI Engineer

🕒 June 16

❄️ Minnesota – Remote

💵 $100k - $135k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

🤖 AI Engineer

AWS

Docker

Python

Apply Now

Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

In Tandem

51 - 200 employees

👥 B2C

☁️ SaaS

⚡ Productivity

B2C • SaaS • Productivity

📋 Description

• Run and optimize our self-hosted inference stack • Run the inference serving layer on our own GPU hardware: choose and tune the serving stack (vLLM, SGLang, TensorRT-LLM) for high throughput and low latency. • Optimize aggressively: tensor parallelism, quantization (FP8, AWQ, GPTQ), KV-cache and prefix caching, continuous batching, speculative decoding, concurrency tuning. • Serve multiple models and features off shared hardware: multi-LoRA, routing, and request scheduling that balances internal workloads against latency-sensitive product traffic. • Keep our AI fast, efficient, and observable • Make our AI workloads efficient: improve latency, throughput, and GPU utilization so we get the most out of what we run. • Build the visibility: instrument performance and usage across our AI surfaces so there's clear data on how everything is running. • Surface the technical tradeoffs (performance, latency, efficiency) so the people making the calls have what they need to make them. • Build AI features and proactive agents • Ship the in-app agent layer that helps families coordinate: proactive nudges, smart suggestions, agents that summarize, draft, schedule, and act for busy parents. • Build the substrate underneath: tools, memory, orchestration, guardrails, and evaluation harnesses, integrated cleanly with production APIs alongside our architecture team. • Work in nimble pairs with feature owners, standing up whatever's needed to test an idea, including a vibe-coded UI when that's the fastest path to a real customer. Ship rough, learn fast, harden what works.

🎯 Requirements

• 5+ years shipping production software, including meaningful applied AI or ML work. • Demonstrated experience running and optimizing self-hosted LLMs on dedicated multi-GPU hardware: a serving stack (vLLM, SGLang, or TensorRT-LLM) and the optimization that comes with it (tensor parallelism, quantization, batching, KV cache). • A track record of optimizing inference performance and efficiency (latency, throughput, GPU utilization). • Strong Python and engineering fundamentals, with the full-stack range to stand up a quick UI, and the genuine desire to work app-layer features and not only infra. • Hands-on with agent frameworks (Claude Agent SDK, LangGraph, or similar), LLM APIs, embeddings, and RAG. • Comfortable with AWS and the devops this role owns: Docker, CI/CD, monitoring, and observability. • Experience building internal tooling or platforms others depend on. Bonus for Slack apps, MCP, or agent orchestration at team scale.

🏖️ Benefits

• Medical: In Tandem pays 100% of the premium for employees AND 99% for all additional family members • 401k: Up to a 4% match with immediate vesting • Paid leave for all new parents • Learning & Development stipend for employees • Paid Time Off: 11 Holidays + Winter Break (3 Days) + Volunteer Time Off (1 Day) + Floating Holiday (1 Day) • Personal Time Off: 15 days for 0-1 years of employment, 20 days 1-3 years of employment • Supportive and flexible working environment – work from anywhere!

Apply Now

Similar Jobs

Senior Full-Stack AI Engineer

🕒 June 16

CES Family of Companies

51 - 200

🤝 B2B

🛍️ eCommerce

🍽️ Food & Beverage

Full-Stack AI Engineer designing and implementing AI solutions. Working with advanced technologies and platforms for leading global enterprises.

🇺🇸 United States – Remote

⏰ Full Time

🟠 Senior

🤖 AI Engineer

AWS

Azure

JavaScript

Microservices

NoSQL

Python

SDLC

SQL

TypeScript

Full-Stack AI Engineer

🕒 June 16

CES Family of Companies

51 - 200

🤝 B2B

🛍️ eCommerce

🍽️ Food & Beverage

Full-Stack AI Engineer building AI solutions and features across applications for CESIT. Collaborating with teams to optimize AI models while handling full-stack development.

🇺🇸 United States – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

🤖 AI Engineer

Azure

JavaScript

NoSQL

Python

SDLC

SQL

TypeScript

.NET

Senior AI Engineer

🕒 June 16

Blue Orange Digital

51 - 200

💼 Consulting

🏥 Healthcare

📦 Logistics

Senior AI Engineer at Blue Orange Digital designing modern data platforms and AI solutions for enterprise clients. Developing machine learning capabilities and actionable insights from complex data.

🇺🇸 United States – Remote

💰 $700k Corporate round on 2022-05

⏰ Full Time

🟠 Senior

🤖 AI Engineer

Forward Deployed AI Engineer

🕒 June 15

Arize AI

51 - 200

🤖 Artificial Intelligence

☁️ SaaS

🏢 Enterprise

Forward Deployed AI Engineer collaborating with enterprise AI teams, designing and scaling production-grade GenAI solutions. Leading technical discussions and managing multiple customer engagements.

🇺🇸 United States – Remote

💵 $125k - $175k / year

⏰ Full Time

🟢 Junior

🟡 Mid-level

🤖 AI Engineer

🦅 H1B Visa Sponsor

AWS

Azure

Cloud

Docker

Google Cloud Platform

Kubernetes

Lead AI Engineer, GTM Applications

🕒 June 12

CrowdStrike

5001 - 10000

🔒 Cybersecurity

☁️ SaaS

🤖 Artificial Intelligence