AI Engineer

Job not on LinkedIn

đŸ”„ 12 minutes ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of In Tandem

In Tandem

51 - 200 employees

đŸ‘„ B2C

☁ SaaS

⚡ Productivity

B2C ‱ SaaS ‱ Productivity

In Tandem is a global technology platform that builds digital tools and apps to support families through key stages and daily life. The company develops and operates consumer-focused family solutions—including co-parenting, family organization, communication and parenting-schedule apps—that aim to improve connection, coordination, and peace of mind for modern families. In Tandem’s products are designed to simplify routines, support co-parenting and family communication, and provide resources during challenging times.

📋 Description

‱ Run and optimize our self-hosted inference stack ‱ Run the inference serving layer on our own GPU hardware: choose and tune the serving stack (vLLM, SGLang, TensorRT-LLM) for high throughput and low latency. ‱ Optimize aggressively: tensor parallelism, quantization (FP8, AWQ, GPTQ), KV-cache and prefix caching, continuous batching, speculative decoding, concurrency tuning. ‱ Serve multiple models and features off shared hardware: multi-LoRA, routing, and request scheduling that balances internal workloads against latency-sensitive product traffic. ‱ Make our AI workloads efficient: improve latency, throughput, and GPU utilization so we get the most out of what we run. ‱ Build the visibility: instrument performance and usage across our AI surfaces so there's clear data on how everything is running. ‱ Surface the technical tradeoffs (performance, latency, efficiency) so the people making the calls have what they need to make them. ‱ Ship the in-app agent layer that helps families coordinate: proactive nudges, smart suggestions, agents that summarize, draft, schedule, and act for busy parents. ‱ Build the substrate underneath: tools, memory, orchestration, guardrails, and evaluation harnesses, integrated cleanly with production APIs alongside our architecture team. ‱ Work in nimble pairs with feature owners, standing up whatever's needed to test an idea, including a vibe-coded UI when that's the fastest path to a real customer. Ship rough, learn fast, harden what works.

🎯 Requirements

‱ 5+ years shipping production software, including meaningful applied AI or ML work. ‱ Demonstrated experience running and optimizing self-hosted LLMs on dedicated multi-GPU hardware: a serving stack (vLLM, SGLang, or TensorRT-LLM) and the optimization that comes with it (tensor parallelism, quantization, batching, KV cache). ‱ A track record of optimizing inference performance and efficiency (latency, throughput, GPU utilization). ‱ Strong Python and engineering fundamentals, with the full-stack range to stand up a quick UI, and the genuine desire to work app-layer features and not only infra. ‱ Hands-on with agent frameworks (Claude Agent SDK, LangGraph, or similar), LLM APIs, embeddings, and RAG. ‱ Comfortable with AWS and the devops this role owns: Docker, CI/CD, monitoring, and observability. ‱ Experience building internal tooling or platforms others depend on. Bonus for Slack apps, MCP, or agent orchestration at team scale.

đŸ–ïž Benefits

‱ Medical: In Tandem pays 100% of the premium for employees AND 99% for all additional family members ‱ 401k: Up to a 4% match with immediate vesting ‱ Paid leave for all new parents ‱ Learning & Development stipend for employees ‱ Paid Time Off: 11 Holidays + Winter Break (3 Days) + Volunteer Time Off (1 Day) + Floating Holiday (1 Day) ‱ Personal Time Off: 15 days for 0-1 years of employment, 20 days 1-3 years of employment ‱ Supportive and flexible working environment – work from anywhere!

Apply Now

Similar Jobs

🕒 June 4

Databricks

1001 - 5000

đŸ€– Artificial Intelligence

🏱 Enterprise

☁ SaaS

AI Engineer developing cutting-edge GenAI solutions for Databricks. Collaborating with customers and teams to enhance AI strategies and product roadmaps.

Apache

AWS

Azure

Google Cloud Platform

Pandas

PyTorch

Scikit-Learn

Spark

🕒 June 4

Alan

501 - 1000

⚕ Healthcare Insurance

💳 Fintech

☁ SaaS

Full-stack Software Engineer developing AI Developer Tools for a health insurance platform. Collaborate within a talented engineering team to create reliable and impactful solutions.

🕒 May 6

360Learning

201 - 500

📚 Education

⚡ Productivity

☁ SaaS

Lead development and productionization of GenAI/LLM systems for a large-scale platform. Work with Python, LLM integrations, MongoDB, Node.js, Vue.js and TypeScript within a remote-friendly R&D team.

đŸ—ŁïžđŸ‡«đŸ‡· French Required

Azure

JavaScript

MongoDB

Node.js

Python

TypeScript

Vue.js

🕒 May 5

Voodoo

501 - 1000

🎼 Gaming

đŸ“± Media

đŸ‘„ B2C

Playable Ads AI Engineer creating interactive advertising experiences at Voodoo. Combining playable development skills with applied AI expertise.

JavaScript

TypeScript

Unity

🕒 April 8

AI Engineer needed for full rebuild of Hector Kitchen's website. Requires ownership of project and experience with consumer-facing websites, focusing on AI tools.