Senior AI Platform Engineer, Core Cloud Engineering

🕒 April 29

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Vultr

Vultr

201 - 500 employees

Founded 2014

🤖 Artificial Intelligence

🤝 B2B

🔧 Hardware

🔥 Funding within the last year

💰 $329M Debt Financing - Vultr on 2025-06

Artificial Intelligence • B2B • Hardware

Vultr is a global cloud infrastructure provider offering on-demand virtual machines, bare-metal servers, GPU-accelerated instances, managed databases, object and block storage, Kubernetes, and networking services. The platform emphasizes AI and HPC workloads with a broad selection of AMD and NVIDIA GPUs, fast networking, and 32+ data center regions, plus a marketplace of deployable apps and developer-friendly APIs. Vultr targets developers and businesses seeking affordable, scalable, and compliant cloud compute and storage alternatives to hyperscalers.

📋 Description

• Evaluate and curate open-source models — Llama, Mistral, Qwen, DeepSeek, Kimi, and others — for fit across engineering use cases including code generation, review, test writing, and summarization. • Build and maintain MCP (Model Context Protocol) servers that expose internal context — codebases, runbooks, incident history, architecture docs, development environments, and testing suites — to AI assistants and coding agents. • Integrate AI capabilities directly into GitLab CI/CD pipelines: automated code review, test generation, changelog drafting, PR summarization, and anomaly detection in build output. • Own the model lifecycle: versioning, A/B routing, quantization tradeoffs, and performance benchmarking under real engineering workloads. • Drive AI adoption across the software engineering organization — identify high-leverage workflows, instrument usage, and iterate based on real data on time-savings and quality impact. • Build and configure IDE tooling integrations — Cursor, Continue, and Copilot alternatives — backed by internal inference endpoints, keeping code off third-party APIs wherever possible. • Produce documentation, internal workshops, and working examples that help engineers go from AI-curious to AI-reliant — including a shared library of prompts, system instructions, and RAG pipelines tuned for Vultr’s stack. • Collaborate closely with Software Engineers, SREs, and Network Engineers to ensure the AI platform layer serves all teams without becoming a bottleneck or single point of failure.

🎯 Requirements

• Hands-on experience deploying and operating LLM inference systems — vLLM, SGLang, TGI, or comparable — at non-trivial scale. • Strong Docker and container skills; comfortable owning the full container lifecycle from image build to production. • Deep familiarity with GitLab CI/CD — pipeline authoring, custom runners, artifact management, and integrating external tooling. • Working knowledge of MCP or similar context-injection patterns for grounding LLMs against private or internal data. • Demonstrated ability to evaluate open-source models for specific task fit — not just benchmarks, but real use-case performance against internal workloads. • Strong software engineering fundamentals — this role writes real code, not just configuration. • Experience with RAG pipelines — vector databases, chunking strategies, retrieval evaluation — especially over code or technical documentation. • GPU infrastructure familiarity — CUDA basics, multi-GPU serving, memory management under inference load. • Ability to communicate technical tradeoffs clearly to engineers, managers, and leadership; track record of moving organizations toward new practices.

🏖️ Benefits

• 100% company-paid insurance premiums for employee medical, dental and vision plans. • 401(k) plan that matches 100% up to 4%, with immediate vesting • Professional Development Reimbursement of $2,500 each year • 11 Holidays + Paid Time Off Accrual + Rollover Plan • Commitment matters to Vultr! Increased PTO at 3 year and 10 year anniversary + 1 month paid sabbatical every 5 years + Anniversary Bonus each year • $500 stipend for remote office setup in first year + $400 each following year • Internet reimbursement up to $75 per month • Gym membership reimbursement up to $50 per month • Company paid Wellable subscription

Apply Now

Similar Jobs

🕒 April 29

Veeam Software

1001 - 5000

☁️ SaaS

🔒 Cybersecurity

🏢 Enterprise

Senior Product Advisor leading cloud integrations with AWS and Azure at Veeam. Collaborating with teams to refine product development and enhance data management solutions.

🕒 April 29

Gartner

10,000+ employees

🏢 Enterprise

Senior Director Analyst developing thought-provoking insights in cloud-native security for Gartner. Engaging with clients to address complex challenges and drive strategies forward.

🕒 April 29

Presidio

1001 - 5000

🤝 B2B

🤖 Artificial Intelligence

🔒 Cybersecurity

Cloud Infrastructure Architect at Presidio designing cloud solutions and driving business innovation through cloud technologies. Collaborating with clients to enhance efficiency and competitiveness in the digital landscape.

🕒 April 29

Presidio

1001 - 5000

🤖 Artificial Intelligence

🔒 Cybersecurity

🏢 Enterprise

Cloud Infrastructure Architect designing scalable cloud architectures at Presidio. Collaborating with stakeholders and cross-functional teams to optimize cloud efficiency and strategy implementation.

🕒 April 29

Niobium Microsystems

11 - 50

🔒 Cybersecurity

🔧 Hardware

🤖 Artificial Intelligence

Frontend Engineer responsible for building and managing the Niobium Cloud Console for encrypted data computation. Working on a new cloud team to create intuitive web applications for sensitive data without compromising privacy.