
1 - 10 employees
Founded 2024
🤖 Artificial Intelligence
☁️ SaaS
Artificial Intelligence • Cloud Computing • SaaS
Yotta Labs is building the DeOS for AI optimization and orchestration at planet scale. The company provides a high-performance framework to aggregate geo-distributed GPUs, delivering high throughput for heterogeneous compute resources. Yotta Labs focuses on affordability and accessibility for AI training and inference across a spectrum of GPUs, including commodity to high-end options. Their platform supports large language models (LLMs) and enables users to fine-tune AI applications seamlessly, deploying secure AI agents in the cloud with optimized API endpoints.
🔥 0 minutes ago
Improve your chances of getting an interview by checking your resume score before you apply.

1 - 10 employees
Founded 2024
🤖 Artificial Intelligence
☁️ SaaS
Artificial Intelligence • Cloud Computing • SaaS
Yotta Labs is building the DeOS for AI optimization and orchestration at planet scale. The company provides a high-performance framework to aggregate geo-distributed GPUs, delivering high throughput for heterogeneous compute resources. Yotta Labs focuses on affordability and accessibility for AI training and inference across a spectrum of GPUs, including commodity to high-end options. Their platform supports large language models (LLMs) and enables users to fine-tune AI applications seamlessly, deploying secure AI agents in the cloud with optimized API endpoints.
• Design and implement high-performance kernels for Attention, MoE, GEMM, collective communication, and quantization. • Optimize kernels for NVIDIA, AMD, and AWS Trainium. • Develop custom operators and graph optimizations using Neuron SDK, PyTorch/XLA, Torch Dynamo, and Neuron Compiler. • Improve performance of vLLM, SGLang, TensorRT-LLM, and custom inference runtimes. • Design scalable distributed training and inference solutions across thousands of accelerators. • Contribute to open-source projects, publish technical findings and engage with the developer community.
• Proficiency in AI programming languages such as Python and C++ • Deep understanding of GPU architecture and performance optimization • Experience with CUDA, Triton, ROCm/HIP, or AWS Neuron • Strong understanding of AI frameworks (e.g., PyTorch, Dynamo, LMCache), model architectures and profiling tools (e.g. Nsight, ROCm Profiler, or Neuron Profiler) • Strong problem-solving skills and the ability to work in a collaborative, remote environment • A background in computer science, engineering, or a related field is preferred
• Competitive compensation with equity • Flexible, remote work environment that values innovation and autonomy
Apply Now🔥 13 hours ago
AI Backend Engineer building backend systems that serve AI-powered insurance workflows. Collaborate in a global engineering team working remotely to enhance production systems.
🕒 Yesterday
Exposure Intelligence Analyst managing infrastructure security challenges at Allstate. Identifying and validating exposure risks across servers, containers, and OS platforms with a focus on mitigation.
🇺🇸 United States – Remote
💵 $100k - $170.5k / year
💰 Post-IPO Equity on 2014-01
⏰ Full Time
🟡 Mid-level
🟠 Senior
⚙️ Systems Engineer
🦅 H1B Visa Sponsor
🕒 Yesterday
Business Systems Engineer translating business requirements into technical specifications. Supporting e-commerce platform and engaging with IT and business partners for effective solutions.
🇺🇸 United States – Remote
💵 $70k - $90k / year
⏰ Full Time
🟡 Mid-level
🟠 Senior
⚙️ Systems Engineer
🦅 H1B Visa Sponsor
🕒 Yesterday
IT Systems Implementation Engineer serving as technical delivery owner on integration and AI projects. Leading discovery sessions, managing implementation, and collaborating with Subscriber IT teams.
🕒 Yesterday
People Systems Analyst managing HR technology operations at Chainlink Labs. Supporting systems, documentation, and continuous improvement within a decentralized finance environment.
Web3