Lead Member of Technical Staff, Inference Infrastructure

🕒 Abril 28

🏄 California – Remoto

info

⏰ Tempo Integral

🟠 Sênior

🖥 Engenheiro de Software

🦅 Patrocina Visto H1B

info

🗣️🇺🇸🇬🇧 Inglês obrigatório

Candidatar-se
Encontrar Vagas Remotas Similares

📊 Verifique sua pontuação de currículo para esta vaga

Melhore suas chances de conseguir uma entrevista verificando sua pontuação de currículo antes de se candidatar.

Logo of Cohere

Cohere

11 - 50 funcionários

🤖 Inteligência Artificial

🏢 Corporativo

☁️ SaaS

Artificial Intelligence • Enterprise • SaaS

A Cohere é uma plataforma de IA líder, fornecendo às empresas modelos de linguagem avançada e um espaço de trabalho integrado projetado para eficiência e segurança. Com uma família de modelos generativos e de recuperação de alto desempenho, a Cohere permite que as organizações simplifiquem fluxos de trabalho, melhorem a segurança dos dados e descubram insights em diversas indústrias por meio de capacidades multilingues. Seu foco em soluções de IA personalizadas garante a proteção de dados críticos, facilitando a integração perfeita nos processos organizacionais existentes.

Descrição

• Join the Model Serving team at Cohere and provide technical leadership across multiple teams • Drive the architecture and strategy for deploying optimized NLP models to production in low latency, high throughput, and high availability environments • Serve as a key point of contact for customers, leading the design of customized deployments to meet their specific needs • Mentor engineers to raise the technical bar across the team

🎯 Requisitos

• 8+ years of engineering experience running production infrastructure at a large scale, with a track record of technical leadership • Demonstrated experience leading the architecture and design of large, highly available distributed systems with Kubernetes and GPU workloads on those clusters • Deep expertise with Kubernetes dev and production coding and support, including setting team-wide standards and best practices • Extensive experience across GCP, Azure, AWS, OCI, and multi-cloud on-prem / hybrid serving environments, with the ability to guide strategic infrastructure decisions • Proven ability to lead the design, deployment, support, and troubleshooting of complex Linux-based computing environments at scale • Experience owning compute/storage/network resource and cost management at an organisational level, including optimisation strategies • Exceptional collaboration and communication skills, with experience mentoring engineers and leading cross-functional initiatives to build mission-critical systems • The grit and adaptability to both solve and guide others through complex technical challenges that evolve day to day • Strong expertise in the computational characteristics of accelerators (GPUs, TPUs, and/or custom accelerators), and how to leverage them to drive latency and throughput improvements at scale • Deep knowledge of distributed systems, with experience establishing patterns and practices across engineering teams • Proficiency in Golang, C++ or other languages designed for high-performance scalable servers, with the ability to set coding standards and conduct senior-level technical reviews

🏖️ Benefícios

• An open and inclusive culture and work environment • Work closely with a team on the cutting edge of AI research • Weekly lunch stipend, in-office lunches & snacks • Full health and dental benefits, including a separate budget to take care of your mental health • 100% Parental Leave top-up for up to 6 months • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend • 6 weeks of vacation (30 working days!)

Candidatar-se

Vagas Similares

🕒 Abril 28

Husch Blackwell

1001 - 5000

🤝 B2B

📋 Conformidade

🏢 Corporativo

Lead Developer serving as a technical lead responsible for enterprise application features at Husch Blackwell. Overseeing teamwork, mentoring developers, and maintaining high-performing enterprise systems.

🇺🇸 Estados Unidos – Remoto (EUA)

💵 $105.000 - $237.000 / ano

⏰ Tempo Integral

🟠 Sênior

🖥 Engenheiro de Software

🦅 Patrocina Visto H1B

info

🗣️🇺🇸🇬🇧 Inglês obrigatório

🕒 Abril 28

Lupus Consulting

201 - 500

☁️ SaaS

🏢 Corporativo

SAP Developer working on complex developments in SAP and S/4HANA environment. Collaborating with agile teams to design, develop, and deliver technical solutions following best practices.

🇺🇸 Estados Unidos – Remoto (EUA)

⏰ Tempo Integral

🟡 Pleno

🟠 Sênior

🖥 Engenheiro de Software

🗣️🇺🇸🇬🇧 Inglês obrigatório

🗣️🇩🇪 Alemão obrigatório

🕒 Abril 27

Kodex

11 - 50

📋 Conformidade

🔒 Cibersegurança

💳 Fintech

Engineering Manager leading Core Portal team at Kodex, enhancing secure data exchange processes for organizations. Drives AI proficiency and team development in a remote startup environment.

🇺🇸 Estados Unidos – Remoto (EUA)

💵 $170.000 - $220.000 / ano

💰 Venture Round em 2022-10

⏰ Tempo Integral

🟡 Pleno

🟠 Sênior

🖥 Engenheiro de Software

🗣️🇺🇸🇬🇧 Inglês obrigatório

🕒 Abril 25

PeerIslands

11 - 50

🤖 Inteligência Artificial

☁️ SaaS

🏢 Corporativo

Backend Developer maintaining backend systems and collaborating on AI technology projects in a remote setting. Requires 5-10 years experience in software development with multiple programming languages.

🇺🇸 Estados Unidos – Remoto (EUA)

⏰ Tempo Integral

🟡 Pleno

🟠 Sênior

🖥 Engenheiro de Software

🗣️🇺🇸🇬🇧 Inglês obrigatório

🕒 Abril 25

PeerIslands

11 - 50

🤖 Inteligência Artificial

☁️ SaaS

🏢 Corporativo

Polyglot Developer working with Python, RAG architectures and LLM integrations. Focused on document ingestion and real-time responses in a remote setting.

🇺🇸 Estados Unidos – Remoto (EUA)

⏰ Tempo Integral

🟡 Pleno

🟠 Sênior

🖥 Engenheiro de Software

🗣️🇺🇸🇬🇧 Inglês obrigatório