Lead Member of Technical Staff, Inference Infrastructure

🕒 vor 1 Monat

🏄 California – Remote

info

⏰ Vollzeit

🟠 Senior

🖥 Softwareentwickler

🦅 H1B-Visum-Sponsor

info

🗣️🇺🇸🇬🇧 Englisch erforderlich

Jetzt Bewerben
Ähnliche Remote-Jobs finden

📊 Überprüfen Sie Ihre Lebenslauf-Bewertung für diese Stelle

Verbessern Sie Ihre Chancen auf ein Vorstellungsgespräch, indem Sie Ihre Lebenslauf-Bewertung vor der Bewerbung überprüfen.

Logo of Cohere

Cohere

11 - 50 Mitarbeiter

🤖 Künstliche Intelligenz

🏢 Unternehmen

☁️ SaaS

Artificial Intelligence • Enterprise • SaaS

Cohere ist eine führende KI-Plattform, die Unternehmen fortschrittliche Sprachmodelle und einen integrierten Arbeitsbereich bietet, der auf Effizienz und Sicherheit ausgelegt ist. Mit einer Reihe von leistungsstarken generativen und Retrieval-Modellen ermöglicht Cohere Organisationen die Optimierung von Arbeitsabläufen, die Verbesserung der Datensicherheit und das Erschließen von Erkenntnissen über verschiedene Branchen hinweg durch mehrsprachige Fähigkeiten. Ihr Fokus auf maßgeschneiderte KI-Lösungen gewährleistet den Schutz kritischer Daten und erleichtert die nahtlose Integration in bestehende organisatorische Prozesse.

Beschreibung

• Join the Model Serving team at Cohere and provide technical leadership across multiple teams • Drive the architecture and strategy for deploying optimized NLP models to production in low latency, high throughput, and high availability environments • Serve as a key point of contact for customers, leading the design of customized deployments to meet their specific needs • Mentor engineers to raise the technical bar across the team

🎯 Anforderungen

• 8+ years of engineering experience running production infrastructure at a large scale, with a track record of technical leadership • Demonstrated experience leading the architecture and design of large, highly available distributed systems with Kubernetes and GPU workloads on those clusters • Deep expertise with Kubernetes dev and production coding and support, including setting team-wide standards and best practices • Extensive experience across GCP, Azure, AWS, OCI, and multi-cloud on-prem / hybrid serving environments, with the ability to guide strategic infrastructure decisions • Proven ability to lead the design, deployment, support, and troubleshooting of complex Linux-based computing environments at scale • Experience owning compute/storage/network resource and cost management at an organisational level, including optimisation strategies • Exceptional collaboration and communication skills, with experience mentoring engineers and leading cross-functional initiatives to build mission-critical systems • The grit and adaptability to both solve and guide others through complex technical challenges that evolve day to day • Strong expertise in the computational characteristics of accelerators (GPUs, TPUs, and/or custom accelerators), and how to leverage them to drive latency and throughput improvements at scale • Deep knowledge of distributed systems, with experience establishing patterns and practices across engineering teams • Proficiency in Golang, C++ or other languages designed for high-performance scalable servers, with the ability to set coding standards and conduct senior-level technical reviews

🏖️ Vorteile

• An open and inclusive culture and work environment • Work closely with a team on the cutting edge of AI research • Weekly lunch stipend, in-office lunches & snacks • Full health and dental benefits, including a separate budget to take care of your mental health • 100% Parental Leave top-up for up to 6 months • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend • 6 weeks of vacation (30 working days!)

Jetzt Bewerben

Ähnliche Jobs

🕒 vor 1 Monat

Husch Blackwell

1001 - 5000

🤝 B2B

📋 Compliance

🏢 Unternehmen

Lead Developer serving as a technical lead responsible for enterprise application features at Husch Blackwell. Overseeing teamwork, mentoring developers, and maintaining high-performing enterprise systems.

🗣️🇺🇸🇬🇧 Englisch erforderlich

🕒 vor 1 Monat

Kodex

11 - 50

📋 Compliance

🔒 Cybersecurity

💳 Fintech

Engineering Manager leading Core Portal team at Kodex, enhancing secure data exchange processes for organizations. Drives AI proficiency and team development in a remote startup environment.

🇺🇸 Vereinigte Staaten – Remote

💵 $170.000 - $220.000 / Jahr

💰 Venture Round im 2022-10

⏰ Vollzeit

🟡 Mittelstufe

🟠 Senior

🖥 Softwareentwickler

🗣️🇺🇸🇬🇧 Englisch erforderlich

🕒 vor 1 Monat

PeerIslands

11 - 50

🤖 Künstliche Intelligenz

☁️ SaaS

🏢 Unternehmen

Polyglot Developer working with Python, RAG architectures and LLM integrations. Focused on document ingestion and real-time responses in a remote setting.

🇺🇸 Vereinigte Staaten – Remote

⏰ Vollzeit

🟡 Mittelstufe

🟠 Senior

🖥 Softwareentwickler

🗣️🇺🇸🇬🇧 Englisch erforderlich

🕒 vor 1 Monat

PeerIslands

11 - 50

🤖 Künstliche Intelligenz

☁️ SaaS

🏢 Unternehmen

Backend Developer maintaining backend systems and collaborating on AI technology projects in a remote setting. Requires 5-10 years experience in software development with multiple programming languages.

🇺🇸 Vereinigte Staaten – Remote

⏰ Vollzeit

🟡 Mittelstufe

🟠 Senior

🖥 Softwareentwickler

🗣️🇺🇸🇬🇧 Englisch erforderlich

🕒 vor 1 Monat

Vytwo Technologies Inc

201 - 500

🤝 B2B

🏢 Unternehmen

🎯 Rekrutierung

Alteryx Workflow Developer developing and maintaining complex workflows using IBM Unica software for M&R Workflow Programming Team. Collaborating with business partners in campaign communications design.

🇺🇸 Vereinigte Staaten – Remote

⏰ Vollzeit

🟡 Mittelstufe

🟠 Senior

🖥 Softwareentwickler

🗣️🇺🇸🇬🇧 Englisch erforderlich