Post a Job Affiliates

Search Remote Jobs

Cerebras Systems

Website LinkedIn All Job Openings

201 - 500 employees

Founded 2016

🤖 Artificial Intelligence

🔧 Hardware

⚕️ Healthcare Insurance

Artificial Intelligence • Hardware • Healthcare Insurance

Cerebras Systems is a pioneering company that focuses on developing advanced AI hardware, specifically the Cerebras Wafer Scale Engine, which delivers unparalleled performance in AI inference, outperforming traditional GPU setups. Their cutting-edge technology enables organizations like Mayo Clinic and AlphaSense to run state-of-the-art AI models with remarkable speed and efficiency. With flexible deployment options including cloud and on-premises solutions, Cerebras is transforming the landscape of AI capabilities for innovative teams across various industries.

Senior Deployment Engineer, AI Inference

🕒 October 14, 2025

🇨🇦 Canada – Remote

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

AWS

Docker

Grafana

Kubernetes

Linux

Prometheus

Python

Apply Now

Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Cerebras Systems

Website LinkedIn All Job Openings

201 - 500 employees

Founded 2016

🤖 Artificial Intelligence

🔧 Hardware

⚕️ Healthcare Insurance

Artificial Intelligence • Hardware • Healthcare Insurance

📋 Description

• Deploy AI inference replicas and cluster software across multiple datacenters. • Operate across heterogeneous datacenter environments undergoing rapid 10x growth. • Maximize capacity allocation and optimize replica placement using constraint-solver algorithms. • Operate bare-metal inference infrastructure while supporting transition to K8S-based platform. • Develop and extend telemetry, observability and alerting solutions to ensure deployment reliability at scale. • Develop and extend a fully automated deployment pipeline to support fast software updates and capacity reallocation at scale. • Translate technical and customer needs into actionable requirements for the Dev Infra, Cluster, Platform and Core teams. • Stay up to date with the latest advancements in AI compute infrastructure and related technologies.

🎯 Requirements

• 5-7 years of experience in operating on-prem compute infrastructure (ideally in Machine Learning or High-Performance Compute) or developing and managing complex AWS plane infrastructure for hybrid deployments. • Strong proficiency in Python for automation, orchestration, and deployment tooling. • Solid understanding of Linux-based systems and command-line tools. • Extensive knowledge of Docker containers and container orchestration platforms like K8S. • Familiarity with spine-leaf (Clos) networking architecture. • Proficiency with telemetry and observability stacks such as Prometheus, InfluxDB and Grafana. • Strong ownership mindset and accountability for complex deployments. • Ability to work effectively in a fast-paced environment.

🏖️ Benefits

• Build a breakthrough AI platform beyond the constraints of the GPU. • Publish and open source their cutting-edge AI research. • Work on one of the fastest AI supercomputers in the world. • Enjoy job stability with startup vitality. • Our simple, non-corporate work culture that respects individual beliefs.

Apply Now

Similar Jobs

Senior Deployment Engineer – CAD

🕒 October 7, 2025

Atolio

11 - 50

🤖 Artificial Intelligence

🏢 Enterprise

☁️ SaaS

Website LinkedIn All Job Openings

Deployment Engineer working with engineering and client success teams at Atolio. Ensure efficient deployment of enterprise search platform in various environments.

🇨🇦 Canada – Remote

💵 CA$150k - CA$200k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

AWS

Azure

Cloud

Google Cloud Platform

Grafana

Kubernetes

Python

ServiceNow

Splunk

Terraform

Apply

View Job

DevOps Engineer

🕒 September 19, 2025

Veeva Systems

1001 - 5000

☁️ SaaS

⚕️ Healthcare Insurance

💊 Pharmaceuticals

Website LinkedIn All Job Openings

DevOps Engineer building scalable cloud and CI/CD infrastructure for Veeva Systems' life sciences SaaS. Focus on IaC, automation, Kubernetes, Terraform, and reliability.

🇨🇦 Canada – Remote

💵 CA$85k - CA$225k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Ansible

AWS

Cloud

Distributed Systems

Docker

Java

Jenkins

Kubernetes

OpenShift

Python

Scala

Terraform

Apply

View Job

DevOps Engineer

🕒 September 16, 2025

Veeva Systems

1001 - 5000

☁️ SaaS

⚕️ Healthcare Insurance

💊 Pharmaceuticals

Website LinkedIn All Job Openings

DevOps Engineer building scalable AWS infrastructure, CI/CD, and containerized deployments for Veeva's life sciences cloud; focuses on automation, reliability, and mentorship.

🇨🇦 Canada – Remote

💵 $85k - $225k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Ansible

AWS

Cloud

Distributed Systems

Docker

Java

Jenkins

Kubernetes

OpenShift

Python

Scala

SQL

Terraform

Apply

View Job

DevOps Engineer

🕒 September 10, 2025

Veeva Systems

1001 - 5000

☁️ SaaS

⚕️ Healthcare Insurance

💊 Pharmaceuticals

Website LinkedIn All Job Openings

DevOps Engineer building scalable cloud infrastructure at Veeva Systems. Ensuring reliable, automated delivery of SaaS products for life sciences customers.

🇨🇦 Canada – Remote

💵 $85k - $225k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Ansible

AWS

Cloud

Distributed Systems

Docker

Java