
201 - 500 employees
Founded 2016
đ€ Artificial Intelligence
đ§ Hardware
âïž Healthcare Insurance
Artificial Intelligence âą Hardware âą Healthcare Insurance
Cerebras Systems is a pioneering company that focuses on developing advanced AI hardware, specifically the Cerebras Wafer Scale Engine, which delivers unparalleled performance in AI inference, outperforming traditional GPU setups. Their cutting-edge technology enables organizations like Mayo Clinic and AlphaSense to run state-of-the-art AI models with remarkable speed and efficiency. With flexible deployment options including cloud and on-premises solutions, Cerebras is transforming the landscape of AI capabilities for innovative teams across various industries.
đ October 14, 2025
Improve your chances of getting an interview by checking your resume score before you apply.

201 - 500 employees
Founded 2016
đ€ Artificial Intelligence
đ§ Hardware
âïž Healthcare Insurance
Artificial Intelligence âą Hardware âą Healthcare Insurance
Cerebras Systems is a pioneering company that focuses on developing advanced AI hardware, specifically the Cerebras Wafer Scale Engine, which delivers unparalleled performance in AI inference, outperforming traditional GPU setups. Their cutting-edge technology enables organizations like Mayo Clinic and AlphaSense to run state-of-the-art AI models with remarkable speed and efficiency. With flexible deployment options including cloud and on-premises solutions, Cerebras is transforming the landscape of AI capabilities for innovative teams across various industries.
âą Deploy AI inference replicas and cluster software across multiple datacenters. âą Operate across heterogeneous datacenter environments undergoing rapid 10x growth. âą Maximize capacity allocation and optimize replica placement using constraint-solver algorithms. âą Operate bare-metal inference infrastructure while supporting transition to K8S-based platform. âą Develop and extend telemetry, observability and alerting solutions to ensure deployment reliability at scale. âą Develop and extend a fully automated deployment pipeline to support fast software updates and capacity reallocation at scale. âą Translate technical and customer needs into actionable requirements for the Dev Infra, Cluster, Platform and Core teams. âą Stay up to date with the latest advancements in AI compute infrastructure and related technologies.
âą 5-7 years of experience in operating on-prem compute infrastructure (ideally in Machine Learning or High-Performance Compute) or developing and managing complex AWS plane infrastructure for hybrid deployments. âą Strong proficiency in Python for automation, orchestration, and deployment tooling. âą Solid understanding of Linux-based systems and command-line tools. âą Extensive knowledge of Docker containers and container orchestration platforms like K8S. âą Familiarity with spine-leaf (Clos) networking architecture. âą Proficiency with telemetry and observability stacks such as Prometheus, InfluxDB and Grafana. âą Strong ownership mindset and accountability for complex deployments. âą Ability to work effectively in a fast-paced environment.
âą Build a breakthrough AI platform beyond the constraints of the GPU. âą Publish and open source their cutting-edge AI research. âą Work on one of the fastest AI supercomputers in the world. âą Enjoy job stability with startup vitality. âą Our simple, non-corporate work culture that respects individual beliefs.
Apply Nowđ October 7, 2025
Deployment Engineer working with engineering and client success teams at Atolio. Ensure efficient deployment of enterprise search platform in various environments.
đšđŠ Canada â Remote
đ” CA$150k - CA$200k / year
â° Full Time
đ Senior
â DevOps & Site Reliability Engineer (SRE)
AWS
Azure
Cloud
Google Cloud Platform
Grafana
Kubernetes
Python
ServiceNow
Splunk
Terraform
Go
đ September 19, 2025
DevOps Engineer building scalable cloud and CI/CD infrastructure for Veeva Systems' life sciences SaaS. Focus on IaC, automation, Kubernetes, Terraform, and reliability.
đšđŠ Canada â Remote
đ” CA$85k - CA$225k / year
â° Full Time
đĄ Mid-level
đ Senior
â DevOps & Site Reliability Engineer (SRE)
Ansible
AWS
Cloud
Distributed Systems
Docker
Java
Jenkins
Kubernetes
OpenShift
Python
Scala
Terraform
Go
đ September 16, 2025
DevOps Engineer building scalable AWS infrastructure, CI/CD, and containerized deployments for Veeva's life sciences cloud; focuses on automation, reliability, and mentorship.
đšđŠ Canada â Remote
đ” $85k - $225k / year
â° Full Time
đĄ Mid-level
đ Senior
â DevOps & Site Reliability Engineer (SRE)
Ansible
AWS
Cloud
Distributed Systems
Docker
Java
Jenkins
Kubernetes
OpenShift
Python
Scala
SQL
Terraform
Go
đ September 10, 2025
DevOps Engineer building scalable cloud infrastructure at Veeva Systems. Ensuring reliable, automated delivery of SaaS products for life sciences customers.
đšđŠ Canada â Remote
đ” $85k - $225k / year
â° Full Time
đĄ Mid-level
đ Senior
â DevOps & Site Reliability Engineer (SRE)
Ansible
AWS
Cloud
Distributed Systems
Docker
Java
Jenkins
Kubernetes
OpenShift
Python
Scala
SQL
Terraform
Go
đ September 9, 2025
Senior SRE owning infrastructure, reliability, and CI/CD for TextNow, a provider of free phone service
đšđŠ Canada â Remote
đ” $113.4k - $162k / year
â° Full Time
đ Senior
â DevOps & Site Reliability Engineer (SRE)
Android
Ansible
AWS
Cloud
Docker
iOS
Kubernetes
Linux
MariaDB
NoSQL
Puppet
Python
Redis
Ruby
Terraform
Go