Senior Site Reliability Engineer

5001 - 10000 employees

🔒 Cybersecurity

💰 Post-IPO Equity on 2001-07

Cloud Computing • Cybersecurity • Content Delivery

Akamai Technologies is a leading cloud services provider that specializes in delivering security, cloud computing, and content delivery solutions. It offers a range of services such as API security, DDoS protection, and performance optimization for web applications, ensuring secure and reliable user experiences. With a robust global infrastructure, Akamai empowers businesses to streamline their digital presence while safeguarding against various cyber threats and enhancing application performance.

Senior Site Reliability Engineer

🕒 June 11

🇨🇦 Canada – Remote

💵 $120.4k - $216.6k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Ansible

Distributed Systems

Kubernetes

Linux

Python

SaltStack

Terraform

Apply Now

Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Akamai Technologies

5001 - 10000 employees

🔒 Cybersecurity

💰 Post-IPO Equity on 2001-07

Cloud Computing • Cybersecurity • Content Delivery

📋 Description

• Owning the SRE infrastructure lifecycle from design reviews and pre-rollout readiness assessments through production sign-off and ongoing reliability management • Designing and implementing frameworks that reflect customer experience for load balancing services and driving action when error budgets are at risk • Building and maintaining observability pipelines from load-balancing components and system-level sources to dashboards that enable rapid incident triage • Leading technical incident response for complex NB/NLB failures, acting as the technical commander and driving root cause analysis and preventive follow-through • Developing and automating safe deployment workflows for phased releases, including bake-period monitoring, feature flag management, and validation across global datacenter rollouts • Reviewing design documents, product-requirement documents and producing actionable SRE input on operational risks, capacity implications, Day-2 concerns, and product strategy gaps • Building automation and tooling using Python or Go that reduces operational toil and improves team-wide operational capability

🎯 Requirements

• 8+ years of experience in SRE, infrastructure engineering, or platform engineering, working with large-scale distributed systems • Demonstrate deep expertise with Linux networking fundamentals and diagnosing at the packet level using tcpdump, netstat, and similar tools • Have hands-on experience with L4/L7 load balancing technologies covering configuration, health checking, high availability, and failure modes at scale • Show a track record of defining SLO/SLI frameworks, building observability platforms from scratch, and running incident management processes at scale • Demonstrate expertise in Kubernetes and containerization at scale including workload scheduling, networking, resource management, and operating stateful or network-intensive workloads in a cluster environment • Build automation and tooling using Python or Go, with infrastructure-as-code experience (SaltStack, Ansible, or Terraform) and deployment safety instincts.

🏖️ Benefits

• healthcare • RRSP • company holidays • vacation (in the form of PTO) • sick time • family friendly benefits including employee assistance program including a focus on mental and financial wellness

Apply Now

Similar Jobs

Senior DevOps Engineer

🕒 June 11

ScaleUP Week

11 - 50

💼 Consulting

👥 B2C

📚 Education

🇨🇦 Canada – Remote

💵 CA$132k - CA$160k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Amazon Redshift

AWS

Cloud

Cyber Security

Docker

ETL

Grafana

Python

Ruby

Ruby on Rails

SDLC

Senior DevOps Engineer

🕒 June 9

Borrowell

51 - 200

💼 Consulting

🛡️ Insurance

💳 Fintech

Senior DevOps Engineer designing and managing cloud infrastructure at Borrowell, a company helping Canadians with their finances. Collaborating with development, security, and QA teams to enhance service delivery.

🇨🇦 Canada – Remote

💵 $100k - $150k / year

💰 Series C on 2021-02

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Azure

Cloud

DNS

Docker

Kubernetes

Linux

Microservices

Terraform

.NET

Mainframe DevOps Migration Consultant

🕒 June 6

Minor Hotels Europe and Americas

10,000+ employees

👥 B2C

🏨 Hospitality

✈️ Travel

Software Change Management Consultant supporting application migration projects using IBM’s DBB/Git/IDD Solutions. Guiding clients through the conversion process and providing migration expertise and training.

🇨🇦 Canada – Remote

💵 $62.9k - $147.5k / year

💰 Post-IPO Equity on 2018-05

⏰ Full Time

🟠 Senior

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

Groovy

DevOps

🕒 June 5

Clic Santé

11 - 50

🏥 Healthcare

💼 Consulting

📦 Logistics

DevOps/DevSecOps managing cloud-native infrastructure on GCP, optimizing CI/CD and automation for a healthcare startup. Prioritizing security, performance, and resilience in a scalable environment.

🇨🇦 Canada – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🗣️🇫🇷 French Required

Cloud

Kubernetes

Terraform

Full Stack Developer – DevOps, Cloud Systems

🕒 June 2

BrightOrder Inc.

51 - 200

📦 Logistics

💼 Consulting

🚗 Transport

Full Stack Developer responsible for creating and scaling BrightOrder’s cloud-based platform. Collaborating with teams and automating processes for efficient system performance.

🇨🇦 Canada – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

AWS

Cloud

Docker

EC2

Grafana

GraphQL

IoT

JavaScript

Kubernetes

Linux

Microservices

MongoDB

MS SQL Server

Oracle

Postgres

Prometheus

Python

RabbitMQ

React

Redis

SQL

Terraform

TypeScript