Senior Site Reliability Engineer

November 21

Apply Now
Logo of Zensurance

Zensurance

Finance • Insurance • B2B

Zensurance is a leading small business insurance brokerage in Canada. The company offers a wide range of low-cost commercial insurance options tailored for various professions and industries, which includes everything from builder's risk and commercial auto insurance to E-commerce and non-profit insurance. Zensurance is dedicated to serving over 300,000 Canadian small business owners by providing instant quotes and significant savings on insurance policies. By leveraging technology, Zensurance can offer competitive rates and comprehensive coverages by working with over 50 insurance providers. Additionally, they provide expert advice, dedicated claims service, and ease of switching insurance providers. Zensurance prides itself on its customer service and has received multiple industry awards for its innovative approach to insurance. The company focuses on empowering small businesses with peace of mind through secure and strategic insurance solutions.

51 - 200 employees

💸 Finance

🤝 B2B

💰 Series B on 2020-08

📋 Description

• Write code and tools to automate repetitive, manual operational tasks to free up engineering time. • Participate in on-call rotations to rapidly detect, triage, and resolve system outages and emergencies. • Implement comprehensive observability (logging, tracing, metrics) and configure intelligent alerts to monitor system health in real-time. • Define and track Service Level Indicators (SLIs) and Service Level Objectives (SLOs) to measure and manage the performance and availability of services. • Partner with development teams to ensure new services are designed for scalability, resilience, and reliability from the start. • Develop and test robust Disaster Recovery (DR) and failover procedures to ensure business continuity. • Perform other duties as assigned.

🎯 Requirements

• University degree or college diploma in a recognized technical, vocational or academic program (preferably in Engineering or Computer Science) or equivalent work experience • 5+ years of experience as a Site Reliability Engineer • Proven experience with Terraform for provisioning and managing cloud infrastructure • Experience with Kubernetes • Experience with AWS as a cloud service provider • Demonstrated experience maintaining and improving an Incident Management process • Experience with a major observability platform (e.g., Prometheus, Grafana, Datadog, ELK Stack, Splunk, or New Relic) • Experience with distributed systems to ensure that services meet scalability, reliability and uptime goals by implementing strategies like redundancy, failover solutions, and monitoring • Experience with GitHub Actions as tool for Continuous Integration/Continuous Delivery (CI/CD) • Experience in Backup and Recovery Scenarios • Ability to communicate efficiently and work in a collaborative style • A commitment to continuous improvement, continuous learning and knowledge sharing

🏖️ Benefits

• Remote-first setup for added flexibility • Home office allowance to create a comfortable workspace • Top-tier tech: "Office in a box" with all necessary tech equipment • Half days before public holidays: Enjoy half days before long weekends • Flexible health and dental plans for families, including mental health support • Health & personal spending accounts to invest in wellness your way • Parental leave top-up, because family comes first • Education assistance reimbursement for courses, conferences, books, and memberships • Opportunities to learn from industry experts and grow your career • Weekly Friday huddles to share updates and connect across teams • Virtual & in-person team-building events to strengthen our culture

Apply Now

Similar Jobs

November 20

Akinox

51 - 200

Team Lead DevOps guiding and evolving the Infrastructure team for a health tech company. Improving deployment automation and collaborating across teams.

🗣️🇫🇷 French Required

Azure

Cloud

Kubernetes

November 17

Deployment Engineer at Versaterm collaborating with law enforcement for technical data interfaces. Ensuring smooth deployment processes for public safety technology with strong customer focus.

Cloud

ETL

SOAP

SQL

November 17

Schema App

11 - 50

DevOps Engineer responsible for AWS infrastructure and automation at Schema App, enhancing search visibility. Collaborating with developers for high availability and system security.

AWS

Cloud

Docker

Kubernetes

Python

Terraform

November 17

DevOps Engineer managing cloud infrastructure on AWS for a fast-growing SaaS platform. Collaborating with teams to implement best practices and optimize system reliability.

Ansible

AWS

Chef

Cloud

Linux

Oracle

Postgres

Puppet

Python

Terraform

November 17

DevOps Engineer at HappyCo shaping reliable systems for property management software. Collaborating with developers and maintaining world-class infrastructure to boost performance and security.

Google Cloud Platform

Linux

Terraform

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com