Staff Site Reliability Engineer

February 1

Apply Now

Loading...

Matillion

Matillion makes the world’s data useful with an easy-to-use, cloud-native data integration and transformation platform.

Business Intelligence • BI • SaaS • Reporting • Dashboards

501 - 1000

Description

• Leading the design of major software components, systems, and features to improve the availability, scalability, latency, and efficiency of Matillion’s SaaS services • Drive the design, implementation and management for expanding observability infrastructure, keeping up to date with new tools and technologies and be a recognised member of the broader Observability community • Lead sustainable incident response, blameless postmortems, and production improvements that result in direct business opportunities for Matillion • Define and document best practices across all pillars of SRE • Providing guidance and mentorship to other team members on managing end-to-end availability and performance of critical services, design techniques and coding standards to cultivate innovation and collaboration across the business • Balancing competing priorities as you manage a range of individual projects, deadlines, and deliverables

Requirements

• A passion for everything performance, observability, availability, scalability, and security with experience owning and delivering projects using Agile methodologies • Extensive experience with Kubernetes and the surrounding ecosystem with tools such as Linkerd, Traefik, and ArgoCD is a must • Have previous experience of large-scale web operations in a public cloud environment • Be competent in Ruby, Go, Java, Python, or an equivalent programming language • Have worked with some of the following key technologies: Prometheus, Grafana, Elasticsearch, Logstash, Kibana, OpenTelemetry, Micrometer, New Relic, Data Dog • Ability to manage and provision infrastructure using code with Terraform or CloudFormation

Benefits

• Work remotely from Ireland, with occasional in-person meetings in either Manchester or Denver • Make a dent in the universe bigger than ourselves • Utilize experience across all pillars of Site Reliability Engineering • Play a pivotal role in modernizing the technology stack • Implement a wide range of new tools around logging, monitoring, metrics, and alerting

Apply Now
Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com
Jobs by Title
Remote Account Executive jobsRemote Accounting, Payroll & Financial Planning jobsRemote Administration jobsRemote Android Engineer jobsRemote Backend Engineer jobsRemote Business Operations & Strategy jobsRemote Chief of Staff jobsRemote Compliance jobsRemote Content Marketing jobsRemote Content Writer jobsRemote Copywriter jobsRemote Customer Success jobsRemote Customer Support jobsRemote Data Analyst jobsRemote Data Engineer jobsRemote Data Scientist jobsRemote DevOps jobsRemote Ecommerce jobsRemote Engineering Manager jobsRemote Executive Assistant jobsRemote Full-stack Engineer jobsRemote Frontend Engineer jobsRemote Game Engineer jobsRemote Graphics Designer jobsRemote Growth Marketing jobsRemote Hardware Engineer jobsRemote Human Resources jobsRemote iOS Engineer jobsRemote Infrastructure Engineer jobsRemote IT Support jobsRemote Legal jobsRemote Machine Learning Engineer jobsRemote Marketing jobsRemote Operations jobsRemote Performance Marketing jobsRemote Product Analyst jobsRemote Product Designer jobsRemote Product Manager jobsRemote Project & Program Management jobsRemote Product Marketing jobsRemote QA Engineer jobsRemote SDET jobsRemote Recruitment jobsRemote Risk jobsRemote Sales jobsRemote Scrum Master + Agile Coach jobsRemote Security Engineer jobsRemote SEO Marketing jobsRemote Social Media & Community jobsRemote Software Engineer jobsRemote Solutions Engineer jobsRemote Support Engineer jobsRemote Technical Writer jobsRemote Technical Product Manager jobsRemote User Researcher jobs