Site Reliability Engineer

March 19

Apply Now

Loading...

Tyk

Open Source #API gateway & #APImanagement platform. We're on a mission to connect every system in the world.

API Management • API Gateways • Authentication Provider • API Consultancy • Open Source

51 - 200

Description

• At Tyk, obsessed with building software that solves problems, seeking an experienced SRE to optimize, automate, and improve performance using insights from massive-scale data in real time. • Counting on SREs to empower users with a rich feature set, high availability, and stellar performance level. • Proactive monitoring to ensure production Cloud environment operates within SLAs through vigilant monitoring and proactive issue resolution. • Collaborating with Senior SRE to enhance system reliability through alerting and monitoring. • Contributing to defining key performance metrics for Cloud services, enabling performance improvements and success measurement. • Proposing and developing solutions to maintain and enhance key performance indicators (KPIs) across Cloud infrastructure. • Gathering and analyzing metrics from operating systems and applications to optimize system performance and expedite fault resolution. • Driving innovation by anticipating customer needs, addressing scaling demands, and optimizing system and infrastructure performance. • Working closely with commercial functions to optimize platform scalability and meet growing customer demands. • Analyzing and ensuring automation, scalability, and efficient management of Cloud infrastructure. • Executing automation for cloud operations tasks and creating new automation solutions to streamline processes. • Designing, writing, and delivering software and automation solutions to enhance the availability, scalability, latency, and efficiency of PaaS services. • Participating in blame-free root cause analysis meetings for continuous system improvement. • Creating and contributing to policies and runbooks for well-documented operational processes. • Providing on-call support, ensuring 24/7 Cloud services by promptly responding to alerts, meeting SLAs, and automating root cause analysis. • Planning and executing software upgrades, including Kubernetes versions, managing and communicating migrations from Classic Cloud to the new Cloud platform.

Requirements

• Strong collaboration skills • Launching and operating production Kubernetes clusters • Designing and operating infrastructure on AWS and other providers • Operating MongoDB (or other document database) clusters • Operating Redis (or other key-value storage) clusters • Administering Linux servers • Maintaining distributed software • Operating Prometheus and Grafana • Operating logging collection and analysis system • Proficient in Kubernetes and containers • Advanced in Go and/or Python • Proficient in AWS and Linux • Proficient in Terraform and IaC in general • Familiar with Helm • Experience with MongoDB (or similar) and Redis (or similar) • Knowledge of monitoring & logging, networking concepts, and common networking protocols

Benefits

• Everyone has unlimited paid holidays. • Total flexibility in hours, allowing creativity to flow better. • Employee share scheme • Generous maternity and paternity leave • Volunteering Days • Company retreats • Employee Wellbeing platform

Apply Now
Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com
Jobs by Title
Remote Account Executive jobsRemote Accounting, Payroll & Financial Planning jobsRemote Administration jobsRemote Android Engineer jobsRemote Backend Engineer jobsRemote Business Operations & Strategy jobsRemote Chief of Staff jobsRemote Compliance jobsRemote Content Marketing jobsRemote Content Writer jobsRemote Copywriter jobsRemote Customer Success jobsRemote Customer Support jobsRemote Data Analyst jobsRemote Data Engineer jobsRemote Data Scientist jobsRemote DevOps jobsRemote Ecommerce jobsRemote Engineering Manager jobsRemote Executive Assistant jobsRemote Full-stack Engineer jobsRemote Frontend Engineer jobsRemote Game Engineer jobsRemote Graphics Designer jobsRemote Growth Marketing jobsRemote Hardware Engineer jobsRemote Human Resources jobsRemote iOS Engineer jobsRemote Infrastructure Engineer jobsRemote IT Support jobsRemote Legal jobsRemote Machine Learning Engineer jobsRemote Marketing jobsRemote Operations jobsRemote Performance Marketing jobsRemote Product Analyst jobsRemote Product Designer jobsRemote Product Manager jobsRemote Project & Program Management jobsRemote Product Marketing jobsRemote QA Engineer jobsRemote SDET jobsRemote Recruitment jobsRemote Risk jobsRemote Sales jobsRemote Scrum Master + Agile Coach jobsRemote Security Engineer jobsRemote SEO Marketing jobsRemote Social Media & Community jobsRemote Software Engineer jobsRemote Solutions Engineer jobsRemote Support Engineer jobsRemote Technical Writer jobsRemote Technical Product Manager jobsRemote User Researcher jobs