Senior Site Reliability Engineer

Job not on LinkedIn

October 24

Apply Now
Logo of Kraken

Kraken

Energy • SaaS • Enterprise

Kraken is an innovative technology platform designed for the utility sector, providing an end-to-end solution that automates and enhances the energy supply chain. It manages over 60 million customer accounts and works with various energy sources, including offshore wind and grid-scale batteries. Kraken helps utilities improve operational efficiency, customer service, and innovative product development while contributing to the transition to a decentralized and decarbonized energy system.

201 - 500 employees

⚡ Energy

☁️ SaaS

🏢 Enterprise

📋 Description

• Teach and support product teams on best practices for reliability, implementation patterns and effective usage of our existing platforms • Support product teams in improving the performance and availability of their systems • Be hands-on in code and infrastructure to help product teams with reliability improvements • Provide comprehensive feedback to the wider Platform group on improvements to be made to core infrastructure based on observations and first-hand experience in the code base • Support the build-out of proof-of-concept requirements in product teams as needed to evolve application deployment architecture to align with business growth as well as enhance scalability and system resilience • Collaborate with product teams to support the release of new features and services, ensuring adherence to reliability and performance standards • Guide product teams in designing systems for resilience and graceful failure under heavy load • Assist application teams with post-incident tasks and follow-ups, and contribute to the creation and review of post-mortem documentation • Analyse incident metrics to identify trends and potential improvements, communicating these insights to the product teams • Help solve interesting and difficult problems. There’s a great opportunity for disruption in the global energy market

🎯 Requirements

• Previous experience as a Site Reliability Engineer • Experience working on SaaS platforms, including engaging product teams to ensure up-skilling and knowledge sharing across teams • Experience managing and supporting a large scale internet facing service • Experience in responding to incidents and outages, writing technical incident reports and organising incident retrospectives • Experience working with very large relational databases • Experience in using service level objectives to improve application performance • A proactive, innovative mindset

🏖️ Benefits

• Great communication skills, working effectively with developers, product managers and other business stakeholders to understand, design and deliver impactful projects and reliability improvements • Proficient using AWS; we use a lot of different AWS services and not just the standard few • Strong Python skills; particularly with Django, the Django ORM and Celery • Good expertise in multiple of the following areas: • PostgreSQL, or a similar RDBMS, particularly in Amazon RDS at scale • Docker and Kubernetes; we use Amazon EKS in production • Datadog, or a similar logging/monitoring tool • Messaging queues, event-driven async processing or similar technologies - we use RabbitMQ • Terraform, or a similar infrastructure-as-code tool • Experience with a Linux distribution • Previous experience working in small, highly-autonomous teams

Apply Now

Similar Jobs

October 24

Circle

501 - 1000

💳 Fintech

₿ Crypto

🌐 Web 3

Site Reliability Engineer at Circle designing and operating blockchain infrastructure. Collaborating with teams to enhance system reliability and performance for a fast-growing platform.

October 23

Input Output (IOHK)

201 - 500

₿ Crypto

🌐 Web 3

Site Reliability Engineer ensuring system reliability and performance for open-source blockchain projects at IOHK. Involves service operations, engineering principles, and collaborative project engagement.

🇬🇧 United Kingdom – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

October 23

Intermedia Cloud Communications

1001 - 5000

🤝 B2B

🏢 Enterprise

☁️ SaaS

DevOps Team Lead overseeing engineers managing impactful projects at a leading cloud communications provider. Fostering teamwork and technical excellence to ensure efficient CI/CD and automation processes.

🇬🇧 United Kingdom – Remote

💵 £60k - £70k / year

💰 Venture Round on 2017-02

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

October 22

Anchor Conzult

11 - 50

🎯 Recruiter

🤝 B2B

📚 Education

Senior DevOps Engineer for a technology-driven customer acquisition company. Elevating infrastructure and automation efforts by managing CI/CD and Infrastructure as Code.

🇬🇧 United Kingdom – Remote

💵 £85k - £100k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

October 17

Oscilar

51 - 200

💳 Fintech

🏦 Banking

📋 Compliance

SRE responsible for architecting resilient cloud infrastructure at Oscilar's AI Risk Decisioning platform. Leading initiatives to optimize availability and performance while mentoring engineers in best practices.

🇬🇧 United Kingdom – Remote

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Developed by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com