
Energy • SaaS • Enterprise
Kraken is an innovative technology platform designed for the utility sector, providing an end-to-end solution that automates and enhances the energy supply chain. It manages over 60 million customer accounts and works with various energy sources, including offshore wind and grid-scale batteries. Kraken helps utilities improve operational efficiency, customer service, and innovative product development while contributing to the transition to a decentralized and decarbonized energy system.
201 - 500 employees
⚡ Energy
☁️ SaaS
🏢 Enterprise
October 24
🇬🇧 United Kingdom – Remote
⏰ Full Time
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
🇬🇧 UK Skilled Worker Visa Sponsor

Energy • SaaS • Enterprise
Kraken is an innovative technology platform designed for the utility sector, providing an end-to-end solution that automates and enhances the energy supply chain. It manages over 60 million customer accounts and works with various energy sources, including offshore wind and grid-scale batteries. Kraken helps utilities improve operational efficiency, customer service, and innovative product development while contributing to the transition to a decentralized and decarbonized energy system.
201 - 500 employees
⚡ Energy
☁️ SaaS
🏢 Enterprise
• Teach and support product teams on best practices for reliability, implementation patterns and effective usage of our existing platforms • Support product teams in improving the performance and availability of their systems • Be hands-on in code and infrastructure to help product teams with reliability improvements • Provide comprehensive feedback to the wider Platform group on improvements to be made to core infrastructure based on observations and first-hand experience in the code base • Support the build-out of proof-of-concept requirements in product teams as needed to evolve application deployment architecture to align with business growth as well as enhance scalability and system resilience • Collaborate with product teams to support the release of new features and services, ensuring adherence to reliability and performance standards • Guide product teams in designing systems for resilience and graceful failure under heavy load • Assist application teams with post-incident tasks and follow-ups, and contribute to the creation and review of post-mortem documentation • Analyse incident metrics to identify trends and potential improvements, communicating these insights to the product teams • Help solve interesting and difficult problems. There’s a great opportunity for disruption in the global energy market
• Previous experience as a Site Reliability Engineer • Experience working on SaaS platforms, including engaging product teams to ensure up-skilling and knowledge sharing across teams • Experience managing and supporting a large scale internet facing service • Experience in responding to incidents and outages, writing technical incident reports and organising incident retrospectives • Experience working with very large relational databases • Experience in using service level objectives to improve application performance • A proactive, innovative mindset
• Great communication skills, working effectively with developers, product managers and other business stakeholders to understand, design and deliver impactful projects and reliability improvements • Proficient using AWS; we use a lot of different AWS services and not just the standard few • Strong Python skills; particularly with Django, the Django ORM and Celery • Good expertise in multiple of the following areas: • PostgreSQL, or a similar RDBMS, particularly in Amazon RDS at scale • Docker and Kubernetes; we use Amazon EKS in production • Datadog, or a similar logging/monitoring tool • Messaging queues, event-driven async processing or similar technologies - we use RabbitMQ • Terraform, or a similar infrastructure-as-code tool • Experience with a Linux distribution • Previous experience working in small, highly-autonomous teams
Apply NowOctober 24
Site Reliability Engineer at Circle designing and operating blockchain infrastructure. Collaborating with teams to enhance system reliability and performance for a fast-growing platform.
October 23
Site Reliability Engineer ensuring system reliability and performance for open-source blockchain projects at IOHK. Involves service operations, engineering principles, and collaborative project engagement.
October 23
DevOps Team Lead overseeing engineers managing impactful projects at a leading cloud communications provider. Fostering teamwork and technical excellence to ensure efficient CI/CD and automation processes.
🇬🇧 United Kingdom – Remote
💵 £60k - £70k / year
💰 Venture Round on 2017-02
⏰ Full Time
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
October 22
Senior DevOps Engineer for a technology-driven customer acquisition company. Elevating infrastructure and automation efforts by managing CI/CD and Infrastructure as Code.
🇬🇧 United Kingdom – Remote
💵 £85k - £100k / year
⏰ Full Time
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
October 17
SRE responsible for architecting resilient cloud infrastructure at Oscilar's AI Risk Decisioning platform. Leading initiatives to optimize availability and performance while mentoring engineers in best practices.