Site Reliability Engineer – Senior

October 30

Apply Now
Logo of HighLevel

HighLevel

SaaS • Marketing • B2B

HighLevel is an all-in-one marketing and sales platform designed to help businesses grow and succeed. The platform consolidates various marketing tools into a single solution, providing features such as lead capture through landing pages, surveys, forms, and calendars, as well as tools for nurturing leads via automated messaging across multiple channels including phone, SMS, email, and social media. HighLevel offers customizable solutions like online appointment scheduling, multi-channel follow-up campaigns, and pipeline management. Additionally, businesses can build websites, funnels, and landing pages using the intuitive page builder. HighLevel supports integrating with existing systems via API, and offers a membership platform for community building and course management. The platform is targeted towards marketers and offers white-labeling options for businesses to brand the software as their own. With a community-driven development approach and award-winning support, HighLevel is focused on empowering businesses to streamline their operations and enhance their marketing efficiencies.

201 - 500 employees

Founded 2018

☁️ SaaS

🤝 B2B

💰 Series A on 2021-11

📋 Description

• Develop and improve observability using monitoring, logging, tracing, and alerting tools (Prometheus, Grafana, ELK, OpenTelemetry, etc.). • Optimize system performance, troubleshoot incidents, and conduct post-mortems/RCA to prevent future issues. • Collaborate with developers to enhance application reliability, scalability, and performance. • Drive cost optimisation efforts in cloud environments. • Experience with multiple databases Mongo, Redis, ES, Queue based etc

🎯 Requirements

• Experience: 4+ years in Site Reliability Engineering, DevOps, or Cloud Infrastructure roles. • Cloud Expertise: Hands-on experience with GCP and AWS. • Infrastructure as Code (IaC): Terraform, Helm, or equivalent tools. • Containerization & Orchestration: Docker, Kubernetes (GKE). • Observability: Experience with Prometheus, Grafana, ELK, OpenTelemetry, or similar monitoring/logging tools. • Programming/Scripting: Proficiency in Python, Bash, or Shell scripting. Basic understanding of API parsing and JSON manipulation. • CI/CD Pipelines: Hands-on experience with Jenkins, GitHub Actions, ArgoCD, or similar tools. • Incident Management: Experience with on-call rotations, SLOs, SLIs, SLAs, Escalation Policies, and incident resolution. • Databases: Experience in monitoring Mongo, Redis, ES, Queue based etc

Apply Now

Similar Jobs

October 30

Arize AI

51 - 200

🤖 Artificial Intelligence

☁️ SaaS

🏢 Enterprise

Senior DevOps Engineer responsible for scalable AI infrastructure at Arize AI. Working hands-on with distributed services and customer environments.

October 30

Exavalu

201 - 500

🤝 B2B

🏦 Banking

⚕️ Healthcare Insurance

DevOps Engineer responsible for CI/CD pipeline management and infrastructure automation for Exavalu's IT services. Enable Agile delivery through AWS solutions and DevOps best practices.

🇮🇳 India – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

October 27

ST Engineering iDirect

501 - 1000

📡 Telecommunications

🔒 Cybersecurity

🏛️ Government

DevOps Engineer working at ST Engineering iDirect, enhancing productivity and reliability through automation and collaboration. Focus on enabling self-service infrastructure and quality delivery.

🇮🇳 India – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

October 27

Akamai Technologies

5001 - 10000

🔒 Cybersecurity

Senior Site Reliability Engineer ensuring optimal performance and uptime of Akamai's portal services. Involves analyzing system performance, troubleshooting, and developing monitoring systems.

🇮🇳 India – Remote

💰 Post-IPO Equity on 2001-07

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

October 27

Akamai Technologies

5001 - 10000

🔒 Cybersecurity

Site Reliability Engineer collaborating across teams to solve complex problems with Akamai's Compute products. Monitoring and improving system reliability through tooling and software.

🇮🇳 India – Remote

💰 Post-IPO Equity on 2001-07

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Developed by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com