Site Reliability Engineer – Level 3

501 - 1000 employees

Founded 1999

🏛️ Government

☁️ SaaS

📋 Compliance

Government • SaaS • Compliance

Granicus is a company focused on transforming the way governments interact with their constituents through digital services and technology solutions. It provides the Government Experience Cloud to improve service delivery, community engagement, and operational efficiency across local, state, and federal governments. Granicus offers tools for agenda and meeting management, digital communication and engagement, public records management, and more, all designed to enhance customer experience and foster transparent and equitable interactions between governments and the people they serve.

Site Reliability Engineer – Level 3

Job not on LinkedIn

🕒 May 16

🇺🇸 United States – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

Ansible

AWS

Azure

Chef

Cloud

Grafana

Java

Linux

Prometheus

Puppet

Python

Ruby

Splunk

Unix

Apply Now

Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Granicus

501 - 1000 employees

Founded 1999

🏛️ Government

☁️ SaaS

📋 Compliance

Government • SaaS • Compliance

📋 Description

• Provide production support on a shift according to the team on-call roster • Work on the customer and internal engineering/implementation team raised tickets while not on-call for production support • Monitor and Maintain Systems: Continuously monitor the health and performance of our services, systems, and infrastructure • Automate Processes: Develop and maintain automation scripts and tools to streamline operations and reduce manual intervention • Incident Management: Assist in troubleshooting and resolving incidents, performing root cause analysis, and implementing long-term fixes to prevent recurrence • System Improvements: Participate in designing and implementing system improvements to enhance reliability, scalability, and performance • Collaboration: Work closely with software engineers to understand application requirements, provide feedback on design and architecture, and support deployment and release processes • Documentation: Create and maintain documentation for processes, procedures, and troubleshooting guides to ensure knowledge sharing within the team • Capacity Planning: Assist in capacity planning activities to anticipate future needs and ensure that our infrastructure can handle growth • Security: Implement and adhere to security best practices to protect our systems and data

🎯 Requirements

• 5+ years of experience in site reliability engineering, system administration, or a similar role • Good understanding of Linux/Unix systems, networking, and cloud services (AWS, Azure, or Google Cloud) • Experience with scripting languages such as Python, Bash, or Ruby • Bachelor's or postgraduate degree in computer science, Information Technology, or a related field, or equivalent practical experience • Familiarity with AI/ML operations, including model lifecycle management, vector databases, and inference performance tuning • Expertise in Linux/Unix systems, networking, and cloud services (AWS, Azure, or Google Cloud) • Proficiency in scripting languages (Python, Bash, Ruby) and programming languages (Go, Java, C++) • Advanced knowledge of monitoring and logging tools like Elastic (Prometheus, Grafana, Splunk), configuration management (Ansible, Chef, Puppet), and CI/CD pipelines • Strong analytical and problem-solving skills with the ability to diagnose and resolve complex issues efficiently • Excellent verbal and written communication skills, with the ability to convey complex technical concepts to non-technical stakeholders • Demonstrated ability to lead and mentor a team, drive projects to completion, and manage cross-functional initiatives • Relevant certifications such as AWS Certified DevOps Engineer, AWS Certified Machine Learning – Specialty, Google Cloud Professional DevOps Engineer, or similar are a plus.

🏖️ Benefits

• Health insurance • 401(k) matching • Flexible work hours • Paid time off • Remote work options

Apply Now

Similar Jobs

Senior Azure DevOps Engineer

🕒 May 15

ImagineX

201 - 500

🤖 Artificial Intelligence

🔒 Cybersecurity

🏢 Enterprise

Senior Azure DevOps Engineer at ImagineX deploying Azure infrastructure and CI/CD pipelines. Collaborating with teams for secure and scalable solutions in a remote environment.

🇺🇸 United States – Remote

💰 Private equity on 2023-11

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Azure

Cloud

Docker

Firewalls

Kubernetes

Python

SQL

Terraform

Site Reliability Engineer

🕒 May 14

AceHack 4.0

11 - 50

⚡ Productivity

☁️ SaaS

Site Reliability Engineer at Orkes solving distributed systems challenges and managing cloud infrastructure. Engaging in incident management and improving system reliability through observability tools.

🇺🇸 United States – Remote

💵 $180k - $250k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

AWS

Azure

Cloud

Distributed Systems

Google Cloud Platform

Grafana

Kubernetes

Microservices

Prometheus

Python

Terraform

Senior Network Reliability Engineer – DGX Cloud

🕒 May 14

NVIDIA

10,000+ employees

🤖 Artificial Intelligence

🎮 Gaming

Senior Network Reliability Engineer maintaining NVIDIA's cloud and datacenter networks. Engaging in global support and driving operational improvements across teams.

🇺🇸 United States – Remote

💵 $136k - $264.5k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

AWS

Azure

Cloud

DNS

Google Cloud Platform

TCP/IP

Site Reliability Engineer – Azure, DevSecOps, IaC, Governance, Observability

🕒 May 14

Avaya

5001 - 10000

🤝 B2B

Site Reliability Engineer at Avaya driving stability and performance across Azure and GCP platforms. Collaborating with DevOps and Security teams to manage incidents and optimize operations.

🇺🇸 United States – Remote

💵 $129k - $143k / year

💰 Post-IPO Debt on 2022-06

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

Ansible

Azure

Cloud

Google Cloud Platform

Terraform

Senior Site Reliability Engineer, Infrastructure Foundations

🕒 May 13

Wikimedia Foundation

501 - 1000

🤝 Non-profit

📚 Education

📱 Media