Site Reliability Engineer

🔥 0 minutes ago

🇺🇸 United States – Remote

💵 $125k - $145k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

info
Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Accela

Accela

201 - 500 employees

Founded 2000

🏛️ Government

☁️ SaaS

🏢 Enterprise

Government • SaaS • Enterprise

Accela is a leader in providing cloud-based solutions designed to modernize government services. Their unified suite of innovative applications focuses on building more connected communities through enhanced efficiency and secure data management. Accela empowers local and state governments by streamlining processes such as building permits, licensing, cannabis regulation, and environmental health. With a focus on civic solutions, Accela aims to improve public sector operations by eliminating data silos and facilitating better interactions with residents. By leveraging their SaaS platform, governments can enhance service delivery, increase transparency, and reduce operational costs, leading to significant time and cost savings.

📋 Description

• Contribute to the operation, maintenance, and continuous improvement of Accela's production cloud environments. • Support platform modernization initiatives, including containerization, cloud-native technologies, and automation efforts. • Monitor platform health, availability, performance, and capacity using modern observability and monitoring tools. • Participate in incident response activities, troubleshooting production issues and contributing to Root Cause Analysis efforts. • Develop and maintain automation, tooling, and scripts that improve reliability, scalability, deployment efficiency, and operational effectiveness. • Support the implementation and monitoring of service level objectives (SLOs), service level agreements (SLAs), and operational metrics. • Partner with Development, DevOps, Database Engineering, and Security teams to identify and resolve reliability, performance, and scalability challenges. • Assist with platform deployments, operational readiness reviews, and change management activities. • Contribute to observability initiatives through monitoring, logging, metrics collection, and distributed tracing. • Support compliance-related operational activities associated with SOC 2, HIPAA, FedRAMP, StateRAMP, and PCI-DSS environments. • Participate in post-incident reviews and contribute to corrective and preventive actions that improve platform stability.

🎯 Requirements

• 4+ years of experience in Site Reliability Engineering, Cloud Operations, Systems Engineering, DevOps, Software Engineering, or a related technical discipline. • Experience supporting cloud-based SaaS environments, preferably within Microsoft Azure. • Experience with Kubernetes and containerized application environments. • Working knowledge of scripting and automation using Python, PowerShell, Bash, or similar languages. • Experience troubleshooting distributed systems across application, infrastructure, networking, and operating system layers. • Familiarity with monitoring, logging, metrics, and observability platforms. • Strong analytical and problem-solving skills with a structured approach to troubleshooting and Root Cause Analysis. • Experience working within Incident, Problem, and Change Management processes. • Strong written and verbal communication skills and the ability to work effectively with cross-functional teams. • Experience using Git and GitHub-based workflows.

🏖️ Benefits

• flexible time off • comprehensive medical, dental, and vision plans • family planning benefits • 401(k) retirement savings plan with company match • health savings account with company contributions • flexible spending account • life, accident, and disability coverage • business travel insurance • employee assistance programs • other well-being benefits

Apply Now

Similar Jobs

🔥 7 hours ago

Parity Healthcare Analytics

1 - 10

⚕️ Healthcare Insurance

☁️ SaaS

🏢 Enterprise

Engineering Manager leading a team of SRE engineers in blockchain infrastructure context. Responsible for setting direction and ensuring reliability standards.

🔥 7 hours ago

High 5 Games

51 - 200

🎮 Gaming

🎲 Gambling

🤝 B2B

DevOps Engineer focusing on ML and Data infrastructure for gaming company. Scaling AI models, automating workflows, and collaborating with data scientists and engineers.

🔥 9 hours ago

Ad Hoc LLC

501 - 1000

🏛️ Government

🤖 Artificial Intelligence

🔌 API

Senior DevOps Engineer developing cloud infrastructure and DevOps strategy for impactful digital services. Mentoring teams and improving software engineering processes for government agencies.

🔥 9 hours ago

Ad Hoc LLC

501 - 1000

🏛️ Government

🤖 Artificial Intelligence

🔌 API

Senior DevOps Engineer contributing to federal technology projects using modern agile methods. Collaborating to improve software engineering processes and support critical government services.

🔥 14 hours ago

Clover Health

501 - 1000

Senior Manager of Site Reliability Engineering at Counterpart Health overseeing a team of engineers. Focused on reliability outcomes and proactive collaboration with product engineering pillars.