Site Reliability Engineer II

🕒 May 27

🇺🇸 United States – Remote

⏰ Full Time

🟢 Junior

🟡 Mid-level

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

info
Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Backblaze

Backblaze

201 - 500 employees

Founded 2007

🛍️ eCommerce

🏢 Enterprise

💰 $5M Series A on 2012-07

Cloud Storage • eCommerce • Enterprise

Backblaze is a cloud storage company that provides scalable and secure data backup solutions for both businesses and individuals. Their B2 Cloud Storage service offers S3 compatible object storage, allowing users to easily protect and manage their data with transparent pricing. Backblaze specializes in automatic and unlimited backup services for computer systems, ensuring data protection and recovery options for users, while also supporting integration with applications for enhanced functionality.

📋 Description

• Support the availability and durability of critical services across production environments. • Monitor service health using SLIs, SLOs, and error budgets, and escalate issues when thresholds are at risk. • Participate in on-call rotations, incident response, and post-incident reviews to drive service improvements. • Follow established ITIL/OSS processes (incident, change, problem, and capacity management). • Develop automation for common operational tasks, reducing manual intervention and toil. • Contribute to monitoring, logging, and alerting frameworks (e.g., Prometheus, Grafana, Catchpoint,ELK). • Work with CI/CD pipelines, configuration management, and infrastructure as code tools (Terraform, Ansible, Jenkins). • Write scripts (Bash, Python, Go, etc.) to improve system reliability and efficiency. • Partner with engineering, product, and operations teams to support resilient system design and operations. • Assist in capacity planning and disaster recovery exercises. • Work with vendors and service providers to troubleshoot service issues and track SLA performance. • Document systems, share learnings, and help grow a reliability-minded engineering culture. • Contribute to playbooks, runbooks, and operational documentation. • Identify recurring issues and propose long-term improvements. • Promote reliability-focused practices within development and operations teams.

🎯 Requirements

• Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience). • 2–4 years of experience in site reliability, systems engineering, or operations. • Exposure to large-scale, production-grade systems. • Solid Linux systems administration and troubleshooting skills. • Familiarity with service reliability concepts - monitoring, alerting, incident response, and root cause analysis. • Proficiency in at least one scripting language (Python, Bash, or Go). • Understanding of containers (Kubernetes, Docker) and microservices concepts. • Knowledge of incident response and operational best practices.

🏖️ Benefits

• Flexible working hours • Professional development opportunities • Remote work options

Apply Now

Similar Jobs

🕒 May 27

OXIO

51 - 200

📡 Telecommunications

☁️ SaaS

💳 Fintech

Site Reliability Engineer designing and implementing cloud platform for OXIO's Telecom services while maintaining production infrastructure.

🇺🇸 United States – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🕒 May 27

Cority

201 - 500

☁️ SaaS

📋 Compliance

Intermediate Site Reliability Engineer supporting reliability, performance, and scalability of cloud-hosted services. Collaborate with engineering teams and contribute to incident response processes.

🇺🇸 United States – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🕒 May 27

General Motors

10,000+ employees

🚗 Transport

⚡ Energy

🏢 Enterprise

Design Release Engineer focusing on semiconductor product development and engineering processes at GM. Involves collaboration with teams to uphold strategic vision and core values of GM.

🇺🇸 United States – Remote

💵 $124.7k - $161.1k / year

💰 $500M Grant on 2024-07

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

info

🕒 May 26

VetsEZ

201 - 500

🤝 B2B

☁️ SaaS

🏛️ Government

DevSecOps Engineer supporting secure software delivery and cloud infrastructure operations for federal government healthcare projects. Collaborating with teams to improve deployment reliability and efficiency.

🇺🇸 United States – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🕒 May 26

VetsEZ

201 - 500

🤝 B2B

☁️ SaaS

🏛️ Government

DevSecOps Engineer for federal healthcare technology initiative, collaborating on secure software delivery and automation. Focusing on CI/CD, cloud infrastructure, and deployment efficiency.

🇺🇸 United States – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)