Senior Site Reliability Engineer – GCP

🕒 March 31

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Filevine

Filevine

201 - 500 employees

☁️ SaaS

🤖 Artificial Intelligence

💰 $108M Series D on 2022-04

SaaS • Legal • Artificial Intelligence

Filevine is a comprehensive legal technology platform that offers a wide range of services tailored to law firms and legal practitioners. The platform provides solutions for case management, document management, and contract management, as well as lead management and business analytics. Utilizing advanced AI technologies, Filevine enhances the legal workflow with tools such as DemandsAI, ImmigrationAI, and FilevineAI to automate tasks and improve productivity. It integrates seamlessly with popular tools like QuickBooks and Gmail, ensuring a complete legal tech stack. Filevine also offers eSignature capabilities, time and billing features, and a client portal to facilitate communication. The platform is utilized by various types of law practices, including personal injury, family law, mass torts, and more, and is recognized for its robust security standards and compliance certifications such as SOC 2 Type II and HIPAA.

📋 Description

• Provide strong leadership, mentoring, and sound judgment as the Reliability Engineering lead on your team. • Design and maintain autonomous systems for building, deploying, testing, and operating all Filevine products. • Act as the authoritative voice of reliability across the full software development lifecycle (SDLC). • Monitor, aggregate, dashboard, and alert on software/infrastructure events to ensure visibility and fast response. • Continuously enhance CI/CD pipelines, automation scripts, playbooks, and tools to streamline processes and reduce resolution time. • Proactively identify and resolve gaps in system availability, performance, and security while defending overall security posture. • Document processes, architecture, procedures, and best practices; research, adopt, or build reliable tools to boost engineer productivity. • Collaborate within your team (or independently), mentor junior engineers, participate in 24/7 on-call rotation for production support and emergency response, and communicate clearly with technical and management stakeholders.

🎯 Requirements

• 8+ years of hands-on technical experience in software engineering, infrastructure, or operations roles, including a minimum of 4 years dedicated to Site Reliability Engineering (SRE). • Demonstrated curiosity, self-motivation, continuous learning mindset, passion for improvement, and proactive enthusiasm to enhance systems and processes daily without needing direction. • Strong proficiency in Python, Bash, PowerShell, and other common SRE tooling and scripting technologies. • Expert-level experience designing, building, and maintaining autonomous systems that handle software build, deployment, testing, monitoring, and operations with minimal human intervention. • Deep proficiency with Google Cloud Platform (GCP) and its core SRE services, including Compute Engine, Kubernetes Engine/GKE, Cloud Monitoring, Cloud Logging, and IAM. Experience with AWS is a strong plus (e.g., EC2, EKS, CloudWatch, S3). • Proficiency in all core skills expected of an SRE II, including monitoring/alerting, incident response, capacity planning, performance optimization, CI/CD pipeline enhancement, and reliability engineering best practices. • Bachelor’s degree in Computer Science, Information Systems, or a related field; equivalent certifications (e.g., Google Cloud Professional certifications, AWS certifications); or substantial comparable direct work experience. • Proven track record of independently driving reliability improvements, reducing toil through automation, and contributing to high-availability, scalable production systems in a fast-paced environment

🏖️ Benefits

• A dynamic, rapidly growing company, focused on helping organizations thrive • Medical, Dental, & Vision Insurance (for full-time employees) • Competitive & Fair Pay • Maternity & paternity leave (for full-time employees) • Short & long-term disability • Opportunity to learn from a dedicated leadership team • Top-of-the-line company swag

Apply Now

Similar Jobs

🕒 March 31

Espresso Systems

11 - 50

₿ Crypto

🌐 Web 3

DevOps Engineer assisting the development team in building infrastructure for the Espresso Network. Supporting production of sequencer software and deployment tooling for test networks.

AWS

Azure

Cloud

Google Cloud Platform

Linux

Terraform

🕒 March 31

Akamai Technologies

5001 - 10000

🔒 Cybersecurity

Senior Site Reliability Engineer managing reliability for Akamai's serverless inference platform. Building automation and collaborating with product engineering teams to enhance systems.

Distributed Systems

Grafana

Kubernetes

Prometheus

Python

Go

🕒 March 31

Tessera Labs

11 - 50

🤖 Artificial Intelligence

🏢 Enterprise

☁️ SaaS

Cloud Infrastructure/DevOps Engineer responsible for building multi-cloud infrastructure for AI systems. Collaborating with various teams and automating workflows for efficiency.

AWS

Azure

Cloud

Google Cloud Platform

Kubernetes

Oracle

Python

Terraform

🕒 March 31

Ivanti

1001 - 5000

🏢 Enterprise

🔐 Security

☁️ SaaS

Site Reliability Engineer managing cloud-based SaaS applications for Ivanti. Collaborating with global teams to enhance reliability and automation in a dynamic environment.

Ansible

Apache

AWS

Azure

Cloud

ElasticSearch

Java

Jenkins

Kafka

Linux

MongoDB

NGINX

Postgres

Python

Redis

Splunk

SQL

Go

.NET

🕒 March 31

Technical Lead role assisting with DevOps strategy and overseeing microservices deployment architecture for development teams. Providing hands-on support and expertise for automation and Continuous Delivery pipeline.

Cloud

Docker

J2EE

Java

Jenkins

Kubernetes

Linux

Microservices

NoSQL

Shell Scripting

Spring

Spring Boot

SpringBoot