Site Reliability Engineer

Job not on LinkedIn

September 28

Apply Now
Logo of Flex Dental Solutions

Flex Dental Solutions

Flex is a collection of smart and easy-to-use tools that pair perfectly with Open Dental to supercharge and simplify your practice workflow. No more complicated interfaces and half-baked integrations — it's time for a solution that really works.

11 - 50 employees

📋 Description

• Be available to respond to critical service incidents outside of business hours on a rotating on-call schedule. • Proactively monitor application health and performance across cloud infrastructure (AWS). • Troubleshoot and prevent service interruptions in real-time, working closely with development teams to resolve incidents efficiently. • Lead and participate in disaster recovery drills and security incident simulations. • Implement Infrastructure as Code (IaC) and maintain scalable deployments using AWS-native tools and services. • Collaborate with development teams to ensure smooth CI/CD workflows using Git and containerized deployments (Docker). • Work closely with stakeholders and product teams to ensure technical reliability aligns with business needs. • Support and improve observability tools, alerting mechanisms, and logging infrastructure to promote transparency and response agility. • Champion best practices in security, availability, performance, and incident response.

🎯 Requirements

• 3+ years of experience in a Site Reliability, DevOps, or related engineering role. • Proven track record managing and scaling applications in a production AWS environment. • Strong proficiency in Amazon Web Services (AWS) with knowledge of services like EC2, ECS, RDS, CloudWatch, and IAM. • Proficiency in Node.js and scripting for automation and tooling. • Experience with Docker for container-based deployment pipelines. • Familiarity with React and Ember.js to understand performance implications at the frontend level. • Understanding of NestJS and scalable Node-based services. • Proficient in MySQL and performance monitoring of relational databases. • Proficiency with Git for collaborative code management and DevOps workflow integration. • Experience with container orchestration (e.g., ECS or Kubernetes is a plus). • Be available to respond to critical service incidents outside of business hours on a rotating on-call schedule. • Commitment to uptime, performance, and security in fast-moving SaaS environments.

🏖️ Benefits

• Fostering a great workplace culture for our team. • Remote (United States)

Apply Now

Similar Jobs

September 26

DevOps Engineer building CI/CD pipelines and managing cloud/on-prem infrastructure at Dresden Partners. Supporting container orchestration, monitoring, and security practices for global clients.

🗣️🇪🇸 Spanish Required

Ansible

AWS

Chef

Docker

Java

Jenkins

Kubernetes

Linux

Puppet

Python

Ruby

Unix

September 26

DevOps Engineer building and operating AWS CI/CD pipelines and IaC for eSimplicity's federal digital services. Ensuring secure, highly available cloud infrastructure and operational support.

AWS

Cloud

Docker

Kubernetes

Redis

Terraform

September 26

Lead cloud and CI infrastructure using Terraform for The Helper Bees' in-home care SaaS. Mentor engineers, improve observability, and optimize deployments.

Ansible

Azure

Cloud

Docker

Google Cloud Platform

Terraform

Go

September 26

ONE

201 - 500

💳 Fintech

Site Reliability Engineer building observability, automation, and cloud infrastructure for OnePay's fintech products. Ensure scalability, security, and incident management for millions of US customers.

AWS

Cloud

Distributed Systems

Grafana

Java

JavaScript

Kubernetes

Node.js

Prometheus

Python

Terraform

TypeScript

Go

September 25

DevOps Engineer at SOFTGIC S.A.S.; manage AWS EKS, Terraform, CI/CD and observability to ensure scalable, reliable cloud services.

AWS

Docker

EC2

Kubernetes

Linux

Terraform

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com