FedRAMP Site Reliability Engineer

November 22

Apply Now
Logo of Confluent

Confluent

Artificial Intelligence ‱ SaaS ‱ Cloud Computing

Confluent is a company that specializes in data streaming platforms which turn real-time data events into actionable outcomes. Their solutions enable the development of intelligent, real-time applications, empowering teams and systems to respond to data instantly. Confluent builds a new data category that impacts the real world by providing the infrastructure for real-time data streaming, which is recognized and partnered with major tech companies like Google Cloud and Microsoft. The company maintains a remote-first culture, hiring talent from over 25 countries, and values diversity and inclusivity in their workplace.

1001 - 5000 employees

Founded 2014

đŸ€– Artificial Intelligence

☁ SaaS

💰 Secondary Market on 2021-06

📋 Description

‱ Understand and participate in the changing FedRAMP space by quickly ramping up with the 20x controls and building upon these to maintain federal compliance ‱ Own and champion high operational standards of Confluent Cloud systems leveraged by federal agencies ‱ Deploy production changes to Confluent Cloud systems and infrastructure through established change management processes ‱ Assist with process improvements and adoption of change management ‱ Own monitoring and incident handling of complex distributed systems, engaging engineering teams when needed through an escort model system. ‱ Act as a core member of Confluents Business Continuity Plan and Disaster Recovery team with efforts across 3 large verticals ‱ Innovate and design solutions to reduce toil, bolster operational maturity, and make day-to-day worklife easier. ‱ Participate in a 24/7 on-call rotation to maintain the integrity of Confluent Cloud for Government systems

🎯 Requirements

‱ 3-5 years of relevant experience ‱ Expertise in Cloud Native technologies with experience operating production services in the cloud ‱ Strong fundamentals of Distributed Systems and their design ‱ Deep knowledge of Kubernetes and containerization ‱ Strong infrastructure as code knowledge (Terraform preferred) ‱ Experience with telemetry tooling to monitor production systems (DataDog, Grafana, Prometheus) ‱ Experience with BCP/DR and high availability exercises ‱ Ability to quickly problem-solve and troubleshoot critical services ‱ Proficiency with scripting and automation (e.g Go, Java, Python, Bash) ‱ Exceptional teamwork, collaboration skills, and the ability to act critically with minimal supervision at times in a remote first environment ‱ Experience with a rotating on-call schedule to provide 24/7 support ‱ BS Degree in Computer Science, Engineering, or equivalent experience.

đŸ–ïž Benefits

‱ Health insurance ‱ Flexible work arrangements ‱ Professional development opportunities

Apply Now

Similar Jobs

November 21

Senior Site Reliability Engineer responsible for maintaining production systems' reliability at Zensurance. Collaborate with engineering teams, automate tasks and improve incident management processes.

AWS

Cloud

Distributed Systems

Grafana

Kubernetes

Prometheus

Splunk

Terraform

November 20

Akinox

51 - 200

Team Lead DevOps guiding and evolving the Infrastructure team for a health tech company. Improving deployment automation and collaborating across teams.

đŸ—ŁïžđŸ‡«đŸ‡· French Required

Azure

Cloud

Kubernetes

November 17

Deployment Engineer at Versaterm collaborating with law enforcement for technical data interfaces. Ensuring smooth deployment processes for public safety technology with strong customer focus.

Cloud

ETL

SOAP

SQL

November 17

Schema App

11 - 50

DevOps Engineer responsible for AWS infrastructure and automation at Schema App, enhancing search visibility. Collaborating with developers for high availability and system security.

AWS

Cloud

Docker

Kubernetes

Python

Terraform

November 17

DevOps Engineer managing cloud infrastructure on AWS for a fast-growing SaaS platform. Collaborating with teams to implement best practices and optimize system reliability.

Ansible

AWS

Chef

Cloud

Linux

Oracle

Postgres

Puppet

Python

Terraform

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com