Site Reliability Engineer

Job not on LinkedIn

🕒 April 1

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Fortyx

Fortyx

1 - 10 employees

Fortyx is a company that is currently in a pre-launch phase, inviting users to subscribe for updates. The company has not provided specific details about its products or services yet. It emphasizes the use of cookies to improve website user experience and analyze traffic. The launch is anticipated in 2024.

📋 Description

• Collaborate with software engineering and operations teams to design, build, and maintain cloud-based infrastructure using AWS and Terraform • Implement and enhance infrastructure-as-code (IaC) practices using Terraform to ensure reproducibility and scalability of infrastructure components • Develop and maintain monitoring solutions to proactively identify performance bottlenecks, system outages, and other potential issues • Participate in incident response and root cause analysis efforts to drive continuous improvement and prevent future incidents • Optimise system performance, reliability, and cost efficiency through continuous monitoring, performance tuning, and capacity planning • Identify opportunities to automate manual processes and improve system resilience • Utilise Python or Bash scripting to create and maintain automation tools for various operational tasks and deployments • Implement and improve continuous integration and continuous deployment (CI/CD) pipelines • Collaborate with security teams to implement best practices for securing cloud infrastructure and services • Ensure compliance with relevant industry standards and regulations • Support CI/CD pipelines for application deployments and updates • Contribute to the design and implementation of deployment strategies that promote zero-downtime releases • Maintain clear and up-to-date documentation for infrastructure configurations, processes, and incident resolution procedures • Participate in knowledge sharing with team members to enhance overall expertise and skill sets

🎯 Requirements

• Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent practical experience) • Proven experience as a Site Reliability Engineer or similar role • Extensive experience with Amazon Web Services (AWS) and its core services (EC2, S3, RDS, IAM, etc.) • Strong proficiency in infrastructure-as-code (IaC) tools, with a focus on Terraform • Proficient in scripting with Python or Bash for automation and operational tasks • Solid understanding of networking principles and protocols • Knowledge of CI/CD pipelines and related tools

🏖️ Benefits

• equity-only position • opportunity to gain a stake in a rapidly growing company • contribute directly to its success

Apply Now

Similar Jobs

🕒 March 31

RemoteStar

11 - 50

🤝 B2B

🎯 Recruiter

☁️ SaaS

Senior Site Reliability Engineer Manager ensuring infrastructure and service reliability. Leading SRE team and driving operational excellence in a B2B diamond marketplace.

AWS

Azure

Cloud

Google Cloud Platform

Grafana

Prometheus

Python

Go

🕒 March 31

Keywords Studios

10,000+ employees

🎮 Gaming

📱 Media

🤖 Artificial Intelligence

Azure DevOps Engineer supporting Azure services for Keywords Group in the global Video Game Industry. Managing cloud solutions and leading projects in a remote environment.

AWS

Azure

Cloud

SQL

🕒 March 31

Whitespace Software

51 - 200

🔌 API

💸 Finance

Senior DevOps Engineer at WhiteSpace Technology managing cloud provisioning and high availability. Collaborating with developers and implementing CI/CD while ensuring system hardening and security.

Ansible

Cloud

Grafana

Prometheus

Python

🕒 March 31

Whitespace Software

51 - 200

🔌 API

💸 Finance

DevOps Engineer at WhiteSpace managing cloud provisioning and high availability systems. Collaborating with development team on user stories and ensuring environment security.

Ansible

Cloud

Grafana

Prometheus

Python

🕒 March 31

Tartan Social

1 - 10

🤝 B2B

🛍️ eCommerce

DevOps Engineer responsible for building and maintaining PlaidCloud on Kubernetes with automation processes and deployment strategies. Ensures high availability and efficient resource usage for customer deployments.

Apache

Cloud

Firewalls

Greenplum

Jenkins

Kubernetes

Linux

Python

RabbitMQ

Redis

Unix