Senior Engineer, Production Operations

Job not on LinkedIn

August 25

Apply Now
Logo of Greenlight

Greenlight

Fintech • Education • B2C

Greenlight is a financial technology company dedicated to providing financial literacy tools and experiences for families, particularly focusing on children and teens. It offers a debit card specifically designed for kids, supported by an app that allows parents to monitor spending, automate allowances, and set savings goals. Greenlight facilitates a comprehensive learning experience by incorporating chores management, investing opportunities, and a financial education game that teaches real-world money skills. It also includes safety features such as location sharing and driving alerts to keep families connected and secure. Through partnerships with banks, the app provides various customizable plans that encourage smart financial habits from an early age.

201 - 500 employees

Founded 2014

💳 Fintech

📚 Education

👥 B2C

💰 $260M Series D on 2021-04

📋 Description

• Contribute to the design, implementation, and maintenance of Greenlight's core cloud infrastructure and Site Reliability Engineering (SRE) practices to ensure high availability, scalability, and performance • Develop, maintain, and optimize our cloud infrastructure using Infrastructure as Code (primarily Terraform) and other automation tools • Collaborate closely with development and security teams to embed SRE principles into the software development lifecycle, promoting secure and reliable coding practices • Design and implement robust monitoring, logging, and alerting solutions to provide comprehensive visibility into system health • Actively participate in and support incident response, performing deep-dive root cause analysis, and contributing to actionable blameless postmortems to prevent recurrence • Identify and implement architectural improvements to enhance system reliability, resilience, and operational efficiency • Automate operational tasks and processes to reduce toil and improve efficiency • Research, evaluate, and advocate for new technologies and tools that can improve our operational posture and efficiency • Enhance existing services and applications to increase availability, reliability, and scalability in a microservices environment • Build and improve engineering tooling, processes, and standards to enable faster, more consistent, more reliable, and highly repeatable application delivery

🎯 Requirements

• 5+ years of experience in a Site Reliability Engineering, Production Operations, or similar role • Proven experience architecting, building, and maintaining highly available, secure, and scalable systems in a public cloud environment (AWS strongly preferred) • Strong proficiency with IaC tools, particularly Terraform • Demonstrated experience in automating operational tasks using scripting languages (e.g., Python, Go, Bash) and automation platforms • Expertise in designing and implementing comprehensive monitoring, logging, and alerting solutions (e.g., Datadog, Prometheus, Grafana, ELK stack) • Solid understanding of incident response best practices, with experience in troubleshooting and resolving complex production issues • Strong understanding of distributed systems, microservices architectures, and containerization technologies (Docker, Kubernetes/EKS) • Exceptional analytical and problem-solving skills, with a track record of debugging complex issues in production environments • Excellent communication, collaboration, and interpersonal skills • A passion for identifying and implementing improvements in system reliability, performance, and operational efficiency

🏖️ Benefits

• Medical, dental, vision, and HSA match • Paid life insurance, AD&D, and disability benefits • Traditional 401k with company match • Unlimited PTO • Paid company holidays and pop-up bonus holidays • Professional development stipends • Mental health resources • 1:1 financial planners • Fertility healthcare • 100% paid parental and caregiving leave, plus cleaning service and meals during your leave • Flexible WFH, both remote and in-office opportunities • Fully stocked kitchen, catered lunches, and occasional in-office happy hours • Employee resource groups

Apply Now

Similar Jobs

August 25

Zeely – AI Admaker

1 - 10

🛍️ eCommerce

☁️ SaaS

Lead Flutter development and team for Zeely AI — build AI-driven UGC ad and campaign tools.

🇺🇸 United States – Remote

⏰ Full Time

🟠 Senior

🧑‍💻 Full-stack Engineer

August 25

QuotaPath

51 - 200

☁️ SaaS

🏢 Enterprise

Senior engineer building QuotaPath's commission-tracking product; full-stack React/TypeScript and Python.

🇺🇸 United States – Remote

💵 $130k - $180k / year

💰 $41M Series B on 2022-04

⏰ Full Time

🟠 Senior

🧑‍💻 Full-stack Engineer

August 25

Embrace Software Inc

201 - 500

💸 Finance

📚 Education

Full-Stack Engineer building AI-powered prototypes for Embrace Software, delivering production-ready AI features across backend and frontend.

🇺🇸 United States – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

🧑‍💻 Full-stack Engineer

August 22

TetraScience

51 - 200

🤖 Artificial Intelligence

🧬 Biotechnology

☁️ SaaS

Senior .NET engineer building scalable backend and TypeScript frontends for TetraScience’s scientific data and AI cloud. Responsible for testing, GitHub Actions CI/CD, and AWS debugging.

🇺🇸 United States – Remote

⏰ Full Time

🟠 Senior

🧑‍💻 Full-stack Engineer

🦅 H1B Visa Sponsor

August 22

CrowdStrike

5001 - 10000

🔒 Cybersecurity

☁️ SaaS

🤖 Artificial Intelligence

Senior cloud backend engineer building large-scale risk ingestion services for CrowdStrike. Owns GoLang microservices, cloud integration, and data-driven security automation.

🇺🇸 United States – Remote

💵 $110k - $180k / year

⏰ Full Time

🟠 Senior

🔴 Lead

🧑‍💻 Full-stack Engineer

🦅 H1B Visa Sponsor

Developed by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com