
Fintech • B2B • SaaS
Lumin Digital is a company that specializes in providing next-generation digital banking solutions for credit unions and banks. Their platform offers a wide array of services, including retail and commercial banking solutions, digital account opening, and tools to enhance user engagement and operational efficiency. With a focus on innovation and cutting-edge technology, Lumin Digital leverages artificial intelligence and robust security features to offer seamless, cloud-native services with near-perfect uptime. They are known for delivering business growth and cost savings for financial institutions, adapting to new technologies, and offering an enhanced user experience.
August 15
🇺🇸 United States – Remote
💵 $170k - $200k / year
⏰ Full Time
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)

Fintech • B2B • SaaS
Lumin Digital is a company that specializes in providing next-generation digital banking solutions for credit unions and banks. Their platform offers a wide array of services, including retail and commercial banking solutions, digital account opening, and tools to enhance user engagement and operational efficiency. With a focus on innovation and cutting-edge technology, Lumin Digital leverages artificial intelligence and robust security features to offer seamless, cloud-native services with near-perfect uptime. They are known for delivering business growth and cost savings for financial institutions, adapting to new technologies, and offering an enhanced user experience.
• Basic Function • The Senior Site Reliability Engineer (SRE) is a developer with a strong operations mindset, responsible for ensuring the reliability, availability, and scalability of Lumin Digital’s applications. This role focuses on eliminating manual tasks through automation, maintaining Service Level Objectives (SLO), and closely collaborating with Software Engineers (SWE) to implement and maintain best practices for large-scale systems. The ideal candidate thrives in solving complex problems, automating processes, and creating resilient systems. • Essential Functions, Responsibilities, Experience: • Design, implement, and manage CI/CD pipelines to improve deployment efficiency. • Monitor and resolve issues in all environments, ensuring SLO and uptime targets are consistently met. • Collaborate with Software Engineers to address SRE concerns during feature design and deployment. • Participate in capacity planning and demand forecasting to proactively address performance bottlenecks and scalability needs. • Perform change management to maintain system stability and minimize disruptions. • Generate uptime and SLO reports for internal review and leadership visibility. • Engage in SRE scrum team activities to drive agile development processes. • Ensure security best practices are followed, safeguarding data integrity and system resilience. • Perform other duties as assigned • Where the Role Will Grow: • 30 Days: Understand the architecture, monitoring tools, and CI/CD pipelines currently in use. Begin participating in SRE team activities and resolving basic operational issues. • 90 Days: Take ownership of monitoring and alerting systems, improve incident response processes, and contribute to SLO reporting. • 1 Year: Deliver measurable improvements in system reliability, scaling capabilities, and automation processes. Take a leadership role in SRE best practices and mentor junior team members. • Knowledge, Skills, & Abilities: • Exceptional full-stack troubleshooting skills, with a focus on resolving system-level issues. • Expertise in at least one configuration management system (e.g., Chef, Ansible, Puppet). • Strong understanding of networking protocols and components such as HTTP, DNS, TCP/IP, and Load Balancing. • Experience with cloud hosting platforms, with AWS preferred (Google Cloud and Azure also valued). • Hands-on experience with Terraform, Kubernetes, and containerization technologies like Docker. • Solid understanding of CI/CD workflows and the ability to architect robust pipelines. • Familiarity with monitoring and alerting strategies, including self-healing and escalation processes. • Commitment to improving on-call experiences by creating resilient and automated systems. • Strong problem-solving skills with a focus on automation and operational efficiency. • Security mindset with a focus on protecting data integrity and resilience. • Excellent written and verbal communication skills. • Proven ability to work within an agile scrum team. • Ability to participate in a 24x7 on-call rotation. • 2+ years of experience as a software engineer, with C#, Angular, or JavaScript preferred. • AWS certifications such as SysOps or Solutions Architect (preferred but not essential). • Experience with Amazon RDS, EKS, and CloudWatch. • Education: Bachelor’s degree or higher in Computer Science, or equivalent experience required. • Travel: Minimal, generally 12 days or less per year, ~2X team get togethers a year
• Exceptional full-stack troubleshooting skills, with a focus on resolving system-level issues. • Expertise in at least one configuration management system (e.g., Chef, Ansible, Puppet). • Strong understanding of networking protocols and components such as HTTP, DNS, TCP/IP, and Load Balancing. • Experience with cloud hosting platforms, with AWS preferred (Google Cloud and Azure also valued). • Hands-on experience with Terraform, Kubernetes, and containerization technologies like Docker. • Solid understanding of CI/CD workflows and the ability to architect robust pipelines. • Familiarity with monitoring and alerting strategies, including self-healing and escalation processes. • Commitment to improving on-call experiences by creating resilient and automated systems. • Strong problem-solving skills with a focus on automation and operational efficiency. • Security mindset with a focus on protecting data integrity and resilience. • Excellent written and verbal communication skills. • Proven ability to work within an agile scrum team. • Ability to participate in a 24x7 on-call rotation. • 2+ years of experience as a software engineer, with C#, Angular, or JavaScript preferred. • AWS certifications such as SysOps or Solutions Architect (preferred but not essential). • Experience with Amazon RDS, EKS, and CloudWatch. • Education: Bachelor’s degree or higher in Computer Science, or equivalent experience required.
Apply NowAugust 9
Join Promptfoo as a Deployment Engineer, bridging customers and product team for AI security.
Docker
JavaScript
Python
August 9
Forge builds AI-native DevOps workspace; seeking engineers who thrive on challenges.
Cloud
Grafana
Kubernetes
Linux
Prometheus
Python
Terraform
Go
August 9
Design and manage infrastructure tools and services using Rust at a progressive technology company.
AWS
Cloud
Docker
Google Cloud Platform
Kubernetes
Prometheus
Rust
August 9
Join Turnkey as a DevOps engineer, managing infrastructure for secure crypto applications.
🇺🇸 United States – Remote
💵 $150k - $250k / year
⏰ Full Time
🟡 Mid-level
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
🦅 H1B Visa Sponsor
AWS
Cloud
Google Cloud Platform
Linux
Web3
August 1
Site Reliability Engineer (.NET) at Virtuous, enhancing product reliability for nonprofits.
🇺🇸 United States – Remote
⏰ Full Time
🟡 Mid-level
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
🦅 H1B Visa Sponsor
ASP.NET
Azure
Redis
SQL
.NET