
11 - 50 employees
🤖 Artificial Intelligence
🔌 API
⚡ Energy
💰 $100M Series A on 2021-06
Artificial Intelligence • API • Energy
Climavision is a pioneering weather intelligence company that revolutionizes weather forecasting through its advanced suite of AI-powered products and services. With a focus on precision and customization, Climavision offers hyper-accurate weather insights for various industries, including agriculture, energy, and transportation. Their innovative Horizon AI models integrate diverse data sources, enabling businesses and government agencies to make informed decisions by anticipating weather-related risks and optimizing operations.
🔥 3 minutes ago
🇺🇸 United States – Remote
💵 $135k - $170k / year
⏰ Full Time
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
Improve your chances of getting an interview by checking your resume score before you apply.

11 - 50 employees
🤖 Artificial Intelligence
🔌 API
⚡ Energy
💰 $100M Series A on 2021-06
Artificial Intelligence • API • Energy
Climavision is a pioneering weather intelligence company that revolutionizes weather forecasting through its advanced suite of AI-powered products and services. With a focus on precision and customization, Climavision offers hyper-accurate weather insights for various industries, including agriculture, energy, and transportation. Their innovative Horizon AI models integrate diverse data sources, enabling businesses and government agencies to make informed decisions by anticipating weather-related risks and optimizing operations.
• Own production reliability for Climavision’s customer-facing platform and radar-derived weather data services across Azure, colocation, and edge Kubernetes environments. • Contribute to the definition and improvement of SLIs, SLOs, alerting standards, and operational metrics used to measure platform reliability. • Support and coordinate production incident response efforts, including troubleshooting, mitigation, communication, and postmortem analysis. • Diagnose and resolve complex production issues across application services, Kubernetes infrastructure, storage, and distributed systems. • Drive multi-replica and multi-cluster high availability across Climavision’s .NET services. • Improve reliability and operational maturity of production platform services, including observability, autoscaling, ingress, and distributed storage. • Partner with software engineering teams to improve production readiness, resiliency patterns, deployment safety, and operational visibility before services reach production. • Support and evolve Climavision’s observability platform, including metrics, logging, distributed tracing, dashboarding, and alerting.
• A bachelor’s degree in computer science, software engineering, or a related field; equivalent professional experience considered. • Minimum of 7 years of experience in Site Reliability Engineering, DevOps, Production Engineering, Platform Engineering, or a related infrastructure-focused role, with at least 4 years in a role formally titled Site Reliability Engineer or carrying explicit SLO / error-budget accountability. • Strong, hands-on software engineering experience with a minimum of 3 years of experience supporting and modifying C# / .NET applications in production environments. • Demonstrated experience refactoring production application code (preferably C# / .NET) to make services horizontally scalable across multiple replicas. • Experience designing or operating multi-cluster high-availability architectures, including failover behavior, traffic routing, and cross-cluster service deployment. • Strong hands-on experience operating production workloads in self-managed or highly customized Kubernetes environments. • Experience diagnosing and resolving production incidents across application, platform and Kubernetes infrastructure layers, including workload scheduling, storage, ingress, and cluster-level failures. • Strong written and verbal communication skills, including incident documentation and postmortem authoring.
• Competitive compensation • Comprehensive benefits package • 401(k) Savings Plan • Medical/Dental/Vision Benefits • Health Savings Account (HSA) and Flexible Spending Account (FSA) • Unlimited Paid Time-off • 11 Paid Holidays • Paid Parental Leave • Company Paid Short-term Disability (STD) • Company Paid Long-term Disability (LTD) • Company Paid Life Insurance
Apply Now🔥 31 minutes ago
DevOps Engineer developing best-in-class health care reporting service using AWS solutions and CI/CD practices. Collaborating within a team to optimize cloud infrastructure and deployment processes.
🇺🇸 United States – Remote
💵 $108.5k - $184.4k / year
💰 $30M Grant on 2021-03
⏰ Full Time
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
🦅 H1B Visa Sponsor
🔥 41 minutes ago
Lead DevSecOps Engineer responsible for embedding security in software delivery across multi-cloud environments. Mentoring team members and establishing practices for cloud platform engineering.
🔥 5 hours ago
Stack AV Site Reliability Engineer managing large-scale autonomous systems development and infrastructure performance. Collaborating across teams to enhance reliability, scalability, and automation of compute platforms.
🔥 13 hours ago
Site Reliability Engineer ensuring health, performance, and delivery of infrastructure systems at Mobile Wave Solutions. Working collaboratively with engineers to automate processes and improve operational reliability.
🕒 2 days ago
Technical Engineering Manager leading high-performing cloud and DevOps teams. Guiding architecture and delivery of scalable, reliable, and secure cloud solutions for clients.