DevOps Reliability Engineer

Job not on LinkedIn

🔥 0 minutes ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Advanced Solutions International, Inc.

Advanced Solutions International, Inc.

201 - 500 employees

Founded 1991

🤝 B2B

🤝 Non-profit

💰 Venture Round on 2022-01

B2B • Non-profit • Software

Advanced Solutions International, Inc. is a company dedicated to providing innovative solutions tailored for non-profit organizations. Their flagship product, iMIS, is recognized as a top association management software, facilitating effective member engagement and organizational performance. ASI also offers various platforms like Clowder and TopClass, aimed at enhancing mobile engagement and learning management for associations.

📋 Description

• Monitor and improve the health, availability, performance, and cost efficiency of Azure-based production systems. • Use application, database, and infrastructure telemetry to identify performance issues, bottlenecks, and reliability risks. • Tune Azure services and platform configurations to maximize performance, resilience, and resource efficiency. • Partner with engineering teams to recommend and implement practical, data-driven improvements to reliability, scalability, and operational effectiveness. • Create and maintain operational documentation, runbooks, and troubleshooting guides to support consistent incident response and ongoing operations. • Support Tech Support and Sustained Engineering by executing approved SQL queries and completing database backups and restores for troubleshooting purposes. • Analyze how partner integrations and customer usage patterns impact system performance and cloud spend. • Investigate complex production issues, perform root cause analysis, and drive resolution of reliability and performance problems. • Contribute to continuous improvement across deployment processes, system stability, and operational readiness. • Perform other job-related duties and responsibilities as assigned.

🎯 Requirements

• Bachelors degree in Computer Science, Information Technology or related degree or relevant experience. • 8+ years of experience in DevOps, Site Reliability Engineering, Cloud Engineering, or similar roles. • Strong hands-on experience with Microsoft Azure, especially: Azure SQL, Azure Functions, Azure App Services, and Azure Containers (AKS, Container Apps, or similar). • Ability to read and interpret telemetry, logs, metrics, and resource usage data and explain what’s wrong and how to fix it. • Experience working with production systems that require high availability and reliability. • Comfort owning work end-to-end, from identifying issues to executing improvements. • Experience adjusting pipelines, hosting configurations, and deployment processes. • Solid understanding of cloud cost drivers and usage optimization. • Strong problem-solving skills and the ability to work collaboratively across engineering and support team. • Ability to read and interpret application code to support troubleshooting, root cause analysis, and identification of performance improvement opportunities.

🏖️ Benefits

• Wellness Benefits • Opportunities for Professional Growth and Development • Flexible Remote Work • Volunteer Time Off • Study Leave • Employee Assistance Program

Apply Now

Similar Jobs

🕒 Yesterday

Ivanti

1001 - 5000

🏢 Enterprise

🔐 Security

☁️ SaaS

Staff DevOps Engineer at Ivanti responsible for improving DevOps practices in SaaS products and ensuring stable releases. Collaboration across teams for best practices and process improvements.

🕒 Yesterday

Capita

10,000+ employees

📋 Compliance

☁️ SaaS

🏢 Enterprise

Senior DevOps Engineer designing and managing automated CI/CD pipelines in Azure DevOps. Collaborating with Salesforce and QA teams to ensure smooth software delivery and deployment.

🕒 2 days ago

Leidos

10,000+ employees

🔒 Cybersecurity

🔬 Science

DevOps Engineer handling database designs and AWS migrations for UK programmes. Contributing to Agile teams and utilizing various software tools and languages.

🕒 2 days ago

Mozilla

501 - 1000

👥 B2C

🔒 Cybersecurity

Senior Site Reliability Engineer managing infrastructure and operations for Thunderbird. Collaborating with a distributed team to enhance system reliability and performance.

🕒 3 days ago

NICE

5001 - 10000

☁️ SaaS

🤖 Artificial Intelligence

📡 Telecommunications

DevOps Engineer automating pipelines in a collaborative environment for NICE. Required to coordinate builds, manage release operations, and research new technologies.

Perl

Python