Lead Business Analyst – Alert Management, Observability Standards

Job not on LinkedIn

🕒 May 26

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Astreya

Astreya

1001 - 5000 employees

Founded 2001

🔒 Cybersecurity

🏢 Enterprise

☁️ SaaS

Cybersecurity • Enterprise • SaaS

Astreya is a leading global provider of IT Managed Services and Technology Solutions, known for its innovative approach to digital engineering and IT logistics. The company focuses on empowering businesses to excel in today's dynamic digital landscape by maximizing productivity and fostering innovation. Astreya offers a range of services including Data Center & Network Management, Digital Workplace Services, Next-Gen Digital Engineering, and Cybersecurity Services. With a commitment to excellence and a focus on operational frameworks, Astreya aims to transform technology into a valuable strategic asset for organizations worldwide.

📋 Description

• Provide solutions that help attain business outcomes. • Responsible for rationalizing and governing all system alerts to ensure they align with department priorities, operational coverage models, and service reliability goals. • Define alerting standards, review and approve alerts before they are routed to the 24x7 Eyes-on-Glass Operations team, and establish a scalable approach to cataloging alert response instructions (runbooks/playbooks). • Operate at the intersection of the IT Operations Command Center (OCC), engineering/application teams, platform/monitoring tool owners, and service owners, ensuring alerts are actionable, prioritized, and paired with clear response guidance. • Establish and maintain a department-wide alert rationalization framework that evaluates alerts for: business/service criticality and operational priority, actionability, signal-to-noise, and ownership. • Perform regular alert reviews to ensure alert quality, correct routing, and alignment with operational coverage. • Lead continuous improvement efforts to reduce alert fatigue while preserving detection of true incidents and high-impact degradation. • Define and enforce alerting standards.

🎯 Requirements

• 5+ years in IT Operations, SRE, Observability, Monitoring Engineering, or Incident Management • Demonstrated success reducing noise and improving actionability across enterprise alerting ecosystems • Experience with common monitoring/observability tools (e.g., Splunk, AppDynamics, Dynatrace, Datadog, Prometheus/Grafana, Azure Monitor, CloudWatch, ServiceNow Event Mgmt or similar) • Strong understanding of: Incident response workflows and operational coverage models (24x7 vs. business hours) • CMDB/service ownership concepts and dependency mapping • Standard operating procedures/runbooks and knowledge management • Excellent stakeholder management and ability to drive standards across teams.

🏖️ Benefits

• Medical provided through UHC (PPO, HSA, Surest options) / Medical provided through Kaiser (HMO option only) for California employees only • Dental provided through UHC Nationwide • Vision provided by UHC • Flexible Spending Account for Health & Dependent Care • Pre-Tax Account for Commuter Benefit/Parking & Transit (location-specific) • Continuing Education and Professional Development via various integrated platforms, e.g. Udemy and Coursera • Corporate Wellness Program provided by Goomi Group • Employee Assistance Program • Wellness Days • 401k Plan • Basic and Supplemental Life Insurance • Short Term & Long Term Disability • Critical Illness, Critical Hospital, and Voluntary Accident Insurance • Tuition Reimbursement (available 6 months after start date, capped) • Paid Time Off (accrued and prorated, maximum of 120 hours annually) • Paid Holidays • Any other statutory leaves, paid time, or other ancillary benefits required under state and federal law

Apply Now

Similar Jobs

🕒 May 24

Amplify

1001 - 5000

📚 Education

BPM Analyst at Amplify leading end-to-end process documentation and improvement initiatives. Collaborating to analyze workflows and identifying inefficiencies for strategic business objectives.

🕒 May 23

Mobile Mentor

51 - 200

🏢 Enterprise

🔒 Cybersecurity

Senior Business Analyst driving AI-driven solutions with a focus on Microsoft technologies for client engagements. Leading business strategies and analysis initiatives across various projects.

🕒 May 23

CACI International Inc

10,000+ employees

🔒 Cybersecurity

Business Analyst supporting Agile cross-functional teams in user story creation and backlog refinement. Collaborating with stakeholders to align business objectives with execution in software development.

🕒 May 23

Cengage Group

5001 - 10000

📚 Education

🛍️ eCommerce

☁️ SaaS

Legal Solutions Analyst responsible for advancing legal technology and efficiency at Cengage. Collaborating with teams to integrate data insights and AI solutions for enhanced workflows.

🇺🇸 United States – Remote

💵 $67k - $87.1k / year

💰 Private Equity Round on 2023-06

⏰ Full Time

🟡 Mid-level

🟠 Senior

🧐 Business Analyst

🕒 May 23

Solenis

10,000+ employees

⚡ Energy

🔬 Science

Senior Business Analyst running programs to eliminate process friction at Solenis. Engaging with cross-functional teams to execute projects and drive business adoption while leading governance processes.