Senior Site Reliability Engineer, Messaging Services

Job not on LinkedIn

🕒 April 30

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of NBA

NBA

11 - 50 employees

Founded 2007

🏠 Real Estate

🤝 B2B

Real Estate • B2B

NBA is an award-winning architecture and interior design studio founded in 2007 by Architect Nadia Bakhurji, based in the GCC. The firm delivers human-centered architecture and interiors across residential, offices, commercial/retail, mixed-use, master planning, and public projects, emphasizing modern design methodologies, high-quality craftsmanship, and a client-first collaborative process. NBA has an established track record of regional and international awards and works with both private developers and public-sector clients.

📋 Description

• Own reliability and operational excellence for messaging and collaboration platforms. • Serve as a senior escalation point for complex issues across Proofpoint, Exchange, Outlook, SMTP, and various collaboration services. • Lead incident response, service restoration, and root cause analysis in high-pressure, real-time scenarios. • Design and operate highly available, secure, and compliant messaging architectures. • Automate operations using PowerShell and Microsoft Graph. • Administer and support Slack and Microsoft Teams, including tenant/workspace configuration and escalations. • Provide white-glove, high-touch support for the NBA Executive Leadership Team, requiring professionalism, discretion, and urgency. • Partner with Security, Legal, and Audit teams on compliance, retention, and eDiscovery. • Participate in a formal rotating on-call schedule. Provide after-hours, weekend, and holiday support.

🎯 Requirements

• A bachelor's degree in computer science or related technical field • 10+ years of experience in enterprise messaging, infrastructure, or reliability engineering • Deep expertise in Microsoft Exchange Online / M365 and hybrid messaging environments • Strong experience with email security (DMARC, DKIM, SPF, ARC; Proofpoint a plus) • Advanced PowerShell automation skills • Experience supporting Slack enterprise environments • Proven ability to support executives and business-critical operations with urgency and precision • Calm, decisive, and effective during live incidents and high-visibility NBA events

🏖️ Benefits

• medical • dental • vision • life/AD&D insurance • short- and long-term disability • fertility and family-forming assistance • wellbeing allowance • educational assistance • mental health coaching/therapy • tax advantaged accounts such as HSA and healthcare/dependent care FSAs • a 401(k) retirement plan • time off benefits that include vacation, sick time, and personal days

Apply Now

Similar Jobs

🕒 April 30

Prompt Therapy Solutions Inc

11 - 50

⚕️ Healthcare Insurance

⚡ Productivity

☁️ SaaS

Senior DevOps Engineer managing infrastructure and deployment processes for healthcare tech company Prompt Therapy. Leading a team and ensuring scalability, security, and reliability in cloud environments.

Ansible

AWS

Azure

Cloud

Docker

Google Cloud Platform

Grafana

Kubernetes

Prometheus

Python

Terraform

Go

🕒 April 29

HHAeXchange

501 - 1000

⚕️ Healthcare Insurance

☁️ SaaS

📋 Compliance

SRE Technical Project Manager at HHAeXchange creating processes for site reliability and project management. Leading teams to improve system stability, resiliency, and automation in operations.

🕒 April 29

The Home Depot

10,000+ employees

🛒 Retail

👥 B2C

Senior Software Engineer for Site Reliability Engineering at Home Depot. Building and operating internal platforms for store systems' reliability and observability.

BigQuery

Cloud

Google Cloud Platform

JavaScript

Kubernetes

Python

Selenium

Spinnaker

Terraform

TypeScript

Go

🕒 April 29

Satsuma Technology Ltd

1 - 10

🔌 API

🤖 Artificial Intelligence

🛍️ eCommerce

Senior Site Reliability Engineer managing multi-cloud infrastructure at Satsuma. Ensuring reliability, scalability, and operational posture using AI-assisted development.

AWS

Azure

Cloud

Google Cloud Platform

Grafana

Kubernetes

Terraform

🕒 April 28

Parallel Domain

51 - 200

🤖 Artificial Intelligence

🔌 API

Senior Site Reliability Engineer managing AWS infrastructure and Kubernetes for autonomous systems testing. Collaborating across teams to ensure system reliability and security.

AWS

Cloud

DNS

Grafana

Kubernetes

Linux

Node.js

Prometheus

Python

Terraform