Platform Site Reliability Engineer

Job not on LinkedIn

July 17

Apply Now
Logo of Nexthink

Nexthink

SaaS • Enterprise

Nexthink is a Digital Employee Experience (DEX) platform that empowers IT teams to see, diagnose, and fix digital workplace issues. It leverages AI-powered solutions for real-time alerting, intelligent diagnostics, and automated remediation, ensuring optimization of workplace applications, collaboration tools like Teams and Zoom, and overall employee engagement. Nexthink helps organizations enhance IT efficiency, manage digital transformation, and maintain cost-effective digital work environments with measurable impact and operational excellence. The platform supports over 15 million endpoints globally, providing unparalleled visibility and automation for proactive IT management and service desk efficiency.

501 - 1000 employees

Founded 2011

☁️ SaaS

🏢 Enterprise

💰 Series D on 2021-02

📋 Description

• Nexthink is looking for a strong Platform Engineer with SRE operations experience to strengthen our infrastructure and accelerate our ability to deploy, monitor, and scale systems effectively. • This role needs to be located in West or Mountain Time Zone. • Join Nexthink's vibrant team where cutting-edge technology meets innovation. • Be a part of Nexthink's Digital Employee Experience technological revolution, ensuring our global customers enjoy a seamless user experience.

🎯 Requirements

• Minimum BS in Computer Science/Engineering • 5+ years in an SRE/platform engineering role supporting SaaS platforms. • Strong hands-on experience with public cloud services (AWS, GCP, Azure). • Proficiency with Kubernetes , container-based deployment and related ecosystems (Helm...), and containerized microservices. • Strong programming or scripting skills (Python, Go, Bash...). • Experience with CI/CD pipelines (e.g., GitHub Actions, GitLab CI, ArgoCD). • Experience with observability stacks (Prometheus, ELK/EFK, Datadog, etc.). • Comfort with being part of a rotating on-call schedule , including handling critical incidents and conducting post-incident reviews. • Strong system-level troubleshooting skills and a proactive mindset toward incident prevention. • Deep understanding of Linux systems , networking, and common troubleshooting practices. • Experience supporting multi-tenant microservices architectures . • Familiarity with service mesh , e.g., Istio. • Knowledge of zero-downtime deployment strategies , blue/green and canary releases. • Exposure to compliance standards such as SOC 2, ISO 27001, or HIPAA. • FedRAMP experience is a big plus. • Experience with chaos engineering or resilience testing practices.

🏖️ Benefits

• Flexible Hours and unlimited vacation (employees have unlimited paid time off on top of the 15 days of holidays we offer) • 11 company-paid holidays, and 3 extra days for volunteering. • Hybrid work model that balances office and remote work, with structured onboarding to foster connections and team integration. • Free access to professional training platforms to explore your interests and enhance your skills. • Up to 16 weeks of paid leave for birthing parents/primary caregivers, 6 weeks for secondary caregivers. • Plan for the future with a 401(k) plan featuring up to 4% company matching contributions, vesting immediately, to grow your retirement savings. • Bonuses for referring successful hires after three months of continuous employment. • 100% covered company benefits that consist of health, dental, vision as well as access to life insurance, long-term disability, and accidental death/personal loss coverage.

Apply Now

Similar Jobs

July 15

Edlio

51 - 200

📚 Education

☁️ SaaS

🤝 B2B

Join Edlio to design and maintain secure cloud infrastructure while leading DevOps initiatives.

🇺🇸 United States – Remote

💰 Private Equity Round on 2018-10

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

July 11

FluidStack

11 - 50

🤖 Artificial Intelligence

SREs at Fluidstack ensure reliability of GPU cloud infrastructure, scaling with AI workloads.

🇺🇸 United States – Remote

⏰ Full Time

🟢 Junior

🟡 Mid-level

⛑ DevOps & Site Reliability Engineer (SRE)

🚫👨‍🎓 No degree required

July 11

Prompt Therapy Solutions Inc

11 - 50

⚕️ Healthcare Insurance

⚡ Productivity

☁️ SaaS

Revolutionize healthcare as a Senior DevOps Engineer at Prompt, managing infrastructure and deployment.

🇺🇸 United States – Remote

💵 $200k - $225k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

July 11

TetraScience

51 - 200

🤖 Artificial Intelligence

🧬 Biotechnology

☁️ SaaS

Lead product lifecycle processes within TetraScience's AI and cloud solutions.

🇺🇸 United States – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

July 11

Swish Analytics

11 - 50

🎲 Gambling

🎮 Gaming

⚽ Sports

Swish Analytics seeks a DevOps Engineer. Role involves managing Kubernetes for predictive sports analytics workloads.

🇺🇸 United States – Remote

💵 $120k - $190k / year

💰 $6.9M Series B on 2019-05

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

Developed by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com