Principal Software Engineer – SRE

🕒 March 11

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of PTC

PTC

5001 - 10000 employees

Founded 1985

🏢 Enterprise

Enterprise • Manufacturing

PTC is a leading software solutions provider that focuses on transforming the way products are designed, manufactured, and serviced. The company offers digital solutions that improve product development, reduce costs, and enhance product quality by allowing collaboration across teams. PTC is renowned for its expertise in enterprise PLM and SLM, making it a preferred partner for manufacturers and service teams looking to optimize processes and innovate. Its tools are utilized by numerous Fortune 500 companies involved in discrete manufacturing, enabling significant improvements in manufacturing efficiency and service operations.

📋 Description

• Own Reliability at Scale • Lead design, implementation, and evolution of reliability, availability, and resiliency strategies for large-scale distributed systems • Identify systemic risks in application architecture, data flows, and infrastructure • Drive operational excellence by preventing, detecting, and mitigating incidents • Apply advanced software engineering practices to eliminate manual work and improve system observability • Partner with product engineers and engineering leadership to ensure reliability in system design • Contribute to longer-term reliability and infrastructure strategy aligned with business growth

🎯 Requirements

• US Citizenship or Permanent Residents only due to ITAR requirements • Ability to work east coast (EST) hours • 10+ years of experience in software engineering, site reliability engineering, or systems engineering roles • Extremely strong proficiency with the Java programming language and its ecosystem • Deep experience operating complex, distributed systems in production environments • Strong software engineering background, with a track record of delivering high-quality, maintainable code • Expert understanding of incident management, service reliability, and performance engineering • Strong hands-on experience with observability (metrics, logs, traces), capacity planning, and SLO-driven reliability • Deep familiarity with modern cloud-based infrastructure, CI/CD pipelines, and infrastructure-as-code practices • Comfortable making high-impact technical decisions in ambiguous environments

🏖️ Benefits

• Medical, dental and vision insurance • Paid time off and sick leave • Tuition reimbursement • 401(k) contributions and employer match • Flexible spending accounts • Life insurance • Disability coverage • Commuter subsidy

Apply Now

Similar Jobs

🕒 March 9

Dave

201 - 500

Lead Site Reliability Engineering across GCP infrastructure at Dave, a fintech innovator enabling accessible financial services. Shape the reliability and performance strategies for a growing platform.

Cloud

DNS

Google Cloud Platform

JavaScript

Kubernetes

MySQL

Python

Redis

SQL

Terraform

TypeScript

Go

🕒 March 7

Inetum

10,000+ employees

🤝 B2B

🏢 Enterprise

☁️ SaaS

Expert DevOps / DevSecOps supporting Generative AI initiatives at Inetum for digital transformation in the United States. Designing high-value GenAI use cases and integrating new tools and practices.

🗣️🇫🇷 French Required

Cloud

Open Source

🕒 March 3

Kapitus

201 - 500

💸 Finance

💳 Fintech

🤝 B2B

Cloud DevSecOps Engineer III enhancing security for Kapitus through AWS solutions. Responsibilities include monitoring, programming, testing, and collaboration with developers.

AWS

Azure

Cloud

Distributed Systems

DynamoDB

🕒 February 27

Fuze Health

1001 - 5000

☁️ SaaS

🤝 B2B

💊 Pharmaceuticals

Staff DevSecOps Engineer shaping security architecture in complex healthcare systems. Joining Fuze Health's Engineering organization to enhance security posture across platforms.

AWS

Cloud

Google Cloud Platform

Jenkins

Kubernetes

Python

Ruby

Terraform

Go

🕒 February 26

Twilio

5001 - 10000

Reliability Architect at Twilio defining and leading solutions for reliable products. Collaborating with teams to ensure operational excellence and scalability in high-scale systems design.

AWS

Cloud

Distributed Systems

Grafana

Java

Kubernetes

Microservices

Prometheus

Python

Terraform

Go