DevOps, Platform Lead

Job not on LinkedIn

🕒 6 days ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Nsight

Nsight

501 - 1000 employees

Founded 1910

📡 Telecommunications

💰 Undisclosed on 2019-02

Telecommunications

Nsight is a Wisconsin- and Upper Michigan-focused telecommunications holding company and service provider that owns brands including Cellcom, Nsight Telservices, and Nsight Tower. The company emphasizes customer-focused operations, employee development, community involvement and investor relations, offering telecom services, network infrastructure and related support across its operating sites.

📋 Description

• Design, build, and own CI/CD pipelines for all new products and greenfield builds; provide support on existing pipeline infrastructure as needed. • Define pipeline standards that support parallel release streams and implement merge-blocking gates that prevent new high-severity security findings from reaching production. • Design deployments for forward progression: feature flags, canary releases, and automated validation gates that make every release safe to ship without relying on rollback. • Manage GitOps workflows using ArgoCD or equivalent with declarative deployment configuration, sync policies, and environment promotion gates. • Enforce source control standards across all repositories: branching strategy, signed commits, branch protection rules, and CODEOWNERS configuration. • Automate vulnerability detection and remediation workflows at the pipeline level: static analysis, dependency scanning, and container scanning integrated as blocking gates. • Generate and maintain SBOMs for all containerized workloads; container registry scanning integrated into the pipeline as a blocking gate. • Maintain continuous audit evidence trails across all products, enabling rapid response to HiTrust R2 and SOC 2 review requests without a fire drill. • Enforce secrets management, access controls, and HIPAA-compliant infrastructure configurations: KMS, Secrets Manager, IAM policies, and GuardDuty alerting owned and maintained here. • Architect and maintain the infrastructure that supports AI-assisted development: how AI-generated output enters the pipeline, how it gets validated, and how it ships safely to production. • Enforce BAA-compliant AI tooling standards at the infrastructure level, with documented usage boundaries for non-PHI environments. • Build audit trail infrastructure and automated review gates for AI-generated code: every AI contribution entering production must be traceable, attributable, and compliant with the engineering quality bar. • Provide and manage all cloud infrastructure through IaC: Terraform required; no manual console changes to production. • Execute container orchestration on EKS or ECS: configuration, scaling, and environment consistency across all products. • Own disaster recovery planning, availability architecture, and uptime accountability across all production systems. • Build and maintain the full observability stack: Prometheus, Grafana, Loki, Tempo, and OpenTelemetry (or equivalent) with alerting that surfaces real signal, not noise. • Own VPC architecture, DNS, NACLs, and routing across environments: network configuration is infrastructure code, not tribal knowledge.

🎯 Requirements

• 5+ years in DevOps or platform engineering, with at least 2 years in a healthcare or regulated industry environment • Direct, hands on HIPAA compliant deployment experience, not just theory • Hands on AWS at production depth: EKS or ECS with working command of ECR, IAM, VPC, KMS, Secrets Manager, CloudWatch, and GuardDuty; built and operated in this stack • IaC at production scale: Terraform required; all environment configuration is code, reviewed, and version controlled • GitOps practice: ArgoCD or equivalent; declarative deployment, sync policies, and gating promotions across environments safely • Demonstrated GitHub Actions experience at scale; pipelines that engineering teams rely on in production, not sandbox demos • Observability stack ownership: Prometheus, Grafana, Loki, Tempo, OpenTelemetry, or Datadog; built or owned a real observability setup with alerting that drives action, not noise • Container fundamentals: image lifecycle management, ECR, SBOM generation, and container scanning integrated into the pipeline as a gate • Scripting fluency in Python and Bash; network fundamentals including VPC design, DNS, NACLs, and routing • Demonstrated experience integrating SAST and SCA tooling (Snyk, SentinelOne, or equivalents) into CI/CD with merge blocking enforcement • Working knowledge of HiTrust R2 or SOC 2 controls, including audit evidence requirements and how infrastructure decisions create or close compliance gaps • Daily, demonstrated use of Claude Code, GitHub Copilot, or equivalent AI assisted development tools. This is a hard requirement. You cannot build AI native infrastructure if you have never operated inside the model. • Strong track record of platform reliability ownership; on call accountability for production systems.

🏖️ Benefits

• PTO • Medical, Dental, Vision, and supplemental insurance options • 401(k) Plan with 3.5% Company Match • Company-provided equipment

Apply Now

Similar Jobs

🕒 6 days ago

Emergent Software

51 - 200

☁️ SaaS

🤝 B2B

DevOps Architect leading the technical direction of our DevOps practice. Joining the cloud infrastructure team at Emergent Software to mentor engineers and guide client architecture discussions.

Azure

Cloud

Terraform

🕒 6 days ago

Replika

51 - 200

🤖 Artificial Intelligence

👥 B2C

Senior DevOps Engineer improving developer experience at Replika. Collaborating on deployments, CI/CD, and efficient development processes.

AWS

Cloud

Docker

Google Cloud Platform

Kubernetes

Python

🕒 6 days ago

Availity

1001 - 5000

⚕️ Healthcare Insurance

☁️ SaaS

🔌 API

DevOps Engineer V at Availity focusing on cloud-native infrastructure and Kubernetes operational excellence. Leading design and management with a focus on automation and reliability across platforms.

AWS

Cloud

Docker

Grafana

Jenkins

Kubernetes

Prometheus

Python

Splunk

Terraform

🕒 6 days ago

Leidos

10,000+ employees

🔒 Cybersecurity

🔬 Science

OCI DevOps Engineer managing cloud infrastructure and CI/CD pipelines for Leidos' defense healthcare solutions, ensuring integration and security compliance.

Ansible

AWS

Azure

Cloud

Docker

Google Cloud Platform

Jenkins

Kubernetes

Linux

Python

Ruby

Terraform

🕒 6 days ago

Guild Mortgage

1001 - 5000

💸 Finance

🏠 Real Estate

Senior Site Reliability Engineer at Guild Mortgage managing reliability and scalability of software systems. Collaborating across teams to ensure performance and system availability with strategic planning.

Azure

Cloud

Linux

MariaDB

MySQL

SQL

Unix