Senior DevOps Engineer – Infrastructure, MLOps

🕒 April 24

🇺🇸 United States – Remote

💵 $180k - $200k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Prompt Therapy Solutions Inc

Prompt Therapy Solutions Inc

11 - 50 employees

⚕️ Healthcare Insurance

⚡ Productivity

☁️ SaaS

Healthcare Insurance • Productivity • SaaS

Prompt Therapy Solutions Inc. is a company that provides therapy-focused software solutions designed to enhance the efficiency and profitability of clinics. They offer a range of features including AI-powered scheduling, patient management, billing, and reporting tools. Their platform caters to various practice sizes, from startup clinics to large enterprises, aiming to optimize clinic operations and improve patient satisfaction. The company's AI solutions help streamline workflows for roles such as front desk staff, therapists, billers, and clinic owners. Through advanced analytics and automated processes, Prompt Therapy Solutions seeks to modernize practice management and drive growth for therapy providers.

📋 Description

• Design, implement, and manage highly available infrastructure for our cloud-based platforms (AWS). • Work with our internal engineering teams to architect and support AI/ML infrastructure, specifically managing AWS Lambda infrastructure, and some legacy SageMaker environments for model training, hosting, and inference. • Create and automate robust deployment pipelines using CI/CD tools (GitLab / GitHub Actions) for both web applications and machine learning models. • Build, maintain, and scale containerized applications with Docker and ECS/Fargate. • Implement MLOps best practices to streamline the transition of models from development to production. • Ensure system scalability and reliability through proactive monitoring, logging, and automated alerting. • Collaborate with both Product Engineers and Data Scientists to optimize performance, security, and infrastructure costs. • Manage and evolve our Infrastructure as Code (IaC) footprint.

🎯 Requirements

• 5+ years of experience in a DevOps or infrastructure role. • Expert knowledge of cloud platforms such as AWS, GCP and Azure • Strong experience with containerization technologies (Docker, ECS / Kubernetes). • Proven track record of designing and managing complex CI/CD pipelines. • Experience with MLOps workflows (model versioning, retraining pipelines, or feature stores). • Hands-on experience with monitoring and logging tools (Datadog, Prometheus, Grafana, MLflow). • Expertise in scripting languages (Python is a must, along with Bash, Go, etc.). • Proficiency with infrastructure automation tools (Terraform, Ansible, or CloudFormation). • Excellent communication skills and the ability to bridge the gap between traditional DevOps and Data Science teams.

🏖️ Benefits

• Competitive salaries • Remote/hybrid environment • Potential equity compensation for outstanding performance • Flexible PTO • Company-wide sponsored lunches • Company paid disability and life insurance benefits • Company paid family and medical leave • Medical, dental, and vision insurance benefits • Discounted pet insurance • FSA/DCA and commuter benefits • 401k • Complimentary subscription to digital fitness classes and wellness content • Recovery suite at HQ – includes a cold plunge, sauna, and shower

Apply Now

Similar Jobs

🕒 April 24

Driver

11 - 50

☁️ SaaS

🔌 API

⚡ Productivity

DevOps Engineer coding and optimizing infrastructure at AI startup Driver, focused on AI-assisted development technology with a dynamic team.

🇺🇸 United States – Remote

💵 $150k - $250k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🕒 April 23

Speechify

51 - 200

☁️ SaaS

Lead technical onboarding for enterprise customers with Speechify's AI/ML platform, ensuring successful integration and collaboration across teams. This role shapes customer outcomes and informs product direction.

🕒 April 23

Shyft6

201 - 500

👥 HR Tech

🎯 Recruiter

🤝 B2B

DevSecOps Engineer supporting large-scale Facets migration project in healthcare payer environment. Focused on bridging development and operations while embedding security best practices throughout the software development lifecycle.

🇺🇸 United States – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🕒 April 23

DataRobot

501 - 1000

🤖 Artificial Intelligence

🏢 Enterprise

☁️ SaaS

DevOps Engineer II at DataRobot, architecting scalable software systems and collaborating with cross-functional teams to optimize AI processes. Requires expertise in Kubernetes, Python, and cloud platforms.

🕒 April 21

Forum Ventures

51 - 200

🤖 Artificial Intelligence

🤝 B2B

☁️ SaaS

Founders needed to leverage expertise for AI Native services in manufacturing and create transformational companies. Own the developer and runtime workflow for PLC logic.

🇺🇸 United States – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)