Senior MLOps Engineer – SRE, DevOps

Job not on LinkedIn

🔥 0 minutes ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of PrideLogic

PrideLogic

11 - 50 employees

PrideLogic is a company that does not currently have detailed information available as their website is under construction. More details may be provided in the future once updates are made available on their site.

📋 Description

• Build and operate model and inference serving infrastructure • Own the ML deployment lifecycle • Operate agentic and LLM workloads in production • Build reproducible, automated ML pipelines • Extend infrastructure-as-code to ML systems • Operate GitOps for ML workloads • Run ML and AI workloads on multi-tenant Kubernetes • Own ML reliability and observability • Drive ML cost efficiency • Use agentic coding tools for infrastructure and pipeline work

🎯 Requirements

• 5+ years in platform engineering, SRE, MLOps, or infrastructure • Hands-on experience deploying and operating ML or AI workloads in production • Strong SRE/DevOps foundation • Deep IaC expertise • Strong GitOps background • Deep Kubernetes knowledge • Strong AWS background • Hands-on experience building and operating CI/CD pipelines • Automation-first thinking at a senior level • Active user of agentic coding tools • Strong communicator

🏖️ Benefits

• Paid time off • Flexible work arrangements

Apply Now

Similar Jobs

🔥 9 hours ago

Experian

10,000+ employees

🤖 Artificial Intelligence

🤝 B2B

☁️ SaaS

DevOps Analyst developing automation solutions for Experian's global infrastructure. Focusing on infrastructure reliability and operational efficiency with a collaborative team.

🗣️🇧🇷🇵🇹 Portuguese Required

Ansible

Jenkins

Python

Terraform

🔥 12 hours ago

Cadmus Soluções em TI

1001 - 5000

🤖 Artificial Intelligence

🤝 B2B

🏢 Enterprise

Senior DevOps professional in a large insurance company. Supporting development teams in automation and software delivery practices.

🗣️🇧🇷🇵🇹 Portuguese Required

AWS

Cloud

Docker

Google Cloud Platform

Java

Jenkins

Python

.NET

🔥 17 hours ago

Vivo (Telefônica Brasil)

10,000+ employees

📡 Telecommunications

👥 B2C

📱 Media

Analista SRE Pl managing non-productive environments and application support at Telefonica. Leading incident response and developing automation processes for operational efficiency.

🗣️🇧🇷🇵🇹 Portuguese Required

Apache

Cassandra

Linux

MongoDB

OpenShift

Oracle

Postgres

Python

🕒 Yesterday

Attus Procuradoria Digital

51 - 200

🤖 Artificial Intelligence

☁️ SaaS

🏛️ Government

Site Reliability Engineer ensuring reliability and performance of critical systems at Attus. Focused on innovating public advocacy processes with reliability practices.

🗣️🇧🇷🇵🇹 Portuguese Required

Ansible

DNS

Docker

ElasticSearch

Grafana

Kafka

Kubernetes

Linux

Prometheus

Python

Redis

Terraform

🕒 3 days ago

Sensedia

501 - 1000

🔌 API

☁️ SaaS

💳 Fintech

DevOps engineer at Sensedia leading Java application development and cloud infrastructure management. Promoting DevOps culture and ensuring system reliability in a fully remote environment.

🗣️🇧🇷🇵🇹 Portuguese Required

Ansible

AWS

Azure

Cloud

Docker

Java

Jenkins

Kubernetes

MongoDB

MySQL

NGINX

Postgres

Redis

ServiceNow

Terraform