Lead Cloud, DevOps Engineer

🕒 May 22

🏰 Missouri – Remote

info

💵 $65 - $75 / hour

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

info
Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Blend360

Blend360

501 - 1000 employees

🤖 Artificial Intelligence

🏢 Enterprise

💰 $100M Private Equity Round on 2022-08

Artificial Intelligence • Enterprise • Consulting

Blend360 is a professional services company specializing in AI, data analytics, and data-driven solutions. They work with Fortune 1000 and large enterprise brands to tackle significant challenges by integrating people and artificial intelligence. Blend360 focuses on several domains including business intelligence, data engineering, data science, MLOps, and data governance. Their industries of expertise encompass financial services, energy, healthcare and life sciences, retail, technology, media & telecom, and travel & hospitality. Blend360 is recognized for their AI and data solutions, having earned accolades such as "AI-Enabling Solution of the Year" and being listed among the "Top Generative AI Service Providers 2024.

📋 Description

• Design and implement AWS cloud infrastructure and deployment patterns for the data platform, including multi-account AWS Organizations strategy, IAM design, networking, naming conventions, and tagging standards. • Build and maintain CI/CD pipelines to support repeatable, controlled releases across Development, Test, and Production environments. • Provision and configure AWS infrastructure as code (Terraform), including services such as AWS Glue, Amazon S3, Amazon Redshift, VPC networking, VPN/Direct Connect connectivity, Route 53, security groups, and firewall controls to connect on-premises source systems. • Configure Git-based integration and deployment workflows for platforms such as Databricks or Snowflake to enforce version-controlled deployments. • Support deployment of backend services, orchestration components, data services, APIs, and front-end applications. • Enable monitoring, logging, alerting, and telemetry using services such as Amazon CloudWatch, AWS CloudTrail, AWS Config, and observability platforms like Datadog. • Define and implement operational controls for reliability, performance, scalability, backup/recovery, and incident response. • Implement and enforce secure access patterns using AWS IAM, IAM Identity Center (AWS SSO), AWS Secrets Manager, AWS KMS, and policy-driven access controls, including row-level and column-level security requirements where applicable. • Ensure the solution aligns with architecture, security, governance, and service transition requirements. • Support non-functional testing, release readiness, and path-to-production activities. • Produce comprehensive operational runbooks, platform documentation, and a full IaC handover package enabling the client’s internal IT team to take ownership of platform operations at programme close. • Support cost management, network performance tuning, and security hardening of the AWS platform; contribute to FinOps reporting and disaster recovery planning.

🎯 Requirements

• Strong hands-on experience with CI/CD tooling and release automation. • Experience with infrastructure-as-code using Terraform or similar tools. • Hands-on experience deploying and operating cloud-native workloads in AWS, including services such as AWS Glue, Amazon S3, Amazon Redshift, Amazon ECS/EKS, AWS Lambda, IAM, and VPC networking. • Experience with Databricks and/or Snowflake deployments in AWS environments. • Strong understanding of containerisation, serverless architectures, managed compute services, and environment promotion strategies. • Experience with observability tooling covering logging, monitoring, alerting, and service health. • Knowledge of security best practices including IAM, RBAC, secrets management, encryption, and policy-driven access control. • Experience supporting production-grade data platforms in enterprise environments, ideally in regulated sectors with compliance requirements such as PIPEDA or equivalent. • Familiarity with Git-based workflows and collaborative engineering practices. • Strong troubleshooting, communication, and stakeholder management skills.

🏖️ Benefits

• Health insurance • 401(k) plan • Paid time off • Paid holidays • Commuter benefits • Flexible spending accounts • Life and disability insurance • Employee assistance programs

Apply Now

Similar Jobs

🕒 May 22

Concept Plus, LLC

51 - 200

🏛️ Government

Platform / DevSecOps Lead delivering technical leadership for Air Force IT systems. Overseeing platform architecture and DevSecOps strategy while ensuring secure, automated system delivery.

🇺🇸 United States – Remote

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🕒 May 21

EnCharge AI

11 - 50

🤖 Artificial Intelligence

🔧 Hardware

🤝 B2B

LLM Inference Deployment Engineer deploying and scaling large language models on energy efficient AI accelerators. Working with AI frameworks and model optimization at EnCharge AI.

🇺🇸 United States – Remote

💵 $180k - $240k / year

💰 $100M Series B - EnCharge AI on 2025-02

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

info

🕒 May 21

SS&C Technologies

10,000+ employees

🏦 Banking

💳 Fintech

Site Reliability Engineer optimizing infrastructure environments at SS&C Technologies. Collaborate with teams to enhance application reliability and drive technology improvements.

🇺🇸 United States – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🕒 May 21

Button

51 - 200

☁️ SaaS

🛍️ eCommerce

🤝 B2B

Senior DevOps Engineer responsible for platform infrastructure management in a commerce-powered internet company. Collaborating with teams on scalable, stable, and operable solutions for business-critical systems.

🕒 May 20

High 5 Games

51 - 200

🎮 Gaming

🎲 Gambling

🤝 B2B

DevOps Engineer responsible for building and optimizing cloud infrastructure for machine learning operations in gaming. Collaborating with data scientists and ML engineers to ensure reliability and performance.