Post a Job Affiliates

Search Remote Jobs

Maneva

Website LinkedIn All Job Openings

Artificial Intelligence • Manufacturing • SaaS

Maneva is a technology company that specializes in automating manufacturing processes using advanced AI solutions. Their product offerings include AI-powered quality control systems for defect detection, automated monitoring of machinery, and intelligent workforce management. Maneva aims to enhance operational efficiency and reduce labor costs for manufacturers through seamless integration of its digital worker solutions into existing operations.

2 - 10 employees

Founded 2022

🤖 Artificial Intelligence

☁️ SaaS

Site Reliability Engineer

8 hours ago

🇨🇦 Canada – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

DNS

Docker

Grafana

IoT

Linux

Prometheus

Python

TCP/IP

Apply Now

Maneva

Website LinkedIn All Job Openings

Artificial Intelligence • Manufacturing • SaaS

2 - 10 employees

Founded 2022

🤖 Artificial Intelligence

☁️ SaaS

📋 Description

• **Operational Support & Incident Response** • - Serve as a first responder for production issues, alarms, and system outages (24/7 rotation required). • - Troubleshoot Linux system issues, hardware problems, networking connectivity, and edge-device performance. • - Perform root-cause analysis (RCA) and implement corrective and preventive solutions.** Document incidents, contributing to a culture of transparency and process improvement. • **Proactive Monitoring & Observability** • - Build and maintain robust monitoring dashboards and alerts using **Prometheus**, **Grafana**, and similar tools. • - Continuously improve observability, including metrics, logs, traces, and health checks. • - Analyze trends to proactively identify reliability risks before incidents occur. • - Develop automation to reduce noise and improve actionable alert quality. • **Systems Reliability & DevOps Engineering** • - Improve deployment workflows, CI/CD pipelines, configuration management, and automated provisioning. • - Create tools and scripts in Python/Bash to streamline operational processes. • - Contribute to load testing, system validation, and network health verification for edge deployments. • - Implement best practices for secure, scalable, and maintainable infrastructure. • **Infrastructure & Application Ownership** • - Understand and operate Manevaʼs end-to-end edge AI stack: • - Jetson/embedded Linux systems • - GPU-accelerated workloads for computer vision • - Video pipelines (RTSP, camera interfaces, data ingestion) • - Local integrations (PLCs, industrial hardware, APIs, network resources) • - VPN-based connectivity (client-based or site-to-site) • - Maintain visibility into device health and fleet-wide system performance. • **Documentation & Process Development** • - Create and maintain SOPs for on-site customer teams and internal engineering workflows. • - Produce detailed incident reports and reliability documentation. • - Maintain internal knowledge bases, troubleshooting guides, and playbooks.

🎯 Requirements

• **Technical Skills** • - Strong Linux systems administration experience (Ubuntu, embedded Linux, ARM systems). • - Proficiency in Python and/or Bash for scripting and operations automation. • - Solid networking fundamentals: TCP/IP, routing, DNS, DHCP, VPNs, VLANs, firewall rules. • - Familiarity with troubleshooting tools: tcpdump, nmap, iftop, netstat, etc. • - Hands-on experience with **Prometheus**, **Grafana**, or similar monitoring/alerting platforms. • - Experience with logging/observability stacks (ELK/EFK, Loki, Fluentd, etc.) is a plus. • - Experience with Docker or containerized applications is desirable. • - Comfort supporting distributed or remote device fleets. • **Soft Skills** • - Excellent diagnostic and analytical abilities under pressure. • - Strong communication skills with both technical and non-technical stakeholders. • - High ownership mentality and ability to follow issues through to resolution. • - Comfortable working independently in a fully remote environment. • - Willingness to participate in on-call rotation, including off-hours and weekends. • Preferred Qualifications • - Experience supporting machine learning, computer vision, or GPU-accelerated systems. • - Familiarity with NVIDIA Jetson or other embedded AI hardware. • - Prior SRE/DevOps/Systems Engineer experience in a 24/7 operational environment. • - Exposure to industrial IoT, manufacturing systems, or operational technology (OT). • - Experience writing customer-facing operational documentation or SOPs.

🏖️ Benefits

• What We Offer • - Fully remote work environment with flexibility (within on-call requirements). • - Opportunities to work with cutting-edge edge compute and AI deployments. • - A high-impact role shaping reliability practices from early stages. • - Contract or full-time options, with competitive compensation. • - A collaborative team committed to transparency, improvement, and excellence.

Apply Now

Similar Jobs

DevOps/Cloud Analyst – Analyste en développement et exploitation, infonuagique

3 days ago

Esri Canada

501 - 1000

🔌 API

🤖 Artificial Intelligence

🔬 Science

Website LinkedIn All Job Openings

DevOps/Cloud Analyst supporting Azure cloud infrastructure at Esri Canada. Responsible for deployment, maintenance, and operational support of Azure environments.

🇨🇦 Canada – Remote

💵 $84.9k - $110k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Azure

Cloud

ElasticSearch

SQL

Apply

View Job

Mid-Market Account Executive – DevOps

4 days ago

Rewind

51 - 200

☁️ SaaS

🔐 Security

Website LinkedIn All Job Openings

Mid-Market Account Executive at Rewind handling full sales cycle for mid-market clients. Focusing on DevOps and IT leaders while driving new revenue and managing relationships.

🇨🇦 Canada – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Apply

View Job

Senior DevOps Engineer

November 27

Datatonic

51 - 200

🤖 Artificial Intelligence

🛍️ eCommerce

📡 Telecommunications

Website LinkedIn All Job Openings

Senior DevOps Engineer contributing to AI transformation projects with a focus on Google Cloud technologies. Collaborating with teams to implement DevOps best practices and innovative solutions.

🇨🇦 Canada – Remote

💰 Pre Seed Round on 2013-01

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Cloud

Google Cloud Platform

Java

Kubernetes

Python

Terraform

Apply

View Job