
201 - 500 employees
Founded 2002
🤖 Artificial Intelligence
🛍️ eCommerce
Artificial Intelligence • eCommerce • Customer Service
Omilia is a leader in Conversational AI, specializing in voice and chat solutions that enable natural, end-to-end customer interactions. Their Omilia Cloud Platform provides advanced AI-driven customer service tools, including real-time agent assistance, voice biometrics for fraud prevention, and data analytics to enhance customer insights. Serving industries such as finance, insurance, retail, automotive, and travel, Omilia focuses on automating customer service while ensuring a secure and personalized experience.
🔥 0 minutes ago
Ansible
AWS
Cloud
Docker
Grafana
Kubernetes
Linux
MySQL
NoSQL
Postgres
Prometheus
Python
RDBMS
Redis
TCP/IP
Terraform
VoIP
Go
Improve your chances of getting an interview by checking your resume score before you apply.

201 - 500 employees
Founded 2002
🤖 Artificial Intelligence
🛍️ eCommerce
Artificial Intelligence • eCommerce • Customer Service
Omilia is a leader in Conversational AI, specializing in voice and chat solutions that enable natural, end-to-end customer interactions. Their Omilia Cloud Platform provides advanced AI-driven customer service tools, including real-time agent assistance, voice biometrics for fraud prevention, and data analytics to enhance customer insights. Serving industries such as finance, insurance, retail, automotive, and travel, Omilia focuses on automating customer service while ensuring a secure and personalized experience.
• - Ensure platform reliability and availability across production and pre-production environments through proactive monitoring, alerting, and automation. • - First response for incidents, contribute to problem management and root cause analysis. • - Supporting the development team's effort towards reliability, creating a solid reliability culture within the development lifecycle. • - Develop troubleshooting documentation for production support resources. • - Collaborate with Engineering teams to develop optimised and productive runbooks, operational documentation and automation of operational tasks. • - Collaborate with development and cloud engineering teams to embed reliability and performance into the software delivery lifecycle. • - Design, implement, and evolve observability solutions (metrics, logs, traces, dashboards) using tools such as Prometheus, Grafana, and ELK. • - Participate in on-call rotations and continuously improve alert quality and response processes. • - Champion a culture of reliability, performance, and continuous improvement across teams.
• - Bachelor's Degree or MS in Engineering or equivalent. • - Experience in operating at least one container orchestration cluster (Kubernetes, Docker Swarm). • - Experience developing or maintaining software for production services at scale. • - Experience with ELK. • - Experience with AWS. • - Experience with Grafana/Prometheus stack. • - Strong scripting skills (Bash, Python or Go). • - Excellent communication skills. • - Thinking out of the box and anticipating challenges. It is imperative we are not simply reactive; we must expect challenges and question technologies, procedures and thinking already in place. You will be expected to constantly review and challenge at all levels. • - Versatility. We work with agile/lean methods. We'd much rather iterate and learn than assume we know all the answers. • - Being a team player. You don't (always) work in isolation and are excited by the thought of using your team whilst involving product, experience design, engineering, and more in the process. • **Will be considered as a plus:** • - Telephony knowledge (SIP, VoIP); • - Experience in Linux Administration (RedHat, CentOS, AL); • - Working knowledge in Configuration Management tools (Terraform, Ansible); • - Experience with TCP/IP and general networking concepts; • - RDBMS knowledge (MySQL, Postgres); • - NoSQL knowledge (Redis).
• - Fixed compensation; • - Long-term employment with the working days vacation; • - Development in professional growth (courses, training, etc); • - Being part of successful cutting-edge technology products that are making a global impact in the service industry; • - Proficient and fun-to-work-with colleagues; • - Apple gear.
Apply Now🕒 2 days ago
🕒 2 days ago
Customer Site Reliability Engineer managing critical services and driving reliability and customer satisfaction at Red Hat. Engaging with cross-functional teams and enhancing system resilience.
🇦🇺 Australia – Remote
💰 Corporate Round on 1999-03
⏰ Full Time
🟡 Mid-level
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
🗣️🇯🇵 Japanese Required
Ansible
AWS
Azure
Cloud
Distributed Systems
Google Cloud Platform
Kubernetes
Linux
OpenShift
Prometheus
TCP/IP
Terraform
Go
🕒 May 8
Senior Platform Engineer at Megaport, focusing on DevOps and SRE practices across their systems. Responsible for reliability and stakeholder engagement in a collaborative tech environment.
AWS
Cassandra
Cloud
Kubernetes
Linux
Postgres
Python
Terraform
Go
🕒 April 28
Devops Engineer building decentralized network infrastructure with Sigma Prime. Assist developers and create testnets while maintaining production instances of Ethereum software.
Ansible
DNS
Firewalls
Kubernetes
Linux
Terraform
🕒 April 15
Database Reliability Engineer at CrowdStrike handling data services with technologies like Cassandra and ElasticSearch. Collaborating with Engineering and Customer Support to ensure system reliability and security.
AWS
Cassandra
Chef
Cloud
ElasticSearch
Google Cloud Platform
Kafka
Kubernetes
Linux
MySQL
Postgres
Python
Zookeeper
🕒 April 10
Site Reliability Engineer ensuring reliable, scalable cloud infrastructure for Ditto's edge-to-cloud technology. Collaborate on observability and incident management to meet enterprise demands.
🇦🇺 Australia – Remote
💵 A$165.6k - A$260k / year
⏰ Full Time
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
AWS
Azure
Cloud
Google Cloud Platform
Grafana
Java
Prometheus
Python
Rust
Terraform
Go