AI Inference Engineer – QVAC

Job not on LinkedIn

🔥 0 minutes ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of ITRex Group

ITRex Group

201 - 500 employees

Founded 2009

🤖 Artificial Intelligence

🏢 Enterprise

🔧 Hardware

Artificial Intelligence • Enterprise • Hardware

ITRex Group is a technology consultancy and engineering firm that builds applied AI, generative AI, data platforms, and intelligent edge/IoT solutions for enterprise clients across healthcare, logistics, manufacturing, and other regulated industries. They provide end-to-end services including AI strategy and readiness assessments, product discovery and PoCs, LLM fine-tuning, MLOps/LLMOps and governance, data architecture and platform modernization (warehouses, lakes, vector DBs), and embedded hardware and edge software development. ITRex focuses on production-ready, compliant deployments that integrate models, data infrastructure, and edge devices to deliver scalable, secure AI-driven products and operations.

📋 Description

• Work on deploying machine learning models to edge devices using the frameworks: llama.cpp, ggml • Collaborate closely with researchers to assist in coding, training and transitioning models from research to production environments • Integrate AI features into existing products, enriching them with the latest advancements in machine learning

🎯 Requirements

• Excellent programming skills in C++, experience in Javascript is a bonus • Strong experience with Llama.cpp and ggml inference engines, which facilitates the deployment of models to specific GPU architectures • Good understanding of deep learning concepts and model architectures • Experience with transformers, LLMs, Diffusion models • Demonstrated ability to rapidly assimilate new technologies and techniques • A degree in Computer Science, AI, Machine Learning, or a related field, complemented by a solid track record in AI R&D

🏖️ Benefits

• Remote flexibility: Work where and how you work best - we trust you to deliver • Fair compensation: Competitive salary + benefits that matter (medical, learning) • Ownership opportunities: See a problem worth solving? Own it. We back smart risks over bureaucratic safety • AI enhancement: We leverage AI to make you faster and stronger - complementing your abilities, not replacing them • Learning investment: English classes, professional development • Career progression: Real paths up, not just sideways shuffling • Responsive teammates: No ignored Slacks, no "not my problem" attitudes • Supportive culture: When you're stuck, people help. When things break, we fix them together • Human connections: Regular meetups, tech talks, and actual relationships beyond work

Apply Now

Similar Jobs

🕒 Yesterday

Teneo Online School

501 - 1000

📚 Education

👥 B2C

🛍️ eCommerce

AI Engineer developing AI capabilities for global EdTech company. Collaborating on AI-enabled product features to enhance learning outcomes.

Cloud

JavaScript

Node.js

Python

TypeScript

🕒 6 days ago

Kainos

1001 - 5000

Lead AI Engineer at Kainos bridging data science and software engineering. Collaborate with teams to deliver AI solutions for Workday products.

AWS

Azure

Cloud

Docker

Google Cloud Platform

Java

Python

🕒 June 10

InPost Group

10,000+ employees

🛍️ eCommerce

🚗 Transport

AI Engineer at InPost driving innovative solutions with Generative AI models for web applications. Collaborating in an international environment focusing on sustainable delivery solutions.

🗣️🇵🇱 Polish Required

AWS

Azure

Cloud

Docker

Google Cloud Platform

Kubernetes

Microservices

NoSQL

Python

Redis

SQL

🕒 June 10

Paystone

51 - 200

💳 Fintech

🛍️ eCommerce

AI Developer designing and deploying AI-powered solutions for Paystone. Building intelligent systems to improve workflows and customer experiences.

AWS

Azure

Cloud

Google Cloud Platform

Python

🕒 June 3

Tealium

501 - 1000

☁️ SaaS

🏢 Enterprise

Software Engineer designing backend platform services for AI-enabled customer data products at Tealium. Collaborating with engineering and product teams to deliver scalable backend capabilities.

🇵🇱 Poland – Remote

💵 zł240k - zł315k / year

💰 $96M Series G on 2021-02

⏰ Full Time

🟠 Senior

🤖 AI Engineer

AWS

Cloud

Docker

DynamoDB

Java

Kafka

Kubernetes

MySQL

NoSQL

Postgres

RabbitMQ

Terraform