Principal Machine Learning Engineer

🕒 February 11

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of BJAK

BJAK

51 - 200 employees

🛍️ eCommerce

🏪 Marketplace

eCommerce • Insurance • Marketplace

BJAK is a leading online platform in Southeast Asia that offers comprehensive automobile insurance comparison services. The company enables Malaysian users to compare and purchase auto insurance from multiple insurers efficiently, providing considerable savings and convenience. BJAK is renowned for its user-friendly digital platform that allows quick insurance and road tax renewals, offering discounts up to 11%. With a strong emphasis on customer service, BJAK also provides 24/7 roadside assistance, accident support, and replacement vehicles. It is a pioneer in the insurance comparison sector in the region and has facilitated significant savings for millions of car owners.

📋 Description

• Build and own end-to-end ML pipelines spanning data, training, evaluation, inference, and deployment. • Fine-tune and adapt models using state-of-the-art methods such as LoRA, QLoRA, SFT, DPO, and distillation. • Architect and operate scalable inference systems, balancing latency, cost, and reliability. • Design and maintain data systems for high-quality synthetic and real-world training data. • Implement evaluation pipelines covering performance, robustness, safety, and bias, in partnership with research leadership. • Own production deployment, including GPU optimization, memory efficiency, latency reduction, and scaling policies. • Collaborate closely with application engineering to integrate ML systems cleanly into backend, mobile, and desktop products. • Make pragmatic trade-offs and ship improvements quickly, learning from real usage. • Work under real production constraints: latency, cost, reliability, and safety.

🎯 Requirements

• Strong background in deep learning and transformer-based architectures. • Hands-on experience training, fine-tuning, or deploying large-scale ML models in production. • Proficiency with at least one modern ML framework (e.g. PyTorch, JAX), and ability to learn others quickly. • Experience with distributed training and inference frameworks (e.g. DeepSpeed, FSDP, Megatron, ZeRO, Ray). • Strong software engineering fundamentals – you write robust, maintainable, production-grade systems. • Experience with GPU optimization, including memory efficiency, quantization, and mixed precision. • Comfort owning ambiguous, zero-to-one ML systems end-to-end. • A bias toward shipping, learning fast, and improving systems through iteration. • Experience with LLM inference frameworks such as vLLM, TensorRT-LLM, or FasterTransformer. • Contributions to open-source ML or systems libraries. • Background in scientific computing, compilers, or GPU kernels. • Experience with RLHF pipelines (PPO, DPO, ORPO). • Experience training or deploying multimodal or diffusion models. • Experience with large-scale data processing (Apache Arrow, Spark, Ray).

🏖️ Benefits

• Our organization is very flat and our team is small, highly motivated, and focused on engineering and product excellence. All members are expected to be hands-on and to contribute directly to the company’s mission.

Apply Now

Similar Jobs

🕒 February 4

Albert Invent

51 - 200

🤖 Artificial Intelligence

🧬 Biotechnology

🔬 Science

Backend & Infrastructure Engineer shaping AI development at Albert, a materials innovation platform. Designing Kubernetes and high-performance infrastructure for AI workloads.

AWS

Azure

Cloud

Distributed Systems

Flask

Google Cloud Platform

Kubernetes

Microservices

Python

🕒 January 23

Escape Velocity Entertainment

51 - 200

🎮 Gaming

AI/Machine Learning Director responsible for AI/ML research and development at Escape Velocity Entertainment. Collaborating with teams to drive innovative solutions and unlock new experiences.

Python

PyTorch

Tensorflow

🕒 January 20

Hightouch

51 - 200

☁️ SaaS

Engineering leader at Hightouch driving machine learning efforts and overseeing product development for AI marketing solutions. Leading teams to enhance customer communication strategies utilizing data.

🕒 December 10, 2025

Airbnb

5001 - 10000

👥 B2C

🛍️ eCommerce

Staff Machine Learning Engineer at Airbnb working with ML models to enhance host and guest tools. Collaboration with data scientists and engineering teams to develop impactful solutions.

Airflow

Java

Kubernetes

Python

PyTorch

Scala

Spark

Tensorflow

🕒 November 27, 2025

Samsara

1001 - 5000

🏢 Enterprise

🚗 Transport

🔐 Security

Staff Machine Learning Engineer at Samsara building end-to-end AI solutions for physical operations using petabyte-scale data. Collaborate with ML Engineers and Scientists for critical product features.

Python

Ray

Rust

Spark