Performance Engineer – Akamai Inference Cloud

November 21

Apply Now
Logo of Akamai Technologies

Akamai Technologies

Cloud Computing • Cybersecurity • Content Delivery

Akamai Technologies is a leading cloud services provider that specializes in delivering security, cloud computing, and content delivery solutions. It offers a range of services such as API security, DDoS protection, and performance optimization for web applications, ensuring secure and reliable user experiences. With a robust global infrastructure, Akamai empowers businesses to streamline their digital presence while safeguarding against various cyber threats and enhancing application performance.

5001 - 10000 employees

🔒 Cybersecurity

💰 Post-IPO Equity on 2001-07

📋 Description

• The Performance Engineer ensures optimal benchmarking, tuning, and performance of an AI inference platform. • Responsibilities include applying advanced optimization techniques to enhance throughput, reduce latency, and improve resource efficiency. • The role involves working with models, hardware accelerators, and infrastructure. • Expertise in AI/ML performance optimization, proficiency with inference frameworks, and a passion for maximizing hardware and software performance are essential.

🎯 Requirements

• Have experience in performance engineering with hands-on expertise in AI/ML model optimization and inference performance tuning. • Demonstrate solid knowledge of inference optimization techniques including quantization (INT8, FP16), model compilation, hardware acceleration, and familiarity with compiler optimizations and ML compilers. • Show proficiency with GPU optimization and understanding of memory hierarchies and techniques to maximize hardware utilization. • Have experience with profiling and benchmarking tools for AI workloads, identifying performance bottlenecks in distributed systems. • Demonstrate problem-solving skills with ability to analyze performance data, communicate insights clearly, and drive optimization efforts. • Possess knowledge of distributed inference and model parallelism techniques. • Have experience with cost optimization for compute-intensive workloads.

🏖️ Benefits

• Your health • Your finances • Your family • Your time at work • Your time pursuing other endeavors

Apply Now

Similar Jobs

November 1

O-I

10,000+ employees

IT Service Delivery Lead responsible for ensuring client technology support across O-I locations. Collaborating with stakeholders to maintain compliance and manage relationships with IT suppliers.

November 1

Dataverse Engineer designing and maintaining enterprise-grade data solutions at Software Mind. Collaborating with DevOps and software engineering teams to enable data-driven decisions.

Azure

SQL

.NET

October 27

Palantir Forward Deployed Engineer leading solution development and implementing data-driven strategies for clients. Collaborating with cross-functional teams to innovate in data and AI solutions.

AWS

Azure

Cloud

Docker

ETL

Google Cloud Platform

Java

Kubernetes

PySpark

Python

Scala

SQL

October 22

Forward-Deployed Engineer developing AI negotiation systems for enterprise procurement. Collaborating with Fortune 500 teams to architect and implement sophisticated contract analysis and negotiation solutions.

AWS

Azure

Cloud

Docker

ERP

ETL

Google Cloud Platform

GraphQL

JavaScript

Kafka

Kubernetes

Oracle

Pulsar

Python

Terraform

TypeScript

October 21

Providing technical assistance as a Virtualization Backup Engineer at Veeam. Supporting IT professionals with troubleshooting and data recovery solutions.

🗣️🇫🇷 French Required

DNS

Firewalls

Linux

Oracle

SQL

TCP/IP

Unix

VMware

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com