Senior TTS Research Engineer

🔥 0 minutes ago

🗣️🇯🇵 Japanese Required

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Cerence Inc.

Cerence Inc.

1001 - 5000 employees

Founded 2019

🤖 Artificial Intelligence

🚗 Transport

💰 Grant on 2020-12

Artificial Intelligence • Transport • Automotive

Cerence Inc. is a global company focused on providing AI-powered solutions, particularly in the automotive industry. They specialize in conversational and generative AI technologies that create intelligent, natural, and personalized interactions between humans and vehicles. With innovations like their proprietary automotive large language models, Cerence enhances user experiences across various forms of transport including cars, two-wheelers, and trucks. The company has over 500 million vehicles shipped with its AI technology, serving more than 80 OEMs and Tier 1 customers worldwide. Cerence is dedicated to continuous advancements in AI, aiming to revolutionize in-car user experiences through fast delivery and seamless integration of their solutions.

📋 Description

• Optimize the core NN algorithms to speed up inference and generation • Integrate different TTS components into a flexible pipeline that is suitable to “one engine for many (30+) languages” • Implement markups to control the TTS engine is optimized for different languages, features and applications • Design, implement and maintain the emb SDK for different platforms and cloud APIs • Build up the automation test and release process to ensure the KPIs such as latency and RTF • Work as a maintainer on the codes from RD engineers • Write well structed release notes, SDK and API documents

🎯 Requirements

• 3+ years of hands-on experience in TTS system development with deep expertise in both frontend and backend components • Proficiency in C/C++ and Python, with mastery of ML frameworks (PyTorch, TensorFlow, etc) • Some background in NLP techniques and/or speech signal processing is welcome • Basic understanding of autoregressive / non-autoregressive acoustic models and neural vocoders • Rich experience on software release, version control and branches maintenance • Experience with ONNX Runtime, TensorRT, or TorchScript, etc • Experience with zero-shot/one-shot/few-shot voice cloning or emotional TTS systems • Native or close to native Japanese is a must have, plus fluent English the working language.

🏖️ Benefits

• Equal Employment Opportunity (EEO) • Workplace security protocols and training programs

Apply Now