speech synthesis

Evaluate leading text-to-speech models – US English

11/24/2024

·

Podonos

Many of us hear the YouTube videos whose voices are generated by AI, or even some of us use such APIs actively. But, you may wonder how good the public text-to-speech (TTS) APIs. It is actually a hard problem, even in the AI society. For such TTS evaluations, some groups use Word Error Rate (WER)…
Read More
Speech Synthesis Performance: OpenAI Text To Speech for Korean

09/22/2024

·

Podonos

One of the key questions when building a new AI model is how good the model is for the target customer. It is also for the AI users: how good this model is for my use case. When it comes to Text To Speech (TTS), many of us want to know how existing TTS models…
Read More
What is subjective audio evaluation?

06/03/2024

·

Podonos

Subjective audio evaluation is the assessment of generated (e.g., by using generative AI models), processed (noise reduction, compression, echo cancellation, and so on) audio or speech by human listeners. The human evaluators play a crucial role in determining the effectiveness and quality of the audio output. The main goals of subjective audio evaluation include: The…
Read More