1-2 of 2 results for 搜索留痕最新技术咨询邮箱：lgooxc@hotmail.com

Working with corpora and custom words for previous-generation models
in Speech to Text

2025-02-17

The service's base vocabulary covers most compound words that occur frequently (for example, 長電話 and 古新聞) but not those compound words that occur infrequently. If your corpus commonly contains compound words, add them as one word as the first step of your customization. For example, 古鉛筆 is not common in general Japanese text; if you use it often, add it to your custom model to improve transcription accuracy.

https://cloud.ibm.com/docs/speech-to-text?topic=speech-to-text-corporaWords

Working with corpora and custom words for large speech models and next-generation models
in Speech to Text

2025-02-17

The character sequences of the base model cover most compound words that occur frequently (for example, 長電話 and 古新聞) but not those compound words that occur infrequently. If your corpus commonly contains compound words, add them as one word as the first step of your customization. For example, 古鉛筆 is not common in general Japanese text; if you use it often, add it to your custom model to improve transcription accuracy.

https://cloud.ibm.com/docs/speech-to-text?topic=speech-to-text-corporaWords-ng

1-2 of 2 results for 搜索留痕最新技术咨询邮箱：lgooxc@hotmail.com

Working with corpora and custom words for previous-generation modelsin Speech to Text

Working with corpora and custom words for large speech models and next-generation modelsin Speech to Text

Working with corpora and custom words for previous-generation models
in Speech to Text

Working with corpora and custom words for large speech models and next-generation models
in Speech to Text