Solution filter:
Content type
Product filter:
Topic type
Categories
The service's base vocabulary covers most compound words that occur frequently (for example, 長電話 and 古新聞) but not those compound words that occur infrequently. If your corpus commonly contains compound words, add them as one word as the first step of your customization. For example, 古鉛筆 is not common in general Japanese text; if you use it often, add it to your custom model to improve transcription accuracy.
https://cloud.ibm.com/docs/speech-to-text?topic=speech-to-text-corporaWordsin Speech to Text
2025-02-17
The character sequences of the base model cover most compound words that occur frequently (for example, 長電話 and 古新聞) but not those compound words that occur infrequently. If your corpus commonly contains compound words, add them as one word as the first step of your customization. For example, 古鉛筆 is not common in general Japanese text; if you use it often, add it to your custom model to improve transcription accuracy.
https://cloud.ibm.com/docs/speech-to-text?topic=speech-to-text-corporaWords-ng