Automatic Removal of Filler Words
Filler words (unnecessary words) such as "あのー" and "えーっと" are automatically removed from the speech recognition results.
For example, if you speak as follows:
えーっと、会議があるので、えー、それまでに、あのー、資料を作成しておきます。
The recognition result for this speech will be as follows:
会議があるのでそれまでに資料を作成しておきます。
The following types of words are treated as filler words:
| Language | Filler word examples |
|---|---|
| Japanese | あー, あのー, えー, おー, えっと |
| English | ah, urm, hmm |
| Chinese | 呃, 啊, 哎呀 |
| Korean | 어, 으, 음 |
note
- Except for some special engines,users cannot add filler words.
- Filler words may change during the process of accuracy improvement, and we do not publish a list of filler words.
- For "音声入力_氏名" engine and "音声入力_住所" engine, no filler words are set by default. By registering unnecessary words in the "フィラー(文頭)" or "フィラー(文末)" class, they can be treated as filler words. For details, please see Special Word Registration in the components of word registration.
If the above speech content is included in an audio file named test-with-filler.wav, you can confirm the automatic removal of filler words by executing the following curl command. For details on this procedure, please see Transcribing Short Audio Files. For WebSocket, please see Speech Recognition Request.
curl -sS https://acp-api.amivoice.com/v1/recognize \
-F u={APPKEY} \
-F "d=-a-general" \
-F a=@test-with-filler.wav | jq
Response
{
"results": [
{
"tokens": [
{
"written": "会議",
"confidence": 0.99,
"starttime": 656,
"endtime": 1184,
"spoken": "かいぎ"
},
{
"written": "が",
"confidence": 1,
"starttime": 1184,
"endtime": 1312,
"spoken": "が"
},
{
"written": "ある",
"confidence": 1,
"starttime": 1312,
"endtime": 1536,
"spoken": "ある"
},
{
"written": "ので",
"confidence": 1,
"starttime": 1536,
"endtime": 1920,
"spoken": "ので"
},
{
"written": "それ",
"confidence": 1,
"starttime": 2384,
"endtime": 2736,
"spoken": "