Weighted Finite State Transducer–Based Endpoint Detection Using Probabilistic Decision Logic

정훈; 이성주; 이윤근

한국전자통신연구원 ETRI Journal Weighted Finite State Transducer–Based Endpoint Detection Using Probabilistic Decision Logic

KCI 등재

Weighted Finite State Transducer–Based Endpoint Detection Using Probabilistic Decision Logic

정훈, 이성주, 이윤근

한국전자통신연구원2014.10

ETRI Journal 36권 5호 714-720(7pages)

DOI http://dx.doi.org/10.4218/etrij.14.2214.0030

UCI G704-001110.2014.36.5.002

인용하기 URL 복사 보관함 담기

초록

In this paper, we propose the use of data-drivenprobabilistic utterance-level decision logic to improveWeighted Finite State Transducer (WFST)-basedendpoint detection. In general, endpoint detection is dealtwith using two cascaded decision processes. The firstprocess is frame-level speech/non-speech classificationbased on statistical hypothesis testing, and the secondprocess is a heuristic-knowledge-based utterance-levelspeech boundary decision. To handle these two processeswithin a unified framework, we propose a WFST-basedapproach. However, a WFST-based approach has thesame limitations as conventional approaches in that theutterance-level decision is based on heuristic knowledgeand the decision parameters are tuned sequentially. Therefore, to obtain decision knowledge from a speechcorpus and optimize the parameters at the same time, wepropose the use of data-driven probabilistic utteranceleveldecision logic. The proposed method reduces theaverage detection failure rate by about 14% for variousnoisy-speech corpora collected for an endpoint detectionevaluation.

키워드

Endpoint detection

speech recognition

Weighted Finite State Transducer.

참고문헌 (0)