닫기
216.73.216.214
216.73.216.214
close menu
KCI 등재
Weighted Finite State Transducer–Based Endpoint Detection Using Probabilistic Decision Logic
정훈, 이성주, 이윤근
ETRI Journal 36권 5호 714-720(7pages)
DOI http://dx.doi.org/10.4218/etrij.14.2214.0030
UCI G704-001110.2014.36.5.002

In this paper, we propose the use of data-drivenprobabilistic utterance-level decision logic to improveWeighted Finite State Transducer (WFST)-basedendpoint detection. In general, endpoint detection is dealtwith using two cascaded decision processes. The firstprocess is frame-level speech/non-speech classificationbased on statistical hypothesis testing, and the secondprocess is a heuristic-knowledge-based utterance-levelspeech boundary decision. To handle these two processeswithin a unified framework, we propose a WFST-basedapproach. However, a WFST-based approach has thesame limitations as conventional approaches in that theutterance-level decision is based on heuristic knowledgeand the decision parameters are tuned sequentially. Therefore, to obtain decision knowledge from a speechcorpus and optimize the parameters at the same time, wepropose the use of data-driven probabilistic utteranceleveldecision logic. The proposed method reduces theaverage detection failure rate by about 14% for variousnoisy-speech corpora collected for an endpoint detectionevaluation.

×