Arama Sonuçları

Listeleniyor 1 - 2 / 2
  • Yayın
    Effective semi-supervised learning strategies for automatic sentence segmentation
    (Elsevier Science BV, 2018-04-01) Dalva, Doğan; Güz, Ümit; Gürkan, Hakan
    The primary objective of sentence segmentation process is to determine the sentence boundaries of a stream of words output by the automatic speech recognizers. Statistical methods developed for sentence segmentation requires a significant amount of labeled data which is time-consuming, labor intensive and expensive. In this work, we propose new multi-view semi-supervised learning strategies for sentence boundary classification problem using lexical, prosodic, and morphological information. The aim is to find effective semi-supervised machine learning strategies when only small sets of sentence boundary labeled data are available. We primarily investigate two semi-supervised learning approaches, called self-training and co-training. Different example selection strategies were also used for co-training, namely, agreement, disagreement and self-combined. Furthermore, we propose three-view and committee-based algorithms incorporating with agreement, disagreement and self-combined strategies using three disjoint feature sets. We present comparative results of different learning strategies on the sentence segmentation task. The experimental results show that the sentence segmentation performance can be highly improved using multi-view learning strategies that we proposed since data sets can be represented by three redundantly sufficient and disjoint feature sets. We show that the proposed strategies substantially improve the average baseline F-measure of 67.66% to 75.15% and 64.84% to 66.32% when only a small set of manually labeled data is available for Turkish and English spoken languages, respectively.
  • Yayın
    Improved microphone array design with statistical speaker verification
    (Elsevier Ltd, 2021-04) Demir, Kadir Erdem; Eskil, Mustafa Taner
    Conventional microphone array implementations aim to lock onto a source with given location and if required, tracking it. It is a challenge to identify the intended source when the location of the source is unknown and interference exists in the same environment. In this study we combine speaker verification and microphone array processing techniques to localize and maximize gain on the intended speaker under the assumption of open acoustic field. We exploit the steering capability of the microphone array for more accurate speaker verification. Our first contribution is a new N-Gram based and computationally efficient feature for detecting an intended speaker. When the source and interference are localized, microphone array can be tuned further to reduce noise and increase the gain. Our second contribution is this integrated algorithm for speaker verification and localization. In the context of this study we developed SharpEar, an open source environment that simulates propagation of sound emanating from multiple sources. Our third and last contribution is this simulation environment, which is open source and available to researchers of the field.