4 sonuçlar
Arama Sonuçları
Listeleniyor 1 - 4 / 4
Yayın Generative and discriminative methods using morphological information for sentence segmentation of Turkish(IEEE-INST Electrical Electronics Engineers Inc, 2009-07) Güz, Ümit; Favre, Benoit; Hakkani Tür, Dilek; Tür, GökhanThis paper presents novel methods for generative, discriminative, and hybrid sequence classification for segmentation of Turkish word sequences into sentences. In the literature, this task is generally solved using statistical models that take advantage of lexical information among others. However, Turkish has a productive morphology that generates a very large vocabulary, making the task much harder. In this paper, we introduce a new set of morphological features, extracted from words and their morphological analyses. We also extend the established method of hidden event language modeling (HELM) to factored hidden event language modeling (fHELM) to handle morphological information. In order to capture non-lexical information, we extract a set of prosodic features, which are mainly motivated from our previous work for other languages. We then employ discriminative classification techniques, boosting and conditional random fields (CRFs), combined with fHELM, for the task of Turkish sentence segmentation.Yayın Model adaptation for dialog act tagging(IEEE, 2006) Tür, Gökhan; Güz, Ümit; Hakkani Tür, DilekIn this paper, we analyze the effect of model adaptation for dialog act tagging. The goal of adaptation is to improve the performance of the tagger using out-of-domain data or models. Dialog act tagging aims to provide a basis for further discourse analysis and understanding in conversational speech. In this study we used the ICSI meeting corpus with high-level meeting recognition dialog act (MRDA) tags, that is, question, statement, backchannel, disruptions, and floor grabbers/holders. We performed controlled adaptation experiments using the Switchboard (SWBD) corpus with SWBD-DAMSL tags as the out-of-domain corpus. Our results indicate that we can achieve significantly better dialog act tagging by automatically selecting a subset of the Switchboard corpus and combining the confidences obtained by both in-domain and out-of-domain models via logistic regression, especially when the in-domain data is limited.Yayın A new speech modeling method: SYMPES(IEEE, 2006) Güz, Ümit; Gürkan, Hakan; Yarman, Bekir Sıddık BinboğaIn this paper, the new method of speech modeling which is called SYMPES is introduced and it is compared with the commercially available methods. It is shown that for the same compression ratio or better, SYMPES yields considerably better hearing quality over the coders such as G.726 at 16 Kbps and voice excited LPC-10E of 2.4Kbps.Yayın A novel method to represent the speech signals by using language and speaker independent predefined functions sets(IEEE, 2004) Güz, Ümit; Gürkan, Hakan; Yarman, Bekir Sıddık BinboğaIn this paper a new modeling method of speech signals is introduced. The proposed method is based on the generation of the so-called Predefined Signature S={s(R)(t)} and Envelope Function E = {e(K)(t)} Sets (PSEFS). These function sets are independent of any speaker and any language. Once the speech signals are divided into frames with selected lengths, then each frame signal piece X-i(t) is synthesized by means of the mathematical form of x(i)(t)=C(i)e(K)(t)s(R)(t). In this representation, C-i is called the frame coefficient, s(R)(t) and e(K)(t) are properly assigned from the PSEFS respectively. It is shown that the proposed method provides fast reconstruction and substantial compression with acceptable hearing quality.












