Arama Sonuçları

Listeleniyor 1 - 5 / 5
  • Yayın
    İngilizce-Türkçe istatistiksel makine çevirisinde biçimbilim kullanımı
    (IEEE, 2012-04-18) Görgün, Onur; Yıldız, Olcay Taner
    Bu çalışmada, İngilizce-Türkçe dil ikilisi için biçimbilimsel çözümleme yardımı ile SIU dermecesi üzerinde istatistiksel makine çevirisi denemeleri yapılmıştır. Kelime biçimlerinin baz alındığı çeviri denemeleri İngilizce-Türkçe dil ikilisi gibi biçimbilimsel ve çekimsel olarak birbirinden uzak diller için düşük performans göstermektedir. Bu durumda, çeviri temel birimi olarak kelime formlarının yerine alt-sözcüksel temsiller kullanmak, makine çevirisi performansını önemli ölçüde arttırmaktadır.
  • Yayın
    Model adaptation for dialog act tagging
    (IEEE, 2006) Tür, Gökhan; Güz, Ümit; Hakkani Tür, Dilek
    In this paper, we analyze the effect of model adaptation for dialog act tagging. The goal of adaptation is to improve the performance of the tagger using out-of-domain data or models. Dialog act tagging aims to provide a basis for further discourse analysis and understanding in conversational speech. In this study we used the ICSI meeting corpus with high-level meeting recognition dialog act (MRDA) tags, that is, question, statement, backchannel, disruptions, and floor grabbers/holders. We performed controlled adaptation experiments using the Switchboard (SWBD) corpus with SWBD-DAMSL tags as the out-of-domain corpus. Our results indicate that we can achieve significantly better dialog act tagging by automatically selecting a subset of the Switchboard corpus and combining the confidences obtained by both in-domain and out-of-domain models via logistic regression, especially when the in-domain data is limited.
  • Yayın
    Unsupervised morphological analysis using tries
    (Springer London, 2012) Ak, Koray; Yıldız, Olcay Taner
    This article presents an unsupervised morphological analysis algorithm to segment words into roots and affixes. The algorithm relies on word occurrences in a given dataset. Target languages are English, Finnish, and Turkish, but the algorithm can be used to segment any word from any language given the wordlists acquired from a corpus consisting of words and word occurrences. In each iteration, the algorithm divides words with respect to occurrences and constructs a new trie for the remaining affixes. Preliminary experimental results on three languages show that our novel algorithm performs better than most of the previous algorithms.
  • Yayın
    A novel approach to morphological disambiguation for Turkish
    (Springer-Verlag, 2012) Görgün, Onur; Yıldız, Olcay Taner
    In this paper, we propose a classification based approach to the morphological disambiguation for Turkish language. Due to complex morphology in Turkish, any word can get unlimited number of affixes resulting very large tag sets. The problem is defined as choosing one of parses of a word not taking the existing root word into consideration. We trained our model with well-known classifiers using WEKA toolkit and tested on a common test set. The best performance achieved is 95.61% by J48 Tree classifier.
  • Yayın
    The expressions of spatial relations during interaction in American sign language, Croatian sign language, and Turkish sign language
    (Versita, 2012-11) Arik, Engin
    Signers use their body and the space in front of them iconically. Does iconicity lead to the same mapping strategies in construing space during interaction across sign languages? The present study addressed this question by conducting an experimental study on basic static and motion event descriptions during interaction (describer input and addressee re-signing/retelling) in American Sign Language, Croatian Sign Language, and Turkish Sign Language. I found that the three sign languages are similar in using classifier predicates of location, orientation, and movement, predominantly employing an egocentric (viewer) perspective but also a non-egocentric perspective, and using similar mapping strategies regardless of interlocutor positions. However, these three sign languages differ from each other in the effects of location and orientation of the objects in pictures and movies, the descriptions of picture (states) vs. movie (motion events), and describer input vs. addressee retellings in their mapping strategies. This study contributes to our knowledge of how the expressions of spatial relations are conveyed in natural human language.