Arama Sonuçları

Listeleniyor 1 - 2 / 2
  • Yayın
    AnlamVer: Semantic model evaluation dataset for Turkish - word similarity and relatedness
    (Association for Computational Linguistics (ACL), 2018-08-26) Ercan, Gökhan; Yıldız, Olcay Taner
    In this paper, we present AnlamVer, which is a semantic model evaluation dataset for Turkish designed to evaluate word similarity and word relatedness tasks while discriminating those two relations from each other. Our dataset consists of 500 word-pairs annotated by 12 human subjects, and each pair has two distinct scores for similarity and relatedness. Word-pairs are selected to enable the evaluation of distributional semantic models by multiple attributes of words and word-pair relations such as frequency, morphology, concreteness and relation types (e.g., synonymy, antonymy). Our aim is to provide insights to semantic model researchers by evaluating models in multiple attributes. We balance dataset word-pairs by their frequencies to evaluate the robustness of semantic models concerning out-of-vocabulary and rare words problems, which are caused by the rich derivational and inflectional morphology of the Turkish language.
  • Yayın
    Morpholex Turkish: a morphological Lexicon for Turkish
    (European Language Resources Association (ELRA), 2022-06-25) Arıcan, Bilge Nas; Kuzgun, Aslı; Marşan, Büşra; Aslan, Deniz Baran; Sanıyar, Ezgi; Cesur, Neslihan; Kara, Neslihan; Kuyrukçu, Oğuzhan; Özçelik, Merve; Yenice, Arife Betül; Doğan, Merve; Oksal, Ceren; Ercan, Gökhan; Yıldız, Olcay Taner
    MorphoLex is a study in which root, prefix and suffixes of words are analyzed. With MorphoLex, many words can be analyzed according to certain rules and a useful database can be created. Due to the fact that Turkish is an agglutinative language and the richness of its language structure, it offers different analyzes and results from previous studies in MorphoLex. In this study, we revealed the process of creating a database with 48,472 words and the results of the differences in language structure.