Işık Üniversitesi Kurumsal Akademik Belleği :: DSpace Angular

Arama Sonuçları

Listeleniyor 1 - 10 / 20

Visual modeling of Turkish morphology
(European Language Resources Association (ELRA), 2020-05-16) Özenç, Berke; Solak, Ercan
In this paper, we describe the steps in a visual modeling of Turkish morphology using diagramming tools. We aimed to make modeling easier and more maintainable while automating much of the code generation. We released the resulting analyzer, MorTur, and the diagram conversion tool, DiaMor as free, open-source utilities. MorTur analyzer is also publicly available on its web page as a web service. MorTur and DiaMor are part of our ongoing efforts in building a set of natural language processing tools for Turkic languages under a consistent framework.
Vikipedi ve Vikisözlük'ten Hypernym çıkarma
(IEEE, 2017-06-27) Şaşmaz, Emre; Ehsani, Razieh; Yıldız, Olcay Taner
Doğal dil işleme alanında kullanılan önemli yapılardan bir tanesi WordNet gibi büyük ölçekli sözlüklerdir. WordNet; eşanlamlı, zıt anlamlı gibi anlamsal ilişkileri de içeren kapsamlı bir sözlüktür. Bu bildiride, WordNet’in önemli bir parçası olan Hypernym-Hyponym ilişkisini çıkarmaya çalıştık. Bu amaca ulaşmak için, Vikipedi, Türkçe Sözlük ve Vikisözlük kaynaklarını kullandık. Sonlu Durum Makinelerinden ürettiğimiz kurallarla Hypernym-Hyponym ilişkilerini çıkardık.
Integrating Turkish Wordnet KeNet to Princeton WordNet: The case of one-to-many correspondences
(Institute of Electrical and Electronics Engineers Inc., 2019-10) Bakay, Özge; Ergelen, Özlem; Yıldız, Olcay Taner
In this paper, we introduce a novel approach of forming interlingual relations between multilingual wordnets. We have mapped Turkish senses in KeNet with their corresponding senses in Princeton WordNet by drawing one-To-many correspondences. As a result of language-specific properties, one synset in one language is matched with multiple synsets in the other language in some cases. Our method of integrating KeNet into a multilingual network also included mapping the most frequent 5000 senses in English with their equivalent senses in Turkish. What we demonstrate is that one-To-many interlingual correspondances are necessary to include in mappings both from Turkish-To-English and English-To-Turkish. Furthermore, one-To-many mappings give us insights into the semantic relations to be constructed in Turkish, such as hypernymy.
Kural bazlı otomatik haber etiketleme
(IEEE, 2017-06-27) Özenç, Berke; Solak, Ercan
Bu çalışmada , genel ağ kaynaklarından haber toplayan ve topladığı bu haberleri otomatik olarak etiketleyen kural tabanlı bir uygulama yapılmıştır. Çalışmanın alt amacı hangi özelliklerin etiket belirleme işine daha uygun olduğunu ölçmektir. Elle etiketlenmiş 100 haber üzerinde her bir kuralın başarısı oranı ölçülmüştür.
Problems caused by semantic drift in WordNet synset construction
(Institute of Electrical and Electronics Engineers Inc., 2019-09) Bakay, Özge; Ergelen, Özlem; Yıldız, Olcay Taner
In this study, we summarize the semantic drift problem that occur in specific synsets of KeNet, a Turkish WordNet, which is caused by mis-merging of semantically-related lexical items, morphological markings and false part of speech (POS) matchings. We present our approach to these problems in order to eliminate the semantic drift. We have re-analyzed the dictionary definitions of the items, placed those that possess different verbal markings into separate synsets, and divided synsets based on the POS of the items in them.
Türkçe kelime ağı KeNet için arayüz
(Institute of Electrical and Electronics Engineers Inc., 2019-04) Özçelik, Rıza; Uludoğan, Gökçe; Parlar, Selen; Bakay, Özge; Ergelen, Özlem; Yıldız, Olcay Taner
Kelime ağları, bir dildeki kelimeler arasındaki bağlantıları, eş anlam kümeleri oluşturarak ve bu kümeleri birbirine çeşitli anlamsal bağıntılar ile bağlayarak temsil eden bir çizge veri yapısıdır. Doğal dil işleme alanındaki en yaygın bilinen kelime ağı WordNet 1990 yılında İngilizce için oluşturulmuşken, Türkçe için en kapsamlı ağ, 2018 yılında oluşturulan KeNet’tir. Bildiğimiz kadarıyla, içinde 80000 eş anlam kümesi ve 25 farklı anlamsal bağlantı bulunan KeNet için şu ana kadar geliştirilen bir kullanıcı arayüzü yoktur. Bu çalışmada, KeNet çizgesinde, anlamsal bağlantıları kullanarak eş anlam kümeleri arasında çevrimiçi olarak gezinmeyi sağlayan bir arayüz sunuyoruz. Bu arayüz sayesinde, bir söz öbeği KeNet’te aranabilir ve eş anlam kümeleri arasındaki üst/alt anlam, parça-bütün ilişkileri gibi ilişkiler kullanılarak KeNet üzerinde gezilebilir. Ayrıca, herhangi bir eş anlam kümesinin, varsa, İngilizce karşılığının kimliği de görüntülenebilir ve bu kümeye WordNet’e ait internet sayfasından erişilebilir.
MorAz: An open-source morphological analyzer for Azerbaijani Turkish
(Association for Computational Linguistics (ACL), 2018) Özenç, Berke; Ehsani, Razieh; Solak, Ercan
MorAz is an open-source morphological analyzer for Azerbaijani Turkish. The analyzer is available through both as a website for interactive exploration and as a RESTful web service for integration into a natural language processing pipeline. MorAz implements the morphology of Azerbaijani Turkish following a two-level approach using Helsinki finite-state transducer and wraps the analyzer with python scripts in a Django instance.
Chunking in Turkish with conditional random fields
(Springer-Verlag, 2015-04-14) Yıldız, Olcay Taner; Solak, Ercan; Ehsani, Razieh; Görgün, Onur
In this paper, we report our work on chunking in Turkish. We used the data that we generated by manually translating a subset of the Penn Treebank. We exploited the already available tags in the trees to automatically identify and label chunks in their Turkish translations. We used conditional random fields (CRF) to train a model over the annotated data. We report our results on different levels of chunk resolution.
An all-words sense annotated Turkish corpus
(IEEE, 2018-06-06) Akçakaya, Sinan; Yıldız, Olcay Taner
This paper reports our efforts in constructing of a sense labeled Turkish corpus with respect to Turkish Language Institution's dictionary, using the traditional method of manual tagging. We tagged a pre-built parallel treebank which is translated from the Penn Treebank II corpus. This approach allowed us to generate a full-coverage resource, in which syntactic and semantic information merged. We also provide miscellaneous statistics about the corpus itself as well as its development process.
A multilayer annotated corpus for Turkish
(IEEE, 2018-06-06) Yıldız, Olcay Taner; Ak, Koray; Ercan, Gökhan; Topsakal, Ozan; Asmazoğlu, Cengiz
In this paper, we present the first multilayer annotated corpus for Turkish, which is a low-resourced agglutinative language. Our dataset consists of 9,600 sentences translated from the Penn Treebank Corpus. Annotated layers contain syntactic and semantic information including morphological disambiguation of words, named entity annotation, shallow parse, sense annotation, and semantic role label annotation.

Filtreler

Yazar

Konu

Tarih

İndeks

WoS Q

Scopus Q

Dil

Tür

Kategori

Bölüm

Erişim Hakkı

Tam Metin

Öğe Türü

Ayarlar

Sırala

Sayfa Başına Sonuç

Arama Sonuçları