Işık Üniversitesi Kurumsal Akademik Belleği :: DSpace Angular

Arama Sonuçları

Listeleniyor 1 - 10 / 23

Integrating Turkish Wordnet KeNet to Princeton WordNet: The case of one-to-many correspondences
(Institute of Electrical and Electronics Engineers Inc., 2019-10) Bakay, Özge; Ergelen, Özlem; Yıldız, Olcay Taner
In this paper, we introduce a novel approach of forming interlingual relations between multilingual wordnets. We have mapped Turkish senses in KeNet with their corresponding senses in Princeton WordNet by drawing one-To-many correspondences. As a result of language-specific properties, one synset in one language is matched with multiple synsets in the other language in some cases. Our method of integrating KeNet into a multilingual network also included mapping the most frequent 5000 senses in English with their equivalent senses in Turkish. What we demonstrate is that one-To-many interlingual correspondances are necessary to include in mappings both from Turkish-To-English and English-To-Turkish. Furthermore, one-To-many mappings give us insights into the semantic relations to be constructed in Turkish, such as hypernymy.
Model adaptation for dialog act tagging
(IEEE, 2006) Tür, Gökhan; Güz, Ümit; Hakkani Tür, Dilek
In this paper, we analyze the effect of model adaptation for dialog act tagging. The goal of adaptation is to improve the performance of the tagger using out-of-domain data or models. Dialog act tagging aims to provide a basis for further discourse analysis and understanding in conversational speech. In this study we used the ICSI meeting corpus with high-level meeting recognition dialog act (MRDA) tags, that is, question, statement, backchannel, disruptions, and floor grabbers/holders. We performed controlled adaptation experiments using the Switchboard (SWBD) corpus with SWBD-DAMSL tags as the out-of-domain corpus. Our results indicate that we can achieve significantly better dialog act tagging by automatically selecting a subset of the Switchboard corpus and combining the confidences obtained by both in-domain and out-of-domain models via logistic regression, especially when the in-domain data is limited.
Problems caused by semantic drift in WordNet synset construction
(Institute of Electrical and Electronics Engineers Inc., 2019-09) Bakay, Özge; Ergelen, Özlem; Yıldız, Olcay Taner
In this study, we summarize the semantic drift problem that occur in specific synsets of KeNet, a Turkish WordNet, which is caused by mis-merging of semantically-related lexical items, morphological markings and false part of speech (POS) matchings. We present our approach to these problems in order to eliminate the semantic drift. We have re-analyzed the dictionary definitions of the items, placed those that possess different verbal markings into separate synsets, and divided synsets based on the POS of the items in them.
Multi-task learning on mental disorder detection, sentiment detection and emotion detection
(Işık Üniversitesi, 2024-02-12) Armah, Courage; Dehkharghani, Rahim; Işık Üniversitesi, Lisansüstü Eğitim Enstitüsü, Bilgisayar Mühendisliği Yüksek Lisans Programı; Işık University, School of Graduate Studies, Computer Science Engineering Master Program
Suicidal behavior is a global cause of life-threatening injury and most of the time, death. Mental disorders such as depression, anxiety, and bipolar are prevalent among the youth in recent decades. Social media are popular platforms for individuals to post their thoughts and feelings on. Extracting people’s sentiments and feelings from such online platforms would help detect mental disorders of the users to treat them before it becomes too late. This thesis investigates the use of multi-task learning systems and single-task learning techniques to estimate behaviors and mental states for early diagnosis. I used data mined from Reddit, one of the popular social media platforms that provides anonymity. Anonymity increases the chances of individuals sharing what they truly feel in their real life. The obtained results by the proposed approaches open new doors to the understanding of how multi-task systems can increase the performance of text classification problems such as depression detection, emotion detection, and sentiment analysis, trained together in a multi-task learning network when compared to their training in isolation in a single-task learning network. We used the SWMH dataset, already labeled by 5 different depression labels (depression, anxiety, suicide, bipolar, and off my chest) and then added emotion and polarity labels to it and made it publicly available for researchers in the literature. The obtained results in this study are also comparable to other approaches in the field.
A new approach for named entity recognition
(IEEE, 2017) Ertopçu, Burak; Kanburoğlu, Ali Buğra; Topsakal, Ozan; Açıkgöz, Onur; Gürkan, Ali Tunca; Özenç, Berke; Çam, İlker; Avar, Begüm; Ercan, Gökhan; Yıldız, Olcay Taner
Many sentences create certain impressions on people. These impressions help the reader to have an insight about the sentence via some entities. In NLP, this process corresponds to Named Entity Recognition (NER). NLP algorithms can trace a lot of entities in the sentence like person, location, date, time or money. One of the major problems in these operations are confusions about whether the word denotes the name of a person, a location or an organisation, or whether an integer stands for a date, time or money. In this study, we design a new model for NER algorithms. We train this model in our predefined dataset and compare the results with other models. In the end we get considerable outcomes in a dataset containing 1400 sentences.
Extension of conventional co-training learning strategies to three-view and committee-based learning strategies for effective automatic sentence segmentation
(IEEE, 2018) Dalva, Doğan; Güz, Ümit; Gürkan, Hakan
The objective of this work is to develop effective multi-view semi-supervised machine learning strategies for sentence boundary classification problem when only small sets of sentence boundary labeled data are available. We propose three-view and committee-based learning strategies incorporating with co-training algorithms with agreement, disagreement, and self-combined learning strategies using prosodic, lexical and morphological information. We compare experimental results of proposed three-view and committee-based learning strategies to other semi-supervised learning strategies in the literature namely, self-training and co-training with agreement, disagreement, and self-combined strategies. The experiment results show that sentence segmentation performance can be highly improved using multi-view learning strategies that we propose since data sets can be represented by three redundantly sufficient and disjoint feature sets. We show that the proposed strategies substantially improve the average performance when only a small set of manually labeled data is available for Turkish and English spoken languages, respectively.
All-words word sense disambiguation in Turkish
(Işık Üniversitesi, 2019-09-06) Akçakaya, Sinan; Yıldız, Olcay Taner; Işık Üniversitesi, Fen Bilimleri Enstitüsü, Bilgisayar Mühendisliği Yüksek Lisans Programı
Word sense disambiguation (WSD) is the identi cation of the meaning of words in context in a computational manner. The main subject of this study is to implement and compare the WSD results of various supervised classi ers (Naive Bayes, K Nearest Neighbor, Rocchio and C4.5) in all-words setting. To this end, we have constructed an all-words sense annotated Turkish corpus, using traditional method of manual tagging. During the annotation, a pre-built parallel treebank (aligned from Penn Treebank) has been tagged with the senses of Turkish Language Institutions dictionary. The approach of annotating a treebank allowed us to generate a full-coverage resource, in which syntactic and semantic information merged. In the WSD evaluations, three distinct experiments have been organized to determine the efect of using different feature sets on the disambiguation performance. First experiment has been conducted with a simple feature set that includes the fundamental local features. In the second experiment, the initial feature set has been augmented with several effective morphological features, and in the third one, the feature set has further been extended with the syntactic features. Our test results show that all classi ers have achieved better results in parallel to growing feature set. Additionally, integration of syntactic features has proved to be useful for WSD.
Semantic role labeling for Turkish propbank
(Işık Üniversitesi, 2019-09-06) Esgel, Volkan; Yıldız, Olcay Taner; Işık Üniversitesi, Fen Bilimleri Enstitüsü, Bilgisayar Mühendisliği Yüksek Lisans Programı
People's communication with each other takes place through sentences that combine words with different purposes. Words can gain different meanings with the presence of other words in the sentences in which they take place. With the rapid development of technology, the studies on understanding of human language by computational power have gained speed. These studies are generally referred to Natural Language Processing and their main purpose is to understand the sentences in human communication. The words in the sentence ful l different purposes. Some words describe an event, while other words indicate details of that event. De ning the semantic roles of words is possible with different algorithms. This study rst started by contributing to the process of determining the semantic roles of the word groups in the sentence by manpower. In addition, the semantic roles in the English sentences were parsed and shared on a web site with the marked roles in the Turkish sentences for comparison purposes. Finally, it is tried to measure how the algorithms aiming to nd the semantic roles of the words in the sentence perform automatically for Turkish.
Shallow parsing in Turkish
(IEEE, 2017) Topsakal, Ozan; Açıkgöz, Onur; Gürkan, Ali Tunca; Kanburoğlu, Ali Buğra; Ertopçu, Burak; Özenç, Berke; Çam, İlker; Avar, Begüm; Ercan, Gökhan; Yıldız, Olcay Taner
In this study, shallow parsing is applied on Turkish sentences. These sentences are used to train and test the per-formances of various learning algorithms with various features specified for shallow parsing in Turkish.
Web service translating content into Turkish sign language
(Institute of Electrical and Electronics Engineers Inc., 2020-10-12) Gümüşçekiçci, Gizem; Ezerceli, Özay; Tek, Faik Boray
The essential communication tool for people with hearing loss is sign language. It is way more efficient for their communication. Existing systems for translating the text into sign language are offline and not practical. In this study, we propose a web service-based solution for online translation of content into Turkish Sign Language. We implemented the system and tested it using 32 sentences of 189 words as inputs. The correct word translation rate was 81.74% for the media or audio inputs and the correct word translation for the text inputs was 81.09% The results show the feasibility of the solution and the potential for improvements.

Filtreler

Yazar

Konu

Tarih

İndeks

WoS Q

Scopus Q

Dil

Tür

Kategori

Bölüm

Erişim Hakkı

Tam Metin

Öğe Türü

Ayarlar

Sırala

Sayfa Başına Sonuç

Arama Sonuçları