5 sonuçlar
Arama Sonuçları
Listeleniyor 1 - 5 / 5
Yayın Morphological analyser for Turkish(Işık Üniversitesi, 2018-01-25) Özenç, Berke; Solak, Ercan; Işık Üniversitesi, Fen Bilimleri Enstitüsü, Bilgisayar Mühendisliği Yüksek Lisans ProgramıNatural Language Processing is one one the fields of work in computer science and specializes in text summarization, machine translation and many various topics. Morphology is one of the Natural Language Processing features which analyses the words with its suxes. A words meaning can change according to the sux that it takes. Turkish is an agglutinative language with rich morphological structure and set of suxes. This features of Turkish result in complex morphology structure. In this study, we present an analyser for Modern Anatolian Turkish which has high coverage on suffixes and morphological rules of Turkish. Two-Level transformation method which is convenient to design morphology of a language, consists our base of approach. We used HFST which is a Finite State Transducer implementation, as our implementation technique. The analyser covers all morphological and phonetic rules that exist in Turkish and contains a lexicon which consist of today's Turkish words. The analyser is publicly available and can be used on http://ddil.isikun.edu.tr/mortur.Yayın Psychometric properties of the Turkish version of the eating pathology symptoms inventory (EPSI-T)(Cogent OA, 2025) Türk, Fidan; Acet, Pınar; Karabulut, Goncagül; Akay, NazlıThe purpose of this study was to examine the factor structure and psychometric properties of the Turkish version of the Eating Pathology Symptoms Inventory (EPSI‑T), and to explore gender differences in eating disorder symptoms. Participants were 473 university students in Türkiye (342 women, 113 men) who completed the EPSI‑T, along with the Modified Weight Bias Internalization Scale (WBIS‑M), Addiction‑like Eating Behaviour Scale (AEBS), Muscularity‑Oriented Eating Test (MOET), and Depression Anxiety and Stress Scales (DASS‑21). Confirmatory factor analysis supported the original eight‑factor, 45‑item structure [χ2(914) = 1994.57, χ2/df = 2.18, CFI = 0.90, RMSEA = 0.05 (0.05–0.06), SRMR = 0.07]. Women scored significantly higher on most subscales, except for Excessive Exercise, Muscle Building, and Negative Attitudes toward Obesity, where men scored higher (p < 0.005). Reliability was strong, with Cronbach’s α ranging from 0.72 to 0.90 and McDonald’s ω from 0.75 to 0.90. Convergent and discriminant validity were also supported. Overall, findings suggest that the EPSI‑T is a reliable and valid measure of eating disorder symptoms in Turkish‑speaking populations and may facilitate cross‑cultural research by providing a tool structurally consistent with the original English version.Yayın An approach to anaylse Turkish syntax at morphosyntactic level(Işık Üniversitesi, Lisansüstü Eğitim Enstitüsü, 2025-01-20) Özenç, Berke; Solak, Ercan; Işık Üniversitesi, Lisansüstü Eğitim Enstitüsü, Bilgisayar Mühendisliği Doktora Programı; Işık University, School of Graduate Studies, Ph.D. in Computer EngineeringSyntactic analysis allows us to analyse the sentence structure in various ways. Constituency parsing is one of the various ways of conducting syntactic analysis. This parsing method defines sentence structure as hierarchical relationships between words or phrases and represents them in tree form. Constituency parsing employs constituency grammar which defines how constituents combine and form other constituents. In this grammar, any syntactic structure from the sentence to the words is represented by the constituents. Although this approach is designed to focus on universal aspects of the languages, English has always been in its focus. This situation makes the constituency approach miss the details that the morphology puts in the syntax of morphologically rich languages. In this study, we implement an extension for the constituency parsing which overcomes the challenges in parsing of MRL (Morphologically Rich Language). We propose ideas tailored to Turkish, yet they can be used for any language like Turkish. Our extension enables the constituency parsing to start at the morpheme level. Thus, we involve morphemic structures in the parsing process and express their syntactic effects on the structure. We have our implementations by extending the CYK (Cocke Younger Kasami) algorithm. During parsing, we utilize extra rules to transfer the ambiguity in morphology to the parsing. In addition, we designed a morpheme-focused constituency set for Turkish. This set involves affixes, stems and phrases headed by a stem. We demonstrate our work with a mini treebank and the grammar generated from it.Yayın Türkçe için biçimbirim temelli bir bileşen grameri yaklaşımı(Beykoz Üniversitesi, 2024-12-26) Özenç, Berke; Solak, ErcanDilin modellenmesi, dil çalışmalarında önemli bir temel olarak yer alır. Farklı modelleme yöntemleri, farklı diller için uyarlanabilir olsa da bu uyarlamalar, hedef dil için her zaman yeterli olmayabilir. Bu durumdan en çok biçimbirimsel açıdan zengin diller etkilenir. Böyle bir dil için hazırlanacak model kurgulanırken dilin evrensel olarak ortak olan özelliklerinin yanı sıra, dilin kendine özgü özelliklerine odaklanılmalıdır. Bu makalede, bağımlı biçimbirim bakımından zengin bir görünüm sunan Türkçe ele alınarak uyarlanan gramer sunulmuştur. Çalışmada açıklanan gramer temelleri geleneksel üretici gramer yönteminden uyarlanmıştır. Bununla birlikte, sunulan gramer, biçimbirimleri söz dizimi elemanı olarak geleneksel söz dizimi elemanlarıyla birlikte, söz dizimine olan etkilerini ele almasıyla ve kullanılan özel bileşen kümesiyle geleneksel üretici gramer yöntemden ayrılır. Geleneksel yöntemden farklı olarak önerilen gramerde, tümce çözümlemesine sözcüklerden değil, biçimbirim elemanları olan sözcük gövdeleri, ekler, biçimbirimler ve bu gibi elemanların oluşturduğu gruplardan başlanır. Buna ek olarak Türkçenin söz dizimsel ve birimbirimsel özelliklerine göre kurgulanan bir bileşen kümesi de sunulmuştur. Sunulan bileşen kümesi, tümce, ad öbeği, eylem öbeği, belirteç öbeği gibi geleneksel sözdizimsel bileşenleri, öbek gövdesi olarak adlandırılan ara bir yapıyı ve çoğul eki, durum eki, zaman çekimi eki gibi, biçimbirimleri veya biçimbirim gruplarını temsil eden bileşenleri içerir.Yayın TURSpider: a Turkish Text-to-SQL dataset and LLM-based study(Institute of Electrical and Electronics Engineers Inc., 2024-11-25) Kanburoğlu, Ali Buğra; Tek, Faik BorayThis paper introduces TURSpider, a novel Turkish Text-to-SQL dataset developed through human translation of the widely used Spider dataset, aimed at addressing the current lack of complex, cross-domain SQL datasets for the Turkish language. TURSpider incorporates a wide range of query difficulties, including nested queries, to create a comprehensive benchmark for Turkish Text-to-SQL tasks. The dataset enables cross-language comparison and significantly enhances the training and evaluation of large language models (LLMs) in generating SQL queries from Turkish natural language inputs. We fine-tuned several Turkish-supported LLMs on TURSpider and evaluated their performance in comparison to state-of-the-art models like GPT-3.5 Turbo and GPT-4. Our results show that fine-tuned Turkish LLMs demonstrate competitive performance, with one model even surpassing GPT-based models on execution accuracy. We also apply the Chain-of-Feedback (CoF) methodology to further improve model performance, demonstrating its effectiveness across multiple LLMs. This work provides a valuable resource for Turkish NLP and addresses specific challenges in developing accurate Text-to-SQL models for low-resource languages.












