3 sonuçlar
Arama Sonuçları
Listeleniyor 1 - 3 / 3
Yayın Construction of a Turkish proposition bank(Tubitak Scientific & Technical Research Council Turkey, 2018) Ak, Koray; Toprak, Cansu; Esgel, Volkan; Yıldız, Olcay TanerThis paper describes our approach to developing the Turkish PropBank by adopting the semantic role-labeling guidelines of the original PropBank and using the translation of the English Penn-TreeBank as a resource. We discuss the semantic annotation process of the PropBank and language-specific cases for Turkish, the tools we have developed for annotation, and quality control for multiuser annotation. In the current phase of the project, more than 9500 sentences are semantically analyzed and predicate-argument information is extracted for 1330 verbs and 1914 verb senses. Our plan is to annotate 17,000 sentences by the end of 2017.Yayın Microservices-based databank for Turkish hazelnut cultivars using IoT and semantic web technologies(John Wiley and Sons Ltd, 2024-03-30) Aydın, Şahin; Aldara, DieaaInformation and communication technologies (ICTs) can play a crucial role in facilitating access to comprehensive information on the quality standards of Turkish hazelnut cultivars. In this regard, this study introduces a Hazelnut Databank System (HDS) that utilizes the microservices architecture, an integrated software system supported by the Internet of Things (IoT) and semantic web, to categorize Turkish hazelnut cultivars. The study focuses on developing microservices using various programming languages and frameworks. Specifically, C# on the.NET Core Framework was used for both microservices and the web-based application implemented through the ASP.NET Core MVC Framework. Mobile-based software applications were created using Xamarin. Forms, and the IoT application was developed using the Python programming language. The data storage is facilitated through the MS SQL Server database. Additionally, the study incorporates the implementation of a hazelnut species classification system using the DNN + ResNet50 machine learning model, achieving an impressive accuracy rate of 95.77%. The overall usability of the system was evaluated, resulting in a score of 42 out of 50. By providing detailed information on Turkish hazelnut cultivars, the HDS has the potential to greatly improve hazelnut production quality in Turkey and increase awareness of hazelnut agriculture among relevant stakeholders.Yayın Grammar or crammer? the role of morphology in distinguishing orthographically similar but semantically unrelated words(Institute of Electrical and Electronics Engineers Inc., 2025) Ercan, Gökhan; Yıldız, Olcay TanerWe show that n-gram-based distributional models fail to distinguish unrelated words due to the noise in semantic spaces. This issue remains hidden in conventional benchmarks but becomes more pronounced when orthographic similarity is high. To highlight this problem, we introduce OSimUnr, a dataset of nearly one million English and Turkish word-pairs that are orthographically similar but semantically unrelated (e.g., grammar - crammer). These pairs are generated through a graph-based WordNet approach and morphological resources. We define two evaluation tasks - unrelatedness identification and relatedness classification - to test semantic models. Our experiments reveal that FastText, with default n-gram segmentation, performs poorly (below 5% accuracy) in identifying unrelated words. However, morphological segmentation overcomes this issue, boosting accuracy to 68% (English) and 71% (Turkish) without compromising performance on standard benchmarks (RareWords, MTurk771, MEN, AnlamVer). Furthermore, our results suggest that even state-of-the-art LLMs, including Llama 3.3 and GPT-4o-mini, may exhibit noise in their semantic spaces, particularly in highly synthetic languages such as Turkish. To ensure dataset quality, we leverage WordNet, MorphoLex, and NLTK, covering fully derivational morphology supporting atomic roots (e.g., '-co_here+ance+y' for 'coherency'), with 405 affixes in Turkish and 467 in English.












