Constructing a Turkish constituency parse treeBank

dc.authorid0000-0001-5838-4615
dc.authorid0000-0002-8448-9987
dc.authorid0000-0001-7754-2033
dc.contributor.authorYıldız, Olcay Taneren_US
dc.contributor.authorSolak, Ercanen_US
dc.contributor.authorÇandır, Şemsinuren_US
dc.contributor.authorEhsani, Raziehen_US
dc.contributor.authorGörgün, Onuren_US
dc.date.accessioned2015-11-30T14:13:22Z
dc.date.available2015-11-30T14:13:22Z
dc.date.issued2016
dc.departmentIşık Üniversitesi, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümüen_US
dc.departmentIşık University, Faculty of Engineering, Department of Computer Engineeringen_US
dc.description.abstractIn this paper, we describe our initial efforts for creating a Turkish constituency parse treebank by utilizing the English Penn Treebank. We employ a semiautomated approach for annotation. In our previouswork [18], the English parse trees were manually translated to Turkish. In this paper, the words are semi-automatically annotated morphologically. As a second step, a rule-based approach is used for refining the parse trees based on the morphological analyses of the words. We generated Turkish phrase structure trees for 5143 sentences from Penn Treebank that contain fewer than 15 tokens. The annotated corpus can be used in statistical natural language processing studies for developing tools such as constituency parsers and statistical machine translation systems for Turkish.en_US
dc.description.versionPublisher's Versionen_US
dc.identifier.citationYıldız, O. T., Solak, E., Çandır, Ş., Ehsani, R. & Görgün, O. (2016). Constructing a Turkish constituency parse treeBank. Paper presented at the Lecture Notes in Electrical Engineering, 363, 339-347. doi:10.1007/978-3-319-22635-4_31en_US
dc.identifier.doi10.1007/978-3-319-22635-4_31
dc.identifier.endpage347
dc.identifier.isbn9783319226354
dc.identifier.issn1876-1100
dc.identifier.issn1876-1119
dc.identifier.scopus2-s2.0-84945967324
dc.identifier.scopusqualityQ4
dc.identifier.startpage339
dc.identifier.urihttps://hdl.handle.net/11729/726
dc.identifier.urihttp://dx.doi.org/10.1007/978-3-319-22635-4_31
dc.identifier.volume363
dc.identifier.wosWOS:000385253500031
dc.identifier.wosqualityN/A
dc.indekslendigikaynakWeb of Scienceen_US
dc.indekslendigikaynakScopusen_US
dc.indekslendigikaynakConference Proceedings Citation Index – Science (CPCI-S)en_US
dc.institutionauthorYıldız, Olcay Taneren_US
dc.institutionauthorSolak, Ercanen_US
dc.institutionauthorÇandır, Şemsinuren_US
dc.institutionauthorEhsani, Raziehen_US
dc.institutionauthorid0000-0001-5838-4615
dc.institutionauthorid0000-0002-8448-9987
dc.language.isoenen_US
dc.peerreviewedYesen_US
dc.publicationstatusPublisheden_US
dc.publisherSpringer Verlagen_US
dc.relation.ispartofLecture Notes in Electrical Engineeringen_US
dc.relation.publicationcategoryKonferans Öğesi - Uluslararası - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.subjectCommunications engineeringen_US
dc.subjectNetworksen_US
dc.subjectComputational linguisticsen_US
dc.subjectSyntacticsen_US
dc.subjectDependency parseren_US
dc.titleConstructing a Turkish constituency parse treeBanken_US
dc.typeConference Objecten_US
dspace.entity.typePublication

Dosyalar

Orijinal paket
Listeleniyor 1 - 1 / 1
Küçük Resim Yok
İsim:
726.pdf
Boyut:
85.79 KB
Biçim:
Adobe Portable Document Format
Açıklama:
Publisher's Version
Lisans paketi
Listeleniyor 1 - 1 / 1
Küçük Resim Yok
İsim:
license.txt
Boyut:
1.71 KB
Biçim:
Item-specific license agreed upon to submission
Açıklama: