Model selection in omnivariate decision trees using Structural Risk Minimization
dc.authorid | 0000-0001-5838-4615 | |
dc.contributor.author | Yıldız, Olcay Taner | en_US |
dc.date.accessioned | 2015-01-15T23:01:51Z | |
dc.date.available | 2015-01-15T23:01:51Z | |
dc.date.issued | 2011-12-01 | |
dc.department | Işık Üniversitesi, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü | en_US |
dc.department | Işık University, Faculty of Engineering, Department of Computer Engineering | en_US |
dc.description | The authors thank the three anonymous referees and the editor for their constructive comments, pointers to related literature, and pertinent questions which allowed us to better situate our work as well as organize the ms and improve the presentation. This work has been supported by the Turkish Scientific Technical Research Council TUBITAK EEEAG 107E127 | en_US |
dc.description.abstract | As opposed to trees that use a single type of decision node, an omnivariate decision tree contains nodes of different types. We propose to use Structural Risk Minimization (SRM) to choose between node types in omnivariate decision tree construction to match the complexity of a node to the complexity of the data reaching that node. In order to apply SRM for model selection, one needs the VC-dimension of the candidate models. In this paper, we first derive the VC-dimension of the univariate model, and estimate the VC-dimension of all three models (univariate, linear multivariate or quadratic multivariate) experimentally. Second, we compare SRM with other model selection techniques including Akaike's Information Criterion (AIC), Bayesian Information Criterion (BIC) and cross-validation (CV) on standard datasets from the UCI and Delve repositories. We see that SRM induces omnivariate trees that have a small percentage of multivariate nodes close to the root and they generalize more or at least as accurately as those constructed using other model selection techniques. | en_US |
dc.description.sponsorship | TÜBİTAK | en_US |
dc.description.version | Publisher's Version | en_US |
dc.description.version | Author Pre-Print | en_US |
dc.identifier.citation | Yıldız, O. T. (2011). Model selection in omnivariate decision trees using structural risk minimization. Information Sciences, 181(23), 5214-5226. doi:10.1016/j.ins.2011.07.028 | en_US |
dc.identifier.doi | 10.1016/j.ins.2011.07.028 | |
dc.identifier.endpage | 5226 | |
dc.identifier.issn | 0020-0255 | |
dc.identifier.issn | 1872-6291 | |
dc.identifier.issue | 23 | |
dc.identifier.scopus | 2-s2.0-80052917492 | |
dc.identifier.scopusquality | Q1 | |
dc.identifier.startpage | 5214 | |
dc.identifier.uri | https://hdl.handle.net/11729/400 | |
dc.identifier.uri | http://dx.doi.org/10.1016/j.ins.2011.07.028 | |
dc.identifier.volume | 181 | |
dc.identifier.wos | WOS:000295760600007 | |
dc.identifier.wosquality | N/A | |
dc.indekslendigikaynak | Web of Science | en_US |
dc.indekslendigikaynak | Scopus | en_US |
dc.indekslendigikaynak | Science Citation Index Expanded (SCI-EXPANDED) | en_US |
dc.institutionauthor | Yıldız, Olcay Taner | en_US |
dc.institutionauthorid | 0000-0001-5838-4615 | |
dc.language.iso | en | en_US |
dc.peerreviewed | Yes | en_US |
dc.publicationstatus | Published | en_US |
dc.publisher | Elsevier Science Inc | en_US |
dc.relation.ispartof | Information Sciences | en_US |
dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | en_US |
dc.rights | info:eu-repo/semantics/closedAccess | en_US |
dc.subject | Classification | en_US |
dc.subject | Machine learning | en_US |
dc.subject | Model selection | en_US |
dc.subject | VC-dimension | en_US |
dc.subject | Structural risk minimization | en_US |
dc.subject | Decision tree | en_US |
dc.title | Model selection in omnivariate decision trees using Structural Risk Minimization | en_US |
dc.type | Article | en_US |
dspace.entity.type | Publication |