Animal sound classification using a convolutional neural network
dc.authorid | 0000-0002-8649-6013 | |
dc.contributor.author | Şaşmaz, Emre | en_US |
dc.contributor.author | Tek, Faik Boray | en_US |
dc.date.accessioned | 2019-03-16T21:58:05Z | |
dc.date.available | 2019-03-16T21:58:05Z | |
dc.date.issued | 2018-12-06 | |
dc.department | Işık Üniversitesi, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü | en_US |
dc.department | Işık University, Faculty of Engineering, Department of Computer Engineering | en_US |
dc.description.abstract | In this paper, we investigate the problem of animal sound classification using deep learning and propose a system based on convolutional neural network architecture. As the input to the network, sound files were preprocessed to extract Mel Frequency Cepstral Coefficients (MFCC) using LibROSA library. To train and test the system we have collected 875 animal sound samples from an online sound source site for 10 different animal types. We report classification confusion matrices and the results obtained by different gradient descent optimizers. The best accuracy of 75% was obtained by Nesterov-accelerated Adaptive Moment Estimation (Nadam). | en_US |
dc.description.version | Publisher's Version | en_US |
dc.identifier.citation | Şaşmaz, E. & Tek, F. B. (2018). Animal sound classification using A convolutional neural network. Paper presented at the 2018 3rd International Conference on Computer Science and Engineering (UBMK), 625-629. doi:10.1109/UBMK.2018.8566449 | en_US |
dc.identifier.doi | 10.1109/UBMK.2018.8566449 | |
dc.identifier.endpage | 629 | |
dc.identifier.isbn | 9781538678930 | |
dc.identifier.isbn | 9781538678923 | |
dc.identifier.isbn | 9781538678947 | |
dc.identifier.scopus | 2-s2.0-85060595306 | |
dc.identifier.scopusquality | N/A | |
dc.identifier.startpage | 625 | |
dc.identifier.uri | https://hdl.handle.net/11729/1453 | |
dc.identifier.uri | http://dx.doi.org/10.1109/UBMK.2018.8566449 | |
dc.identifier.wos | WOS:000459847400120 | |
dc.identifier.wosquality | N/A | |
dc.indekslendigikaynak | Web of Science | en_US |
dc.indekslendigikaynak | Scopus | en_US |
dc.indekslendigikaynak | Conference Proceedings Citation Index – Science (CPCI-S) | en_US |
dc.institutionauthor | Şaşmaz, Emre | en_US |
dc.institutionauthor | Tek, Faik Boray | en_US |
dc.institutionauthorid | 0000-0002-8649-6013 | |
dc.language.iso | en | en_US |
dc.peerreviewed | Yes | en_US |
dc.publicationstatus | Published | en_US |
dc.publisher | IEEE | en_US |
dc.relation.ispartof | 2018 3rd International Conference on Computer Science and Engineering (UBMK) | en_US |
dc.relation.publicationcategory | Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı | en_US |
dc.rights | info:eu-repo/semantics/closedAccess | en_US |
dc.subject | Animal sound classification | en_US |
dc.subject | Mel frequency cepstral coefficient (MFCC) | en_US |
dc.subject | Convolution neural network (CNN) | en_US |
dc.subject | Confusion matrix (CF) | en_US |
dc.subject | Birds | en_US |
dc.subject | Acoustics | en_US |
dc.subject | Acoustic indices | en_US |
dc.subject | Convolution | en_US |
dc.subject | Deep learning | en_US |
dc.subject | Image resolution | en_US |
dc.subject | Network architecture | en_US |
dc.subject | Neural networks | en_US |
dc.subject | Animal types | en_US |
dc.subject | Confusion matrices | en_US |
dc.subject | Convolution neural network | en_US |
dc.subject | Convolutional neural network | en_US |
dc.subject | Gradient descent | en_US |
dc.subject | Mel-frequency cepstral coefficients | en_US |
dc.subject | Moment estimation | en_US |
dc.subject | Sound classification | en_US |
dc.subject | Animals | en_US |
dc.title | Animal sound classification using a convolutional neural network | en_US |
dc.type | Conference Object | en_US |