36 sonuçlar
Arama Sonuçları
Listeleniyor 1 - 10 / 36
Yayın EEG signal compression based on classified signature and envelope vector sets(Wiley, 2009-03) Gürkan, Hakan; Güz, Ümit; Yarman, Bekir Sıddık BinboğaIn this paper, a novel method to compress electroencephalogram (EEG) signal is proposed. The proposed method is based on the generation process of the classified signature and envelope vector sets (CSEVS), which employs an effective k-means clustering algorithm. It is assumed that both the transmitter and the receiver units have the same CSEVS. In this work, on a frame basis, EEG signals are modeled by multiplying only three factors called as classified signature vector, classified envelope vector, and gain coefficient (GC), respectively. In other words, every frame of an EEG signal is represented by two indices R and K of CSEVS and the GC. EEG signals are reconstructed frame by frame using these numbers in the receiver unit by employing the CSEVS. The proposed method is evaluated by using some evaluation metrics that are commonly used in this area such as root-mean-square error, percentage root-mean-square difference, and measuring with visual inspection. The performance of the proposed method is also compared with the other methods. It is observed that the proposed method achieves high compression ratios with low-level reconstruction error while preserving diagnostic information in the reconstructed EEG signal.Yayın A New speech coding algorithm using zero cross and phoneme based SYMPES(IEEE, 2013-07-11) Şişman, Burak; Güz, Ümit; Gürkan, Hakan; Yarman, Bekir Sıddık BinboğaIn this work, a new low bit rate hybrid speech coding approach which combines the benefits of the SYMPES (Systematic Procedure for Predefined Envelope and Signature Sequences) and zero cross and phoneme based segmentation is proposed. In the new approach, the SYMPES structure is developed in the phoneme based fashion. In order to achieve lower bit rates, some drawbacks such as computational complexity, relatively high encoding times etc. of the SYMPES are also eliminated in the new version. Experimental results show that in almost same bit rates very promising speech quality is obtained compared to the other conventional methods such as CELP (Code Excited Linear Predictive) coding algorithm.Yayın Cascaded model adaptation for dialog act segmentation and tagging(Elsevier Ltd, 2010-04) Güz, Ümit; Tür, Gökhan; Hakkani Tür, Dilek; Cuendet, SebastienThere are many speech and language processing problems which require cascaded classification tasks. While model adaptation has been shown to be useful in isolated speech and language processing tasks, it is not clear what constitutes system adaptation for such complex systems. This paper studies the following questions: In cases where a sequence of classification tasks is employed, how important is to adapt the earlier or latter systems? Is the performance improvement obtained in the earlier stages via adaptation carried on to later stages in cases where the later stages perform adaptation using similar data and/or methods? In this study, as part of a larger scale multiparty meeting understanding system, we analyze various methods for adapting dialog act segmentation and tagging models trained on conversational telephone speech (CTS) to meeting style conversations. We investigate the effect of using adapted and unadapted models for dialog act segmentation with those of tagging, showing the effect of model adaptation for cascaded classification tasks. Our results indicate that we can achieve significantly better dialog act segmentation and tagging by adapting the out-of-domain models, especially when the amount of in-domain data is limited. Experimental results show that it is more effective to adapt the models in the latter classification tasks, in our case dialog act tagging, when dealing with a sequence of cascaded classification tasksYayın A novel computed tomography image compression method based on classified energy and pattern blocks(IEEE, 2013) Gökbay, İnci Zaim; Gezer, Murat; Güz, Ümit; Gürkan, Hakan; Yarman, Bekir Sıddık BinboğaIn this work, a new biomedical image compression method is proposed based on the classified energy and pattern blocks (CEPB). CEPB based compression method is specifically applied on the Computed Tomography (CT) images and the evaluation results are presented. Essentially, the CEPB is uniquely designed and structured codebook which is located on the both the transmitter and receiver part of a communication system in order to implement encoding and decoding processes. The encoding parameters are block scaling coefficient (BSC) and the index numbers of energy (IE) and pattern blocks (IP) determined for each block of the input images based on the CEPB. The evaluation results show that the newly proposed method provides considerable image compression ratios and image quality.Yayın Generative and discriminative methods using morphological information for sentence segmentation of Turkish(IEEE-INST Electrical Electronics Engineers Inc, 2009-07) Güz, Ümit; Favre, Benoit; Hakkani Tür, Dilek; Tür, GökhanThis paper presents novel methods for generative, discriminative, and hybrid sequence classification for segmentation of Turkish word sequences into sentences. In the literature, this task is generally solved using statistical models that take advantage of lexical information among others. However, Turkish has a productive morphology that generates a very large vocabulary, making the task much harder. In this paper, we introduce a new set of morphological features, extracted from words and their morphological analyses. We also extend the established method of hidden event language modeling (HELM) to factored hidden event language modeling (fHELM) to handle morphological information. In order to capture non-lexical information, we extract a set of prosodic features, which are mainly motivated from our previous work for other languages. We then employ discriminative classification techniques, boosting and conditional random fields (CRFs), combined with fHELM, for the task of Turkish sentence segmentation.Yayın Model adaptation for dialog act tagging(IEEE, 2006) Tür, Gökhan; Güz, Ümit; Hakkani Tür, DilekIn this paper, we analyze the effect of model adaptation for dialog act tagging. The goal of adaptation is to improve the performance of the tagger using out-of-domain data or models. Dialog act tagging aims to provide a basis for further discourse analysis and understanding in conversational speech. In this study we used the ICSI meeting corpus with high-level meeting recognition dialog act (MRDA) tags, that is, question, statement, backchannel, disruptions, and floor grabbers/holders. We performed controlled adaptation experiments using the Switchboard (SWBD) corpus with SWBD-DAMSL tags as the out-of-domain corpus. Our results indicate that we can achieve significantly better dialog act tagging by automatically selecting a subset of the Switchboard corpus and combining the confidences obtained by both in-domain and out-of-domain models via logistic regression, especially when the in-domain data is limited.Yayın Effective semi-supervised learning strategies for automatic sentence segmentation(Elsevier Science BV, 2018-04-01) Dalva, Doğan; Güz, Ümit; Gürkan, HakanThe primary objective of sentence segmentation process is to determine the sentence boundaries of a stream of words output by the automatic speech recognizers. Statistical methods developed for sentence segmentation requires a significant amount of labeled data which is time-consuming, labor intensive and expensive. In this work, we propose new multi-view semi-supervised learning strategies for sentence boundary classification problem using lexical, prosodic, and morphological information. The aim is to find effective semi-supervised machine learning strategies when only small sets of sentence boundary labeled data are available. We primarily investigate two semi-supervised learning approaches, called self-training and co-training. Different example selection strategies were also used for co-training, namely, agreement, disagreement and self-combined. Furthermore, we propose three-view and committee-based algorithms incorporating with agreement, disagreement and self-combined strategies using three disjoint feature sets. We present comparative results of different learning strategies on the sentence segmentation task. The experimental results show that the sentence segmentation performance can be highly improved using multi-view learning strategies that we proposed since data sets can be represented by three redundantly sufficient and disjoint feature sets. We show that the proposed strategies substantially improve the average baseline F-measure of 67.66% to 75.15% and 64.84% to 66.32% when only a small set of manually labeled data is available for Turkish and English spoken languages, respectively.Yayın Biometric identification using fingertip electrocardiogram signals(Springer London Ltd, 2018-07) Güven, Gökhan; Gürkan, Hakan; Güz, ÜmitIn this research work, we present a newly fingertip electrocardiogram (ECG) data acquisition device capable of recording the lead-1 ECG signal through the right- and left-hand thumb fingers. The proposed device is high-sensitive, dry-contact, portable, user-friendly, inexpensive, and does not require using conventional components which are cumbersome and irritating such as wet adhesive Ag/AgCl electrodes. One of the other advantages of this device is to make it possible to record and use the lead-1 ECG signal easily in any condition and anywhere incorporating with any platform to use for advanced applications such as biometric recognition and clinical diagnostics. Furthermore, we proposed a biometric identification method based on combining autocorrelation and discrete cosine transform-based features, cepstral features, and QRS beat information. The proposed method was evaluated on three fingertip ECG signal databases recorded by utilizing the proposed device. The experimental results demonstrate that the proposed biometric identification method achieves person recognition rate values of 100% (30 out of 30), 100% (45 out of 45), and 98.33% (59 out of 60) for 30, 45, and 60 subjects, respectively.Yayın A novel biometric identification system based on fingertip electrocardiogram and speech signals(Elsevier Inc., 2022-03) Güven, Gökhan; Güz, Ümit; Gürkan, HakanIn this research work, we propose a one-dimensional Convolutional Neural Network (CNN) based biometric identification system that combines speech and ECG modalities. The aim is to find an effective identification strategy while enhancing both the confidence and the performance of the system. In our first approach, we have developed a voting-based ECG and speech fusion system to improve the overall performance compared to the conventional methods. In the second approach, we have developed a robust rejection algorithm to prevent unauthorized access to the fusion system. We also presented a newly developed ECG spike and inconsistent beats removal algorithm to detect and eliminate the problems caused by portable fingertip ECG devices and patient movements. Furthermore, we have achieved a system that can work with only one authorized user by adding a Universal Background Model to our algorithm. In the first approach, the proposed fusion system achieved a 100% accuracy rate for 90 people by taking the average of 3-fold cross-validation. In the second approach, by using 90 people as genuine classes and 26 people as imposter classes, the proposed system achieved 92% accuracy in identifying genuine classes and 96% accuracy in rejecting imposter classes.












