A Novel Image Compression Method Based on Classified Energy and Pattern Blocks: Initial Results U... more A Novel Image Compression Method Based on Classified Energy and Pattern Blocks: Initial Results Umit Guz1, Hakan Gurkan1, and B. Siddik Yarman2 1 Isik University, Engineering Faculty, Department of Electronics Engineering, Sile, Istanbul, Turkey 2 Istanbul University, Engineering ...
2018 IEEE Spoken Language Technology Workshop (SLT), 2018
The objective of this work is to develop effective multiview semi-supervised machine learning str... more The objective of this work is to develop effective multiview semi-supervised machine learning strategies for sentence boundary classification problem when only small sets of sentence boundary labeled data are available. We propose three-view and committee-based learning strategies incorporating with co-training algorithms with agreement, disagreement, and self-combined learning strategies using prosodic, lexical and morphological information. We compare experimental results of proposed three-view and committee-based learning strategies to other semi-supervised learning strategies in the literature namely, self-training and co-training with agreement, disagreement, and self-combined strategies. The experiment results show that sentence segmentation performance can be highly improved using multi-view learning strategies that we propose since data sets can be represented by three redundantly sufficient and disjoint feature sets. We show that the proposed strategies substantially improve the average performance when only a small set of manually labeled data is available for Turkish and English spoken languages, respectively.
In this paper, a novel method to compress ElectroEncephaloGram (EEG) Signal is proposed. The prop... more In this paper, a novel method to compress ElectroEncephaloGram (EEG) Signal is proposed. The proposed method is based on the generation Classified Signature and Envelope Vector Sets (CSEVS) by using an effective k-means clustering algorithm. In this work, on a frame basis, any EEG signal is modeled by multiplying three parameters as called the Classified Signature Vector, Classified Envelope Vector, and Frame-Scaling Coefficient. In this case, EEG signal for each frame is described in terms of the two indices R and K of CSEVS and the frame-scaling coefficient. The proposed method is assessed through the use of root-mean-square error (RMSE) and visual inspection measures. The proposed method achieves good compression ratios with low level reconstruction error while preserving diagnostic information in the reconstructed EEG signal.
In this research work, we present a newly fingertip electrocardiogram (ECG) data acquisition devi... more In this research work, we present a newly fingertip electrocardiogram (ECG) data acquisition device capable of recording the lead-1 ECG signal through the right- and left-hand thumb fingers. The proposed device is high-sensitive, dry-contact, portable, user-friendly, inexpensive, and does not require using conventional components which are cumbersome and irritating such as wet adhesive Ag/AgCl electrodes. One of the other advantages of this device is to make it possible to record and use the lead-1 ECG signal easily in any condition and anywhere incorporating with any platform to use for advanced applications such as biometric recognition and clinical diagnostics. Furthermore, we proposed a biometric identification method based on combining autocorrelation and discrete cosine transform-based features, cepstral features, and QRS beat information. The proposed method was evaluated on three fingertip ECG signal databases recorded by utilizing the proposed device. The experimental results demonstrate that the proposed biometric identification method achieves person recognition rate values of 100% (30 out of 30), 100$$\%$$% (45 out of 45), and 98.33$$\%$$% (59 out of 60) for 30, 45, and 60 subjects, respectively.
A Novel Image Compression Method Based on Classified Energy and Pattern Building Blocks, 2011
In this paper, a novel image compression method based on generation of the so-called classified e... more In this paper, a novel image compression method based on generation of the so-called classified energy and pattern blocks (CEPB) is introduced and evaluation results are presented. The CEPB is constructed using the training images and then located at both the transmitter and receiver sides of the communication system. Then the energy and pattern blocks of input images to be reconstructed are determined by the same way in the construction of the CEPB. This process is also associated with a matching procedure to determine the index numbers of the classified energy and pattern blocks in the CEPB which best represents (matches) the energy and pattern blocks of the input images. Encoding parameters are block scaling coefficient and index numbers of energy and pattern blocks determined for each block of the input images. These parameters are sent from the transmitter part to the receiver part and the classified energy and pattern blocks associated with the index numbers are pulled from the CEPB. Then the input image is reconstructed block by block in the receiver part using a mathematical model that is proposed. Evaluation results show that the method provides considerable image compression ratios and image quality even at low bit rates.
The primary objective of sentence segmentation process is to determine the sentence boundaries of... more The primary objective of sentence segmentation process is to determine the sentence boundaries of a stream of words output by the automatic speech recognizers. Statistical methods developed for sentence segmentation requires a significant amount of labeled data which is time-consuming, labor intensive and expensive. In this work, we propose new multi-view semi-supervised learning strategies for sentence boundary classification problem using lexical, prosodic, and morphological information. The aim is to find effective semi-supervised machine learning strategies when only small sets of sentence boundary labeled data are available. We primarily investigate two semi-supervised learning approaches, called self-training and co-training. Different example selection strategies were also used for co-training, namely, agreement, disagreement and self-combined. Furthermore, we propose three-view and committee-based algorithms incorporating with agreement, disagreement and self-combined strategies using three disjoint feature sets. We present comparative results of different learning strategies on the sentence segmentation task. The experimental results show that the sentence segmentation performance can be highly improved using multi-view learning strategies that we proposed since data sets can be represented by three redundantly sufficient and disjoint feature sets. We show that the proposed strategies substantially improve the average baseline F-measure of 67.66% to 75.15% and 64.84% to 66.32% when only a small set of manually labeled data is available for Turkish and English spoken languages, respectively.
A Novel Image Compression Method Based on Classified Energy and Pattern Blocks: Initial Results U... more A Novel Image Compression Method Based on Classified Energy and Pattern Blocks: Initial Results Umit Guz1, Hakan Gurkan1, and B. Siddik Yarman2 1 Isik University, Engineering Faculty, Department of Electronics Engineering, Sile, Istanbul, Turkey 2 Istanbul University, Engineering ...
2018 IEEE Spoken Language Technology Workshop (SLT), 2018
The objective of this work is to develop effective multiview semi-supervised machine learning str... more The objective of this work is to develop effective multiview semi-supervised machine learning strategies for sentence boundary classification problem when only small sets of sentence boundary labeled data are available. We propose three-view and committee-based learning strategies incorporating with co-training algorithms with agreement, disagreement, and self-combined learning strategies using prosodic, lexical and morphological information. We compare experimental results of proposed three-view and committee-based learning strategies to other semi-supervised learning strategies in the literature namely, self-training and co-training with agreement, disagreement, and self-combined strategies. The experiment results show that sentence segmentation performance can be highly improved using multi-view learning strategies that we propose since data sets can be represented by three redundantly sufficient and disjoint feature sets. We show that the proposed strategies substantially improve the average performance when only a small set of manually labeled data is available for Turkish and English spoken languages, respectively.
In this paper, a novel method to compress ElectroEncephaloGram (EEG) Signal is proposed. The prop... more In this paper, a novel method to compress ElectroEncephaloGram (EEG) Signal is proposed. The proposed method is based on the generation Classified Signature and Envelope Vector Sets (CSEVS) by using an effective k-means clustering algorithm. In this work, on a frame basis, any EEG signal is modeled by multiplying three parameters as called the Classified Signature Vector, Classified Envelope Vector, and Frame-Scaling Coefficient. In this case, EEG signal for each frame is described in terms of the two indices R and K of CSEVS and the frame-scaling coefficient. The proposed method is assessed through the use of root-mean-square error (RMSE) and visual inspection measures. The proposed method achieves good compression ratios with low level reconstruction error while preserving diagnostic information in the reconstructed EEG signal.
In this research work, we present a newly fingertip electrocardiogram (ECG) data acquisition devi... more In this research work, we present a newly fingertip electrocardiogram (ECG) data acquisition device capable of recording the lead-1 ECG signal through the right- and left-hand thumb fingers. The proposed device is high-sensitive, dry-contact, portable, user-friendly, inexpensive, and does not require using conventional components which are cumbersome and irritating such as wet adhesive Ag/AgCl electrodes. One of the other advantages of this device is to make it possible to record and use the lead-1 ECG signal easily in any condition and anywhere incorporating with any platform to use for advanced applications such as biometric recognition and clinical diagnostics. Furthermore, we proposed a biometric identification method based on combining autocorrelation and discrete cosine transform-based features, cepstral features, and QRS beat information. The proposed method was evaluated on three fingertip ECG signal databases recorded by utilizing the proposed device. The experimental results demonstrate that the proposed biometric identification method achieves person recognition rate values of 100% (30 out of 30), 100$$\%$$% (45 out of 45), and 98.33$$\%$$% (59 out of 60) for 30, 45, and 60 subjects, respectively.
A Novel Image Compression Method Based on Classified Energy and Pattern Building Blocks, 2011
In this paper, a novel image compression method based on generation of the so-called classified e... more In this paper, a novel image compression method based on generation of the so-called classified energy and pattern blocks (CEPB) is introduced and evaluation results are presented. The CEPB is constructed using the training images and then located at both the transmitter and receiver sides of the communication system. Then the energy and pattern blocks of input images to be reconstructed are determined by the same way in the construction of the CEPB. This process is also associated with a matching procedure to determine the index numbers of the classified energy and pattern blocks in the CEPB which best represents (matches) the energy and pattern blocks of the input images. Encoding parameters are block scaling coefficient and index numbers of energy and pattern blocks determined for each block of the input images. These parameters are sent from the transmitter part to the receiver part and the classified energy and pattern blocks associated with the index numbers are pulled from the CEPB. Then the input image is reconstructed block by block in the receiver part using a mathematical model that is proposed. Evaluation results show that the method provides considerable image compression ratios and image quality even at low bit rates.
The primary objective of sentence segmentation process is to determine the sentence boundaries of... more The primary objective of sentence segmentation process is to determine the sentence boundaries of a stream of words output by the automatic speech recognizers. Statistical methods developed for sentence segmentation requires a significant amount of labeled data which is time-consuming, labor intensive and expensive. In this work, we propose new multi-view semi-supervised learning strategies for sentence boundary classification problem using lexical, prosodic, and morphological information. The aim is to find effective semi-supervised machine learning strategies when only small sets of sentence boundary labeled data are available. We primarily investigate two semi-supervised learning approaches, called self-training and co-training. Different example selection strategies were also used for co-training, namely, agreement, disagreement and self-combined. Furthermore, we propose three-view and committee-based algorithms incorporating with agreement, disagreement and self-combined strategies using three disjoint feature sets. We present comparative results of different learning strategies on the sentence segmentation task. The experimental results show that the sentence segmentation performance can be highly improved using multi-view learning strategies that we proposed since data sets can be represented by three redundantly sufficient and disjoint feature sets. We show that the proposed strategies substantially improve the average baseline F-measure of 67.66% to 75.15% and 64.84% to 66.32% when only a small set of manually labeled data is available for Turkish and English spoken languages, respectively.
Uploads
Papers by UMIT GUZ
procedure to determine the index numbers of the classified energy and pattern blocks in the CEPB which best represents (matches) the energy and pattern blocks of the input images. Encoding parameters are block scaling coefficient and index numbers of energy and pattern blocks determined for each block of the input images. These parameters are sent from the transmitter part to the receiver part and the classified energy and pattern blocks associated with the index numbers are pulled from the CEPB. Then the input image is reconstructed block by block in the receiver part using a mathematical model that is proposed. Evaluation results show that the method provides considerable image compression ratios and image quality even at low bit rates.
procedure to determine the index numbers of the classified energy and pattern blocks in the CEPB which best represents (matches) the energy and pattern blocks of the input images. Encoding parameters are block scaling coefficient and index numbers of energy and pattern blocks determined for each block of the input images. These parameters are sent from the transmitter part to the receiver part and the classified energy and pattern blocks associated with the index numbers are pulled from the CEPB. Then the input image is reconstructed block by block in the receiver part using a mathematical model that is proposed. Evaluation results show that the method provides considerable image compression ratios and image quality even at low bit rates.