Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

Hjorth Derivation

Download as pdf or txt
Download as pdf or txt
You are on page 1of 125

Methods to Improve the Signal Quality of

Corrupted Multi-Parameter Physiological Signals


by
Gartheeban Ganeshapillai
Submitted to the Department of Electrical Engineering and Computer
Science
in partial fulllment of the requirements for the degree of
Master of Science in Computer Science and Engineering
at the
MASSACHUSETTS INSTITUTE OF TECHNOLOGY
June 2011
c Massachusetts Institute of Technology 2011. All rights reserved.
Author . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Department of Electrical Engineering and Computer Science
May 09, 2011
Certied by. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
John J. Guttag
Professor, Electrical Engineering and Computer Science
Thesis Supervisor
Accepted by . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Leslie. A. Kolodziejski
Chair of the Committee on Graduate Students
2
Methods to Improve the Signal Quality of Corrupted
Multi-Parameter Physiological Signals
by
Gartheeban Ganeshapillai
Submitted to the Department of Electrical Engineering and Computer Science
on May 09, 2011, in partial fulllment of the
requirements for the degree of
Master of Science in Computer Science and Engineering
Abstract
A modern Intensive Care Unit (ICU) has automated analysis systems that depend
on continuous uninterrupted real-time monitoring of physiological signals such as
Electrocardiogram (ECG), Arterial Blood Pressure (ABP), and the Photo Plethys-
mogram (PPG). Unfortunately, these signals are often corrupted by noise, artifacts,
and missing data, which can result in a high incidence of false alarms.
We present a novel approach to improve the Signal Quality of a multi-parameter
physiological signal by identifying the corrupted regions in the signal, and recon-
structing them using the information available in correlated signals. The method is
specically designed to preserve the clinically most signicant aspects of the signals.
We use template matching to jointly segment the multi-parameter signal, morpho-
logical dissimilarity to estimate the quality of the signal segment, similarity search to
nd the closest match from a database of templates, and time-warping to reconstruct
the corrupted segment using the matching template.
Experiments carried out on the MIT-BIH Arrhythmia Database, a multi-parameter
ECG database with many clinically signicant arrhythmias, demonstrate the eec-
tiveness of the method. Our method improved the classication accuracy of the beat
type by more than 700% on the signal corrupted with white Gaussian noise, and
increased the similarity to the original signal, as measured by the normalized residual
distance, by more than 250%. When the method was applied to the multi-parameter
physiological signal data from Cinc Challenge 2010 database at Physionet.org, our
method improved the classication accuracy of beat type by more than 33 times on
a signal corrupted with white Gaussian noise, and increased the similarity to the
original signal by more than 280%.
Thesis Supervisor: John J. Guttag
Title: Professor, Electrical Engineering and Computer Science
3
4
Acknowledgments
This thesis would not have been possible without the help and guidance of many.
As part of my graduate school application at MIT, I had to choose a professor I
wanted to work with. John Guttag was not the advisor I chose, but the advisor I was
assigned to. It is often spur-of-the-moment decisions, sometimes made by others,
that can change our whole lives. John Guttags inspiration, guidance, and, more
than anything, empathy have so far helped me survive the graduate school. He has
been an excellent teacher, mentor, friend, and, of course, the thesis advisor. It has
been a pleasure to work with John, and I hope to continue in the coming years.
I am thankful to the valuable comments from the members of the Cardiac group,
especially that from Prof Collin Stultz, whose overt sarcasms and views kept the
humor oating on our otherwise dry technical decisions.
I am thankful to my lab mates, Ali Shoeb - the eldest, Alaa Kharbouch, Jessica
Liu, Asfandyar Qureshi, Eugene Shih, Anima Singh, Zeeshan Syed - the wisest, and
Jenna Weins. Of these individuals, I would like to extend a special thanks to Alaa
Kharbouch and Zeeshan Syed for helping me with the technical materials. I am also
in great debt to Jessica Liu for patiently running the experiments.
This thesis would not have been possible without the dedication and kindness of
Dorothy Curtis, Sheila Marian, and Janet Fischer. I owe a great debt to Dorothy
Curtis. With such exuberance, she has always been a delight to converse with.
I am thankful to many friends at and around MIT. The friends I met in the rst
year at MIT; the friends who introduced various lifestyles of Boston; the friends I
met at EECS-GSA, who made me want to attend the events at GSA, volunteer at
the events, and eventually lead and organize the events for GSA. I am also indebted
to fellow Sri Lankans in Boston, including the graduate students at CSAIL, who
regularly help resurrect the Sri Lankan in me.
Finally, I am grateful to my parents and grandparents, who have always taken
interest in my academic progress, supported me along the way. If it were not for the
love of my mother, and the support of my grand parents, I would not be here.
5
6
Contents
1 Introduction 19
1.1 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
1.2 Goal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
1.3 Proposed Solution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
1.4 Problem Decomposition . . . . . . . . . . . . . . . . . . . . . . . . . 22
1.5 Contributions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
1.6 Organization of Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 24
2 Background 25
2.1 Redundancy and Data fusion . . . . . . . . . . . . . . . . . . . . . . 25
2.2 Filling the Gap . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
2.3 Signal Quality Estimation . . . . . . . . . . . . . . . . . . . . . . . . 28
2.3.1 ECG Signal Quality . . . . . . . . . . . . . . . . . . . . . . . 29
2.3.2 ABP Signal Quality . . . . . . . . . . . . . . . . . . . . . . . . 31
2.3.3 PPG Signal Quality . . . . . . . . . . . . . . . . . . . . . . . . 32
2.4 Distance Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
2.4.1 Euclidean Distance . . . . . . . . . . . . . . . . . . . . . . . . 33
2.4.2 Dynamic Time Warping . . . . . . . . . . . . . . . . . . . . . 33
2.4.3 Longest Common Sub Sequence . . . . . . . . . . . . . . . . . 37
2.4.4 Edit Distance on Real Sequence . . . . . . . . . . . . . . . . . 39
2.4.5 Comparison . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
2.5 Similarity Search . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
2.6 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
7
3 Temporal Segmentation of Multi-parameter Physiological Signals 43
3.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
3.2 Background . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
3.3 Method . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
3.3.1 Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
3.3.2 weighted time warping . . . . . . . . . . . . . . . . . . . . . . 49
3.3.3 Path Constraint . . . . . . . . . . . . . . . . . . . . . . . . . . 49
3.3.4 Templates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
3.3.5 Long Segments . . . . . . . . . . . . . . . . . . . . . . . . . . 53
3.4 Experimental Results . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
3.4.1 Experiment 1 : Raw MIMIC Data . . . . . . . . . . . . . . . . 58
3.4.2 Experiment 2: Articially Corrupted MIMIC Data - Additive
Noise . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59
3.4.3 Experiment 3: Articially Corrupted MIMIC Data - Transient
Corruption . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61
3.5 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63
4 Quality Estimation 65
4.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
4.2 Previous work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66
4.3 New morphological dissimilarity Metric . . . . . . . . . . . . . . . . . 67
4.3.1 Method . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68
4.3.2 Comparison . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
4.3.3 morphological dissimilarity Metric with LCSS . . . . . . . . . 71
4.3.4 Adaptive Signal Quality Estimates . . . . . . . . . . . . . . . 73
4.4 Implementation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
4.5 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
5 Reconstruction of Multi-parameter Physiological Signals 77
5.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77
5.2 Related Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78
8
5.3 Method . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78
5.3.1 Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79
5.3.2 A segment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80
5.3.3 Identifying corrupted regions . . . . . . . . . . . . . . . . . . . 80
5.3.4 Feature Representation . . . . . . . . . . . . . . . . . . . . . . 82
5.3.5 Dimensionality Reduction . . . . . . . . . . . . . . . . . . . . 83
5.3.6 Reconstruction . . . . . . . . . . . . . . . . . . . . . . . . . . 86
5.4 Experimental Results . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
5.4.1 Experiment 1 : Eectiveness of Reconstruction . . . . . . . . . 89
5.4.2 Experiment 2 : Dierent SNR Levels . . . . . . . . . . . . . . 91
5.4.3 Experiment 3 : Simulated Real-World Corruptions . . . . . . 94
5.4.4 Experiment 4 : Size of the Database . . . . . . . . . . . . . . 96
5.4.5 Experiment 5 : Learning . . . . . . . . . . . . . . . . . . . . . 96
5.5 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99
6 Data Fusion 101
6.1 Data Fusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101
6.2 Global Trend . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104
6.3 Experimental Results . . . . . . . . . . . . . . . . . . . . . . . . . . . 106
6.3.1 Experiment 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . 107
6.3.2 Experiment 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . 109
6.4 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110
7 Implementation 111
7.1 Database . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111
7.1.1 Structure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 112
7.1.2 Lookup . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 112
7.1.3 Compaction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113
7.2 Computational Complexity . . . . . . . . . . . . . . . . . . . . . . . . 114
7.3 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 115
9
8 Conclusion and Future Work 117
8.1 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 117
8.2 Contributions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 117
8.3 Future Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119
8.4 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 120
10
List of Figures
1-1 Key components of the system. . . . . . . . . . . . . . . . . . . . . . 22
2-1 An autoencoder (b) is built using two RBMs (a) stacked together [47]. 27
2-2 Input channels U
i
are passed through adaptive lters, and summed
up. The adaptive lter coecients are estimated from the negative
feedback through a closed loop [25]. . . . . . . . . . . . . . . . . . . . 28
2-3 A comparison of alignments induced by a) Euclidean distance b) Time
warping and c) Longest common subsequence. Euclidean matching
doesnt degenerate, but incurs the highest cost (total area of dier-
ence). Time Warping achieves the minimum cost, but degenerates.
Degeneration is noticeable between time 40-60. Longest common sub-
sequence, an alternative to time warping, though not optimal, mini-
mizes the cost, and avoids the degenerative matchings [56]. . . . . . . 35
2-5 Two Sequences (A) are matched using DTW (B) and DDTW (C).
DTW tries to match the peaks with peaks and valleys with valleys
by extremely warping the X-axis. Further, the alignment is obtained
by globally minimizing the warped distance between two sequences,
which often result in degenerative matchings. For example, in (B), one
peak is ultimately matched with a valley. In (C), DDTW matches a
local maxima with a local maxima, and hence minimizes the warped
distance locally. [33] . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
11
2-6 The common subsequences for two sequences as identied by LCSS.
The gray region denes the area that falls within the thresholds
x
and
y
of the rst sequence. The similarity is maximized by nding
the longest sub-sequences from both sequences that share the common
area [56]. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
3-1 An example template. It contains clean ECG and Arterial Blood Pres-
sure (ABP) waveforms. The positions of the segment boundaries are
denoted by
1
,
2
and . The template is a little longer than two seg-
ments. It contains two full segments of length
2

1
and
2
; the
length of the template is . . . . . . . . . . . . . . . . . . . . . . . . . 46
3-2 The accumulated distance matrix between two sequences A and B
when Equation 3.4 is used to compute the accumulated distances. The
lighter the region, the higher the cost. Because of the innitely large
cost involved, the optimal path that is shown by arrows, will avoid
the white regions. Thereby, the path is constrained globally between
the two marked boundaries. Here, sequence A is 180 samples long
and sequence B is 220 samples long. Under these constraints, only a
subsequence of B that is longer than 180/E can legally match the full
sequence of A. Similarly only a subsequence of A that is longer than
220/E can legally match the full sequence of B. The expansion factor
E is 3 for Equation 3.4. . . . . . . . . . . . . . . . . . . . . . . . . . . 50
3-5 The Electrocardiogram (ECG), Arterial Blood Pressure (ABP) and
the Photo Plethysmogram (PPG) extracted from MIMIC, record 039.
Over time period A, ECG is corrupted. Over time period B, ABP is
unavailable. Over time period C, PPG is corrupted. . . . . . . . . . . 56
3-6 Record 213 contains a) ECG channel I, b) ECG channel II, c) ECG
channel V, d) ABP and e) PPG. The plethysmogram is completely
absent over the 12 hour period. A severe corruption spans across all
the channels for signicant amount of time also. . . . . . . . . . . . . 57
12
3-7 Test 2: distribution of errors on the raw data. WTW was able to
segment 4.121 10
5
segments accurately while, the QRS detector was
able to segment 4.111 10
5
segments correctly. It should be noted
that, while the QRS detector is using only one ECG channel, WTW is
making use of other channels as well. . . . . . . . . . . . . . . . . . . 59
3-8 Optional caption for list of gures . . . . . . . . . . . . . . . . . . . . 60
4-1 Record 200 from MIT-BIH Arrhythmia Database. Ideally, we expect
signal qualities at segments AF to be [C, D] > B > [A, F] > E. The
rationale is that two normal beats will be more similar to each other
compared to a PVC beat. A normal beat with noise must be more
dissimilar than a beat with PVC and a normal beat. Finally, when
there is a corruption or signal interruption, the signal quality must be
the lowest. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67
4-2 Record 201, channel MLII from MIT-BIH Arrhythmia database (a).
The original signal is added with AWGN at 0dB between 8 to 15
minute and 23 to 30 minutes. For the corrupted signal, SQI using
morphological dissimilarity (c) and ECGSQI (b) are computed. . . . . 70
4-3 An example recording of a multi-parameter physiological signal (record
039 from MIMIC) with 20dB AWGN added to ABP and PPG signal.
This record is used illustrate the application of RSQEs in Figure 4-4. 71
4-4 Relative Signal Quality Estimates (RSQE) of a multi-parameter physi-
ological signal (Figure 4-3). When all three signals are corruption-free,
RSQEs are approximately equal, and take an average of 0.33. When
20dB AWGN is added to ABP and PPG signals, it results in an RSQE
5 times higher (average RSQE for ECG is 0.8) for ECG than others
(average RSQE for ABP is 0.16). At 10dB AWGN the dierence is
signicant. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
13
4-5 For the example in Figure 4-1, the SQE estimated using LCSS. The
SQE generated by LCSS is bounded between 0 and 1. Through LCSS
we achieve the signal quality estimates in the order [C, D] > B >
[A, E] > F, which is close to the ideal situation. . . . . . . . . . . . . 74
5-1 Optional caption for list of gures . . . . . . . . . . . . . . . . . . . . 81
5-2 An example distribution of the corruption. When one signal is cor-
rupted we try to reconstruct them. When both signals are corrupted,
the reconstruction process only ags that segment, but does not try to
reconstruct it. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82
5-3 Optional caption for list of gures . . . . . . . . . . . . . . . . . . . . 85
5-4 Record 200 from MIT-BIH Arrhythmia database. The rst channel is
corrupted with AWGN at SNR 0dB (a), and reconstructed using our
method (b). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90
5-5 Record 123, from MIT-BIH Arrhythmia database. The original signal
is added with AWGN at SNR = 10dB, 0dB and -10dB. . . . . . . . . 93
5-6 Record 123, from MIT-BIH Arrhythmia database. The original signal
is added with the following types of corruptions at SNR 10dB : Elec-
tromagnetic Interference (EM), Muscle Artifact (MA), and Baseline
Wander (BW). The highlighted area indicates the corrupted region. . 95
5-7 Last 20%, 50%, and 80% of the rst channel of the signal are cor-
rupted with AWGN at 0dB SNR. The average disagreement () and
the residual distance (r) are computed for all 39 records. . . . . . . . 97
14
5-8 In experiment 4, we build the database using the rst 20%, 50%, and
80% of the signals. The remainder of the signals is corrupted with
AWGN at SNR 0dB. The average disagreement () and the residual
distance (r) are computed for all 39 records. In experiment 5, we build
the database using the rst 20% of the signals. On the remainder,
three randomly chosen non-overlapping regions (each 5 minute long),
are corrupted with AWGN at SNR 0dB. Hence, totally 50% of the data
is corrupted. Here, we also learn the new morphologies over the regions
free of corruption. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98
6-1 The picture outlines the reconstruction process on a multi-parameter
timeseries. The multi-parameter signal contains several correlated phys-
iological signals such as ECG, ABP, and PPG. First, we simultaneously
perform the joint segmentation and the Signal Quality estimation on
the multi-parameter signal. Then we learn the pair wise relationship
between the signal-to-be-reconstructed and another correlated signal
in the multi-parameter signal and build the database. This is repeated
for all the correlated signals. We nd the best matches separately for
each correlated signal and nally combine them. . . . . . . . . . . . . 102
7-1 The structure of the database. It contains three elds: features, Signal
A, and Signal B. The feature, itself, is a xed length vector of dimension
30, and serves as the key. . . . . . . . . . . . . . . . . . . . . . . . . . 112
15
16
List of Tables
2.1 Reported reconstruction performances of the methods submitted to
PhysioNet/CinC challenge 2010. . . . . . . . . . . . . . . . . . . . . . 28
2.2 Distance Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
3.1 The summary of the variables used in the algorithm. The rst row
contains the variable at the initialization of the algorithm. The second
row contains the list of variables used in an iteration. The last row
contains the output variable. . . . . . . . . . . . . . . . . . . . . . . . 47
3.2 Test 1: Comparison of WTW against the QRS detector on 10 randomly
selected records from the raw MIMIC data. . . . . . . . . . . . . . . . 58
3.3 Test 3, 4, and 5: Summary of the experimental results on the articially
corrupted data. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61
5.1 Set of features that are used to represent a segment. . . . . . . . . . . 84
5.2 Experiment 1 : Summary . . . . . . . . . . . . . . . . . . . . . . . . 91
5.3 Experiment 1 : Eect of reconstruction . . . . . . . . . . . . . . . . . 92
5.4 Experiment 2 : Eect of SNR levels . . . . . . . . . . . . . . . . . . . 94
5.5 Performance against dierent types of real-world corruptions . . . . . 94
6.1 Experiment 1 : Eectiveness of data fusion on dierent sets of records 108
6.2 Experiment 1 : Summary . . . . . . . . . . . . . . . . . . . . . . . . 108
6.3 Experiment 2 : Summary . . . . . . . . . . . . . . . . . . . . . . . . 109
17
18
Chapter 1
Introduction
In this thesis, we present a set of novel computational tools to improve the Signal
Quality of physiological signals. We use an unsupervised learning framework that does
real-time estimation of the correct values of corrupted signals by fusing information
from correlated signals. There are three main components of our work: segmenting a
multi-parameter quasiperiodic signal in the presence of noise and transient corruption,
developing a Signal Quality Estimate that is comparable across dierent type of
signals, and reconstructing the segments of low signal quality.
1.1 Motivation
A modern Intensive Care Unit (ICU) employs several bedside monitors to track the
state of patients. They allow continuous monitoring of a patient, and inform medical
sta of changes in the status of the patient [26]. Automated analysis systems are
typically used to analyze these signals in real-time. They operate by triggering alarms
when one or more physiological parameters fall outside the predetermined normal
region [48, 52, 40].
These systems depend on continuous uninterrupted real-time monitoring of physi-
ological signals such as electrocardiogram (ECG), arterial blood pressure (ABP), and
the photo plethysmogram (PPG) [40]. Unfortunately, these signals are often cor-
rupted by noise, artifacts, and missing data, which can result in a high incidence of
19
false alarms [1, 26, 6]. Frequent false alarms lead to the desensitization of clinical
sta to real alarms [13]. On the other hand, in the event of missed alerts, the patient
remains unattended, and the state of the patient might eventually deteriorate into
serious medical condition.
It would be advantageous if an algorithm could estimate the correct values of
the corrupted signals by fusing the information from correlated signals from multiple
monitors. In this thesis, we address the problem of identifying the corrupted regions
in a multi-parameter signal, and reconstructing them in a clinically useful way using
the information available in the correlated signals.
1.2 Goal
We consider a multi-parameter signal represented by a matrix S
nm
, where each
column represents a signal (e.g., ECG) and each row represents a point in time.
There are m synchronous single parameter signals in S. Each cell s
i,j
contains one
sample. Samples may be corrupted in an unknown fashion. Our goal is to identify
the corrupted regions, and estimate the sample values of that region.
A typical multi-parameter physiological signal consists of various ECG channels,
ABP, PPG, respiration, central venous pressure (CVP), pulmonary arterial pressure
(PAP) and intracranial pressure (ICP). We mainly consider ECG channels, ABP and
PPG, because they are commonly available. In future we hope to extend the work to
other correlated signals.
1.3 Proposed Solution
First, we identify the segment boundaries of the multi-parameter signal in the presence
of signicant amounts of transient corruption spanning multiple columns and rows of
the matrix S.
We use a template, a short multi-parameter signal, and match it with a slid-
ing window of the multi-parameter signal. The initial template is derived from an
20
archived signal, and is regularly updated to reect the time evolution of the signal.
We continuously extract a non-overlapping window from S, and identify the bound-
ary in the window by nding the prex of the window that most closely matches the
template. The matching is done using weighted time warping (WTW), which mini-
mizes the weighted morphological dissimilarity across all the parameters. The warped
distance between two single parameter signals gives the morphological dissimilarity.
The weight represents the estimated quality of the signal, which is again computed
by the morphological dissimilarity of the signal with its counterpart in the template.
The signal quality estimated from the morphological dissimilarity is also used to nd
the corrupted regions.
Second, using a database of templates, we reconstruct the corrupted regions. Here,
a template is a segment of the multi-parameter signal that was chosen from previously
seen regions that were believed to be free of signal corruption. When we come across
the segments of high signal quality, we add them to the database; thus, we learn new
morphologies.
The method is based on nding the closest match (template) to the corrupted
segment from the database, time-warping the template to t the corrupted segments
interval, and replacing the corrupted segment with the result. The closest match
is found using the DTW cost. As a preliminary step, we represent the segments
by features. This has the dual advantages of providing a level of abstraction that
preserves the clinically relevant information and speeding up the matching.
Finally, we combine the matches based on the quality of each match, and re-
construct the segment. In this process, we time-warp the matches with the target
segment, so that they will be of the same length.
The proposed framework is patient specic. Here, we learn the relationships
between the correlated signals for a patient, and reconstruct the corrupted regions
using the history of the patient and the correlated signals. However, in future work,
we would like to extend this, by sharing the databases of the same signals among the
patients. Thus, we will be able to reconstruct the beats of types, previously unseen
for a patient, but prevalent on others.
21
1.4 Problem Decomposition


"#$%&'(
)*'&+$'&,-
./0+/-'$'&,-
1/$'#2/
3/42/*/-'$'&,-
5/$2-&-0 )*'&+$'&,-
Figure 1-1: Key components of the system.
In order to modularize the framework, we structure our problem into following
subproblems.
Segmentation to limit the analysis to a segment (a beat) of the signal. The
quasiperiodic physiological signals have a repetitive morphological structure.
By restricting the focus to a segment we limit the error propagation to that
segment.
Estimating the quality of the signals using the change in signal morphology.
Feature representation of the segments to reduce the time complexity of the
comparison of two sequences and provide a level of abstraction.
Learning the relationship between the correlated signals by building a database
of templates.
22
Estimating the actual values in the corrupted region using the signals history
and the correlated signals.
These subproblems depend on each other sequentially as shown in Figure 1-1. The
modularity of the framework allows us to independently optimize the parts . Fur-
ther, modularization is important for extending the system. The only component
that depends on the signal type is the feature representation. Hence, the proposed
framework can be extended to support other set of correlated quasiperiodic signals.
1.5 Contributions
We briey discuss some of the major contributions of our work. A detailed discussion
is deferred to subsequent parts of the thesis.
We present a novel approach to jointly segment a multiparameter signal
in the presence of signicant amounts of noise and transient corruption. The
key idea is that by simultaneously considering all the signals one can segment
them more accurately than would be possible by considering each signal inde-
pendently. Further, we formulate the problem of segmentation as a dissim-
ilarity minimization problem, and exploit the repetitive morphology of the
quasiperiodic physiological signals to segment them. We use template matching
to segment the quasiperiodic units. We use a signal template that eliminates
the need of any prior knowledge of the specic properties of the signal, and the
dynamic adaptation of the template, which allows us to accommodate the time
evolution of the signal.
We propose a new method, Weighted Time Warping (WTW), to mea-
sure the similarity between two multi-parameter sequences of dierent lengths.
WTW is an extension of Dynamic Time Warping (DTW) to multiple dimen-
sions with varying inuence along each dimension. The extent of the inuence
exerted by each dimension is determined by the quality of the signal in that
dimension.
23
We propose a new method, Morphological dissimilarity (MD) to estimate
the signal quality at the granularity of a beat. MD is comparable across dierent
types of signals, and is therefore, useful to obtain the relative signal qualities
across multiple signals in a multi-parameter signal.
We propose a new method to reconstruct the corrupted segments of a
signal, by matching the corresponding segment from the correlated signal with
a set of templates in a database.
We propose a new method for similarity search on a variable length phys-
iological time series data by using features. This serves dual purposes. First,
it provides a level of abstraction and thus generalizes. Second, it reduces the
computational complexity of the database lookups for a variable length signal.
1.6 Organization of Thesis
The remainder of this thesis is organized as follows. Chapter 2 presents the previous
work and background. Chapter 3 introduces the problem of segmenting a quasiperi-
odic multi-parameter signal in the presence of corruption. Chapter 4 presents a
method to estimate the signal qualities of dierent types of quasiperiodic physiologi-
cal signals. Chapter 5 describes the methods to identify the corrupted segments and
reconstruct them. In Chapter 6, we discuss the fusion of multiple signals and the
regression on the global trends. Chapter 7 details the implementation of the system.
Chapter 8 concludes with a summary and discussion of future work.
24
Chapter 2
Background
In this chapter, we review the related clinical applications, and provide the technical
background for our work. In Section 2.1, we start with a short discussion on the at-
tempts to counteract noise and transient corruption on physiological signals by fusing
data from multiple sources. In Section 2.2, we discuss the related work, especially
the methods submitted to Mind the Gap contest. We devote the remainder of the
chapter for the discussion on the technical material related to our work. In Section
2.3, we present the existing methods for signal quality estimation. In Section 2.4, we
discuss the characteristics of dierent distance functions. In Section 2.5, we look at
dierent methods for ecient lookups on multi-dimensional data.
2.1 Redundancy and Data fusion
Corruptions by noise, artifact, and missing data in physiological signals lead to serious
errors in automated medical systems and early warning systems. Researchers have
proposed several methods to counteract this. A survey of strategies that address
this problem is provided in [38]. Recent attempts to mitigate this problem focus on
using redundant measurements, and fusing data from multiple sensors [1, 13, 38, 12].
Typically, they employ independent methods on dierent channels and combine the
results only at the nal stage.
Recently there have been attempts to robustly estimate heart rate (HR) by fusing
25
information from multiple signals [38]. In addition to various ECG channels, re-
searchers also make use of Arterial Blood Pressure (ABP) and Photo Plethysmogram
(PPG) signals for HR estimation [12]. In this context, segmentation methods for
ABP and PPG by detecting the parts of the waveform corresponding to the onset of
a pulse have been proposed [1, 13]. These methods independently segment the phys-
iological signals. Hence, they do not preserve the alignment between the segments
across dierent signals.
2.2 Filling the Gap
The problem of reconstructing a corrupted multi-parameter physiological signal was
formally posed in the 11
th
annual PhysioNet/CinC challenge - Mind the Gap. Contes-
tants were asked to reconstruct, using any combination of available prior and concur-
rent information, 30-second segments of ECG, continuous blood pressure waveforms,
respiration, and other signals that had been removed from recordings of patients in
intensive care units. The data collected from MIMIC II project was used for the
contest. The submissions were rated by two criteria Q
1
and Q
2
.
V
res
(t) = V
rec
(t) V
ref
(t), where t = t
0
, t
0
+ t, ..., (n 1)t. (2.1)
E
ref
=
n

i=0
V
ref
(t
0
+it)
2

1
n
(
n

i=0
V
ref
(t
0
+it))
2
(2.2)
Q
1
= max(1
n

i=0
V
res
(t
0
+it)
2
/E
ref
, 0) (2.3)
Q
2
= cov(V
ref
(t), V
rec
(t))/
V
ref

Vrec
(2.4)
Here, the target signal V
ref
(t), is subtracted from the reconstruction, V
rec
(t) to obtain
the residual signal, V
res
(t). E
ref
is the energy (sample variance) of the target signal
[41].
In the decreasing order of the scores earned, the approaches of some of the more
successful contestants are discussed below.
The highest scored method [47] uses a multilayer perceptron neural network
26
Where is a small constant and < . > represents the aver-
age value on the data.
In a second step of the RBM learning procedure, to ac-
celerate reduction of reconstruction error, we use Mean
Field (deterministic) learning [11]. The difference to CD-
learning is: instead of generating the new values, s
i
, from
the sampled 0-1 values, h
i
, of the hidden units, we gene-
rate them from the probabilities, (b
j
+

i
v
i
w
ij
). The
updating rule for the weights remains the same.
2.6.3. Stacking two RBMs to build an au-
toencoder
To initialize the weights and bias of a signals autoen-
coder, the rst RBM we build has the signal as its visible
layer; the values of the hidden layer from the rst RBM
will be used as the visible layer of the second RBM. There-
fore, the rst RBM has linear visible units, and, the second
has logistic visible units.
Afterwords, the bidirectional weights of the rst and
second RBM are used for the directional weights of the
autoencoder. In a general RBM, we will note W and W
T
,
respectively, the matrices of the weights from the visible to
the hidden layer, and, from the hidden to the visible layer.
In this way, we get the matrices W
1
, W
T
1
, W
2
and W
T
2
.
a) b)
W
1
W
2
input
W
1
W
2
W
T
2
W
T
1
output

r
s
t
R
B
M
s
e
c
o
n
d
R
B
M
Figure 3. a) the stack of RBMS and b) the autoencoder we
build from it
With these weights, we build an MLP whose input layer
is the visible layer of the rst RBM, the rst hidden layer
is the hidden layer from the rst RBM, the second hidden
layer is the hidden layer from the second RBM, the third
hidden layer is again the hidden layer from the rst RBM
and the output layer is the visible layer from the rst RBM.
The bias from the RBMs units are used in the obvious way
in these MLP. This is the way we initialize the autoencoder.
To improve accuracy on reconstructing the input, we up-
date the weights using standard backprop algorithm.
2.6.4. Building the nal four hidden layer
MLP from two autoencoders
To get our nal four layer MLP, we connect, the encoder
from the input signals, with the decoder from the target
signal, using a simple perceptron. This perceptron receives
as input, the code from the input signals, and should output
the code from the target signal.
input signals encoder
input signal
input signal
code
target code
perceptron
target signal
target decoder
Figure 4. Initializing the nal 4-hidden layer MLP from
the input signals and target autoencoders
The perceptron is initialize with random weights, and
after, we update the weights according to the rule:
w
ij
=< input
i
output
j
> < input
i

output
j
>
where input
i
and output
j
are data values and

output
j
is the 0-1 output produced by the perceptron with the
present weights, sampling from the Bernoulli distribution
with p = (b
j
+

i
s
i
w
ij
).
In a second step we use a similar learning rule, but set

output
j
= (b
j
+

i
s
i
w
ij
).
After learning the perceptron weights, we improve the
weights between the rst, second and third layers, of our
nal MLP. For that, we use backprop algorithm on a new
MLP, with one hidden layer, builded from those layers and
the weights between them.
To nish training, we use backprop on the nal four
layer MLP.
2.7. Using the patches to reconstruct the
missing signal
To get the 30-second missing signal, we averaged, for
each data point, the contribution of all the reconstructed
patches that contain the point.
Figure 2-1: An autoencoder (b) is built using two RBMs (a) stacked together [47].
(MLP) that outputs the signal to be reconstructed, the target signal. The MLP maps
all the channels present to the channel to be reconstructed. The neural network has
four hidden layers, and uses the ideas from [30] to train the network. An auto-encoder
is used for MLP whose output layer reproduces the input (Figure 2-1). The middle
layer acts as a code to represent the input. The layers until the middle layer dene
an encoder, and the layers after the middle layer, inclusive, form a decoder. Two
Restricted Boltzmann Machines (RBM) are stacked to built the auto-encoder.
Silva [49] produces M reconstruction signals from each of the M channels using
a bank of GALL lters [17]. The signals are then linearly combined using a Kalman
lter. The weights of the Kalman lter are learned in the training phase, and they
remain constant over the remainder of the signal.
Similarly, another method [25] that uses a bank of adaptive lter, maps each of the
M channels to the channel that needs to be reconstructed, and averages the output
(Figure 2-2).
Li et al. [39] proposes to reconstruct the corrupted signals by matching patterns.
The paper uses a combination of correlation coecient and Euclidean distance to nd
the match, a template from the signals history, and replaces the corrupted region
with the template. Since the region of reconstruction is likely to be of dierent length
compared to the length of the template, the authors ll the gaps with 0s, or trim the
27
2.1. The model
Our approach to identication was a gray-box tech-
nique. The model structure was derived from the principle
that since the signals originate from the same physiolog-
ical system, there has to be a strong interconnection be-
tween them. The connections were identied in the form
of a Multi-Input/Single-Output (MISO) system. Third or-
der Innite Impulse Response (IIR) lters were used on
the inputs, then the ltered signals were summarized in a
linear model (See Figure 1).
Figure 1. Model structure, where U
1
...U
n
are the input
signals and Y is the output.
2.2. The algorithm
The algorithm relies on the following assumptions:
1. There is a strong linear interconnection between the sig-
nals of the record.
2. Although there might be a time-variation in the con-
nection between physiological signals, in short segments
(<1 minute) this can be disregarded and the connection
considered as time-invariant.
3. If we manage to identify a good model on the data avail-
able preceding the gap (prior), we will be able to recon-
struct the target signal using the signals measured parallel
to the target (concurrent signals).
The parameters of the model (the actual lter coef-
cients) were estimated using the prior as learning set. For a
faster convergence, the mean was removed from the prior,
Figure 2. Block diagram of the identication method,
where U is the input prior, Y is the history of the target
signal, Y

is its estimate and E is an error for feedback to


the learning algorithm.
and later added to the reconstructed signal. A genetic algo-
rithm was used for parameter identication (see e.g. in [5])
with mean crossover strategy and mutation. The tness
function, f = SD MSE was found to maximize both
Q
1
and Q
2
scores. The block diagram of the identication
is shown in Figure 2.
Assuming that the connection is time-invariant, we were
able to predict the target by using the identied model
on the concurrent signals. In agreement with the 2nd as-
sumption, the identication of the transfer function was re-
stricted to the time interval 10 to 20 seconds preceding the
target. The reconstructions tted to different lengths were
evaluated on the 30 second prior on a survival of the ttest
basis. The algorithm was runned multiple times in order to
overcome local optima.
A sample MATLAB implementation of the algo-
rithm is freely available under the GNU Public Li-
cense version 3 (GPLv3) from the authors homepage:
http://researcherscorner.com/users/ahartmann.
3. Results
In all cases, the identied lters generated stable output.
On most of the signals we found the presence of strong
linear interconnection. A typical reconstruction and curve
of parameter learning is presented in Figure 3.
Table 1 shows the detailed results grouped by the type
of the target signals. As it can be seen, we resulted in
an overall good reconstruction:

Q
1
= 69.6591 and

Q
2
= 81.3236 out of the possible 100. The Q
2
scores
Figure 2-2: Input channels U
i
are passed through adaptive lters, and summed up.
The adaptive lter coecients are estimated from the negative feedback through a
closed loop [25].
trailing signal.
In the contest, these methods were evaluated using the residual distances on a
set of multi-parameter database containing ECG, ABP, and PPG signals. Table 2.1
summarizes the performances reported by these methods.
Table 2.1: Reported reconstruction performances of the methods submitted to Phy-
sioNet/CinC challenge 2010.
Entrant Q
1
Q
2
Neural newtork [47] 83.00 90.51
Gall Filter + Kalman Filter [49] 70.55 84.09
Adaptive Filter + Averaging [25] 69.66 81.32
2.3 Signal Quality Estimation
Our framework requires the assessment of the signal quality at multiple stages of
the system. In the joint segmentation of the multi-parameter signals (Chapter 3),
we weigh the individual signals inuence on the joint segmentation of the multi-
parameter signal by the estimate of the quality of the signal. In the reconstruction of
28
the morphological features of the physiological signals (Chapter 5), we use the signal
quality estimates to identify the presence of noise and corruption in the signal.
Researchers have investigated the signal quality assessments of many physiological
signals including ECG, ABP and PPG [11, 38, 44, 63]. These papers use the perfor-
mance of the QRS detectors that rely on distinct properties of the ECG signal, and
the spectral power characteristics of the ECG, ABP and PPG signals to estimate the
signal quality. More recently, researchers have started to look into the morphological
dissimilarity to identify the presence of the artifacts [22, 57].
2.3.1 ECG Signal Quality
Signal quality assessment for a single channel ECG is presented in [57]. It estimates
the signal quality by computing the area dierences (mismatch) between successive
QRS complexes. A cumulative histogram of mismatches is then generated, and the
signal quality is determined based on how fast the cumulative histogram curves rise.
Signal quality assessment for a multi-lead ECG is presented in [38]. It combines
the following four distinct methods to obtain an ECG signal quality metric.
bSQI
This metric is based on the hypothesis that the dierent QRS detection algorithms are
sensitive to dierent types of noise. For a window of length w seconds, it derives the
quality estimates from the ratio of the beats detected synchronously by two dierent
algorithms to all the detected beats [44].
bSQI(k) = N
matched
(k, w)/N
all
(k, w) (2.5)
The window length w is set to 10 seconds in this and the following equations [38].
N
matched
is the numbers of beats both methods agreed upon and N
all
is the number
of all beats detected by either method. The method uses a digital ltering and
integration (DF) based QRS detector [24], and a length transform (LT) based QRS
detector [62]. Here, LT is more sensitive to noise than DF.
29
iSQI
This measures the inter-channel signal quality using the ratio of the number of
matched beats to the number of all detected beats between a given lead i and all
other synchronous ECG leads. While other constraints remain the same as for bSQI,
iSQI uses only DF.
iSQI(k)
i
= max{N
matched
(k, w)
i,j
/N
all
(k, w)
i,j
}, j, j = i (2.6)
kSQI
This metric is based on the kurtosis of the ECG signal. It works on the premise
that the samples from random noise is uncorrelated and tends to have a Gaussian
distribution, and the samples from a signal such as ECG are correlated and tend
to have non-Gaussian distributions. Kurtosis measures the related peakedness of a
distribution with respect to a Gaussian distribution.
kSQI(k)
i
=
_

_
1 if kurtosis(k, w) > 5
0 if kurtosis(k, w) 5
(2.7)
sSQI
sSQI uses Power Spectral Density (PSD) distribution to determine the quality of the
signals. For ECG, QRS energy is centered around 10 Hz. sSQI compares the PSD
in this region to the whole signal. If P gives the signal power, Spectral Distribution
Ratio (SDR), and sSQI (from SDR) are calculated by,
SDR(k) =
_
f=14
f=5
P(k, w)
__
f=50
f=5
P(k, w) (2.8)
sSQI(k) =
_

_
1 if SDR(k) > 0.5 and SDR(k) 0.8
0 if SDR(k) < 0.5 or SDR(k) > 0.8
(2.9)
30
ECGSQI
The combined signal quality index (ECGSQI) is then given by
ECGSQI(k) =
_

_
max(bSQI(k), iSQI(k)) if kSQI(k) = 1 & sSQI(k) = 1
max(bSQI(k), iSQI(k)) if kSQI(k) = 0 & sSQI(k) = 1
bSQI(k) if kSQI(k) = 1 & sSQI(k) = 0
bSQI(k) if kSQI(k) = 0 & sSQI(k) = 0
(2.10)
where = 0.7. Each metric contributes to a lowered overall ECGSQI during the
noisy period.
2.3.2 ABP Signal Quality
Li et al. [38] combines wSQI and jSQI to obtain the arterial blood pressure signal
quality (ABPSQI). The wSQI algorithm uses ABP pulse detection and fuzzy reasoning
to estimate the signal quality [63]. The wSQI takes a continuous value between zero
and one (poor to excellent quality) for each beat. The jSQI algorithm uses the same
ABP pulse detection routine, and series of features such as ABP amplitudes, slopes,
and beat-to-beat variations in each pulse to generate the abnormality index, jSQI. It
takes 0 for normal beats, and 1 for physiologically abnormal beats or noise artifacts
[51].
ABSQI is given by,
ABPSQI(k) =
_

_
wSQI(k) if jSQI(k) = 0
wSQI(k) if jSQI(k) = 1.
(2.11)
where = 0.7.
31
2.3.3 PPG Signal Quality
Gil et al. [18] identies the periods of major artifact in photo plethysmogram by
thresholding the Hjorth parameters of the waveform. The Hjorth parameters repre-
sent a time signal by its activity (H
0
), mobility (H
1
), and complexity (H
2
) [31]. To
obtain the corruption free PPG signal, Deshmane [13] proposes to threshold H
1
and
H
2
values calculated on a 5 s window of the PPG signal, between

H
1
1 and

H
1
+1.4,
and below

H
2
+3 respectively. Here

H
1
and

H
2
are the medians of the H
1
and H
2
for
the whole signal. A detailed discussion on the computation of the Hjorth parameters
and PPG signal quality estimation are given in [13].
2.4 Distance Functions
The critical component of time series analysis is the choice of the distance function.
A distance function is often used to capture the notion of similarity between two
sequences in these analyses [42].
We use distance functions extensively in our methods. For instance, in Chapter
3, we use template matching for the segmentation, where we minimize the distance
between the template and a window. In Chapter 4, we use distance function to
estimate the morphological dissimilarity, and thus quantify the signal quality. In
Chapter 5, we obtain the template closest to the current segment from the database
using a distance function.
A diverse set of measures exists for estimating the distance between two sequences.
They can be divided into several categories. Elasticity is a criterion that can be used
to categorize these measures. Euclidean distance is a non-elastic distance measure,
where as, Dynamic Time Warping (DTW), Longest Common Subsequence (LCSS),
and Edit Distance on Real Sequence (EDR) are elastic measures. Elastic measures are
useful to compare two sequences of dierent lengths by time-warping the sequences.
The resulting extent of warping is usually dependent on the sequence, and is often
exacerbated by the presence of noise. Another categorization is based on the objective
functions of the measures. Euclidean distance and DTW are based on L
2
norm. LCSS
32
and EDR compute a similarity score based on a matching threshold . All of the
elastic distance measures rely on dynamic programming for their ecient evaluation.
Nevertheless, typical time-space complexity of these methods is quadratic on the
lengths of the sequences. To address these problems, a number of techniques have been
developed that impose restrictions on the warping length of the dynamic programming
evaluation [33, 43, 42, 56].
2.4.1 Euclidean Distance
Euclidean distance is the most basic distance measure. It sums the pair-wise dis-
tances between the points in two time series. Given two time series, A and B of
length n, Euclidean distance is given by
_

n
i=1
(A
i
B
i
)
2
. It is extended to higher
dimensions by taking the L
p
norm of the pair wise distances as in
p
_

n
i=1
|A
i
B
i
|
p
.
For example, for two dimensional sequences A
n2
and B
n2
, under L
2
norm,
Euclidean distance is given by
_

n
i=1
(A
1,i
B
1,i
)
2
+ (A
2,i
B
2,i
)
2
. This measure
can be used to compare only two sequences of equal length. More sophisticated tech-
niques that employ time-warping are therefore needed to compare two sequences of
dierent lengths [14].
2.4.2 Dynamic Time Warping
Given two time series, A and B of length n and m, DTW nds the optimal alignment
between A and B. It achieves this by minimizing the warped distance between A and
B.
The element wise distance between two points from A and B is given by c
(x,y)
=
(A
x
B
y
)
2
. We construct the pairwise distance matrix by applying this process be-
tween all pairs of elements between A and B. The optimal alignment W = w
1
w
2
..w
k
is found by choosing the pairs incrementally from (1, 1) to (n, m) such that the cu-
mulative distance of the chosen pairs is minimized. Here, k is the alignment length.
Dynamic programming is used to compute the cumulative distance eciently and
the resulting matrix is called the accumulated distance matrix. An element d
x,y
in the
33
accumulated distance matrix gives the minimum distance between two sub-sequences
A
x
= a
1
a
2
..a
x
and B
y
= b
1
b
2
..b
y
. Further, d
m,n
gives the minimal distance between
the complete sequences A and B. Dynamic programming reduces the time complexity
from exponential to O(nm). Dynamic programming employs the following recursion.
d
x,y
= c
x,y
+min{d
x1,y1
, d
x,y1
, d
x1,y
}. (2.12)
The following three fundamental conditions constrain the computation of DTW.
Continuity: In DTW, we ensure that no element from either sequence is
skipped in the alignment. Hence, two consecutive points w
t
= (x, y) and
w
t+1
= (x

, y

) in an alignment can only vary by at most one element in each


sequence, i.e., (x x

) = 0 or 1, and (y y

) = 0 or 1.
Monotonicity: In DTW, we incrementally choose the pairs for the alignment.
For instance, after aligning a
x
with b
y
, we can only match a
x
for x

x with
b
y
for y

y.
Boundary conditions: In DTW, we ensure that w
1
= (1, 1) and w
k
= (n, m).
The alignment should start at the rst elements of the sequences and end at
the last elements of the sequences.
Applications
Researchers are increasingly using Dynamic Time Warping (DTW) for temporal seg-
mentation problems [35, 61]. DTW has also been used in locating motion clips [20]
and temporal segmentation of human motions [61]. Vlachos et al. [56] provides an
overview on the implementation of multidimensional Dynamic Time Warping (DTW)
over the L
p
norm. Based on this work, we propose weighted time warping (WTW),
a novel method that uses time warping to perform segmentation in Section 3.3.2.
34
Limitations
In its general form, DTW is extremely exible in matching two warped sequences.
Hypothetically, in an extreme case, Equation 2.12 allows all but the last point of
sequence A to be matched with the rst point of B, while the last point of A is
matched with all but the rst point of B. This exibility is generally benecial.
However, under noise, it can result in degenerative matchings (Figure 2-3). This has
prompted researchers to look for additional constraints on or alternatives to DTW
[32].
0 20 40 60 80 100 120
Euclidean Matching
0 20 40 60 80 100 120
Time Warping
0 20 40 60 80 100 120
Longest Common Subsequence
Figure 1: A lucid example about the quality matching of the LCSS compared to other distance functions.
The Euclidean distance performs an inexible matching, while the DTW gives many superuous and spurious
matchings, in the presence of noise.
our model, our algorithms are automatically challenged with
quadratic execution time. Moreover, these exible functions
are typically non-metric, which makes dicult the design of
indexing structures. To speed up the execution of a similar-
ity function, one can devise a low cost, upper bounding func-
tion (since the LCSS model captures the similarity, which
is inversely analogous to the distance). We utilize a fast
preltering scheme that will return upper bound estimates
for the LCSS similarity between the query and the indexed
trajectories. In addition to providing similarity measures
that guarantee no false dismissals, we also propose approxi-
mate similarity estimates that signicantly reduce the index
response time. Finally, we show that the same index can
support other distance measures as well.
Our technique works by splitting the trajectories in multi-
dimensional MBRs and storing them in an R-tree. For a
given query, we construct a Minimum Bounding Envelope
(MBE) that covers all the possible matching areas of the
query under warping conditions. This MBE is decomposed
into MBRs and then probed in the R-tree index. Using the
index we can discover which trajectories could potentially
be similar to the query. The index size is compact and its
construction time scales well with the trajectory length and
the database size, therefore our method can be utilized for
massive datamining tasks.
The main contributions of the paper are:
We present the rst external memory index for mul-
tidimensional trajectories, that supports multiple distance
functions (such as LCSS, DTW and Euclidean), without the
need to rebuild the index.
We give ecient techniques for upper(lower) bounding
and for approximating the LCSS(DTW) for a set of trajec-
tories. We incorporate these techniques in the design of an
ecient indexing structure for the LCSS and the DTW.
We provide a exible method that allows the user to
specify queries of variable warping length, and the technique
can be tuned to optimize the retrieval time or the accuracy
of the solution.
2. RELATED WORK
There has been a wealth of papers that use an Lp dis-
tance family function to perform similarity matching for
1D time-series. Work on multidimensional sequences can
be found in [14, 9]. However, they support only Euclidean
distance, which, as mentioned in the introduction, cannot
capture exible similarities.
Although the vast majority of database/data mining re-
search on time series data mining has focused on Euclidean
distance, virtually all real world systems that use time series
matching as a subroutine, use a similarity measure which al-
lows warping. In retrospect, this is not very surprising, since
most real world processes, particularly biological processes,
can evolve at varying rates. For example, in bioinformat-
ics, it is well understood that functionally related genes will
express themselves in similar ways, but possibly at dierent
rates. Because of this, DTW is used for gene expression data
mining [1, 3]. Dynamic Time Warping is a ubiquitous tool
in the biometric/surveillance community. It has been used
for tracking time series extracted from video [7], classifying
handwritten text [16] and even ngerprint indexing [13].
While the above examples testify to the utility of a time
warped distance measure, they all echo the same complaint;
DTW has serious scalability issues. Work that attempted to
mitigate the large computational cost has appeared in [12]
and [21], where the authors use lower bounding measures to
speed up the execution of DTW. However, the lower bounds
can be loose approximations of the original distance, when
the data are normalized. In [15] a dierent approach is used
for indexing Time Warping, by using sux trees. Nonethe-
less, the index requires excessive disk space (about 10 times
the size of the original data).
The exibility provided by DTW is very important, how-
ever its eciency deteriorates for noisy data, since by match-
ing all the points, it also matches the outliers distorting the
true distance between the sequences. An alternative ap-
proach is the use of Longest Common Subsequence (LCSS),
which is a variation of the edit distance. The basic idea is
to match two sequences by allowing them to stretch, with-
out rearranging the order of the elements but allowing some
elements to be unmatched. Using the LCSS of two se-
quences, one can dene the distance using the length of this
subsequence [6]. In [20] an internal memory index for the
LCSS has been proposed. It also demonstrated that while
the LCSS presents similar advantages to DTW, it does not
share its volatile performance in the presence of outliers.
Closest in spirit to our approach, is the work of [10] which,
however, only addresses 1D time-series. The author uses
constrained DTW as the distance function, and surrounds
the possible matching regions by a modied version of a
Piecewise Approximation, which is later stored as equi-length
MBRs in an R-tree. However, by using DTW, such an
approach is susceptible to high bias of outliers. Also, the
217
Figure 2-3: A comparison of alignments induced by a) Euclidean distance b) Time
warping and c) Longest common subsequence. Euclidean matching doesnt degen-
erate, but incurs the highest cost (total area of dierence). Time Warping achieves
the minimum cost, but degenerates. Degeneration is noticeable between time 40-60.
Longest common subsequence, an alternative to time warping, though not optimal,
minimizes the cost, and avoids the degenerative matchings [56].
Alternatives such as LCSS and MLM have been proposed to address this issue at
the expense of the exibility oered by DTW [37, 56]. For acoustic signal processing,
Myers et al. [43] proposes a set of local continuity and global path constraints (Figure
2-4) . These constraints help avoid degenerative matching while retaining the useful
properties of the DTW.
Derivative Dynamic Time Warping
DTW nds the alignment between two sequences by minimizing the global time-
warped distances between two sequences. Therefore, it fails to match the local vari-
35
Exact indexing of dynamic time warping 367
son is to prevent pathological warpings, where a relatively small section of one se-
quence maps onto a relatively large section of another. The importance of global
constraints was documented by the originators of the DTW algorithm, who were
exclusively interested in aligning speech patterns (Sakoe and Chiba 1978). However,
it has been empirically conrmed in other settings, including nance, medicine, bio-
metrics, chemistry, astronomy, robotics, and industry.
3.3.2. Local constraints on time warping
In addition to the global constraints listed above, there has been active research on
local constraints (Itakura 1975; Myers et al. 1980; Rabiner and Juang 1993; Sakoe
and Chiba 1978; Tappert and Das 1978) for several decades. The basic idea is to
limit the permissible warping paths, by providing local restrictions on the set of
alternative steps considered. For example, we can visualize Eq. (5) as a diagram of
admissible step patterns, as in Fig. 7a. The lines illustrate the permissible steps the
warping path may take at each stage. We could replace Eq. (5) with (i, j) = d(i, j)+
min {(i 1, j 1), (i 1, j 2), (i 2, j 1)}, which corresponds with the step
pattern shown in Fig. 7c. Using this equation, the warping path is forced to move
one diagonal step for each step parallel to an axis. The effective intensity of the
slope constraint can be measured by P = n/m. Figures 7.a to 7.d illustrate the four
original constraints suggested by Sakoe and Chiba (1978); in addition, many others
have been suggested, including asymmetric ones. The constraint might be designed
based on domain knowledge, or from experience learned through trial and error.
Rabiner and Juangs classic paper contains an extensive review (Rabiner and Juang
1993). The important implication of local constraints for our work is the fact that
they can be reinterpreted as global constraints. Figure 7.e shows an example.
Fig. 7. a to d Four local constraints on dynamic time warping, as suggested by Sakoe and Chiba. a) cor-
responds to the trivial case of no constraint, and is therefore equivalent to Eq. (5), (i, j) = d(i, j) +
min {(i1, j 1), (i1, j), (i, j 1)}. In contrast, c) corresponds to (i, j) = d(i, j)+min {(i1, j 1),
(i 1, j 2), (i 2, j 1)}. Local constraints can be reinterpreted as global constraints, as an example,
d) can be reinterpreted as the global constraint shown in e), which looks supercially like the Itakura paral-
lelogram constraint
To reinterpret a local constraint as a global constraint, we can do the following.
Create a shadow matrix, which is the same size as the DTW matrix. Initialize all
the elements of the shadow matrix as unreachable. Call the DTW function, using the
relevant constraint; every time the recurrence visits a new cell (i, j) in the original
Figure 2-4: Four dierent local constraints corresponding to the following recursions
a) d
(x,y)
= c
(x,y)
+min{d
(x1,y1)
, d
(x,y1)
, d
(x1,y)
}
b) d
(x,y)
= c
(x,y)
+ min{d
(x1,y1)
, d
(x1,y2)
+ c
(x,y1)
, d
(x2,y1)
+ c
(x1,y)
, d
(x1,y3)
+
c
(x,y1)
+c
(x,y2)
, d
(x1,y3)
+c
(x,y1)
+c
(x,y2)
}
c) d
(x,y)
= c
(x,y)
+min{d
(x1,y1)
, d
(x1,y2)
+c
(x,y1)
, d
(x2,y1)
+c
(x1,y)
}
d) d
(x,y)
= c
(x,y)
+min{d
(x1,y1)
, d
(x2,y3)
+c
(x,y1)
+c
(x1,y2)
, d
(x3,y2)
+c
(x1,y)
+
c
(x2,y1)
} and
e) the resulting global path constraint resulting by the local continuity constraint
given in d). [32]
ations. Derivative Dynamic Time Warping (DDTW) [33] addresses this particular
shortcoming of the DTW in computing the warped distance.
The reasoning behind this is that, if two sequences have variabilities in Y-axis,
the DTW is likely to produce less desirable alignments. If those variabilities are
global dierences, such as osets, scalings or linear trends, they can be removed in
preprocessing. However, if the dierences are local it fails to produce the correct
alignment. For instance, when one local feature (e.g. valley) is more prominent in
one sequence than in the other sequence, this would result in an extreme warping to
minimize the global distances between two sequences that DTW is trying to optimize.
DTW only considers the data-points in Y-axis. In contrast, DDTW considers the
rst derivative, which include the changes (variability) in the Y-axis. For instance,
in Figure 2-5, DTW is unable to align the local maxima, where as DDTW provides
the desired alignment [33].
In DDTW, we obtain the derivatives of the sequences, and compute the warped
36
Figure 2-5: Two Sequences (A) are matched using DTW (B) and DDTW (C). DTW
tries to match the peaks with peaks and valleys with valleys by extremely warping the
X-axis. Further, the alignment is obtained by globally minimizing the warped distance
between two sequences, which often result in degenerative matchings. For example,
in (B), one peak is ultimately matched with a valley. In (C), DDTW matches a local
maxima with a local maxima, and hence minimizes the warped distance locally. [33]
distance on the derivatives.
A

t
=
(A
t
A
t1
) + (A
t+1
A
t1
)/2
2
1 < t < n (2.13)
B

t
=
(B
t
B
t1
) + (B
t+1
B
t1
)/2
2
1 < t < m (2.14)
c
i,j
= (A

i
B

j
)
2
(2.15)
d =
_

_
c
1,1
c
1,2
.. c
1,m
c
2,1
c
2,2
.. c
2,m|
... .. .. ..
c
n,1
.. .. c
m,n
_

_
(2.16)
Here A
n
and B
m
are two sequences, and we want to compute the alignment
between them; A

and B

are the derivatives of the original sequences.


2.4.3 Longest Common Sub Sequence
LCSS is a variation of the edit distance that is extended to real sequences. It allows
two sequences to be stretched for matching without rearranging the order of the
elements by allowing some elements to be unmatched. While DTW requires all the
elements, including the outliers, to be matched, LCSS allows few elements to be
37
unmatched. This gives the method the resilience against the noise [56].
2) The LCSS model allows a more efcient approximate
computation, as will be shown later (whereas in DTW you
need to compute some costly Norm).
In gure 2 we can see the clustering produced by the
distance. The sequences represent data collected
through a video tracking process. Originally they represent
2d series, but only one dimension is depicted here for clar-
ity. The fails to distinguish the two classes of words,
due to the great amount of outliers, especially in the begin-
ning and in the end of the trajectories. Using the Euclidean
distance we obtain even worse results. The produces
the most intuitive clustering as shown in the same gure.
Generally, the Euclidean distance is very sensitive to small
variations in the time axis, while the major drawback of the
is that it has to pair all elements of the sequences.
Therefore, we use the model to dene similarity
measures for trajectories. Nevertheless, a simple extension
of this model into 2 or more dimensions is not sufcient,
because (for example) this model cannot deal with paral-
lel movements. Therefore, we extend it in order to address
similar problems. So, in our similarity model we consider
a set of translations in 2 or more dimensions and we nd
the translation that yields the optimal solution to the
problem.
The rest of the paper is organized as follows. In section 2
we formalize the new similarity functions by extending the
model. Section 3 demonstrates efcient algorithms
to compute these functions and section 4 elaborates on the
indexing structure. Section 5 provides the experimental
validation of the accuracy and efciency of the proposed
approach and section 6 presents the related work. Finally,
section 7 concludes the paper.
2 Similarity Measures
In this section we dene similarity models that match the
user perception of similar trajectories. First we give some
useful denitions and then we proceed by presenting the
similarity functions based on the appropriate models. We
assume that objects are points that move on the -plane
and time is discrete.
Let and be two trajectories of moving objects with
size and respectively, where
and . For
a trajectory , let be the sequence
.
Denition 1 Given an integer and a real number
, we dene the as follows:
A B
a and and
The constant controls howfar in time we can go in order to
match a given point fromone trajectory to a point in another
trajectory. The constant is the matching threshold (see
gure 3).
The rst similarity function is based on the and
the idea is to allow time stretching. Then, objects that are
close in space at different time instants can be matched if
the time instants are also close.
Denition 2 We dene the similarity function between
two trajectories and , given and , as follows:
We use this function to dene another similarity measure
that is more suitable for trajectories. First, we consider the
set of translations. A translation simply shifts a trajectory in
space by a different constant in each dimension. Let be
the family of translations. Then a function belongs to
if .
Next, we dene a second notion of the similarity based on
the above family of functions.
0 200 400 600 800 1000 1200
100
200
300
400
500
600
700
800
900
1000
Figure 3. The notion of the matching within a
region of & for a trajectory. The points of the 2
trajectories within the gray region can be matched by
the extended function.
Figure 2-6: The common subsequences for two sequences as identied by LCSS. The
gray region denes the area that falls within the thresholds
x
and
y
of the rst
sequence. The similarity is maximized by nding the longest sub-sequences from
both sequences that share the common area [56].
Given two time series, A and B of length n and m, LCSS maximizes the similarity
between A and B by nding the longest subsequences from both sequences whose
elements fall within
x
and
y
of each other in X-axis and Y-axis respectively (Figure 2-
6). The following dynamic programming function on two sub-sequences A
x
= a
1
a
2
..a
x
and B
y
= b
1
b
2
..b
y
is used to achieve this [56].
LCSS(A
x
, B
y
) =
_

_
0 if A or B is empty
1 +LCSS(A
x1
, B
y1
) if |a
x
b
y
| <
y
and
max{LCSS(A
x1
, B
y
), |x y| <
x
LCSS(A
x
, B
y1
)} otherwise
(2.17)
Threshold
x
controls the time-warping permitted and
y
bounds the pair wise dier-
ences of two similar elements. Finally, the similarity (S) and the distance (D) between
38
two sequences A and B are obtained from LCSS by
S(A, B) =
LCSS(A, B)
min(n, m)
(2.18)
D(A, B) = 1 S(A, B) (2.19)
LCSS is extended to higher dimensions by maximizing the common volume, in-
stead of the common area [56]. For two sub-sequences A
x
= a
1
a
2
..a
x
and B
y
= b
1
b
2
..b
y
of A
nd
and B
md
, the resulting dynamic programming function is
LCSS(A
x
, B
y
) =
_

_
0 if A or B is empty
1 +LCSS(A
x1
, B
y1
) if |a
x,j
b
y,j
| <
y
j = 1, 2..d
and |x y| <
x
max{LCSS(A
x1
, B
y
),
LCSS(A
x
, B
y1
)} otherwise
(2.20)
LCSS, like other edit distance based measures, is highly resilient to noise. Further,
not only it lets us control the extent of warping by
x
, it also allows us to decide the
pair wise similarity between two elements in the sequences by
y
. Owing to these
properties, we choose LCSS for generating the morphological dissimilarity in Section
4.3, which in return estimates the signal quality.
2.4.4 Edit Distance on Real Sequence
EDR is based on Edit Distance (ED), which is used on strings to measure the simi-
larity. For two strings A and B, ED measures the number of edits (insert, delete or
replace operations) required to convert A into B. EDR extends this to real sequences,
by quantifying the match between two real values and dening the edit operations
[9].
Given two time series, A and B of length n and m, EDR maximizes the similarity
between A and B by nding the minimal number of edits required to convert A into
B. The following dynamic programming function on two sub-sequences A
x
= a
1
a
2
..a
x
39
and B
y
= b
1
b
2
..b
y
is used to achieve this [9].
EDR(A
x
, B
y
) =
_

_
x if y = 0
y if x = 0
min{EDR(A
x1
, B
y1
) +subcost otherwise
EDR(A
x1
, B
y
) + 1,
EDR(A
x
, B
y1
) + 1}
(2.21)
subcost =
_

_
0 if |a
x
b
y
| <
1 otherwise
(2.22)
Here, the costs of all edit distances are assumed to be one, and the threshold bounds
the pair wise similarity between two elements in the sequences. Similar to LCSS, EDR
can be extended to higher dimensions as well.
2.4.5 Comparison
DTW and its variant DDTW, LCSS, and EDR support elastic matching between
two sequences of dierent lengths. While providing this feature, they incur quadratic
computational complexities. There have been constraints proposed to reduce this
computational complexity. Since these constraints usually limit the time-warping
permitted, their use often results in improved performances [15].
Table 2.2: Distance Functions
Measure Time Shifting Noise X-threshold Y-threshold Computational Cost
Euclidean O(n)
DTW O(n
2
)
LCSS O(n
2
)
EDR O(n
2
)
Table 2.2 summarizes the characteristics of these distance functions [9]. DTW has
the highest classication accuracy (nds the most similar sequence), and the least
40
resilience against noise [15]. LCSS, like other edit distance based measures, oers
the best performance against the noise. EDR, while oering the noise performance
similar to LCSS, achieves the classication accuracy close to DTW [15]. We use
DTW, LCSS, and their extensions at dierent places in our work.
2.5 Similarity Search
Lookup for the closest match on a database of time series data is addressed by the
Nearest Neighbor Search (NNS) optimization problem that is expressed in the metric
space. The problem can be formulated as : given a set S of samples in a metric space
M, to the query q M, nd the closet sample in S.
The closest match can be simply found by exhaustively traversing through the
database. This naive approach has the time complexity of O(nc), where n is the
size of the database, and c is the computational complexity of the comparison of
any two entries [58]. For an elastic distant function such as DTW, LCSS, or EDR,
c is proportional to d
2
, where d is the dimensionality of the entries. There exist
methods for lookups on a database in the metric space with the guaranteed time
bounds. The two popular methods are partitioning into trees, and hashing into
buckets. Locality Sensitive Hashing (LSH), Grid, Quadtree, R-tree, PR-tree, and
kd-tree are few methods addressing this problem [10, 28, 29, 58].
2.6 Summary
In this chapter, we reviewed the related clinical applications. We discussed the exist-
ing signal quality metrics, and reviewed a set of distance functions. We also looked at
few related tools and methods. With this context in place, we now present our work
on developing novel methods to identify the corrupted regions on a multi-parameter
physiological signal. In subsequent chapters, we show that our work can improve the
accuracy of the classication tasks on these physiological signals in the presence of
corruption.
41
42
Chapter 3
Temporal Segmentation of
Multi-parameter Physiological
Signals
This chapter discusses the development of a method to segment the physiological
signals in the presence of noise and transient corruption. Specically, we detail the
solution to the rst subproblem, the segmentation of a multi-parameter physiological
signal, proposed in Section 1.4. We introduce the problem in Section 3.1 and detail the
preliminaries. In Section 3.2, we present the background. We begin Section 3.3 with
an overview of the algorithm followed by a detailed discussion on the subcomponents
of the algorithm. Section 3.4 gives the details of the experiments carried out, and the
results. Finally, we conclude this chapter with the discussion and summary.
3.1 Introduction
We address the problem of online segmentation of a quasiperiodic multi-parameter
physiological signal in the presence of noise and transient corruption.
We consider a multi-parameter signal represented by a matrix S
nm
, where each
column represents a signal (e.g., ECG) and each row represents a point in time. There
are m synchronous single parameter signals in S. Each cell s
i,j
contains one sample.
43
For simplicity, we assume that all the signals are sampled at the same rate. Because
the matrix S represents a quasi-periodic multi-parameter signal, it has a repetitive
structure that is shared by all the columns. This is common in situations, where each
signal is generated by the same underlying system. In the case of the cardiovascular
system, the periodicity of the blood pressure waveforms, and the plethysmograph
signals are related to the heart rate.
The goal is to segment S in to quasiperiodic units Y. However, in practice,
samples may be corrupted in an unknown fashion. We present a novel approach
to identifying segment boundaries in the presence of signicant amounts of transient
corruption spanning multiple columns and rows of the matrix S. The key idea is that,
by simultaneously considering all the signals one can segment them more accurately
than would be possible by considering each signal independently.
In our method, we use a template Z
m
(Figure 3-1), a short multi-parameter
signal that is regularly updated based upon recent estimates. The template length
is roughly twice the length of the estimate of current segment. The initial template
is derived from an archived signal.
The method is based on matching a sliding window W
(+e)m
of the signal to the
template. Here, e is some buer length. We continuously extract non-overlapping
windows from S and attempt to identify the boundary in the window, nding the
prex of the window that most closely matches the template. The matching is done
using a new method, weighted time warping (WTW) that minimizes the weighted
morphological dissimilarity across all the parameters. The warped distance between
two single parameter signals gives the morphological dissimilarity. The weight repre-
sents the estimated quality of the signal.
The method can accurately segment a multi-parameter signal, even when all the
individual channels are so corrupted that they cannot be individually segmented.
Further, our method accommodates the evolution of the signals by updating the
template regularly.
44
3.2 Background
The analysis of the physiological signals typically begins with the temporal segmen-
tation of these signals in order to modularize the process. The diculty of segment-
ing non-stationary signals such as EEG and acoustic signals generally depends prior
knowledge of the stationary structure of the kind of the signal. Whereas, the seg-
mentation of quasi-periodic signals such as ECG generally makes use of the prior
knowledge (e.g, QRS complex) [38].
QRS detection is widely used for the segmentation of ECG signals. Further, it
is used for Heart Rate (HR) estimation, and as an entry point for classication of
the types of cardiac cycles [12]. Kohler [34] provides an extensive survey on dierent
software QRS detection methods. In the presence of noise and transient corruption,
the prior knowledge will be invalid, and the accuracy of these methods deteriorates
signicantly. Therefore, the use of other correlated signals has been proposed [1, 12,
13, 38].
3.3 Method
First, we present the overview of the algorithm, followed by the discussion of a single
iteration of the loop. Additional details are provided later in this section. Table
3.1 lists the variables introduced in the description of the algorithm. Bold capital
letters represent a matrix S, non-bold capital letters denote a column vector A, and
lower-case letters denote a scalar w.
3.3.1 Overview
Goal : Let S
nm
be a multi-parameter timeseries consisting of set of m
number of single parameter physiological signal, and Z
m
= {Z
j

}
m
an initial
template. The goal is to segment S into a set of quasiperiodic units Y = {Y
i
} where
Y
i
def
= S
[p
i
,p
i.+1
)
, and each unit corresponds to a single heart beat. Here, S
[p
i
,p
j
)
denotes
the window in the target sequence S from time t = p
i
to t = p
j
1.
45
0
50
100
0.2
0
0.2
0.4
0.6
0.8
1
1.2
0.6
0.4
0.2
0
0.2
0.4
0.6
0.8
1

Relative Amplitude

R
e
l
a
t
i
v
e

A
m
p
l
i
t
u
d
e
ABP
ECG

2
Figure 3-1: An example template. It contains clean ECG and Arterial Blood Pressure
(ABP) waveforms. The positions of the segment boundaries are denoted by
1
,
2
and
. The template is a little longer than two segments. It contains two full segments of
length
2

1
and
2
; the length of the template is .
46
Table 3.1: The summary of the variables used in the algorithm. The rst row contains
the variable at the initialization of the algorithm. The second row contains the list
of variables used in an iteration. The last row contains the output variable.
n Length of the multi-parameter signal.
m Number of parameters in the multi-dimensional signal.
S The multi-dimensional signal that is to be segmented. It is of size n m.
It contains m single parameter signals.
Z The initial template (a short multi-parameter signal). It is of size l m. It
contains m single parameter signal templates.
p
start
An arbitrary starting point of the iteration.
p
i
The start of the segment at the i
th
iteration.
Length of the current template.
e Buer size.
W The window extracted from S. It is of size ( +e) m. It contains m single
parameter signal windows.
pD
j
The pairwise distance matrix between W
j
and Z
j
. The cell (x, y) contains
(W
x,j
Z
y,j
)
2
.
q
j
Quality of the signal.
D Distance matrix produced by WTW.
aD Accumulated distance matrix obtained by applying dynamic programming
on D.
W Prex of the window W that matches the template Z.
W.f The row in the window W that is aligned with the row in the template
Z.
2
.
p
i+1
The end of the segment at the i
th
iteration.
Y Output : segments (set of quasi-periodic units).
47
We require the template (Figure 3-1) to be comprised of at least two segments.
These segments are used to nd the quasiperiodic unit Y
i
in Y. We also assume that
we know the locations of the segment boundaries Z.
1
, Z.
2
and Z. in the template.
Here, Z.k denotes the k
th
row in Z. It is a vector of samples corresponding to time
k. The prototypical template is initially obtained from an archive.
Procedure : Before applying our method, we remove the baseline wander from
the signals. Then, we start the process at some arbitrary point in time p
start
on the
signal that is to be segmented. This need not be an actual segment boundary. We
run the algorithm starting at p
start
, continuously segment S, and add the segments
to Y. We also update Z to reect the evolution of the time series. This enables us
to accommodate gradual changes in the morphology of the signal.
An iteration : We start each iteration with the extraction of a window W =
S
[p
i
,p
i
+v)
from the time series data S at p
i
. Here, the window length is given by
v = + e, where e is the buer length. In the following discussion, we use j to
index the single parameter signals. The window is then detrended and normalized,
and the morphological quality estimates {q
j
}
m
are computed, where q
j
represents
the morphological similarity between the channel W
j
from W and the corresponding
channel from the template Z
j
. For each channel j, a pairwise Euclidean distance
matrix pD
j
is calculated between Z
j
and W
j
(Equations 3.1-3.2). The nal distance
matrix Dis obtained by weighting the pairwise distance matrix pD
j
with q
j
(Equation
3.3).
c
x,y
= (W
x,j
Z
y,j
)
2
(3.1)
pD
j
=
_

_
c
1,1
c
1,2
.. c
1,|Z
j
|
c
2,1
c
2,2
.. c
2,|Z
j
|
... .. .. ..
c
|W
j
|,1
.. .. c
|W
j
|,|Z
j
|
_

_
(3.2)
D =

_
m

j=1
q
j
pD
j
(3.3)
48
The accumulated distance matrix aD is then computed from D using dynamic pro-
gramming. We trim W to obtain the prex W so that it contains only the portion
of the signal that matches the template. If the prex W does not contain two
segments, the buer length e is increased and the process is repeated.
We next use the accumulated distance matrix aD to nd the optimal path align-
ment between Z and W, as in DTW. From the alignment, we extract the point W.f
in W that is matched with the segment boundary Z.
2
in Z. This corresponds to
the segment boundary we are interested in, because Z.
2
marks the end of the rst
segment in the template. Then, using the corresponding length f, we update p
i+1
to
p
i
+ f. Finally S
[p
i
,p
i+1
)
is added to Y. Following the template update, the process
is repeated to nd the next segment boundary.
3.3.2 weighted time warping
DTW matches two one-dimensional sequences. We extend it to higher dimensions by
introducing a weighted norm (Equation 3.3) over the parameter signals to vary the
inuence exerted by each parameter. The morphological quality metric q
j
captures
the morphological similarity between the parameter signal W
j
and the template Z
j
.
We discuss the quality estimation in detail in Section 4.
3.3.3 Path Constraint
In a typical formulation of DTW, the distance function that is used to solve DTW
(Equation 2.12), allows any path to be taken from (A
1
, B
1
) to (A
n
, B
m
). This makes
DTW susceptible to degenerate matchings, especially in the presence of noise. For
example, a long subsequence of a signal might be matched with a signicantly shorter
subsequence of another signal.
Therefore, we use local continuity and global path constraints (proposed as Type
III and Type IV local continuity constraints in [43]) to prevent such physiologically
49
Sequence A (220 samples long)
S
e
q
u
e
n
c
e

B

(
1
8
0

s
a
m
p
l
e
s

l
o
n
g
)


20 40 60 80 100 120 140 160 180 200 220
20
40
60
80
100
120
140
160
180
2
4
6
8
10
12
14
x 10
7
220/E
180/E
Figure 3-2: The accumulated distance matrix between two sequences A and B when
Equation 3.4 is used to compute the accumulated distances. The lighter the region,
the higher the cost. Because of the innitely large cost involved, the optimal path
that is shown by arrows, will avoid the white regions. Thereby, the path is constrained
globally between the two marked boundaries. Here, sequence A is 180 samples long
and sequence B is 220 samples long. Under these constraints, only a subsequence of
B that is longer than 180/E can legally match the full sequence of A. Similarly only
a subsequence of A that is longer than 220/E can legally match the full sequence of
B. The expansion factor E is 3 for Equation 3.4.
50
implausible alignments by updating Equation 2.12 with
aD(A
i
, B
i
) = D(A
i
, B
i
) +min{D(A
i1
, B
i1
),
aD(A
i1
, B
i2
) +D(A
i
, B
i1
),
aD(A
i1
, B
i3
) +D(A
i
, B
i1
) +D(A
i
, B
i2
),
aD(A
i2
, B
i1
) +D(A
i1
, B
i
),
aD(A
i3
, B
i1
) +D(A
i1
, B
i
) +D(A
i2
, B
i
)}. (3.4)
This ensures that there are no long horizontal or vertical paths in the matrix aD
along the alignment. It also results in global path constraints, as shown in Figure
3-2, by excluding certain parts on the accumulated distance matrix in which optimal
warping paths could lie.
The shortest possible match allowed in Equation 3.4, is a third of the smallest
length of the sequence. The typical range of the human heart rate can be bounded
by three times of the average heart rate, and one third of the average heart rate.
Therefore, while avoiding physiologically implausible matching, it allows us to capture
the full valid range of human heart rate. For people with certain cardiovascular
conditions, a sudden long pause can occur; our method does not address this special
case.
3.3.4 Templates
The templates (Figure 3-1) are initially derived from an archive of the prototypical
multi-parameter signal. hey are then updated (Figure 3-3) using the recent segment
boundaries.
Template Length
When searching for the segment boundaries we only assume the approximate location
of the starting point (p
i
). This allows us to start the algorithm at an arbitrary location
(p
start
). It also makes the detection of the segment boundary p
i+1
less sensitive to
51
A B C D
(a) Record 106 from MIT-BIH Arrhythmia database (b) Region C
(c) Template over region A (d) Region A
(e) Template over region B (f) Region B
(g) Template over region D (h) Region D
Figure 3-3: The evolution of the template over dierent regions in the record 106 (a)
from MIT-BIH Arrhythmia database. In Region A (d), the template (c) reects the
morphology of the signal in that region. When the process moves to Region B (f),
the template (e) evolves to follow the morphology in the region. Particularly, S wave
in the QRS complex becomes less noticeable. The eect of the template update is
prominent in Region D (h). Further, in Region C (f), the template was not updated,
because the signal quality was deemed to be poor.
52
an inaccuracy in determining the previous segment boundary (p
i
). To accommodate
this we use templates that are more than two segments long.
Template Update
To follow the gradual changes that are common in the physiological signals, we update
the template regularly. However, we update the templates only when the variation
of the segment lengths in the neighborhood is small, and the quality estimates of all
the channels are above a threshold. We require the dierence between the maximum
segment length and the minimum segment length to be less than 25% of the mean
segment length over a moving time period of 60 seconds. We also require the min-
imum signal quality of any channel in the segment to be greater than a threshold.
Empirically we set this threshold at 0.6.
We update the template by averaging the excerpt of the last two segments with
the current template time-warped with the excerpt (Equation 3.6). The time warping
is necessary, since the current template Z and the excerpt of the last two segments
Z

can be of dierent lengths. Finally, we normalize the template by the sum of the
absolute value of the samples.
Z

= S
[(p
(i1)
),p
(i+1)
]
; 1 (3.5)
Z

= warp(Z) +Z

; 0 (3.6)
Z
j
=
Z

i
|Z
(j,i)
|
j = 1, 2..m (3.7)
Here, ensures that the template consists of at least two segments. We vary the
inuence of the recent segments on the template through the constant 0 1,
where = 0 implies a static template. We chose = 0.06, but the results are not
sensitive on the choice of .
3.3.5 Long Segments
Spurious matching occurs when the segments in the window Ware signicantly longer
than the segments in the template Z; therefore, the current buer length e is not long
53
0 50 100 150


X: 121
Y: 190.2
Samples
PPG
ABP
ECG III
(a) Template Z of 121 samples long
0 50 100 150
Samples


PPG
ABP
ECG III
(b) Window W
1
of 126 samples long
0 50 100 150 200
Samples


PPG
ABP
ECG III
(c) Window W
2
of 174 samples long
0 50 100 150
0
500
1000
1500
2000
k


d
w
d
z
(d) Cost of matching for W
1
0 50 100 150 200
0
500
1000
1500
2000
k
C
o
s
t


d
w
d
z
(e) Cost of matching for W
2
Figure 3-4: The template Z (a) is initially matched with a window W
1
(b). However,
the minimum cost to match the full template to a prex of the window (min d
z
) is
greater than the minimum cost to match the full window to the prex of the template
(min d
w
) (d). This is considered a spurious matching. Therefore, the process is
repeated with a longer window W
2
(c). For W
2
, the minimum cost to match the
full template to a prex of the window (min d
z
) is less than the minimum cost to
match the full window to the prex of the template (min d
w
) (e). Thus, the desired
matching is obtained.
54
enough to span at least two segments in the window. In such cases, the window
W = S
[p
i
,p
i
+v)
would not contain two full segments and does not give the expected
matching. In such situations, we have to repeat the matching with a longer window
(Figure 3-4).
We use the following equations on the accumulated distance matrix aD to check
for spurious matching.
d
z
= min
k
1
1
k
1
aD(|Z|, k
1
) (3.8)
z = argmin
k
1
1
k
1
aD(|Z|, k
1
) (3.9)
d
w
= min
k
2
1
k
2
aD(k
2
, |W|) (3.10)
w = argmin
k
2
1
k
2
aD(k
2
, |W|) (3.11)
Here d
z
is the minimum cost to match the full template (Z) with any prex string
(W
[0,w]
) of window W. Similarly, d
w
is the minimum cost to match the full window
with any prex string of the template. Further, k
1
and k
2
are the proxies for the
corresponding alignment lengths, |Z| is the length of the template and |W| is the
length of the window.
If d
z
< d
w
then the window is shorter than two segments. Therefore, we increase
v through e and repeat the process with a longer window. Otherwise, we trim the
window and obtain W, the portion of the window that matches the template.
3.4 Experimental Results
In this section, we present the experimental results for our method. We begin with
the description of our dataset and then present the performance of the segmentation
method.
We applied our method to multi-parameter physiological signal data (Figure 3-5) from
MIMIC at Physionet.org [21]. The database has 72 waveform records with several
annotation sets including ECG beat labels. It includes recordings from multiple
55
ECG channels, Arterial Blood Pressure (ABP) and Photo Plethysmogram (PPG).
The signals are sampled at 125 Hz. From this database, we selected all 70 records
that contained one continuous hour of at least one ECG channel, and ABP, PPG or
both signals.
1000 1200 1400 1600 1800 2000 2200
5000
4000
3000
2000
1000
0
1000
2000
3000
4000
Sample number
M
a
g
n
i
t
d
u
e
An excerpt from MIMIC dataset


ECG
ABP
Plethysmogram
B
A
C
Figure 3-5: The Electrocardiogram (ECG), Arterial Blood Pressure (ABP) and the
Photo Plethysmogram (PPG) extracted from MIMIC, record 039. Over time period
A, ECG is corrupted. Over time period B, ABP is unavailable. Over time period C,
PPG is corrupted.
We compared our method to a widely used QRS detection based segmentation
method [45, 55]. This method provides high specicity and sensitivity with low com-
putational load [34]. Further, it is publicly available in the open source software
package EPlimited
2
.
We conducted the following experiments
1. Raw Data : on the raw MIMIC data we compared our algorithm to the QRS
detector (test 1 and 2).
2
http://www.eplimited.com/
56
Figure 3-6: Record 213 contains a) ECG channel I, b) ECG channel II, c) ECG
channel V, d) ABP and e) PPG. The plethysmogram is completely absent over the
12 hour period. A severe corruption spans across all the channels for signicant
amount of time also.
2. Synthetic Data
AWGN noise : we selected clean excerpts from the raw data, and articially
corrupted them with AWGN noise (test 3 and 4).
Transient Corruption : we selected clean excerpts from the raw data, and
articially corrupted them with transient corruption (test 5).
We evaluate the performance using the number of errors and the average error in
milliseconds. The number of errors is dened as the total number of the estimated
segment boundaries that are dierent from the actual boundaries by more than one
sample. This also includes the cases where a segment boundary is completely missed
and an erroneous boundary is found. The average error is the mean distance between
an actual segment boundary and the closest segment boundary found by the method.
57
3.4.1 Experiment 1 : Raw MIMIC Data
We performed two tests on the raw data. In the rst test (test1), we randomly
selected 10 records and ran the tests for the minimum of 12 hours and the end of the
record. We compared the results of our method and the QRS detector to the segment
boundaries marked in the data.
The results are presented in Table.3.2. The results exhibit signicant inter-record
dierences. This is mainly due to the presence of sporadic corruptions in some records,
which results in a burst of errors in these records. For example, in record 213 (Figure
3-6), the corruption is severe, and it spans all the channels. It causes signicantly
large number of errors for both our method and the QRS detector. The median
number of errors for our algorithm was 65, whereas it was 330 for the QRS detector.
Table 3.2: Test 1: Comparison of WTW against the QRS detector on 10 randomly
selected records from the raw MIMIC data.
WTW (ours) QRS
Rec avg Sgmt Err avg Err Err avg Err
ID len (ms) # (ms) # (ms)
39 506.25 6 0.01 30 0.25
41 737.91 2 0.00 31 0.47
55 671.13 19 0.05 385 28.27
208 642.67 215 20.63 699 135.39
216 771.33 293 1.10 330 1.64
220 632.88 5 1.65 4 1.20
253 864.40 35 0.15 346 35.13
262 824.93 65 0.30 249 3.55
284 651.70 119 0.23 191 0.96
226 532.17 311 0.55 675 6.25
213 660.31 1921 3.19 2927 42.22
Median number of beats in one record is 56007
In the second test (test 2), we ran the test across all 70 records for an hour.
The results are presented in Figure 3-7. WTW was able to segment 4.121 10
5
segments accurately while the QRS detector was able to segment 4.11110
5
segments
correctly. Although our method made fewer mistakes compared to that made by the
58
QRS detector, the dierence in the accuracies of both the methods is not signicant
because of the sparsity of the occurrence of corruption in the raw data.
100 0 100 200 300 400 500 600 700
0
100
200
300
400
500
600
700
800
900
Error (ms)
F
r
e
q
u
e
n
c
y


QRS
WTW
Figure 3-7: Test 2: distribution of errors on the raw data. WTW was able to segment
4.12110
5
segments accurately while, the QRS detector was able to segment 4.111
10
5
segments correctly. It should be noted that, while the QRS detector is using only
one ECG channel, WTW is making use of other channels as well.
3.4.2 Experiment 2: Articially Corrupted MIMIC Data -
Additive Noise
In this experiment, we evaluated our algorithms performance on noisy data. We
obtained clean excerpts from the raw MIMIC data and corrupted it with articial
59
0 50 100 150 200
1000
0
1000
2000
PPG
0 50 100 150 200
1000
500
0
500
1000
1500
ABP
0 50 100 150 200
0
500
1000
1500
2000
ECG III
Samples
(a) AWGN on all 3 parameter signals
0 50 100 150 200
1000
0
1000
PPG
0 50 100 150 200
500
0
500
1000
ABP
0 50 100 150 200
0
500
1000
1500
Samples
ECG III
(b) AWGN on all but the last parameter signal
Figure 3-8: (a) Test 3 is carried out on the data altered with AWGN on all m
parameters. (a) In Test 4 only m1 parameters are altered with AWGN.
60
Table 3.3: Test 3, 4, and 5: Summary of the experimental results on the articially
corrupted data.
Average error (ms) on articially corrupted data
Type QRS WTW
all m any m1
AWGN 20 dB 12.87 0.87 0.0097
(SNR) 10 dB 188.11 3.27 0.0011
0 dB 303.48 5.81 0.008
Transient Corruption 387.32 - 2.89
Average segment length is 521 ms.

noise. We randomly extracted 1000 high quality 5-minute excerpts for which both
our method and the QRS detector were 100% accurate. We added AWGN noise at
dierent Signal/Noise (SNR) levels to these excerpts and tested both the methods on
them.
We calculate the power of the noise added P
n
= P
s
SNR, using the signal
variance P
s
= 20log(std(S)) instead of the absolute signal power P
s
= 10log(
P
n
i=1
S
2
i
n
)
to remove the eects of the oset.
We carried out two tests. In the rst test (test 3), we added noise to all m
channels, whereas in the second test (test 4), we added noise to only m1 channels.
The results of these tests are presented in Table 3.3. Our method was able to identify
the segment boundaries with reasonable accuracy even in the presence of signicant
additive noise; further, if any one of the channels is free of noise, the performance is
comparable to the performance on the clean raw data.
3.4.3 Experiment 3: Articially Corrupted MIMIC Data -
Transient Corruption
In this test (test 5) we evaluated our algorithms performance on the data altered
with transient corruption. Again, we randomly extracted 1000 high quality 5-minute
excerpts and articially corrupted them. On each of these excerpts, we corrupt m1
single parameter signals. For each these single parameter signals, we randomly se-
61
0 0.5 1 1.5 2 2.5 3 3.5 4
x 10
4
3000
2000
1000
0
1000
2000
PPG
0 0.5 1 1.5 2 2.5 3 3.5 4
x 10
4
1500
1000
500
0
500
1000
1500
ABP
0 0.5 1 1.5 2 2.5 3 3.5 4
x 10
4
500
0
500
1000
1500
ECG III
Samples
(a) Synthetic transient corruption on a 5 minute long excerpt
1 1.02 1.04 1.06 1.08 1.1 1.12 1.14 1.16 1.18 1.2
x 10
4
2000
1500
1000
500
0
500
1000
1500
PPG
1 1.02 1.04 1.06 1.08 1.1 1.12 1.14 1.16 1.18 1.2
x 10
4
20
0
20
40
60
80
ABP
1 1.02 1.04 1.06 1.08 1.1 1.12 1.14 1.16 1.18 1.2
x 10
4
500
0
500
1000
1500
Samples
ECG III
(b) Transient corruption over a 16s long region
Figure 3-9: An example distribution of the synthetic corruptions applied on an ex-
cerpt. A high quality 5 minute long excerpt from Record 039 is altered with transient
corruption. (a) There are m = 2 single parameter signal. Two of them are corrupted
with signal interruption, exponential damping, overshooting and clipping, superim-
position of high frequency sinusoidal signal, and superimposition of low frequency
sinusoidal signal. In this particular example, PPG signal was not corrupted. (b) The
eect of corruption is clearly noticeable in this 16s long region. ABP is corrupted
with exponential damping. ECG III is corrupted with amplication and clipping.
PPG is left out of the corruption.
62
lected 5 non-overlapping 1 minute long regions and randomly applied one of the
following types of corruptions: signal interruption, exponential damping, overshoot-
ing and clipping, and superimposition of articial low frequency signals and high
frequency signals. The ECG channels and one of ABP or PPG were corrupted.
The results are presented in Table 3.3. Under transient corruption, the average
error was 2.89 ms for WTW. This is comparable to that of the data with AWGN
at SNR 10 dB on all m parameters. Under severe transient corruption, the QRS
detector becomes totally unusable.
3.5 Summary
In this chapter, we presented a novel online method for temporal segmentation of
quasiperiodic multi-parameter physiological signals in the presence of noise and tran-
sient corruption. Our method uses weighted time warping to exploit the relationship
between the partially correlated channels and the repetitive morphology of the time
series. Our method has a greater constant overhead in computational complexity
relative to QRS detection based segmentation algorithm. For a window of length ,
our method uses dynamic time warping which runs in O(
2
) time and space. A QRS
detection based segmentation runs in O() time. In the examples used in this paper
is less than 500.
Our method is particularly useful when the system suers noise and transient
corruption. For corruptions, we tested our method on few articial corruptions; but
the real world corruptions could be dierent, and perhaps adversarial. We havent
tested our method on these specic types of corruptions. In the case of additive noise,
AWGN is the most dicult to handle because it spans the entire frequency spectrum
and has the highest uncertainty. Our method performs well against AWGN.
We chose ABP, PPG and ECG for testing, because they are commonly available
in the recordings of ICU patients. However, our method should be applicable to
any set of correlated physiological signals such as Central Venous Pressure (CVP),
Pulmonary Arterial Pressure (PAP) and respiratory signal.
63
We showed that our method performs as well as an excellent QRS detector on rel-
atively clean ECG data. On 1 hour long test data across 70 records, our method
achieved 99.56% accuracy, whereas the QRS detector achieved 99.41% accuracy.
When AWGN is synthetically added, the dierence in the performance between our
method and the QRS detector becomes signicant. Our method was able to limit
the average error to 5.81 ms when all m parameters were corrupted with AWGN
at SNR: 0dB, and to 0.008 ms when m 1 parameters were corrupted at the same
SNR. The average error for the QRS detector rose to 303.48 ms when the ECG chan-
nel is corrupted with the same AWGN. Similar results were observed on transient
corruption.
64
Chapter 4
Quality Estimation
This chapter discusses the assessment of signal quality in our framework. After giving
an introduction, we relate to the background. In Section 4.3, we present a novel signal
quality metric that can be used across dierent types of signals. Finally, in Section
4.4, we explain how our system uses the signal quality estimates.
4.1 Introduction
Our framework requires the assessment of the signal quality at multiple stages of
the system. In the joint segmentation of the multi-parameter signals (Chapter 3),
we weigh the individual signals inuence on the joint segmentation of the multi-
parameter signal by the estimate of the quality of the signal. In the reconstruction
of the morphological features of the physiological signals (Chapter 5), we use the
signal quality estimates to identify the presence of noise and corruption in the signal.
In Chapter 6, we use the signal quality of the reconstructed signals to combine the
reconstructions from multiple sources.
If we had the model of the source, the signal quality assessment would be straight-
forward, the signal to noise ratio (SNR) gives an estimate of the signal quality. SNR
is the ratio between the source signal power, and the dierence between the observed
signal power and source signal power. However, for physiological signals, the source
signal power is hard to obtain. Therefore, researchers use indirect measures to esti-
65
mate the signal quality of the physiological signals. They use various characteristics
of the physiological signals to identify the presence of the artifacts and the level of cor-
ruption. Here, the assumptions are that the characteristics of the signals are known
a priori, and the noise artifacts cause any deviation present in the signal.
The signal quality estimates are calculated for a region in the signal, ranging
from a single sample to tens of seconds long windows. Our framework operates at the
granularity of a heartbeat. Hence, we require the signal quality estimates at heartbeat
granularity. In addition, we have the following requirements.
We require the signal quality estimates to be independent of the beat type and
the location of the segment. For instance, a segment with signicant noise must
have a lower signal quality than a clean segment irrespective of the location,
signal amplitude, segment length, and the type of the beat.
We require the signal quality estimates to be comparable across dierent type
of signals, such as ECG, ABP and PPG signals. For a multi-parameter signal,
the signal quality estimate is computed separately for each signal. These signal
quality estimates must be comparable, i.e., using the estimates, we should be
able to assess the individual signals relative quality.
4.2 Previous work
Researchers have come up with various the signal quality estimates for various physio-
logical signals including ECG, ABP and PPG [11, 38, 44, 63]. ECGSQI estimates the
signal quality of a multi-channel ECG signal [38]. ABPSQI [38] combines wSQI and
jSQI to obtain the arterial blood pressure signal quality. PPG signal quality [13] es-
timation is done using Hjorth [31] parameters. These methods generate independent
quality estimations, and hence they are not comparable with each other.
66
4.3 New morphological dissimilarity Metric
We hypothesize that the corrupted regions of the signals will be morphologically dis-
similar from clean signals. We use an evolving template to represent the hypothesized
corruption-free signal (Section 3.3.4). The templates (Figure 3-1) are initially derived
from an archive of the prototypical multi-parameter signal. They are then updated
using the recent signal estimates. We then compute the morphological dissimilarity
between a window from the signal, and the template. The inverse of the morpho-
logical dissimilarity gives the quality estimate. Ideally, we expect the signal quality
estimates derived from morphological dissimilarity to provide the ordering in Figure
4-1.
A
B C
F
D E
Figure 4-1: Record 200 from MIT-BIH Arrhythmia Database. Ideally, we expect
signal qualities at segments A F to be [C, D] > B > [A, F] > E. The rationale is
that two normal beats will be more similar to each other compared to a PVC beat. A
normal beat with noise must be more dissimilar than a beat with PVC and a normal
beat. Finally, when there is a corruption or signal interruption, the signal quality
must be the lowest.
67
4.3.1 Method
Consider a multi-parameter signal S and the template Z. The signal quality of the
k
th
signal at segment p
i
is calculated from the morphological dissimilarity between
the window A = S
k
[p
i
,p
i+1
)
, and the corresponding signal in the template B = Z
k
. Here
A and B are one-dimensional signals of length n and m respectively.
c
i,j
= (A
i
B
j
)
2
(4.1)
D =
_

_
c
1,1
c
1,2
.. c
1,m
c
2,1
c
2,2
.. c
2,m
... .. .. ..
c
n,1
.. .. c
m,n
_

_
(4.2)
We obtain the morphological dissimilarity by computing the warped distance between
two sequences, for example, the window from the signal and the template. Using
dynamic programming, we compute the accumulated distance matrix aD (Equation
4.3). The morphological dissimilarity is given by md = aD(A
n
, B
m
).
aD(A
i
, B
i
) = D(A
i
, B
i
) +min{D(A
i1
, B
i1
),
aD(A
i1
, B
i2
) +D(A
i
, B
i1
),
aD(A
i1
, B
i3
) +D(A
i
, B
i1
) +D(A
i
, B
i2
),
aD(A
i2
, B
i1
) +D(A
i1
, B
i
),
aD(A
i3
, B
i1
) +D(A
i1
, B
i
) +D(A
i2
, B
i
)}. (4.3)
The inverse of md gives the signal quality estimate q (Equation 4.4).
q =
1
md
(4.4)
We prefer Derivative Dynamic Time Warping (DDTW) to DTW, for computing
the warped distance. The reasoning behind this is that if a local feature (e.g. valley)
is more prominent in one sequence than in the other sequence, when using DTW,
68
this would result in extreme warping to minimize the global distances between two
sequences. DTW only considers the data-points in Y-axis. In contrast, DDTW
considers the rst derivative, which include the changes (variability) in the Y-axis as
well.
Therefore instead of using the sample values, we use the rst derivative of them
A

t
=
(A
t
A
t1
) + (A
t+1
A
t1
)/2
2
1 < t < n (4.5)
B

t
=
(B
t
B
t1
) + (B
t+1
B
t1
)/2
2
1 < t < m (4.6)
c
i,j
= (A

i
B

j
)
2
(4.7)
(4.8)
Here A
n
and B
m
are two sequences, and we want to compute the mor-
phological dissimilarity between them; A

and B

are the derivatives of the original


sequences.
In DDTW computation, we again employ the path constraints (Section 3.3.3) to
avoid degenerate matching. Taking the derivative usually amplies the distortions
present in the signal. Therefore, DDTW helps us nd the corrupted regions better.
However, the amplication of distortions also increases the probability of the degen-
erative matching. Therefore, the path constraints are essential. The morphological
dissimilarity generated by DDTW is unbounded, and is dependent on the scaling of
the samples.
4.3.2 Comparison
morphological dissimilarity metric estimates the quality of the signal beat by beat. In
contrast, previously discussed methods estimate the quality of the signal over a region
[38]. For instance, ECGSQI is calculated on a 10 s window. The ability to obtain a
more ne-grained quality estimate helps us handle the bordering regions (start and
end of the corrupted region) better. For instance in Figure 4-2, the SQI obtained from
morphological dissimilarity is able to identify the start and the end of the corrupted
69
1 2 3 4 5 6
x 10
5
150
100
50
0
50
100
150
200
250
300
# sample
A
m
p
l
i
t
u
d
e
(a) AWGN added signal data
(b) ECGSQI
(c) SQI from morphological dissimilarity
Figure 4-2: Record 201, channel MLII from MIT-BIH Arrhythmia database (a). The
original signal is added with AWGN at 0dB between 8 to 15 minute and 23 to 30
minutes. For the corrupted signal, SQI using morphological dissimilarity (c) and
ECGSQI (b) are computed.
70
region better than ECGSQI. Further, when the R-peaks in the ECG signals are
clearly identiable, ECGSQI tends to overestimate the signal quality irrespective
of the distortion in the overall morphology of the beat; it is noticeable during last few
hundred beats of the record where both wqrs and ep limited agreed mostly.
Unlike the related methods, the morphological dissimilarity is applicable to all of,
ECG, ABP and PPG signals. Therefore, we are able to use a single metric across
dierent types of signals (Figures 4-3, 4-4). Particularly, in Section 3.3.2, we use
the relative signal quality estimates obtained from the morphological dissimilarity to
weigh the individual signals inuence on the joint segmentation of a multi-parameter
signal. We normalize the signal qualities estimated for individual signals by their sum
to obtain the relative signal quality estimates (RSQE). Other previously discussed
methods cannot be directly used for this purpose.
8350 8400 8450 8500 8550 8600 8650 8700 8750
1500
1000
500
0
500
1000
# sample
A
m
p
l
i
t
u
d
e


ABP
PPG
ECG
Figure 4-3: An example recording of a multi-parameter physiological signal (record
039 from MIMIC) with 20dB AWGN added to ABP and PPG signal. This record is
used illustrate the application of RSQEs in Figure 4-4.
4.3.3 morphological dissimilarity Metric with LCSS
When using the morphological dissimilarity obtained from the DTW distance on the
gradient of the signal samples to estimate the quality, we observe that it can accurately
estimate the relative signal quality (Figures 4-3, 4-4). However, it is unbounded and
71
0 100 200 300 400 500 600 700
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
# beat
R
e
l
a
t
i
v
e

M
o
r
p
h
o
l
o
g
i
c
a
l

D
i
s
s
i
m
i
l
a
r
i
t
y


ABP
PPG
ECG
(a) Signals are corruption-free
0 100 200 300 400 500 600 700
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
# beat
R
e
l
a
t
i
v
e

M
o
r
p
h
o
l
o
g
i
c
a
l

D
i
s
s
i
m
i
l
a
r
i
t
y


ABP
PPG
ECG
(b) ABP and PPG signals are added with 20dB AWGN
0 100 200 300 400 500 600 700
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
# beat
R
e
l
a
t
i
v
e

M
o
r
p
h
o
l
o
g
i
c
a
l

D
i
s
s
i
m
i
l
a
r
i
t
y


ABP
PPG
ECG
(c) ABP and PPG signals are added with 10dB AWGN
Figure 4-4: Relative Signal Quality Estimates (RSQE) of a multi-parameter physi-
ological signal (Figure 4-3). When all three signals are corruption-free, RSQEs are
approximately equal, and take an average of 0.33. When 20dB AWGN is added to
ABP and PPG signals, it results in an RSQE 5 times higher (average RSQE for ECG
is 0.8) for ECG than others (average RSQE for ABP is 0.16). At 10dB AWGN the
dierence is signicant.
72
dependent on the scaling of the sample values. Therefore, we cannot infer a hard
threshold to identify the regions of corruptions. For example, in Figure 4-2(c), the
morphological dissimilarity has a large variation.
Therefore, we use LCSS, instead of DTW, to estimate the morphological dissimi-
larity. We still use the gradient of the samples and take the variability in y-axis into
account.
LCSS(A

x
, B

y
) =
_

_
0 if A or B is empty
1 +LCSS(A

x1
, B

y1
) if |A

x
B

y
| <
y
and |x y| <
x
max{LCSS(A

x1
, B

y
)
LCSS(A

x
, B

y1
)} otherwise
(4.9)
q =
LCSS(A, B)
min(n, m)
(4.10)
(4.11)
Furthermore, using LCSS has several other advantages. It is bounded, and hence,
is comparable across beats at dierent locations and of dierent types. It lets us
obtain a more accurate signal quality estimate (Figure 4-5) at a ner granularity of
heartbeat.
4.3.4 Adaptive Signal Quality Estimates
To estimate the signal quality, we compare a window of the signal against the tem-
plate. This would be valid if any dierence between the template and the window
were due to the noise or corruption presents in the signal. However, the signal mor-
phology evolves along the time. For instance, heart rate may increase, the amplitude
of the ABP signal might drop, etc. We regularly update the template to accommo-
date the time evolution of the signal. The template update was discussed in Section
3.3.4.
73
D
F
B
A C
E
0.9
0.82
0.7
0.53
0.41
0.48
Figure 4-5: For the example in Figure 4-1, the SQE estimated using LCSS. The SQE
generated by LCSS is bounded between 0 and 1. Through LCSS we achieve the signal
quality estimates in the order [C, D] > B > [A, E] > F, which is close to the ideal
situation.
74
4.4 Implementation
First, in the joint segmentation of the multi-parameter physiological signals (Section
3.3.2), we require the Relative Signal Quality Estimates (RSQE) beat by beat. We
compute the morphological dissimilarity between the window and the current tem-
plate, and obtain the signal quality estimate (Equation 4.10). We normalize the signal
quality indices across all the signals by their sum. The resulting relative signal quality
estimates {q
j
}
m
, are then used in WTW (Equation 3.3).
In Chapter 5, we require the signal quality estimates to identify the corrupted
regions. We use the previously estimated signal quality estimates to identify the
corrupted regions.
4.5 Summary
We gave an overview of the signal quality indices available for ECG, ABP and PPG
signals. Then, we introduced a novel morphological dissimilarity metric to estimate
the signal quality of any quasiperiodic physiological signal. After that, we discussed
the improvements on this method. Finally, we summarized the details of the imple-
mentation of the signal quality estimation in our framework.
75
76
Chapter 5
Reconstruction of Multi-parameter
Physiological Signals
This chapter discusses the methods for reconstructing the corrupted regions in the
physiological signals. Specically, we detail the solutions to the subproblems listed
in Section 1.4, feature representation, learning, and estimation. We introduce the
problem in Section 5.1. In Section 5.2, we present the background, and discuss the
related work. We begin Section 5.3 with an overview of the algorithm followed by
a detailed discussion on the subcomponents of the algorithm. Section 5.4 gives the
details of the experiments carried out, and the results. Finally, we conclude this
chapter with the discussion and summary.
5.1 Introduction
We address the problem of identifying the corrupted regions in a multi-parameter
signal, and reconstructing them using the information available in the correlated
signals.
As in earlier chapters, we consider a multi-parameter signal represented by a
matrix S
n2
. It contains two correlated synchronized physiological signals. Each cell
s
i,j
contains one sample. Samples may be corrupted in an unknown fashion. Our
goal is to identify the corrupted regions, and estimate the actual sample values on
77
that region. In this chapter, we limit our focus to reconstructing the morphological
features of the corrupted ECG signals. We do not consider the baseline and global
trends in this chapter.
In our method, we use a database of templates. Here, a template is a segment of
the multi-parameter signal that was chosen from previously seen regions free of signal
corruption. When we come across the segments of high signal quality, we add them
to the database; thus, we learn new morphologies.
The method is based on nding the closest match (template) to the corrupted
segment from the database, time-warping the template to t the corrupted segments
interval, and replacing the corrupted segment with it. The signal and the template
contain two channels. The channel in the template corresponding to the corrupted
channel in the signal is used for replacement, and the other channel in both the
template and the signal is used for matching. The closest match is found using the
DTW cost. As a preliminary step, we represent the segments by features to reduce
the dimensions of the lookup entry, and preserve the clinically relevant information.
5.2 Related Work
There have been attempts to exploit the information available in the correlated chan-
nels of a multi-parameter physiological signal to assist automated medical systems to
produce results that are more reliable. For example, researchers have tried to fuse
information from various ECG channels, and other signals to robustly estimate the
heart rate [1, 13, 38]. Further, the problem of reconstruction was the theme of the
11
th
annual PhysioNet/CinC challenge [41].
5.3 Method
First, we present a quick overview of our method. A detailed description of the sub-
parts is provided later. Bold capital letters represent a matrix S, non-bold capital
letters denote a column vector A, and lower-case letters denote a scalar w.
78
5.3.1 Overview
Goal : Let S
n2
be a multi-parameter time series consisting of set of two single
parameter physiological signals. The goal is to identify the corrupted regions {U
i
},
where U
i
= S
[t
i
1
,t
i
2
)
, and reconstruct the corrupted samples in that region with the
estimates {V
i
} where V
i
= S
[t
i
1
,t
i
2
)
.
Procedure : First, we detrend the signal, and remove the baseline wander
using a low pass lter
1
. Then, we extract the segments using the method described
in Chapter 3. We run the algorithm starting at the rst segment U
1
, continuously
evaluating the signal qualities of the segments, and determining whether the segment
needs reconstruction. If the segment needs reconstruction, we nd the entry from
the database that closely matches the current segment, and reconstruct the current
segment using the signal segment in that entry. We also add new segments to our
database if they are of high signal quality. This process is iterated over all the
segments.
An iteration : We start each iteration with a segment U
i
= S
[t
i
1
,t
i
2
)
from S,
where the length of the segment is
i
= t
i
2
t
i
1
. Using Morphological Dissimilarity, we
determine whether the segment is corrupted, and whether it requires reconstruction.
If the signal needs reconstruction, i.e., if the Signal Quality Estimate (SQE) is
below a threshold (q
i
<
low
), we proceed with the reconstruction process. First, we
build the feature representation F
i
of the segment. The signal S is a 2-parameter
signal. Hence, U
i
contains two correlated synchronized signals, and F
i
is the joint
representation of the both. We search the database, using F
i
as the key, and nd
the top 20 matches. We nd the best match on this set using the DTW distance
between the clean channel of the segment U
i
and the corresponding channel in the
top matches. If the cost of the match is above a threshold, i.e., c
i
> , we abort the
reconstruction process on the current segment, and continue with the next segment.
Otherwise, we use the best match as a template for reconstructing the corrupted
signal. We time-warp the channel V
a
i
from the matching template V
i
, with the clean
channel U
a
i
from the segment U
i
to obtain the alignment w. Then, we replace the
1
http://www.mit.edu/ gari/CODE/FILTERS/Baseline%20Wander%20Removal.htm
79
corrupted channel U
b
i
of the current segment U
i
, with V
b
i
that is obtained from the
template V
i
by warping V
b
i
using the alignment w.
If the signal doesnt need reconstruction, i.e, if the SQE is above a threshold
(q
i
>
high
), we try to learn the morphology from that segment. First, we build
the feature representation F
i
of the segment, and add the entry (F
i
U
i
) to the
database.
5.3.2 A segment
A segment for reconstruction is the unit of the signal from the mid point of one R-R
interval to the mid point of the next R-R interval.
The template we use treats the R-R interval a segment. Since the rate of change
is the highest at R peaks, this choice allows us to identify the segment boundaries
with the highest resolution, i.e., up to single sample. Though using R-R intervals
as segments is common, it is not appropriate for this application. First, we want a
smooth transition between segments, which suggests that segment boundaries should
occur in places where the signal is changing slowly. Secondly, since the shape of the
QRS complex is important in many applications, we need to reconstruct it accurately.
This is easier if the entire complex lies within a single segment. For this reason, we
use the R peaks as the center of segments rather than as the border.
We start with the segment boundaries {p
i
} obtained using the joint segmentation
(Section 3), and construct the segments, as illustrated by Figure 5-1, so that they are
centered on R-peak p
i
.
5.3.3 Identifying corrupted regions
We reconstruct only the corrupted regions, i.e., those with a poor SQE. First, we
use the SQE obtained from Morphological Dissimilarity (Section 4) to identify the
corrupted regions. Based on the SQE values, there are three possibilities (Figure 5-
2). If neither signal appears to be corrupted, we update the templates, but make no
change to the signals. If the SQE for exactly one signal in the segment falls below a
80
0 200 400 600 800 1000 1200 1400 1600 1800
50
0
50
100
150
200
# sample
A
m
p
l
i
t
u
d
e
R
1
R
post R
2
R
pre
P
i1 P
i P
i+1 P
i2
P
i+2
R
2
/2
R
1
/2
(a) Segment extraction and common features
800 850 900 950 1000 1050
0
50
100
150
200
# sample
A
m
p
l
i
t
u
d
e
P
i
(b) One Segment
Figure 5-1: Figure (a) illustrates how a segment is extracted from the signal S using
the segment boundaries {p
i
} obtained in Section 3. While extracting the segment,
the features that depend on R-R lengths (R
pre
, R
1
, R
2
, R
post
) are also constructed.
Figure (b) denes one segment. The i
th
segment U
i
is centered around p
i
.
81


ECG MLII
ECG V
Both signals are
corrupted
ECG MLII is
corrupted
Corruptionfree
Figure 5-2: An example distribution of the corruption. When one signal is corrupted
we try to reconstruct them. When both signals are corrupted, the reconstruction
process only ags that segment, but does not try to reconstruct it.
threshold, we try to reconstruct it. Since the correlated signal is used to estimate the
sample values of the corrupted signal, when both signals are corrupted, we dont have
enough information left in the signal to reconstruct them. Therefore, if the corruption
spans both the signals, we abort the process, and continue with the next segment.
In such case, we ag that segment, so that any systems that depend on these signals
could avoid triggering false alarms.
5.3.4 Feature Representation
In reconstruction, we search the database for the closest match to the current seg-
ment. Since we want to do it in real-time with a growing database, we need a fast
method of retrieval that could scale. A method with linear time complexity (O())
for comparison between two sequences of length , and sub-linear time complexity
for lookup on database of size n is highly desirable. In addition, we also want the
method to provide a level of abstraction that increases the probability of nding a
physiologically relevant match.
Since the segments are usually of dierent lengths, a direct comparison function,
82
such as the Euclidean distance is not suitable. On the other hand, variable length
metrics such as DTW, LCSS, and Edit Distance On Real Sequence (EDR) are of
quadratic time complexity. Further, lookups using these methods for comparison
do not easily scale with large databases. We, therefore, resort to an intermediate
representation of the segments using features.
Feature representation helps us represent a segment with a xed length vector,
hence two sequences can be compared in linear time in feature space. Further, it
provides the level of abstraction desired, thus helps us generalize. The feature rep-
resentation also helps our method preserve the clinically relevant information such
as PVCs. Two beats with Premature Ventricular Contraction (PVC) could dier in
morphology, but they might still indicate the same clinical event. The set of features
we chose helps us represent this information better, and avoid over-tting. Hence,
it is used to nd the set of closest matches from the databases to an entry, loosely
serving as a hash-code.
Every segment U
i
contains two correlated synchronized signals, and F
i
is the
joint representation of the both. Table 5.1 lists the set of features in the feature
representation F
i
= {f}. First four rows of the table contain the features related to
R-R intervals, followed by the features of the signal in the segment.
5.3.5 Dimensionality Reduction
The database may contain thousands of entries. Because signal in each record is of
varying length, we use the DTW distance to measure the similarity. The computation
of the DTW distance has a quadratic time complexity. Therefore, to lookup an entry,
a pairwise comparison with all the records in the database can be unacceptably slow.
We speed up the lookup by reducing the dimensionality of the search. We represent
the signal with the features, and use the features to do similarity search. For a xed
length feature vector, the comparison can be done in time that is sub-linear on the size
of the database. We achieve further speed up through other optimizations discussed
in Chapter 7.
The top K matches are found through k-nearest neighbors search (KNN) with a
83
Table 5.1: Set of features that are used to represent a segment.
ID Feature Description
f
1
R
pre
The length of the previous R-R interval (Figure 5-1)
f
2
R
1
The length of the R-R interval constituting the rst part of the
segment (Figure 5-1)
f
3
R
2
The length of the R-R interval constituting the latter part of the
segment (Figure 5-1)
f
4
R
post
The length of the next R-R interval (Figure 5-1)
f
5
E
total
Square root of the total energy of the rst signal in the current
segment (Figure 5-3)
f
6
.. f
15
E
k
The fraction of the square root of the energy in the k
th
section of
the rst signal in the segment (Figure 5-3)
f
16
K Kurtosis of the sample values of rst signal in the segment
f
17
D
total
DTW distance between the rst signal in the segment and the me-
dian of the same signal
f
18
.. f
27
D
k
DTW distance between the k
th
subsequence of the rst signal in the
segment and the k
th
subsequence of the median of the same signal
f
28
SDR Fraction of spectral energy in the QRS complex of the rst signal in
the segment
f
29
MAX The maximum sample value of the rst signal in the segment
f
30
MIN The minimum sample value of the rst signal in the segment
84
0 50 100 150 200 250 300 350 400
200
100
0
100
200
300
400
500
# sample
A
m
p
l
i
t
u
d
e
(a) The signal from a segment
50 100 150 200 250 300 350
0
50
100
150
200
250
300
350
400
450
S
q
u
a
r
e

r
o
o
t

o
f

p
o
w
e
r
# sample
f
5
(b) f
5
: Square root of the total energy of the signal
0 50 100 150 200 250 300 350 400
0
50
100
150
200
250
300
350
400
450
# sample
S
q
u
a
r
e

r
o
o
t

o
f

t
h
e

p
o
w
e
r
f
11
(c) f
6
.. f
15
: The fraction of the square root of the energy in the k
th
section
of the signal
Figure 5-3: Estimation of features f
5
(b), and f
11
(c) on an ECG signal segment(a).
85
normalized euclidean distance function.
d(

X,

X
i
) = (

X

X
i
)
T
(

X

X
i
) (5.1)
K = argmin
iN
X
d(

X,

X
i
) (5.2)
Here, N
X
is the set of neighbors for the entry

X.
We select the top twenty matches, and from them, we nd the candidate for
replacement V
i
using both the DTW distance and the feature distance.
V
i
= argmin
k
_

j
(F
k,j
F
i,j
)
2
+.dtw(U
a
k
, U
a
i
)
_
(5.3)
c
i
= min
k
_

j
(F
k,j
F
i,j
)
2
+.dtw(U
a
k
, U
a
i
)
_
(5.4)
Here, F
k
and F
i
are the feature vectors of the segments (top 20 matches) {U
k
},
and the current segment U
i
respectively. The summation is performed over all the
features in the feature vector. The DTW distance is computed only between the
clean channels of the segments (U
a
k
and U
a
i
). The regularizer = 25 was empirically
chosen. The combination helps us avoid over-tting, and balance the preservation of
clinically relevant events against the morphological similarity.
5.3.6 Reconstruction
We want to reconstruct the corrupted channel of the current segment U
b
i
with the
corresponding channel from the replacement candidate V
b
i
that was found.
We rst verify the correctness of the match found. We accept the reconstruction
only if the cost of the match c
i
(Equation 5.4) is less than a threshold. If the cost c
i
is greater, we ag the segment U
i
so that automated systems could avoid producing
false alarms in those regions.
Since the length of the current segment U
i
and the length of the candidate found
(template) V
i
are typically unequal, we next time-warp the template with the cur-
rent segment. Time-warping is done by nding the optimal alignment (k) between
86
the clean channel of the current segment U
a
i
and the corresponding channel of the
template V
a
i
(Equation 5.5-5.7).
(k) = (
1
(k),
2
(k)), 1 k K (5.5)
C(V
a
i
, U
a
i
) = min

(V
a
i
, U
a
i
) (5.6)
C

(V
a
i
, U
a
i
) =
K

k=1
d(V
a
i
[
1
(k)], U
a
i
[
2
(k)]) (5.7)
We then replace each sample of the corrupted channel U
b
i
[x] with the time-warped
sample V
b
i
[x], which is obtained from the median of the samples with which it is
aligned.
x = median(
2
(k)), 1 k K and
1
(k) = x (5.8)
5.4 Experimental Results
In this section, we present the experimental results for our method. We begin with
the description of our dataset, and then present the results of the experiments.
In our experiments, we use the multi-parameter ECG data from MIT-BIH Ar-
rhythmia Database at Physionet.org [21]. The database has 48 ECG waveform
records, each contains two channels and is 30 minutes long. The recordings are
digitized at 360 Hz with 11-bit resolution over 10mV range. The recordings were
selected to include less common but clinically signicant arrhythmias. This helps us
evaluate the robustness of our method.
We add synthetic corruption that is intended to mimic real-world corruption to the
data (selected channel of a record). We evaluate our method by quantifying the eec-
tiveness of the reconstruction on this corrupted data. We compare the reconstructed
data, and the corrupted data with the original data to evaluate the performance of
our method.
We used only 39 records from this set. Records 100-108 were excluded from the
experiments, because they contain signicant amount of noise and transient corrup-
87
tion on both channels. We would not able to use those records eectively, because we
require a gold standard to compare against, and the second channel to be corruption
free so that it can be used to reconstruct the articially corrupted rst channel. Other
records might also contain sporadic transient corruptions. Although it is unfavorable
to our test cases, we do not exclude any region in these 39 records, even when both
channels are corrupted.
We use the following criteria for comparison.
1. Q1 : Residual distance : We measure the similarity between the recon-
structed data (S
b
), and the original uncorrupted data (S
b
) by measuring the
Euclidean residual distance r of the reconstructed data.
r =

n
k
(S
b
[k] S
b
[k])
2
n
2
S
(5.9)
We normalize the Euclidean distance to make it comparable across the records.
2. Q2 : Reproducibility : Our ultimate goal is to enable the automated analysis
systems produce more reliable results. Hence, we test our methods ability to
improve the classication accuracy of a common task. We run a widely used
Premature Ventricular Contraction (PVC) detector
2
on the original data (S
b
),
the articially corrupted data (S
b
#) and the reconstructed data (S
b
), and
record their agreements. If the PVCs are detected within 150 ms on two signals,
we consider it an agreement. We quantify the ability to preserve the clinically
relevant events by counting the disagreements (Algorithm 1). The number of
disagreement n
disagreement
is evaluated between the original data (S
b
), and the
articially corrupted data (S
b
#), and between the original data (S
b
), and the
reconstructed data (S
b
). The disagreement is nally expressed in terms of
the fraction between the total number of disagreements n
disagreement
, and the
total number of beats n
beats
in the region.
= n
disagreement
/n
beats
(5.10)
2
http://www.eplimited.com/software.htm
88
Algorithm 1 Disagreement
n
disagreement
= 0
for all beat in beats do
if there is PVC in S
1
then
if there is no PVC in S
2
then
n
disagreement
= n
disagreement
+ 1
end if
else if there is PVC in S
2
then
if there is no pVC in S
1
then
n
disagreement
= n
disagreement
+ 1
end if
end if
end for
5.4.1 Experiment 1 : Eectiveness of Reconstruction
We corrupt the rst channel of the records with the additive white gaussian noise
(AWGN) at 0dB SNR. We add the articial corruption to only the last 20% of the
data, and attempt to reconstruct it. We build the database from the rst 80% of the
data.
Table 5.2 summarizes the results. It shows that our method reduces the residual
distance (Q
1
) by 250% for a signal corrupted at SNR 0dB. Further, on average, it
was able to improve the classication accuracy (Q
2
) by more than seven fold. Figure
5-4, shows the eectiveness of the reconstruction on Record 200.
Table 5.3 lists the performance of our method on each record. The rst three
columns describe the record, the next two columns provide the results for the Q
1
criterion on both corrupted and reconstructed channels, and the last six columns
provide the results for the Q
2
criterion on both corrupted and reconstructed channels.
Q
1
is measured by the normalized residual distance, and Q
2
is dependent on the total
number of disagreements, which is the sum of misses and inventions. A miss is
counted, when the PVC found on the original signal S
b
is missed in S
b
# or S

. An
invention is counted, when a new PVC is found in S
b
# or S

. The reconstruction not


only improves the morphological similarity, but also preserves the clinically relevant
information. On some records, e.g., 207 and record 217, our method performs poorly.
89


Corrupted
Original
(a) Corrupted Vs Original


Original
Reconstructed
(b) Reconstructed Vs Original
Figure 5-4: Record 200 from MIT-BIH Arrhythmia database. The rst channel is
corrupted with AWGN at SNR 0dB (a), and reconstructed using our method (b).
90
This is mainly due to the poor quality of the correlated channel (channel 2). On record
217 in particular, channel 2 is extremely noisy, and at times completely corrupted.
Table 5.2: Experiment 1 : Summary
Q1 Q2
PVC r
S
b
#
r
S
b


S
b
#

S
b
Median 4 1.01 0.39 0.09 0
Average 47.64 1.01 0.40 0.14 0.02
5.4.2 Experiment 2 : Dierent SNR Levels
On all 39 records, we corrupt the last 20% of the rst channel with AWGN at SNR
levels of 10 dB, 0 dB, and 10 dB (Figure 5-5). We build the database from the rst
80% of the data. For the remainder, we check whether the segment is corrupted and
reconstruct the segment if it is corrupted.
Table 5.4 summarizes the average disagreement (), and the residual distance
(r) for the reconstructed signal (S
b
). Somewhat surprisingly, we get the worst per-
formance at the highest signal to noise ratio. In addition, at low SNR levels, the
performance does not deteriorate with decreasing signal quality.
Our method makes a binary decision to either reconstruct the corrupted region,
or leave it. It does not support partial recovery of the signal. This is both the
strength and the weakness of the algorithm. By design, once the region is determined
to be corrupted, the algorithm will choose to reconstruct it completely using the
correlated channel and the history of the signal. Therefore, at low SNR levels, the
level of corruption does not aect the performance of the reconstruction process. In
addition, at higher SNR levels, when the signal is of high quality, the algorithm will
choose not to reconstruct the signal, and, hence the original performance will be
retained. However, in the middle, for instance, at SNR 10dB, corrupted regions are
not easily distinguishable, and hence it results in poorer performance compared to
that of lower signal qualities.
91
Table 5.3: Experiment 1 : Eect of reconstruction
In last 6 minutes Q1 Q2-unreconstructed Q2- reconstructed
Record Beats PVC r
S
b
#
r
S
b

miss invention miss invention


ID # # # # # #
109 504 6 1.33 0.59 27 3 0.06 0 0 0
111 424 1 0.93 0.50 25 1 0.06 1 1 0
112 507 0 0.95 0.30 33 0 0.07 0 0 0
113 356 0 1.02 0.28 32 0 0.09 0 0 0
114 375 0 1 0.66 38 0 0.10 1 0 0
115 389 0 0.98 0.47 34 0 0.09 0 0 0
116 481 11 0.60 0.42 36 9 0.09 0 1 0
117 305 0 0.93 0.16 24 0 0.08 0 0 0
118 457 4 1.07 0.40 31 4 0.08 1 0 0
119 416 87 0.98 0.19 32 66 0.24 0 0 0
121 372 0 0.97 0.55 36 0 0.10 0 0 0
122 493 0 1.01 0.15 51 0 0.10 0 0 0
123 301 2 1.02 0.21 22 2 0.08 0 0 0
124 324 3 0.96 0.30 18 3 0.06 0 0 0
200 555 149 0.94 0.44 23 120 0.26 8 7 0.03
201 405 23 1.08 0.42 21 18 0.10 0 7 0.02
202 426 1 1.16 0.41 38 1 0.09 0 0 0
203 619 71 0.94 0.80 35 61 0.16 11 41 0.08
205 531 36 1.04 0.49 42 31 0.14 0 8 0.02
207 473 224 1 0.73 5 170 0.37 4 131 0.29
208 605 235 0.98 0.44 15 187 0.33 3 3 0.01
209 607 2 1 0.61 41 2 0.07 0 2 0
210 534 31 1.02 0.38 41 25 0.12 10 4 0.03
212 549 0 0.97 0.30 32 0 0.06 0 0 0
213 655 53 0.98 0.26 35 42 0.12 3 2 0.01
214 456 45 0.99 0.42 40 34 0.16 2 4 0.01
215 676 40 1.03 0.39 52 32 0.12 2 1 0
217 453 436 1.09 0.37 0 374 0.83 0 116 0.26
219 459 11 1.03 0.21 30 7 0.08 0 0 0
220 411 0 1.02 0.29 17 0 0.04 0 0 0
221 489 32 0.89 0.29 39 24 0.13 0 0 0
222 523 2 1.08 0.76 37 2 0.07 2 2 0.01
223 525 97 1.09 0.33 29 74 0.20 2 5 0.01
228 425 74 1.02 0.39 20 59 0.19 0 1 0
230 490 1 1.12 0.50 35 1 0.07 0 0 0
231 399 0 1.06 0.32 10 0 0.03 0 0 0
232 361 0 1.03 0.34 23 0 0.06 0 0 0
233 627 180 1 0.32 16 141 0.25 0 6 0.01
234 550 1 1.04 0.29 28 0 0.05 0 0 0
92


MLII
V5
(a) SNR 10dB


MLII
V5
(b) SNR 0dB


MLII
V5
(c) SNR -10dB
Figure 5-5: Record 123, from MIT-BIH Arrhythmia database. The original signal is
added with AWGN at SNR = 10dB, 0dB and -10dB.
93
Table 5.4: Experiment 2 : Eect of SNR levels
Similarity : r
S
b

Disagreement :
S
b
10 0.410 0.031
0 0.401 0.021
-10 0.402 0.022
However, on all three noise noise levels, our method improved the classication
accuracy, as measured by the disagreement, by more than 460%, and the increased
the similarity to the original signal, as measured by the normalized residual distance,
by more than 200%.
5.4.3 Experiment 3 : Simulated Real-World Corruptions
Table 5.5: Performance against dierent types of real-world corruptions
Similarity : r
S
b

Disagreement :
S
b
AWGN 0.410 0.031
EM 0.36 0.023
MA 0.19 0.003
BW 0.05 0.001
We alter the rst 20% of the rst channel with the following types of corruptions at
SNR = 10 dB: Additive White Gaussian Noise (AWGN), Electromagnetic Interference
(EM), Muscle Artifact (MA), and Baseline Wander (BW) (Figure 5-6). We use MIT-
BIH Noise Stress Test Database
3
and nstdbgen
4
to generate the rst three types of
noise models. We chose to corrupt at SNR 10dB, because at lower signal qualities,
the distinction between dierent noise models become less apparent and signals with
high signal quality does not require reconstruction.
Table 5.5 summarizes the average disagreement (), and the residual distance
(r) for the reconstructed signal (S
b
). We achieve the best performance for Baseline
3
http://www.physionet.org/physiobank/database/nstdb
4
http://www.physionet.org/physiobank/database/nstdb/nstdbgen-
94


ML II
V5
(a) Electromagnetic Interference (EM)


ML II
V5
(b) Muscle Artifact (MA)


ML II
V5
(c) Baseline Wander (BW)
Figure 5-6: Record 123, from MIT-BIH Arrhythmia database. The original signal is
added with the following types of corruptions at SNR 10dB : Electromagnetic Inter-
ference (EM), Muscle Artifact (MA), and Baseline Wander (BW). The highlighted
area indicates the corrupted region.
95
Wander. In preprocessing, we remove baseline wander using a low-pass lter
5
. The
baseline wander removal algorithm was eective at 10 dB SNR, and was able to
cancel the noise itself. The worst performance was observed for AWGN, and EM
noise. Hence, on all four noise noise models, our method improved the classication
accuracy, as measured by the disagreement, by more than 500%, and the increased
the similarity to the original signal, as measured by the normalized residual distance,
by more than 250%. Further, it must be noted that this is the worst case performance,
and BW and MA are more common than EM and AWGN in practice.
5.4.4 Experiment 4 : Size of the Database
In this experiment, we test the inuence of the length of the database and learning.
We corrupt the rst channel of the records with AWGN at 0dB SNR. We corrupt the
last 20%, 50%, and 80% of the rst channel of the signal. Therefore, the length of
signal available to build the database shrinks from 80% to 20% of the original signal.
For the remainder, we check whether the segment is corrupted, at the granularity of
a beat, and reconstruct the segment if it is corrupted.
Figure 5-7 summarizes the average disagreement (), which is the measure of re-
producibility, and the residual distance (r) for the reconstructed signal (S
b
), which is
the measure of the similarity. Both metrics worsen with increasing length of corrupted
regions. The average disagreement rises exponentially. There are two contributing
factors for this observation. First, we are unable to reconstruct the PVC beats prop-
erly with shrinking database. Another reason is that with larger corrupted regions,
we are estimating the sample values farther from where we learned the relationships.
5.4.5 Experiment 5 : Learning
We corrupt three randomly chosen non-overlapping, 5 minute long windows (50%
of the signal) starting from the 6
th
minute (20% of data) on the rst channel of
the records with AWGN at 0dB SNR. We build our database using the rst 20% of
5
http://www.mit.edu/ gari/CODE/FILTERS
96
!"# %"# &"#
"'"(
"'!)
"'**
"'+
"'+*
"'%&
!"##$%&'( #'*+",
,-./01234546478 ,-9412:6 ;497:<3-
;49:=/-->-<7
Figure 5-7: Last 20%, 50%, and 80% of the rst channel of the signal are corrupted
with AWGN at 0dB SNR. The average disagreement () and the residual distance
(r) are computed for all 39 records.
97
!"# %"# &"# &"#
'()*+,-*./ 0 '()*+,-*./ %
"10
"102
"1%!
"1%3
"1"4
"1&5
"122
"136
!"#$%& () *+$#,- .*"/ %( $"#"0,%" %&" /,%,1,*"
7*8,9:;< =,8/;.>* 7*)+?9:>,@,<,/A =,8;B+**-*./
Figure 5-8: In experiment 4, we build the database using the rst 20%, 50%, and 80%
of the signals. The remainder of the signals is corrupted with AWGN at SNR 0dB.
The average disagreement () and the residual distance (r) are computed for all 39
records. In experiment 5, we build the database using the rst 20% of the signals. On
the remainder, three randomly chosen non-overlapping regions (each 5 minute long),
are corrupted with AWGN at SNR 0dB. Hence, totally 50% of the data is corrupted.
Here, we also learn the new morphologies over the regions free of corruption.
98
the signal. For the remainder, we check whether the segment is corrupted, at the
granularity of a beat, and reconstruct the segment if it is corrupted. If the segment
is determined to be free of corruption in both channels, and is of high signal quality,
we add that to the database, thus learn the new morphologies.
Figure 5-8 compares the results to that of Experiment 4, and demonstrates the
benets of learning new relationships and morphologies on the y. Though, we start
with a database built from only 20% of the signal, we are able to achieve the per-
formance similar to that, where we start with a database generated from 50% of the
signal. The dierence in residual distance is not large. However, there is a signi-
cant improvement in disagreement. We attribute this to adding new segments with
PVCs to the database, and being able to learn the morphologies closer to the place
of reconstruction.
5.5 Summary
In this chapter, we presented a method for reconstructing a corrupted signal in a
multi-parameter physiological signal using the information available in a correlated
signal. The method uses template matching to reconstruct the signal. The best match
is found from the database of templates using a combination of DTW cost and the
feature distance.
Using the data from the MIT-BIH Arrhythmia Database, we conducted a series
of experiments to test the eectiveness of our method. We added synthetic corrup-
tion to the data, and used this articially corrupted data to evaluate our method.
We quantify the eectiveness of the reconstruction by comparing the reconstructed
data, and the corrupted data with the original data. Our evaluation criteria were
normalized residual distance and classication accuracy.
On 39 records from MIT-BIH Arrhythmia Database, our method improved the
classication accuracy by more than 700%, and increased the similarity to the original
signal, as measured by the normalized residual distance, by more than 250%. We also
compared the performance for various types of real-world corruptions, and dierent
99
SNR levels. Finally, we showed the eectiveness of dynamically determining the signal
quality, at the granularity of a beat, and learning the signal morphology on the y.
Our method is particularly useful when the system suers noise and transient
corruption. We used only ECG signals in this chapter, because we were able to
obtain the database containing handpicked, less common but clinically signicant
events only for ECG. However, we believe that our method is applicable to any set
of correlated physiological signals such as d ABP, PPG, and CVP as well.
100
Chapter 6
Data Fusion
This chapter discusses the details of fusing estimations from multiple signals. We
begin Section 6.1 with details of the method. In Section 6.2, we discuss the methods
to handle the baseline. In Section 6.3, we discuss the measures of performance used
to evaluate our method, and present the results of a series of tests in which compar-
isons are made using each of the performance measures. Finally, in Section 6.4, we
summarize our work.
6.1 Data Fusion
Goal : Let S
nm
be a multi-parameter time series consisting of set of m single
parameter physiological signals. The goal is to nd the corrupted regions in signal
S
1
, and reconstruct it using estimations obtained separately from the other correlated
signals S
j
from S, where j = 2, 3..m.
Procedure : First, we preprocess the signals. Then we jointly segment them,
and estimate the SQE of each segment for each of the signals. Using the segment
boundaries, we divide the signal into quasiperiodic units, which serve as the units of
replacement. We build a database using the segments of signal quality greater than a
threshold. The database is built for each pair of signals: a signal that has corruptions
and a correlated signal. Then we run the algorithm presented in Section 5.3, starting
at the rst segment U
1
, continuously evaluating the signal qualities of the segments,
101










"#$%&
'()*(%&+,#%

-
- - - -
!"##$%&'(
*+,-./0
'$)%+. /0+.$&1
23,*+,#%
4(+5%$%) 6(7#%3&507,#%
1'2"-0&#$2&'(
3$&%$&
-
- - - -









8+&+ 903$#%
Figure 6-1: The picture outlines the reconstruction process on a multi-parameter
timeseries. The multi-parameter signal contains several correlated physiological sig-
nals such as ECG, ABP, and PPG. First, we simultaneously perform the joint seg-
mentation and the Signal Quality estimation on the multi-parameter signal. Then we
learn the pair wise relationship between the signal-to-be-reconstructed and another
correlated signal in the multi-parameter signal and build the database. This is re-
peated for all the correlated signals. We nd the best matches separately for each
correlated signal and nally combine them.
102
and determining whether each segment needs reconstruction. If the segment needs
reconstruction, for each correlated signal, we nd an entry from the corresponding
database that closely matches the current segment using the method explained in
Section 5.3.5. We weight each entry by the quality of reconstruction, and combine
them. Figure 6-1 summarizes the complete process.
For the segment U
1
i
, let {V
j
i
|j = 2, 3..m} be the matches found, and {c
j
|j =
2, 3..m} be the costs of the matches. The reconstruction V
i
is obtained by fusing the
matches,
V
i
=

m
j=2
V
j
(w
j
)
p

m
j=2
(w
j
)
p
(6.1)
w
j
= c
j
.q
j
i
.r
j
. (6.2)
Here, q
j
i
is the SQE of the i
th
segment of j
th
signal, and r
j
is the correlation coecient
between the signals (corrupted signal and j
th
correlated signal) estimated around the
segment V
j
i
.
The choice of p inuences the performance of our method. For instance, when
the correlation between the corrupted signal and a correlated signal is high (e.g., two
ECG channels), p tends to perform better. This is similar to selecting the most
useful signal for the reconstruction of the segment U
i
,
k = argmax
j=2,3..m
w
j
(6.3)
V
i
= V
k
. (6.4)
However, when the correlation between the corrupted signal and the correlated signals
is low (e.g., an ECG channel and an ABP signal), lowering p to one seems to improve
the performance.
103
6.2 Global Trend
In our analysis, we rst remove the global trends and constant osets. In ECG
signals, the baseline represents noise, and it can be removed without aecting the
analysis. Further, removing the trend negates the eects of baseline wander, and
improves the performance of the automated analysis. However, signals such as ABP,
and PPG contain their primary information in the baseline. Hence, our framework
must accommodate the baseline of these signals.
In our framework, we rst extract the baseline using a rst order zero-phase
lowpass lter. For ECG signals the 3dB cuto frequency is set at 0.2Hz, and
for ABP and PPG it is set at 0.07Hz. Then we subtract the baseline from the
signal, and reconstruct the corrupted segments on this baseline-subtracted signal.
We reconstruct the baselines independently, and add them to the signals. Thus, we
modularize the reconstruction into two subproblems :
1. Reconstruction of the baseline-subtracted signals. This is a quasiperiodic multi-
parameter signal, and hence we focus on reconstructing the repetitive morpho-
logical structures in the signal. We discussed the methods to do this in Chapter
5 and Section 6.1.
2. Prediction of the global trends. Using the baselines of the multi-parameter
signal, we try to estimate the values of the baseline of the corrupted signal over
the segments of low signal quality.
The baseline of a multi-parameter signal is a multi-parameter time-series. We
treat the reconstruction of the baseline as a regression problem. We take the mean
of the samples on a segment as a sample point in our regression. Thus we model the
relationship between a scalar variable, the mean of the samples of the segment from
the signal that is to be reconstructed, and one or more variables, the means of the
samples of the segments from the correlated signals. For instance, if the rst signal
S
1
in the multi-parameter signal S
nm
= {S
1
, S
2
..S
m
} contains corruption, the data
104
X
n(m1)
, Y
n1
for the regression is,
x
(j1)
i
= mean(S
j
[p
i
,p
i+1
)
)j = 2, ..m (6.5)
y
i
= mean(S
1
[p
i
,p
i+1
)
) (6.6)
X =
_

_
x
2
1
x
3
1
.. x
m
1
x
2
2
x
3
2
.. x
m
2
... .. .. ..
x
2
n
x
3
n
.. x
m
n
_

_
(6.7)
Y =
_

_
y
1
y
2
...
y
n
_

_
(6.8)
Here, p
i
is the i
th
segment boundary.
Further, we hypothesize that the baseline is both autocorrelated and cross corre-
lated. Hence, for the model order p, we model the regression as
y
i
=
1
(
1
x
1
i
+
2
x
2
i
+.. +
m
x
m
i
+
i
)
+
2
(
1
x
1
i1
+
2
x
2
i1
+.. +
m
x
m
i1
+
i2
)
+ ...
+
p+1
(
1
x
1
ip
+
2
x
2
ip
+.. +
m
x
m
ip
+
ip
) (6.9)
= w
1
z
1
+w
2
z
2
+.. +w
m
z
m
+w
m+1
z
m+1
+.. +w
(p+1).m
z
(p+1).m
+
i
(6.10)
where
a
,
b
, w
b
c
are coecients, z
a.m+b
= x
a
b
, and w
a.m+b
=
a
.
b
. This results in a
linear regression, where y
i
is the target variable, z
i
is the regressor, and W = {w
a
|a =
1, 2..(p + 1).m} are the coecients.
We want to learn the coecients when we build the database. There are several
methods addressing the learning problem. Ridge regression [54] and Support Vector
Regression (SVR) [50] provide robust solutions to regression, especially when the data
contain the outliers [5]. We use Ridge regression to obtain the coecients w
b
a
=
a
.
b
.
105
We obtain the regression coecients from,
W = (Z
T
.Z +kI)
1
Z
T
Y (6.11)
where I is an identity matrix. The positive Ridge regularizer k controls the condi-
tioning of the problem by reducing the variance of the estimates [54].
We want to update the coecients as we add new entries to the database. We
use online gradient descent to achieve this [7]. The regularization function of the
regression is
R(W) =
k
2
||W||
2
+||W
T
Z Y ||
2
(6.12)
At time t, for update coecient , and regularization function R(w) the standard
gradient descent update is given by,
W
t+1
= W
t
2(W
T
Z
t+1
Y
t+1
)Z
t+1
(6.13)
Implicit update,
W
t+1
= W
t

2
1 + 2||Z
t+1
||
2
(W
T
Z
t+1
Y
t+1
)Z
t+1
(6.14)
modies the learning rate, and makes the method more robust to scaling of the data
[36].
6.3 Experimental Results
In this section, we present the experimental results for our method. We begin with
the description of our dataset, and then present the results of the experiments.
In our experiments, we use the multi-parameter data from Challenge 2010 Test Set
B at Physionet.org [21]. The database has 100 ECG waveform records, each contains
six, seven, and eight-channel records of ECG, ABP and PPG signals, each 10 minutes
long, to be used as a test set for the PhysioNet/Computers in Cardiology Challenge
2010. The recordings are digitized at 125 Hz.
106
We add AWGN to the rst ECG channel of the data. We evaluate our method by
quantifying the eectiveness of the reconstruction on this corrupted data. We compare
the reconstructed data, and the corrupted data with the original data to evaluate
the performance of our method. In the experiments, we compare the data with their
baselines included. Therefore, the reconstructed signal is rst obtained by merging the
morphological reconstruction (Section 6.1), and the baseline reconstruction (Section
6.2).
We use the residual distance (Equation 5.9), and the disagreement (Equation 5.10)
to evaluate our performance.
6.3.1 Experiment 1
We corrupt the rst channel of the records with the additive white gaussian noise
(AWGN) at 0dB SNR. We add the articial corruption to only the last 20% of the
data, and attempt to reconstruct it. We build the database from the rst 80% of the
data.
Table 6.1 lists the results of the experiment for the best 10 and the worst 10
records. In some records, e.g., record 37, the corruption, including total signal inter-
ruption, spans more than 50% of more than half of the signals in the record. Further,
some records have signals misaligned with each other. Since we use correlated signals
to t and align the templates on the corrupted segment, this leads to large errors.
Therefore, we report the results separately for 70 records that are relatively free of
corruption and misalignment, and the rest.
Table 6.2 shows that our method performs well on records with enough correlated
signals and correct alignment. Our method improves the signal quality on those
records by reducing the average disagreement by 3300%, and the average residual
distance by 280%.
107
Table 6.1: Experiment 1 : Eectiveness of data fusion on dierent sets of records
In the record In last 2 mins Q1 Q2
Record Beats PVC PVC r
S
b
#
r
S
b


S
b
#

S
b

ID # # #
best 10 records
4 660 156 32 0.98 0.01 0.20 0.23
18 744 1319 309 0.95 0.01 0.79 0.00
23 852 0 0 1.04 0.01 0.09 0.00
27 880 0 0 0.96 0.00 0.04 0.00
58 809 0 0 0.99 0.01 0.01 0.00
63 909 0 0 0.97 0.00 0.14 0.00
78 729 0 0 0.97 0.00 0.25 0.09
87 700 1 0 0.97 0.01 0.07 0.00
93 671 515 147 0.98 0.00 0.74 0.12
99 729 6 0 0.97 0.00 0.06 0.18
worst 10 records
24 848 28 21 0.92 0.41 0.17 0.16
25 963 30 3 0.64 0.92 0.13 0.02
33 1069 172 95 0.95 0.47 0.52 0.20
37 837 0 0 0.74 1.24 0.02 0.05
42 730 22 2 1.01 0.41 0.01 0.08
45 1290 57 25 1.06 0.33 0.09 0.25
55 814 9 0 1.12 0.53 0.04 0.00
61 922 1 0 0.99 0.39 0.00 0.03
75 775 89 22 1.44 0.85 0.18 0.21
77 741 80 23 2.17 1.09 0.20 0.19
Table 6.2: Experiment 1 : Summary
Records Q1 Q2
# r
S
b
#
r
S
b


S
b
#

S
b
On good records 70
Mean 1.00 0.03 0.14 0.05
Median 0.98 0.02 0.09 0.00
On poor records 30
Mean 1.27 0.33 0.13 0.11
Median 0.96 0.22 0.09 0.05
108
6.3.2 Experiment 2
In this experiment we use only the 70 records that are relatively free of transient
corruption and misalignment. We corrupt the rst channel of the records with additive
white gaussian noise (AWGN) at 0dB SNR. We add articial corruption to the last
10% of the data. We also add the articial corruption to a region as long as the 10%
of the data starting at 20%, 40%, and 60% of the data. The idea is to keep the span of
the corrupted region to 20% of the data, and evaluate the performance on identifying
the corrupted regions, estimating the signal qualities correctly, and learning from the
regions free of corruption. We build the database from the rst 20% of the data, and
add new entries when we come across a clean segment.
Table 6.3: Experiment 2 : Summary
Corruption starts at Q1 Q2
% r
S
b
#
r
S
b


S
b
#

S
b
20
Mean 0.97 0.042 0.15 0.08
Median 0.97 0.03 0.14 0.00
40
Mean 0.97 0.041 0.14 0.06
Median 0.97 0.03 0.14 0.00
60
Mean 0.97 0.038 0.14 0.05
Median 0.97 0.03 0.14 0.00
Table 6.3 presents the results of the experiment. Our frameworks performance
seems independent of the location of the corruption. However, there are two factors
inuencing the inferior performance compared to experiment 1. First, we have a
smaller region to analyze (20% of the data, compared to 80% of the data). Second, our
framework automatically identies the corrupted region and reconstructs it. Another
issue is that there are segments in the original record that are of poor signal quality,
and our algorithm attempts to improve their signal quality. Our evaluation criteria
is oblivious to this. It considers the original signal the gold standard and penalizes
our method for cleaning up the original signal.
109
6.4 Summary
In this chapter, we presented a method for reconstructing a corrupted signal in a
multi-parameter physiological signal by using information from correlated signals.
The method uses database of templates for each pair of correlated signals, and -
nally combines the reconstructions from each correlated signal. We also predict the
estimates of the baseline by treating them as a regression problem.
110
Chapter 7
Implementation
This chapter discusses the details of the implementation and analyzes the compu-
tational complexity of the system. Since our framework runs in real-time, we want
to do the database operations quickly, and hence we provide methods to optimize
these operations. We provide the details of the database construction in Section 7.1.
In Section 7.2, we discuss the running time of each subcomponent and the over all
computational complexity of our framework. Finally, in Section 7.3, we summarize
our work.
7.1 Database
In our framework, we use a database of templates to provide the estimates for re-
construction. The database contains the corruption free segments learned, and the
corresponding features. The use of a data structure helps us abstract the lookups,
maintenance, and transfer of the morphological structures learnt.
A separate database is maintained for each pair of signals. This allows us to
use the databases across the records. For instance, when we bootstrap, we begin
with the database in the archive for the corresponding pair of signals, probably built
from another patients signals, and add the segments from the current record as we
learn new morphologies. This allows us to transfer knowledge, and then adapt to the
patients signal morphologies.
111
7.1.1 Structure
Each database contains three elds: a set of features (Section 5.3.4), raw segments
that are used to provide the estimates for reconstruction (Signal A), and the corre-
sponding segments from the correlated signals (Signal B) as shown in Figure 7-1.
Figure 7-1: The structure of the database. It contains three elds: features, Signal A,
and Signal B. The feature, itself, is a xed length vector of dimension 30, and serves
as the key.
7.1.2 Lookup
In reconstruction, to nd the estimates for the corrupted segment, we lookup for an
entry in the table that closely matches the current segment. We use the correlated
signal (Signal A) to nd the estimates of the signal that contains corruptions (Signal
B). The entry contains the segments from both the signals. Since the lookup query
and the entries in the table are of dierent lengths, we need to use an elastic distance
measure, such as DTW. DTW takes O(
2
) time, where is the length of the segment.
The naive approach of comparing the query with every entry would take O(n
d
.
2
)
time, for the database of n
d
entries and segments of length . We use features to
linearize this (O(n
d
.)) and provide a level of abstraction. The matching is rst done
through features, and then using DTW.
112
Our method uses the features to obtain the best 20 matches in the database. The
lookup on this database is done using a nearest neighbor search. In nearest neighbor
search, we nd the entry closest to the query according to some distance function. We
use Euclidean distance to compare two features, and hence, each dimension is weighed
equally. To speed up the nearest neighbor searches, a multi-dimensional database is
normally indexed by space partitioning or locality sensitive hashing (LSH). We use
an algorithm that nds the approximate nearest neighbors in high dimensions. It has
provably near-optimal time in the class of LSH [3].
The use of LSH adds another level of indirection. LSH indexes the data so that
the queries for nearest neighbor can be served quickly. LSH is typically implemented
by creating a hash table that puts similar items into the same bucket. To serve a
query, we apply the hash function to the query, obtain the buckets ID, and do an
exhaustive search inside the bucket. The idea here is that the hash function returns
a set smaller than the original data to search for. This set contains a smaller number
of false positives, and may miss some positives. For any two vectors F
1
and F
2
, our
expectation is that
LSH(F
1
) = LSH(F
2
) if ||F
1
F
2
|| < (7.1)
for some threshold . LSH also provides an indirect clustering of the data by grouping
the similar items together.
7.1.3 Compaction
We begin with a database in an archive, and we add the entries to the database as
we learn the morphologies. Since we want to limit the size of the database, we evict
old, presumably less useful, entries from the database. This also helps us capture the
most recent information well.
This is achieved by performing the compaction on the database when its size grows
over a limit u
upper
. We use Least Recently Used (LRU) algorithm to evict the old
entries. During the compaction, the entries with the least counts are evicted, and the
113
size of the database is brought back to u
lower
.
7.2 Computational Complexity
We expect to run the system in real-time. Therefore, the computational complexity
of the framework is relevant.
Segmentation : Segmentation uses DTW to compare a window and the template
of a multi-parameter signal. If the length of the segment is , it takes O(
2
) to
nd a segment boundary. If the signal length is n, the number of segments in
the signal is O(n). Hence, it takes O(n.
2
) time to segment the signal.
Signal Quality estimation : Similar to segmentation, the signal quality estima-
tion uses an elastic function for comparison. However, it is done for all the
signals in a multi-parameter signal, and hence, it takes O(n.m.
2
) time for a
multi-parameter signal with m parameters.
Database Lookup : In Section 7.1.2, we showed that a lookup takes O(
2
). The
total number of segments requiring reconstruction is O(n), and hence the total
time complexity is O(n.
2
)
Reconstruction : In reconstruction, we time-warp the template to t the seg-
ment, and it takes O(
2
) time for each reconstruction. Therefore, the total
amount of time required is O(n.
2
).
The total time complexity of our framework is O(n.m.
2
), where n is the length of
the signal, m is the number of parameters, and is the average segment length. Since
m is typically a constant, the overall time complexity of our framework is O(n
2
).
Typically the sampling rate f of the signal determines the segment length . Hence,
we can give the running time of the system on the sampling rate as O(n.f
2
).
We carried out our experiments using the unoptimized code written in Matlab
2010a, on a machine congured at 2.93GHz Intel Core i7 processor and 8GB Ram.
The experiments (Sections 6.3.1, 6.3.2) conducted in Matlab R14, on 10 minute long
114
records from Challenge 2010 Test Set B at Physionet.org [21] took on average 51
seconds per record.
7.3 Summary
In this chapter, we discussed the details of the implementation of the system. First,
we presented the description of the data structure that holds the templates, and
the methods to access the data in the data structure. Second, we analyzed the
computational complexity of the system. In the next chapter, we provide the summary
and conclude.
115
116
Chapter 8
Conclusion and Future Work
We summarize our work in Section 8.1, review the major concepts and methods in
Section 8.2, and outline some future research in Section 8.3.
8.1 Summary
We provided a set of novel computational tools to improve the signal quality of the
physiological signals. We focused on: segmenting a multi-parameter quasiperiodic
signal in the presence of noise and transient corruption, developing a signal quality
estimate that is consistent across dierent signals, and reconstructing the segments of
low signal quality. We formulated the problems, developed the models, and evaluated
our solutions to the problems.
8.2 Contributions
We presented a novel method to jointly segment a quasiperiodic signal using template
matching. The key idea of our work is that by simultaneously considering all the
signals one can segment them more accurately than would be possible by considering
each signal independently, especially in the presence of noise. Although we tested
and applied our method only on physiological signals, the approach is generic and
can be applied to any quasiperiodic signal. Template matching uses DTW, an elastic
117
distance function, to compare variable length sequences.
In order to extend the template matching to multi-dimensional signals we intro-
duced weighted time warping (WTW) that allows the use of weights to vary the
inuence of each one-dimensional signal in the nal alignment. We use the estimated
quality of the signal for the weight. WTW measures the similarity between two
multi-parameter sequences, even when the sequences are of dierent lengths.
We developed a new method, morphological dissimilarity, to estimate the quality
of the signal at the granularity of a segment (heartbeat). We use morphological
dissimilarity evaluated by an elastic distance function (LCSS) on the gradient of the
signal to determine the signal quality. The signal quality Estimate obtained from the
morphological dissimilarity is also comparable across dierent types of signals (ECG,
ABP, and PPG).
We approached the problem of reconstruction as an unsupervised learning and
discovery problem. We provided methods to learn and archive useful morphological
features that can be used later in reconstruction. We provided methods to convert
the variable length sequence into a xed length feature vector.
We also demonstrated the eectiveness of our method. In the experiments car-
ried out on the MIT-BIH Arrhythmia Database, a multi-parameter ECG database
with many clinically signicant arrhythmias, our method improved the classication
accuracy of the beat type by more than 700% on signals corrupted with white Gaus-
sian noise, and increased the similarity to the original signal, as measured by the
normalized residual distance, by more than 250%. When the method was applied to
the multi-parameter physiological signal data from Cinc Challenge 2010 database at
Physionet.org, our method improved the classication accuracy of the beat type by
more than 3300% on a signal corrupted with white Gaussian noise, and increased the
similarity to the original signal by more than 280%.
118
8.3 Future Work
In the near future, we hope to evaluate the inuence of the choice of design parameters
on the performance of the system. These parameters include template update rate,
length of the template database, signal quality estimate thresholds, etc. We set the
parameters by locally optimizing the performance on MIT-BIH Arrhythmia dataset.
First, we could like to nd globally optimal values for these parameters through an
exhaustive search on the same dataset. Second, we would like to test whether the
same values give the best performance on another dataset, for example, on MIMIC
II dataset.
We would also like to address a limitation of our present approach. Our method
handles severe corruptions better than mild corruptions. The process of reconstruc-
tion is based on a binary decision, i.e., the method either decides to reconstruct a
segment or chooses to leave it as it is. When the signal is mildly corrupted, for
instance, at SNR 10dB as in Section 5.4.2, our method chooses not to reconstruct
the signal. We would like to explore alternative approaches that work well on mild
corruptions.
In our framework, we do not consider the temporal relationships between seg-
ments. Our hypothesis is that the correlated signals contain sucient information to
reconstruct the corrupted segment. Therefore, we do not map any temporally distant
relationships between segments. We segment the signals to limit the error propaga-
tion, and hence, the temporal relationship between samples is limited to individual
segments. We would like to investigate the eectiveness of incorporating temporal
relationships between segments.
We hope to investigate the utility of transfer learning across patients. For instance,
we would like to test the eectiveness of beginning the reconstruction process with a
generic template database. We would then adapt to the current record by learning
new morphological relationships in the current record. This is useful at the beginning
of the reconstruction process when the database would be otherwise empty. The
design of our framework supports this extension.
119
We tested our framework on ECG, ABP and PPG signals. We believe they are ap-
plicable to CVP and ICP as well. We hope to try our methods on other quasiperiodic
multi-parameter signals to evaluate its usefulness.
In building our framework, we developed a set of tools, such as WTW, MD, and
feature based indirection. We believe they will be useful in other areas as well. We
would like to explore their use in areas such as video segmentations, multichannel
audio segmentation, multi-parameter sensor data, etc.
8.4 Conclusion
The fundamental premise of this thesis is that given a multi-parameter signal, in-
formation in some signals can be used to infer the value of other signals. Based on
this premise, we developed methods to jointly process a set of multiple signals, and
improve their signal quality.
The key technical idea is to use templates to capture the relationships between the
morphologies of correlated signals. These templates are learned on a patient-specic
basis and continue to evolve as data is processed.
We showed that the joint segmentation could accurately segment a multi-parameter
signal, even when each of the individual channels are so corrupted that they cannot
be individually segmented. We also showed that correlated signals can be used to
accurately reconstruct corrupted signals.
Our results strongly suggest that our initial premise is valid and that templates
are indeed a good way to capture relationships among correlated signals. They also
suggest that it is critical that the database of templates be allowed to evolve over
time, perhaps because the signals themselves evolve over time.
120
Bibliography
[1] A Aboukhalil, L Nielsen, M Saeed, RG Mark, and GD Cliord. Reducing false
alarm rates for critical arrhythmias using the arterial blood pressure waveform.
Journal of biomedical informatics, 41(3):442451, 2008.
[2] MG Akritas. Linear regression for astronomical data with measurement errors
and intrinsic scatter. arXiv.org, 1996.
[3] A Andoni and P Indyk. Near-optimal hashing algorithms for near neighbor prob-
lem in high dimensions. Proceedings of the Symposium on Foundations of . . . ,
2006.
[4] L Arge, M de Berg, and HJ Haverkort. The priority R-tree: A practically ecient
and worst-case optimal R-tree. In Proceedings of the 2004 . . . , 2004.
[5] D Basak and S Pal. Support vector regression. Neural Information Processing-
. . . , 2003.
[6] Hanqing Cao, Larry Eshelman, Nicolas Chbat, Larry Nielsen, Brian Gross, and
Mohammed Saeed. Predicting ICU hemodynamic instability using continuous
multiparameter trends. Engineering in Medicine and Biology Society, 2008.
EMBS 2008. 30th Annual International Conference of the IEEE, pages 3803
3806, 2008.
[7] N Cesa-Bianchi. Analysis of two gradient-based algorithms for on-line regression.
In Proceedings of the tenth annual conference on . . . , 1997.
[8] Samprit Chatterjee and Ali S Hadi. Regression analysis by example. LibreDigital,
2006.
[9] L Chen, MT

Ozsu, and V Oria. Robust and fast similarity search for mov-
ing object trajectories. Proceedings of the 2005 ACM SIGMOD international
conference on Management of data, pages 491502, 2005.
[10] KL Cheung. Enhanced nearest neighbour search on the R-tree. ACM SIGMOD
Record, 1998.
[11] G Cliord, F Azuaje, and P McSharry. Advanced methods and tools for ECG
data analysis. gbv.de, 2006.
121
[12] Qiao Li Cliord and Gari D. Suppress False Arrhythmia Alarms of ICU Monitors
Using Heart Rate Estimation Based on Combined Arterial Blood Pressure and
Ecg Analysis. pages 13, mar 2008.
[13] AV Deshmane. False Arrhythmia Alarm Suppression Using ECG, ABP, and
Photoplethysmogram. 2009.
[14] Elena Deza. Encyclopedia of Distances.
[15] H Ding, G Trajcevski, P Scheuermann, X Wang, and E Keogh. Querying and
mining of time series data: experimental comparison of representations and dis-
tance measures. Proceedings of the VLDB Endowment, 1(2):15421552, 2008.
[16] AL Edwards. An introduction to linear regression and correlation, 1984.
[17] Z Fejzo. Adaptive Laguerre-lattice lters. Signal Processing, 2002.
[18] E Gil and J Mara Vergara. Detection of decreases in the amplitude uctuation
of pulse photoplethysmography signal as indication of obstructive sleep apnea
syndrome in children. Biomedical Signal Processing and . . . , 2008.
[19] A Gionis and P Indyk. Similarity search in high dimensions via hashing. In
Proceedings of the 25th International . . . , 1999.
[20] M Gleicher and L Kovar. Automated Extraction and Parameterization of Mo-
tions in Large Data Sets. ACM transactions on graphics, 2004.
[21] A L Goldberger, L A N Amaral, L Glass, J M Hausdor, P Ch Ivanov, R G
Mark, J E Mietus, G B Moody, C-K Peng, and H E Stanley. PhysioBank,
PhysioToolkit, and PhysioNet: Components of a New Research Resource for
Complex Physiologic Signals. Circulation, 101(23):e215e220, 2000. Circulation
Electronic Pages: http://circ.ahajournals.org/cgi/content/full/101/23/e215.
[22] J Guttag and Z Syed. Computational methods for physiological data.
dspace.mit.edu, 2009.
[23] JD Hamilton. Time series analysis. Princeton Univ Pr, 1994.
[24] Patrick S Hamilton and Willis J Tompkins. Quantitative Investigation of QRS
Detection Rules Using the MIT/BIH Arrhythmia Database. Biomedical Engi-
neering, IEEE Transactions on, BME-33(12):11571165, 1986.
[25] A Hartmann. Reconstruction of missing cardiovascular signals using adaptive
ltering. Computing in Cardiology, 2010.
[26] T Heldt, B Long, GC Verghese, P Szolovits, and RG Mark. Integrating Data,
Models, and Reasoning in Critical Care. Engineering in Medicine and Biology
Society, 2006. EMBS 06. 28th Annual International Conference of the IEEE,
pages 350353, 2006.
122
[27] WJ Hemmerle. An explicit solution for generalized ridge regression. Technomet-
rics, 1975.
[28] Zhang Hengfei, Zeng Zhiyuan, Tan Xiaojun, and Chen Jixiong. 2010 2nd Inter-
national Conference on Advanced Computer Control. In 2010 2nd International
Conference on Advanced Computer Control, pages 489493. IEEE.
[29] A Henrich. The LSDh-tree: An access structure for feature vectors. Data Engi-
neering, 1998.
[30] G Hinton. Learning multiple layers of representation. Trends in cognitive sci-
ences, 2007.
[31] B Hjorth. The physical signicance of time domain descriptors in EEG analysis.
Electroencephalography and Clinical Neurophysiology, 1973.
[32] E Keogh and CA Ratanamahatana. Exact indexing of dynamic time warping.
Knowledge and Information Systems, 7(3):358386, 2005.
[33] EJ Keogh and MJ Pazzani. Derivative dynamic time warping. First SIAM
international conference on data mining, 2001.
[34] B-U Kohler, C Hennig, and R Orglmeister. The principles of software QRS
detection. Engineering in Medicine and Biology Magazine, IEEE, 21(1):4257,
2002.
[35] L Kovar and M Gleicher. Automated extraction and parameterization of motions
in large data sets. ACM Transactions on Graphics (TOG), 2004.
[36] B Kulis. Implicit online learning. Proc Intl Conf Machine Learning, 2010.
[37] LJ Latecki, V Megalooikonomou, Qiang Wang, R Lakaemper, CA Ratanama-
hatana, and E Keogh. Partial elastic matching of time series. Data Mining, Fifth
IEEE International Conference on, pages 4 pp. EP, 2005.
[38] Q Li, R Mark, and G Cliord. Robust heart rate estimation from multiple
asynchronous noisy sources using signal quality indices and a Kalman lter.
Physiological measurement, 2008.
[39] Y Li, Y Sun, P Sondhi, L Sha, and C Zhai. Reconstructing Missing Signals in
Multi-Parameter Physiologic Data by Mining the Aligned Contextual Informa-
tion.
[40] RG Mark and M Saeed. Temporal pattern recognition in multiparameter ICU
data. 2007.
[41] GB Moody. The physionet/computing in cardiology challenge 2010: Mind the
gap. Computing in Cardiology, pages 305308, 2010.
123
[42] MD Morse and JM Patel. An ecient and accurate method for evaluating time
series similarity. Proceedings of the 2007 ACM SIGMOD international conference
on Management of data, pages 569580, 2007.
[43] C Myers, L Rabiner, and A Rosenberg. Performance tradeos in dynamic time
warping algorithms for isolated word recognition. Acoustics, Speech and Signal
Processing, IEEE Transactions on, 28(6):623635, 1980.
[44] M Oenger. Monitoring transient repolarization segment morphology deviations
in Mouse ECG. dspace.mit.edu, 2006.
[45] Jiapu Pan and Willis J Tompkins. A Real-Time QRS Detection Algorithm.
Biomedical Engineering, IEEE Transactions on, BME-32(3):230236, 1985.
[46] Rao. Linear Statistical Inference and Its Applications, feb 1998.
[47] R Rodrigues. Filling in the Gap: a General Method using Neural Networks.
Computing in Cardiology, 37, 2010.
[48] Mohammed Saeed and Roger Mark. A novel method for the ecient retrieval of
similar multiparameter physiologic time series using wavelet-based symbolic rep-
resentations. AMIA Annual Symposium proceedings / AMIA Symposium AMIA
Symposium, pages 679683, 2006.
[49] I Silva. PhysioNet 2010 Challenge: A Robust Multi-Channel Adaptive Filtering
Approach to the Estimation of Physiological Recordings. Computing in Cardi-
ology, 37, 2010.
[50] A Smola. A tutorial on support vector regression. Statistics and computing,
2004.
[51] JX Sun, AT Reisner, and RG Mark. A signal abnormality index for arterial
blood pressure waveforms. Computers in Cardiology, 2006, pages 1316, 2008.
[52] L Tarassenko, A Hann, and D Young. Integrated monitoring and analysis for
early warning of patient deterioration. Medical Applications of Signal Processing,
2005. The 3rd IEE International Seminar on (Ref. No. 2005-1119), pages 6468,
2006.
[53] AN Tichonov. Solutions of ill-posed problems. Vh Winston, 1977.
[54] A N Tikhonov and Andre Nikolaevich Tikhonov. Ill-posed problems in natural
sciences. proceedings of the International Conference held in Moscow, August
19-25, 1991. Vsp, dec 1992.
[55] J Urrusti and W Tompkins. Performance evaluation of an ECG QRS complex
detection algorithm. PROC ANNU CONF ENG MED BIOL, 1993.
124
[56] Michail Vlachos, Marios Hadjieleftheriou, Dimitrios Gunopulos, and Eamonn
Keogh. Indexing Multidimensional Time-Series. The VLDB Journal The
International Journal on Very Large Data Bases, 15(1), 2006.
[57] J Wang. A new method for evaluating ECG signal quality for multi-lead ar-
rhythmia analysis. COMPUTERS IN CARDIOLOGY, 2002.
[58] R Weber and HJ Schek. A quantitative analysis and performance study for
similarity-search methods in high-dimensional spaces. In Proceedings of the In-
ternational . . . , 1998.
[59] SM Wu. Time Series and System Analysis Modeling and Applications, 1979.
[60] G UDNY YULE. On a method of investigating periodicities in disturbed series,
with special reference to Wolfers sunspot numbers., Philos. Transactions (A),
1927.
[61] Feng Zhou, F Torre, and JK Hodgins. Aligned Cluster Analysis for temporal
segmentation of human motion. Automatic Face & Gesture Recognition, 2008.
FG 08. 8th IEEE International Conference on, pages 17, 2008.
[62] W Zong and G Moody. A robust open-source algorithm to detect onset and
duration of QRS complexes. COMPUTERS IN CARDIOLOGY, 2004.
[63] W Zong, G Moody, and R Mark. Reduction of false blood pressure alarms by
use of electrocardiogram blood pressure relationships. Computers in Cardiology
1999, pages 305308, 1999.
125

You might also like