Systolic Blood Pressure Estimation from Electrocardiogram and Photo Plethysmogram Signals Using Convolutional Neural Networks
2 School of Innovation, Design, and Engineering, Mälardalen University, Västerås, Sweden
Received Date: May 15, 2020 / Accepted Date: Jul 03, 2020 / Published Date: Jul 10, 2020
Citation: Moghaddam Mansouri SR, Gholamhosseini H, Lowe A, Linden M (2020). Systolic Blood Pressure Estimation from Electrocardiogram and Photoplethysmogram Signals Using Convolutional Neural Networks. EJBI. 16(2): 18-23 DOI: 10.24105/ejbi.2020.16.2.18-23
This open-access article is distributed under the terms of the Creative Commons Attribution Non-Commercial License (CC BY-NC) (http://creativecommons.org/licenses/by-nc/4.0/), which permits reuse, distribution and reproduction of the article, provided that the original work is properly cited and the reuse is restricted to noncommercial purposes. For commercial reuse, contact [email protected]
Background: Digital continuous blood pressure (BP) monitoring is increasingly being used in clinical and remote settings. Although it could significantly help clinicians in vital signs monitoring, the analyzing of such amount of BP data is challenging. Objective: This study is aimed to investigate the feasibility of applying deep convolutional neural network (CNN) to the estimation of the systolic blood pressure (SBP) using electrocardiogram (ECG) and photo plethysmography (PPG) signals.
Method: A total of 62500 ECG and PPG signals, sampled at 125 Hz, with 250 corresponding SBP, sampled at 1 Hz, were selected from Medical Information Mart for Intensive Care (MIMIC-III) Waveform Database. The collected signals from 22 subjects were divided into training (80%) and testing (20%) datasets. A CNN-based model was designed with five convolutional layers, one fully connected layer, and one regression layer to predict the SBP. Two different methods of applying data to the input of the CNN model was evaluated. In the first method, the continuous wavelet transform of the data was used while in the second method, the raw ECG and PPG signals without preprocessing were used as the input dataset.
Results: The results showed a high accuracy of 87.42% for the first method and 90.31% for the second method. Moreover, the mean square errors of 5.43 mmHg and 4.82 mmHg were measured for first and second models, respectively.
Conclusions: Both methods are capable of learning how to extract relevant features from the ECG and PPG signals and estimate SBP within an acceptable error margin set by Association for the Advancement of Medical Instrumentation (AAMI).
Blood pressure (BP) is a vital physiological parameter of the human body that can develop a highly prevalent risk factor worldwide . BP could be varying over time due to physical condition, physiological rhythm, environmental conditions, and so many other factors. Early detection of diseases related to high BP could dramatically decrease disability and mortality rate by maintaining control at the earlier stages of the disease. Continuous BP monitoring is one of the most promising and accurate methods to monitor BP through home-based or clinically measurement . Moreover, there is a high demand for global healthcare toward a wearable sensor and personalized medicine. Thus, the wearable continuous BP devices give the chances of early diagnosis and proper clinical treatments for cardiovascular patients .
The most accurate and clinically accepted continuous BP measurement methods mostly include arterial tonometry tension and volume compensation parameters, and pulse wave velocity (PWV) parameters . The arterial tonometry method applicable to radial artery and does not need calibration and accurate for longtime monitoring; however, the utilize sensor could be so sensitive to motion artifacts . Employing volume compensation-based method for BP monitoring devices required a reference pressure setting which makes the measurement complicated. Moreover, the patients will experience considerable pains due to continuous pressure in venous congestion . The relationship between blood pressure and PWV was first theoretically validated by Moens and Korteweg (M-K) . The PWV method achieved a more straightforward design for BP measuring devices and solved the somberness of patient’s comfortability in long-term monitoring.
With recent developments in artificial intelligence as applied to healthcare, the potential in achieving significant improvements and providing real-time, and better personalized medical devices at lower costs is increasing . The researchers have recognized the remarkable potential of artificial intelligence in healthcare to improve the people wellbeing towards cuff-less BP measurement [8-12]. Considering the ability of machine learning to learn the function of the complex system makes it a promising method for BP estimation. A well-trained model can address the latent affecting parameters that cannot be measured in the analytical model. The main idea is to use machine learning to extract surrogate cardiovascular features from time-domain or frequency-domain of physiological signals, then train the data with a machine learning-based model, and finally estimate the BP through the developed model .
Deep learning is a subfield of machine learning that has a powerful ability to handle supervised and unsupervised learning, besides solve different type of classification and identification problems. Deep learning has shown promising results in object detection , speech recognition , and image recognition . This technique also used in biomedical technologies such as risk assessment for hypertension [17,18], and echocardiography images analysis [19,20]. But to the availability of a vast amount of medical data, deep learning is considered as a powerful tool in the mining of medical data collected from monitoring and wearables devices. In general, deep belief networks (DBN), stacked auto encoders (SAE), and CNN are typical structures of deep learning models .
In this study, a CNN-based architecture to estimate the SBP from ECG and PPG signals was designed in order to eliminate the engineered feature extraction process. This paper is an extension of work initially presented at the pHealth-2019 conference . We increased the size of the training data, modified the CNN design, and compared the new models with selected studies in the same field.
Materials and Methods
The database has been used for this study is Medical Information Mart for Intensive Care (MIMIC-III) Waveform Database Matched Subset , which is a subset of the MIMIC-III Waveform Database. This database is included in multiple physiologic waveforms and numeric time series of vital sign measurements. It contains 22,317 waveform records, and 22,247 numeric records collected from 10,282 distinct ICU patients that have been matched and time-aligned with MIMIC-III Clinical Database records.
To evaluate both proposed methods, we chose subjects with available ECG, PPG and SBP signals, and after time synchronization, the related waveforms were extracted to use as training and testing data. The data collected form a total of 22 patients in ICU. Ultimately, after removing the missing and uncompleted data, 62500 ECG and PPG samples, sampled 125 times per second, with 250 corresponding SBP, sampled one time per second were collected for the experiment.
Convolutional Neural Networks (CNN) Architecture
Deep learning has shown promising results in healthcare applications such as risk assessment for hypertension [17,18], and echocardiography images analysis [19,20]. CNN can be categorized as a class of deep neural network , and it is known as the most popular technique in deep learning . Like other deep learning models, CNN is a hierarchical machine learning tool, commonly consists of input and output layers as well as multiple hidden convolutional layers in sequence. Inspired by the success of CNN in the fields of speech analysis and recognition as well as emotion recognition [26-29], the CNN-based architecture has been applied to continuous and cuff-less BP monitoring in this study. One of the significant benefits of such method is the ability of CNNs to perform perception tasks, which allows them to learn the BP relevant features from ECG and PPG signals and skip the complicated feature extraction step.
Although there are a variety of pre-trained CNN networks such as AlexNet, GoogLeNet, and ResNet, we designed a CNN from scratch due to the complexity of the selected training dataset. The software environment used for this research was MATLAB R2020-a with Intel Core i7-6700 GPU.
We developed two methods based on CNN to estimate BP. The first method employs continuous wavelet transform (CWT) of the data while in the second method, the raw ECG and PPG signals are used without preprocessing as the input data.
The First Method
In this method, the CWT of the selected ECG and PPG dataset was calculated and presented as the input to the proposed CNN for estimating the SBP. The scalogram of the ECG and PPG signals was created using CWT and employed to train and test the proposed CNN model.
ECG and PPG Scalograms using CWT: The CWT is one of the effective techniques that performs time-frequency analysis of the desired signals. This method is suitable for extracting highfrequency components of the signals in a short period. The CWT of a signal X(t) ∈ L2(R) could be expressed as:
where X(t) is the signal in the time domain, ‘∗’ operator denotes the complex conjugate and ψ(t) is the mother wavelet scaled by a factor a, a>0, and dilated by a factor b.
The absolute value of the CWT coefficients of a signal can be considered as the scalogram, which is a function of time and frequency. Since the ECG is related to heart activity and PPG is a fusion of the microcirculation system that detects the changes in blood volume, their time-frequency domain parameters are essential in this study. Therefore, CWT is used to produce
Red-Green-Blue (RGB) images from the selected ECG and PPG signals. Each cardiac cycle converted to a scalogram over both time and frequency, which can locate different frequency components of the signals.
The absolute value of CWT was obtained using the analytic Morse wavelet through the Wavelet toolbox of MATLAB with the symmetry parameter (gamma) equal to 3 and the timebandwidth product equal to 60. Each scalogram was generated in size of 41×250×1- pixel RGB image. In total, 500 scalogram images were generated from time-synchronized ECG and PPG as well as 250 SBP corresponding reading.
A sample of ECG, and PPG and its corresponding scalograms created for this study are illustrated in Figure 1.
The Proposed CNN Architecture and Training Model: The proposed CNN has five convolutional layers (C1 to C5), one fully connected layer, and one regression layer. The ECG and PPG scalograms passed through the proposed network to extract the relevant features. The training data sorted as 41×250×2 array, where 41 is the height and 250 is the width of the pixels in the grey-scaled images, and 2 is numbers of channels (ECG and PPG). The first convolutional layer (C1) has eight kernels of size (3×3×1) with applied at stride setting of 2 pixels. It is followed by batch normalization layer with eight channels to normalize the prediction of the network when training starts. A ReLU layer used to improve the efficiency of the training process and an average pooling layer with a stride of two implied respectively. The second convolutional layer (C2) has 16 kernels of size (3×3×8) which applied to the input with a stride of two pixels. Correspondingly, followed by batch normalization, ReLU, and average pooling layers. The third convolutional layer (C3) has 32 kernels of size (3×3×16) and followed as the previous one. The fourth convolutional layer (C4) has 32 kernels of size (3×3×32), followed by batch normalization, ReLU, and average pooling layers. The last convolutional layer (C5) has 64 kernels of size (3×3×64) and followed by a fully connected layer of one neuron. To reduce the overfitting, a dropout layer that performs regularization with a dropout ratio of 20% is used before fully connected layer. A regression layer is used to estimate the BP from the final fully connected layer in Figure 2.
The Second Method
In the second method, we used raw ECG and PPG signals without preprocessing as the input data. Although the PPG signals are noisy, and the preprocessing need to be considered, the PPG signals from the MIMIC III database are highly filtered and further preprocessing is not required.
The same architecture for the CNN as described in the previous section repeated for this experiment. During the training process, the convolutional layers of the network only detect part of training data that resemble different features. While, the first convolutional layers of network detecting very low-level features, the last layers are more sensitive to complex level features. A fully connected layer at the end of the network takes the high-level features learned in the earlier layers of CNN.
A total of 62500 samples of ECG and PPG signals along with synchronized 250 SBP reading were extracted from MIMIC III waveform dataset for this experiment.
Comparison and Discussion
The experimental results demonstrated a low error rate with high accuracy, confirming that the proposed CNN models are suitable and feasible where the features are not immediately apparent or difficult to extract. Moreover, the preliminary results illustrated an increase in the performance if we employ raw signals for training data compared to using the scalograms of the data.
Also, the experimental results were compared with selected works in the literature. Table 1, summarizes the comparison results of the proposed two methods with two previous works of the authors for SBP estimation. The first column shows the SBP estimation model used and the following column illustrates the error rate, where the last column indicates the use of engineered feature extraction. From Table 1, it is clear that the proposed CNN-based-SBP estimation models showed the least errors in compare to the other methods. The engineered features extraction approach was implemented for the other two studies. Still, we did not employ such a technique as our network learns to extract the related features from ECG and PPG signals intelligently. Accordingly, signal preprocessing and complex engineering feature extraction are not required for the proposed methods, which decrease the complexity and improve the processing time.
|Model||RMSE||Engineered Feature extraction|
|PTT (Pulse Transit Time) ||11||Yes|
|Pulse wave analysis (PWA) and pulse arrival time (PAT) ||10.6||Yes|
|The first method (previous study)||7.79||No|
|The second method (previous study)||8.1||No|
Table 1: Comparison of our modified model with our previous work.
Moreover, a comparison between this study and our previous study  has been conducted. In the current study, we created a larger database and collected 100 more scalograms images for method one and 12500 more samples for method two. As the complexity of the new approach increased, we added one more convolutional layer to the CNN model to achieve better results. The first 80% of data used for training and the rest used for testing. Table 2, illustrates the results of both methods and shows the number of samples. It confirms that this study, which used a larger database and modified CNN structure, outperforms the previous study.
|Model||RMSE||Accuracy||Number of samples|
|First method (this study)||5.43||87.42||500 samples|
|Second method (this study)||4.82||90.31||62500 samples|
|First method (previous study) ||7.79||82.76||400 samples|
|Second method (previous study) ||8.1||89.66||50000 samples|
Table 2: Comparison of the current and previous study results with a different number of samples.
Moreover, we compared the results with one of the latest study , which employed extensive signal processing and feature engineering. The authors extracted a total of 39 features from time domain and six features from frequency, then normalized the features before training their models. They employed three types of machine learning methods to estimate the BP and calculated the RMSE for SBP. It is reported that the RMSE of 5.86 mmHg, 9.50 mmHg, and 10.30 mmHg were calculated for the random forest, support vector machine, and k-nearest neighbors respectively. Compared to the RMSE of 5.43 mmHg obtained for our first method, and 4.82 for the second method, we have achieved better results with less error.
In this paper, we investigated the ability of CNN for continuous SBP monitoring and compared its performance with other classical methods. This study presented two deep learning estimation methods using the scalogram of signals and raw signals. It is important to remark that the proposed methods eliminate the engineered or manual feature extraction step and has low computational complexity and faster training time.
The proposed methods can be employed in wearable and smart devices and integrated with cloud computing technology. The wearable devices could collect signals and information, then the cloud platforms can process the data using embedded CNN models. So, it can play a major role in continuous Blood Pressure monitoring and early diagnosis of hypertension, leading to a decrease in the rate of morbidity and mortality associated with hypertension diseases.
Although the proposed method showed promising results and outstanding potential in BP monitoring, there is still room to improve the accuracy and performance of this study. Even though we increased the training data, a larger database will be beneficial for such models. Further validation and optimization with clinical databases will be considered for future work.
- Banet MJ, Zhou Z, Dhillon MS, Kopotic RJ, Terry AS, Visser IH. Vital sign monitor for measuring blood pressure using optical, electrical and pressure waveforms. Google Patents; 2013.
- Zhang Q, Zeng X, Hu W, Zhou D. A machine learning-empowered system for long-term motion-tolerant wearable monitoring of blood pressure and heart rate with ear-ECG/PPG. IEEE Access. 2017; 5: 10547-10561.
- Guk K, Han G, Lim J, Jeong K, Kang T, Lim E, et al. Evolution of wearable devices with real-time disease monitoring for personalized healthcare. Nanomaterials. 2019; 9(6): 813.
- Li X, Li S, Fang Z, Zhou Q. Noninvasive continuous blood pressure estimation algorithm based on features of pulse waves. 12th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI). 2019: 1-6.
- Wu H, Ji Z, Li M. Non-invasive continuous blood-pressure monitoring models based on photoplethysmography and electrocardiography. Sensors. 2019; 19(24): 5543.
- Jadooei A, Zaderykhin O, Shulgin V. Adaptive algorithm for continuous monitoring of blood pressure using a pulse transit time. IEEE XXXIII International scientific conference electronics and nanotechnology (ELNANO). 2013: 297-301.
- Ahmed Z, Mohamed K, Zeeshan S, Dong X. Artificial intelligence with multi-functional machine learning platform development for better healthcare and precision medicine. Database. 2020.
- Kachuee M, Kiani MM, Mohammadzade H, Shabany M. Cuff-less high-accuracy calibration-free blood pressure estimation using pulse transit time. IEEE International Symposium on Circuits and Systems (ISCAS). 2015; 1006-1009.
- Li Y, Wang Z, Zhang L, Yang X, Song J. Characters available in photoplethysmogram for blood pressure estimation: beyond the pulse transit time. Australasian Phys Eng Sci Med. 2014; 37(2): 367-376.
- Monte-Moreno E. Non-invasive estimate of blood glucose and blood pressure from a photoplethysmograph by means of machine learning techniques," Artificial intelligence in medicine. 2011; 53(2): 127-138.
- Xing X, Sun M. Optical blood pressure estimation with photoplethysmography and FFT-based neural networks. Biomedical Optics Express. 2016; 7(8): 3007-3020.
- Ruiz-Rodríguez JC, Ruiz-Sanmartín A, Ribas V, Caballero J, García-Roche A, Riera J, et al. Innovative continuous non-invasive cuffless blood pressure monitoring based on photoplethysmography technology. Intensive care medicine. 2013; 39(9): 1618-1625.
- Miao F, Fu M, Ting Y, Ding XR, Hong X, He Q, et al. A novel continuous blood pressure estimation approach based on data mining techniques. IEEE Journal of Biomedical and Health Informatics. 2017; 21(6): 1730-1740.
- Zhao ZQ, Zheng P, Xu ST, Wu X. Object detection with deep learning: A review," IEEE transactions on neural networks and learning systems. 2019; 30(11): 3212-3232.
- Nassif AB, Shahin I, Attili I, Azze M, Shaalan K. Speech recognition using deep neural networks: A systematic review. IEEE Access. 2019; 7: 19143-19165.
- Antipov G, Baccouche M, Berrani SA, Dugelay JL. Effective training of convolutional neural networks for face-based gender and age prediction. Pattern Recognition. 2017; 72: 15-26.
- Rodrigues JC, Amadu AM, Dastidar AG, McIntyre B, Szantho GV, Lyen S, et al. ECG strain pattern in hypertension is associated with myocardial cellular expansion and diffuse interstitial fibrosis: a multi-parametric cardiac magnetic resonance study. European Heart J Cardiovascular Imaging. 2016; 8(4): 441-450.
- Saeki M, Sato N, Kawasaki M, Tanaka R, Nagaya M, WatanabeT, et al. Left ventricular layer function in hypertension assessed by myocardial strain rate using novel one-beat real-time three-dimensional speckle tracking echocardiography with high volume rates. Hypertension Res. 2015; 38(8): 551.
- Madani A, Arnaout R, Mofrad M, Arnaout R. Fast and accurate view classification of echocardiograms using deep learning," npj Digital Medicine. 2018; 1(1): 6.
- Everdingen WM, Maass AH, Vernooy K, Meine M, Allaart CP, Lange FJ, et al. Comparison of strain parameters in dyssynchronous heart failure between speckle tracking echocardiography vendor systems. Cardiovascular Ultrasound. 2017; 15(1): 25.
- Liang Y, Chen Z, Ward R, Elgendi M. Photoplethysmography and deep learning: enhancing hypertension risk stratification. Biosensors. 2018; 8(4): 101.
- Rastegar S, Gholamhosseini H, Lowe A, Mehdipour F, Lindén M. Estimating Systolic Blood Pressure Using Convolutional Neural Networks. Studies in health technology and informatics. 2019; 261: 143-149.
- Goldberger Al, Amaral LA, Glass L, Hausdorff JM, Ivanov PC, Mark RG, et al. PhysioBank, PhysioToolkit, and PhysioNet: components of a new research resource for complex physiologic signals. Circulation. 2000; 101(23): e215-e220.
- Teramoto A, Tsukamoto T, Kiriyama Y, Fujita H. Automated classification of lung cancer types from cytological images using deep convolutional neural networks," BioMed research international. 2017.
- Tsinalis O, Matthews PM, Guo Y, Zafeiriou S. Automatic sleep stage scoring with single-channel EEG using convolutional neural networks. Cornell University, 2016.
- Badshah AM, Ahmad J, Rahim N, Baik SW. Speech emotion recognition from spectrograms with deep convolutional neural network. 2017 International Conference on Platform Technology and Service (PlatCon). IEEE. 2017; 1-5.
- Mao Q, Dong M, Huang Z, Zhan Y. Learning salient features for speech emotion recognition using convolutional neural networks. IEEE Transactions on Multimedia. 2014; 16(8): 2203-2213.
- He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. Proceedings of the IEEE conference on computer vision and pattern recognition. 2016; 770-778.
- Sainath TN, Mohamed AR, Kingsbury B, Ramabhadran B. Deep convolutional neural networks for LVCSR. IEEE international conference on Acoustics, speech and signal processing (ICASSP). IEEE. 2013; 8614-8618.
- Nidigattu GR, Mattela G, Jana S. Non-invasive modeling of heart rate and blood pressure from a photoplethysmography by using machine learning techniques. International Conference on COMmunication Systems & NETworkS (COMSNETS). 2020: 7-12.
- Mishra B, Thakkar N. Cuffless blood pressure monitoring using PTT and PWV methods. International Conference on Recent Innovations in Signal processing and Embedded Systems (RISE) 2017; 395-401.
- Yoon YZ, Kang JM, Kwon Y, Park S, Noh S, Kim Y, et al. Cuff-less blood pressure estimation using pulse waveform analysis and pulse arrival time. IEEE journal of biomedical and health informatics. 2018; 22(4): 1068-1074.