Javascript is required
Beck, J. C. (1999). Geriatrics Review Syllabus: A Core Curriculum in Geriatric Medicine. Kendall/Hunt Publishing Company, Dubuque, IA, US. [Google Scholar]
Canayaz, M. (2021). C+EffxNet: A novel hybrid approach for COVID-19 diagnosis on CT images based on CBAM and EfficientNet. Chaos, Solitons Fractals, 151, 111310. [Google Scholar] [Crossref]
Chan, H. L., Ouyang, Y., Chen, R. S., Lai, Y. H., Kuo, C. C., Liao, G. S., Hsu, W. Y., & Chang, Y. J. (2023). Deep neural network for the detections of fall and physical activities using foot pressures and inertial sensing. Sensors, 23(1), 495. [Google Scholar] [Crossref]
Gao, W. D. & Zhao, Z. W. (2021). Gait phase recognition using fuzzy logic regulation with multisensor data fusion. J. Sens., 2021, 1–13. [Google Scholar] [Crossref]
Li, B. C., Yao, Z. M., Wang, J. G., Wang, S. N., Yang, X. J., & Sun, Y. N. (2020). Improved deep learning technique to detect freezing of gait in Parkinson’s disease based on wearable sensors. Electronics, 9(11), 1919. [Google Scholar] [Crossref]
Liang, S. Y., Liu, Y. M., Li, G. L., & Zhao, G. R. (2019). Elderly fall risk prediction with plantar center of force using convlstm algorithm. In 2019 IEEE International Conference on Cyborg and Bionic Systems (CBS), Munich, Germany (pp. 36–41). [Google Scholar] [Crossref]
Liang, S. Y., Ning, Y. K., Li, H. Q., Wang, L., Mei, Z. Y., Ma, Y. N., & Zhao, G. R. (2015). Feature selection and predictors of falls with foot force sensors using KNN-Based algorithms. Sensors, 15(11), 29393–29407. https://doi.org/ [Google Scholar] [Crossref]
Liu, X. G., Chen, M., Liang, T., Lou, C. G., Wang, H. R., & Liu, X. L. (2022). A lightweight double-channel depthwise separable convolutional neural network for multimodal fusion gait recognition. Math. Biosci. Eng., 19, 1195–1212. [Google Scholar] [Crossref]
Montanini, L., Del Campo, A., Perla, D., Spinsante, S., & Gambi, E. (2017). A footwear-based methodology for fall detection. IEEE Sens. J., 18(3), 1233–1242. [Google Scholar] [Crossref]
Pardoel, S., Shalin, G., Nantel, J., Lemaire, E. D., & Kofman, J. (2021). Early detection of freezing of gait during walking using inertial measurement unit and plantar pressure distribution data. Sensors, 21(6), 2246. [Google Scholar] [Crossref]
Peng, X. H., Feng, Y. K., Ji, S. J., Amos, J. T., Wang, W. N., Li, M., Ai, S. L., Qiu, X. Z., Dong, Y. Y., Ma, D., Yao, D. Z., Valdes-Sosa, P. A., & Ren, P. (2021). Gait analysis by causal decomposition. IEEE Trans. Neural. Syst. Rehabil. Eng., 29, 953–964. [Google Scholar] [Crossref]
Podsiadlo, D. & Richardson, S. J. (1991). The Timed “Up & Go”: A test of basic functional mobility for frail elderly persons. J. Am. Geriatrics Soc., 39(2), 142–148. [Google Scholar] [Crossref]
Raîche, M., Hébert, R., Prince, F., & Corriveau, H. (2000). Screening older adults at risk of falling with the Tinetti balance scale. The Lancet, 356(9234), 1001–1002. [Google Scholar] [Crossref]
Shalin, G., Pardoel, S., Lemaire, E. D., Nantel, J., & Kofman, J. (2021). Prediction and detection of freezing of gait in Parkinson’s disease from plantar pressure data using long short-term memory neural-networks. J. Neuroengineering Rehabil., 18(1), 1–15. [Google Scholar] [Crossref]
Shumway-Cook, A., Brauer, S., & Woollacott, M. (2000). Predicting the probability for falls in community-dwelling older adults using the Timed Up & Go Test. Phys. Therapy, 80(9), 896–903. [Google Scholar] [Crossref]
Tan, M. & Le, Q. V. (2019). EfficientNet: Rethinking model scaling for convolutional neural networks. arXiv Preprint, 11946, arXiv: 1905.11946. [Google Scholar]
Taylor, A. J., Menz, H. B., & Keenan, A. M. (2004). Effects of experimentally induced plantar insensitivity on forces and pressures under the foot during normal walking. Gait Posture, 20(3). [Google Scholar] [Crossref]
Tinetti, M. E. (1986). Performance-oriented assessment of mobility problems in elderly patients. J. Am. Geriatrics Soc., 34(2), 119–126. [Google Scholar] [Crossref]
Wang, H. X., Zhang, Z. G., Cui, X., & Cui, R. Y. (2022). Chinese-Korean Weibo sentiment classification based on pre-trained language model and transfer learning. In 2022 IEEE 2nd International Conference on Computer Communication and Artificial Intelligence (CCAI), Beijing, China (pp. 49–54). [Google Scholar] [Crossref]
Wang, Z. G. & Oates, T. (2015). Imaging time-series to improve classification and imputation. arXiv Preprint, 00327, arXiv:1506.00327. [Google Scholar] [Crossref]
Woo, S., Park, J., Lee, J. Y., & Kweon, I. S. (2018). Cbam: Convolutional block attention module. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany (pp. 3–19). [Google Scholar] [Crossref]
Yu, J. C., Gao, W. D., & Jiang, W. N. (2020). Foot pronation detection based on plantar pressure measurement. J. Phys. Conf. Ser., 1646(1), 012041. [Google Scholar] [Crossref]
Zhao, S. M., Liu, R., Fei, C. W., Zia, A. W., & Jing, L. X. (2020). Flexible sensor matrix film-based wearable plantar pressure force measurement and analysis system. PLoS One, 15(8), e0237090. [Google Scholar] [Crossref]
Search
Open Access
Research article
Special Issue

Enhancing Fall Risk Assessment in the Elderly: A Study Utilizing Transfer Learning in an Improved EfficientNet Network with the Gramian Angular Field Technique

congcong li1,2*,
yueting cai1,
laith jaafer habeeb3,
Atta-ur Rahman4,
ritzkal5
1
School of Information Science and Technology, Hebei Agricultural University, 071001 Baoding, China
2
Hebei Key Laboratory of Agricultural Big Data, Hebei Agricultural University, 071001 Baoding, China
3
Training and Workshop Center, University of Technology-Iraq, 10001 Baghdad, Iraq
4
College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, 31441 Dammam, Saudi Arabia
5
Informatics Department, Science & Engineering Faculty, Universitas Ibn Khaldun Bogor, 16162 Bogor, Indonesia
Healthcraft Frontiers
|
Volume 1, Issue 1, 2023
|
Pages 1-14
Received: 09-11-2023,
Revised: 11-14-2023,
Accepted: 11-22-2023,
Available online: 11-30-2023
View Full Article|Download PDF

Abstract:

Recent years have seen a significant increase in the incidence of falls among the elderly, leading to accidental injuries and fatalities. This trend underscores the critical need for accurate fall risk assessment, a major concern for public health and safety. In addressing this challenge, a novel approach has been developed, leveraging a pressure sensor placed on the foot's sole to gather gait data from elderly individuals. This method provides a precise analysis of gait stability on a daily basis. The research introduced here utilizes the gramian angular summation field (GASF) technique for converting this data into two-dimensional images, which are then processed using an enhanced EfficientNet model. The innovation lies in the integration of a convolutional block attention module (CBAM) into this model, resulting in a CBAM-EfficientNet algorithm. This approach includes freezing the first four stages of the EfficientNet model, focusing training on the deeper layers that incorporate CBAM. This strategy is aimed at augmenting the network's ability to extract critical features from foot pressure data, consequently improving the accuracy of fall risk classification. Experimental evaluation of this optimized model demonstrates a classification accuracy of 98.5% and a sensitivity of 99.0%, indicating its practical efficacy and strong generalization capacity. These findings reveal that the methodology significantly enhances the classification of plantar pressure data, offering valuable support in medical diagnosis and has substantial practical implications.

Keywords: EfficientNet, Convolutional block attention module, Gram angular field algorithm, Wearable devices, Fall risk, Transfer learning

1. Introduction

Global population aging presents a significant challenge, with China experiencing particularly rapid and profound demographic shifts. As per international standards, societies with over 7%-14% of the population aged 65 and above are considered aging. Recent statistics place China's aging population in the upper echelons globally, with over 190 million individuals aged 65 and above in 2022, surpassing the 14% threshold and marking China's entry into an advanced aging society. This demographic shift has heightened societal focus on the health issues of the elderly, particularly the risk of falls (L​i​a​n​g​ ​e​t​ ​a​l​.​,​ ​2​0​1​9). Accurate fall risk assessment is paramount for enhancing the health and safety of the elderly. Existing research in this domain underscores the need for a comprehensive fall prevention assessment system and effective fall prevention information management strategies to mitigate fall incidences. Clinical assessments traditionally rely on fall risk scales and tests (L​i​a​n​g​ ​e​t​ ​a​l​.​,​ ​2​0​1​5; S​h​u​m​w​a​y​-​C​o​o​k​ ​e​t​ ​a​l​.​,​ ​2​0​0​0). However, these methods necessitate medical professionals' involvement and are impractical for daily life monitoring.

Wearable sensors, especially plantar pressure sensors, offer an innovative solution, unencumbered by environmental constraints and boasting substantial concealment, making them ideal for continuous motion monitoring of the elderly. Notable contributions in this field include the system leveraging plantar pressure sensors of Y​u​ ​e​t​ ​a​l​.​ ​(​2​0​2​0​) for inversion detection, and the wearable plantar pressure measurement and analysis system (WPPFMA) of Z​h​a​o​ ​e​t​ ​a​l​.​ ​(​2​0​2​0​), and P​e​n​g​ ​e​t​ ​a​l​.​ ​(​2​0​2​1​) use of plantar pressure signals in gait health analysis. Such studies highlight the significant potential of plantar pressure data in gait analysis, though its application in fall risk assessment remains underexplored. M​o​n​t​a​n​i​n​i​ ​e​t​ ​a​l​.​ ​(​2​0​1​7​) proposed an approach using three pressure sensors on the sole, setting a threshold at one-third of the maximum pressure value during activity. The sensors' analog signals are translated into binary data, effectively delineating the gait cycle phase. G​a​o​ ​&​ ​Z​h​a​o​ ​(​2​0​2​1​) analyzed plantar pressure data from 48 subjects to study gait line symmetry and temporal consistency, achieving an 87.5% accuracy on their test dataset.

In the realm of fall risk assessment using plantar pressure data, L​i​a​n​g​ ​e​t​ ​a​l​.​ ​(​2​0​1​9​) demonstrated the potential of raw plantar force data, applying the convolutional long short-term memory (ConvLSTM) model with a notable classification accuracy of 94%. Similarly, S​h​a​l​i​n​ ​e​t​ ​a​l​.​ ​(​2​0​2​1​) explored gait freezing identification through features extracted from plantar pressure data, utilizing long short-term memory (LSTM) networks. Their model exhibited an average sensitivity of 82.1% and specificity of 89.5%. Further advancing the field, C​h​a​n​ ​e​t​ ​a​l​.​ ​(​2​0​2​3​) developed an intelligent shoe system integrating pressure and inertial sensors. They crafted a hybrid model combining convolutional neural network (CNN) and recurrent neural network (RNN) techniques, achieving a remarkable F1-score of 99.8%. These studies collectively highlight plantar pressure sensors' concealment and comfort, affirming their suitability for everyday fall risk monitoring.

Despite these advancements, challenges persist with the substantial data size generated by plantar pressure sensors, imposing significant demands on data transmission and processing. Direct use of raw sensor data as input signals for fall risk detection models often leads to high computational costs and complex model architectures, hindering mobile deployment. Moreover, the nascent stage of research in this domain means that robust feature extraction from plantar pressure signals and the enhancement of model accuracy in fall risk detection are ongoing concerns.

Addressing these issues requires the development of robust fall risk detection methods based on plantar pressure signals that also prioritize data compactness. This necessitates specific data processing and feature extraction techniques tailored for plantar pressure sensor data. In response, a model has been developed utilizing deep learning techniques. This model introduces a CBAM-EfficientNet technique, based on the EfficientNet model and augmented with the CBAM. It processes plantar pressure data transformed into the GASF format. The inclusion of the CBAM module notably enhances the network's feature extraction capabilities from plantar pressure data. Practical transfer learning is then employed to expedite network training and improve classification performance. Experimental evaluations have demonstrated the model's effectiveness in reliably extracting characteristics from plantar pressure signals.

2. Materials and Methods

2.1 Data Collection and Data Label

For this study, plantar pressure sensors were placed in the shoes of 24 elderly individuals residing in or near nursing homes to capture human plantar pressure signals. These individuals, all aged 65 or above, were selected excluding those with significant mobility impairments or conditions impacting gait. The aim was to assess fall risk through a comprehensive walking test, focusing on gait patterns and potential indicators of fall risk during various walking tasks.

The experimental setup was straightforward, requiring no special arrangements. The commencement and conclusion of each walking test were signaled by the recorder. Participants were instructed to walk at a comfortable, regular pace on flat ground, following a straight path or executing turns. Each subject repeated the walking experiment 3-4 times. The human foot was divided into five distinct areas for data analysis: the first toe (T1), the second to fifth toes (T2-5), metatarsals (M1-5), midfoot (MF), and heel (HM, HL) as delineated in Figure 1 (T​a​y​l​o​r​ ​e​t​ ​a​l​.​,​ ​2​0​0​4). However, during data collection, a significant number of missing values were observed in the T2-5 region, leading to the decision to exclude this data and focus only on the remaining foot areas.

Figure 1. Plantar partition

In the data labeling phase, a supervised learning approach was employed (Figure 2). The Timed Up and Go Test (TUGT) (P​o​d​s​i​a​d​l​o​ ​&​ ​R​i​c​h​a​r​d​s​o​n​,​ ​1​9​9​1) was initially used to categorize participants. Following the classification criteria set by P​a​r​d​o​e​l​ ​e​t​ ​a​l​.​ ​(​2​0​2​1​), 15 individuals were identified as high-risk, 5 as low-risk, and 4 were placed in a 'grey zone' where TUGT alone did not yield a clear classification. Subsequently, those in the grey zone underwent further assessment using the Tinetti Balance and Gait Scale (B​e​c​k​,​ ​1​9​9​9; R​a​î​c​h​e​ ​e​t​ ​a​l​.​,​ ​2​0​0​0; T​i​n​e​t​t​i​,​ ​1​9​8​6). The culmination of these assessments resulted in a final labeling of the data, with 17 volunteers classified as low risk and 7 as high risk for falling.

Figure 2. Data labeling flow chart
2.2 Processing of Plantar Pressure Data

In the processing of plantar pressure data, the inherent one-dimensional nature of sensor signals poses a challenge for deep learning methods, which are more adept at handling multi-dimensional data. To address this, W​a​n​g​ ​&​ ​O​a​t​e​s​ ​(​2​0​1​5​) introduced a novel approach of converting one-dimensional time series into two-dimensional images via the Gramian Angular Field (GAF) algorithm. This method effectively preserves the time-dependence and relevance of the original data.

The plantar pressure sequence data were divided utilizing the sliding window method (L​i​ ​e​t​ ​a​l​.​,​ ​2​0​2​0; L​i​u​ ​e​t​ ​a​l​.​,​ ​2​0​2​2), with each window set to a duration of 1 second and an overlap rate of 50%. This approach was chosen to ensure the intact preservation of gait characteristics in the plantar pressure data. Subsequently, the data format was transformed by the GAF algorithm, which utilizes a polar coordinate system, derived from the Kerr matrix, instead of a Cartesian system. The GAF algorithm's implementation involved three critical steps (T​a​n​ ​&​ ​L​e​,​ ​2​0​1​9):

Normalization: The time series X = {x1, x2,…xN}, containing N real-valued observations, underwent normalization to ensure that X ranged between [-1, 1].

$x_{i-1}^i=\frac{\left(x_i-\max (X)\right)+\left(x_i-\min (X)\right)}{\max (X)-\min (X)}$
(1)

Polar coordinate conversion: The normalized one-dimensional time series maintained its absolute temporal relationship using polar coordinates, where values were encoded as angle cosines and timestamps as radii.

$\left\{\begin{array}{c}\varphi_i=\arccos \left(x_i\right),-1 \leq x_i \leq 1, x_i \in X \\ r_i=\frac{t_i}{N}, t_i \in N\end{array}\right.$
(2)

where xi is each observation in the time series X, ti is the corresponding timestamp, N represents the total length of the timestamp. When these normalized values are mapped into a polar coordinate system, the cosine value of each data point, previously normalized to the range [-1, 1], is interpreted within the angular boundaries of [0, π].

GASF: The temporal correlation between different time intervals was angularly identified, following the transformation of the rescaled time series into a polar coordinate system. The GAF is defined by calculating the sum of trigonometric functions between each sample point in this new coordinate system.

$G A S F=\left\{\begin{array}{cccc}\cos \left(\varphi_1+\varphi_1\right) & \cos \left(\varphi_1+\varphi_2\right) & \ldots & \cos \left(\varphi_1+\varphi_N\right) \\ \cos \left(\varphi_2+\varphi_1\right) & \cos \left(\varphi_2+\varphi_2\right) & \ldots & \cos \left(\varphi_2+\varphi_N\right) \\ \ldots & \ldots & \ldots & \ldots \\ \cos \left(\varphi_N+\varphi_1\right) & \cos \left(\varphi_N+\varphi_2\right) & \ldots & \cos \left(\varphi_N+\varphi_N\right)\end{array}\right.$
(3)

Figure 3 illustrates the complete process of converting the rescaled time series into a coded mapping in polar coordinates. The initial step involved normalizing the time series using Eqs. (1) and (2), followed by a conversion to a polar coordinate system. The final images were obtained using the GAF method, as detailed in Eq. (3). The result of this process was the generation of 2085 high-risk samples and 826 low-risk samples, classified based on the processed plantar pressure data.

Figure 3. GASF
2.3 Fall Risk Assessment Modeling Algorithm
2.3.1 EfficientNet

CNNs have been recognized for their capacity to autonomously extract features from images through convolutional kernel manipulation. The enhancement of a network’s depth, width, and image resolution typically facilitates the acquisition of multiscale features. However, increasing network depth can lead to complexities like vanishing gradients, while augmenting network width and image resolution significantly elevates computational and memory demands. EfficientNet (W​o​o​ ​e​t​ ​a​l​.​,​ ​2​0​1​8), an innovative architecture, was developed to balance these aspects by uniformly scaling the network’s depth, width, and resolution with fixed scaling factors. With variants ranging from B0 to B9, EfficientNet allows for model parameter adjustments to suit diverse tasks and computational resources. For the task of fall risk classification using plantar pressure data, the EfficientNetB0 architecture was selected. This architecture primarily comprises Mobile Inverted Convolution (MBConv) modules, utilizing 3×3 and 5×5 convolutional kernels. Each MBConv module includes two 1×1 convolutional layers (Conv), a depthwise separable convolutional layer (DWConv), and a Squeeze-and-Excitation (SE) module, as illustrated in Figure 4. The structure of EfficientNet is detailed in Table 1, showcasing its ability to effectively extract and utilize features from plantar pressure data for fall risk classification.

Table 1. Efficientnet network architecture

Stage I

Operator F

Resolution H*W

Channels C

Layers L

1

Conv3*3

224*224

32

1

2

MBConv1, k3*3

112*112

16

1

3

MBConv6, k3*3

112*112

24

2

4

MBConv6, k5*5

56*56

40

2

5

MBConv6, k3*3

28*28

80

3

6

MBConv6, k5*5

14*14

112

3

7

MBConv6, k5*5

14*14

192

4

8

MBConv6, k3*3

7*7

320

1

9

Conv1*1 & Pooling & FC

7*7

1280

1

Figure 4. The MBConv module
2.3.2 Transfer learning

Transfer learning refers to the transfer of parameters from a trained model (pre-trained model) to a new model to facilitate training (C​a​n​a​y​a​z​,​ ​2​0​2​1; W​a​n​g​ ​e​t​ ​a​l​.​,​ ​2​0​2​2). It is predicated on the premise that many data sets or tasks share correlations, allowing for the transfer of previously learned model parameters (akin to transferring knowledge) to a new model. This accelerates and optimizes the model’s learning process, eliminating the need to learn from scratch, as is common with most networks. The transferred parameters serve as a robust feature set, reducing the reliance on new data sets and shortening training durations. In this study, the weights from the EfficientNet model, trained on a large public dataset, were utilized as pre-trained model weights for the binary classification task of fall risk. The optimal parameters of the new model were obtained through retraining, demonstrating the efficiency and effectiveness of transfer learning in this context.

2.3.3 CBAM

CBAM, as proposed by W​o​o​ ​e​t​ ​a​l​.​ ​(​2​0​1​8​), is an enhancement mechanism for CNNs, involving spatial and channel-wise attention processes. The module, depicted in Figure 5, comprises both a spatial attention module and a channel attention module, offering a plug-and-play feature that allows for integration at any point within the network without altering the input feature characteristics.

Figure 5. CBAM

The channel attention module, illustrated in Figure 6, operates on the principle of discerning inter-channel relationships. It processes the input channel features F through both Maximum Pooling (MaxPool) and Average Pooling (AvgPool) to derive MaxPool(F) (MaxPool (F)∈R1×1×c) and AvgPool(F) (AvgPool (F)∈R1×1×c) features. These are subsequently passed through a multi-layer perceptron (MLP), which performs dimensionality reduction followed by enhancement. The output of the MLP, after undergoing a sigmoid activation function, yields the channel attention feature Mc(F), as formulated in Eq. (4), where W0 and W1 represent the respective weights.

Figure 6. Channel attention module
$\begin{aligned} & M_C(F)=\operatorname{Sigmoid}(\operatorname{MLP}(\operatorname{MaxPool}(F))+(\operatorname{AvgPool}(F))) \\ & =\operatorname{Sigmoid}\left(W_1\left(W_0(\operatorname{MaxPool}(F))\right)+W_1\left(W_0(\operatorname{AvgPool}(F))\right)\right)\end{aligned}$
(4)

In parallel, the spatial attention module, shown in Figure 7, focuses on the relationships between spatial attributes to generate spatial attention maps. The input features F' undergo spatially-oriented MaxPool and AvgPool operations, resulting in MaxPool(F') (MaxPool (F')∈R1×1×c) and AvgPool(F') (AvgPool(F') ∈ R1×1×c) features. These features are first concatenated channel-wise, followed by a convolution operation with a 7×7 kernel. A subsequent sigmoid activation operation produces the spatial attention feature Ms(F), as outlined in the computational process.

Figure 7. Spatial attention module
$M_S(F)=\sigma\left(\mathrm{f}^{7 \times 7}\left(\left[\operatorname{AvgPool}\left(F^{\prime}\right) ; \operatorname{MaxPool}\left(F^{\prime}\right)\right]\right)\right)=\sigma\left(\mathrm{f}^{7 \times 7}\left(\left[F_{\text {avg }}^S ; F_{\max }^S\right]\right)\right)$
(5)

where, σ represents the sigmoid function, $F_{\text {avg }}^S$ and $F_{\max }^S$ denote the mean and maximum pooling features, respectively.

2.3.4 CBAM-EfficientNet Framework

The integration of CBAM into the EfficientNet framework represents a pivotal enhancement in the processing of plantar pressure data. CBAM, combining the channel attention module and the spatial attention module, facilitates the extraction of global feature information, thereby augmenting the network's classification accuracy. While shallow networks are proficient in extracting low-level features with high generalization capabilities, they entail higher computational costs. Deep networks, conversely, are adept at extracting more specific, finer details pertinent to particular datasets. By embedding the CBAM module within MBConv of the deep network, an emphasis on salient features of plantar pressure data is achieved. This strategic placement, post DWConv convolution, allows the network to broaden its scope in learning diverse features.

Figure 8 illustrates the structure of the MBConv module integrated with the CBAM module. Comparative experimental analyses were conducted to ascertain the optimal network layers for the application of the attention mechanism, culminating in the integration of the CBAM module into Stages 5, 6, 7, and 8 of the network architecture, as detailed in Table 1.

Figure 8. The MBConv module integrated into CBAM module

The CBAM-EfficientNet model, as proposed in this study, encompasses both transfer learning and the CBAM mechanism within the EfficientNet architecture. The initial weights of the EfficientNet network, pretrained on the ImageNet dataset, were subsequently applied to the foot pressure sensor dataset (Figure 9). The model adopts a strategy wherein the shallow layers of the network are frozen, allowing only the deeper layers to undergo weight parameter updates. This approach is predicated on the understanding that while shallow layers in convolutional networks excel in extracting low-level features with strong generalization capabilities, deeper layers are more focused on dataset-specific, finer details. Consequently, in this transfer learning classification task, only the weight parameters of the deep layers are permitted adjustment, while those of the shallow layers remain static. This methodology significantly enhances the classification performance on the two-dimensional representation of foot pressure data, effectively addressing fall risk assessment challenges. The first four stages of the network were frozen, and the subsequent feature extraction layers, inclusive of those with CBAM integration, underwent training to update weight parameters, culminating in the refined classification of fall risk assessment.

Figure 9. CBAM- Efficientnet model

3. Experiments and Analyses

3.1 Experimental Setting

The experiment was designed to evaluate human fall risk using plantar pressure data and to verify the accuracy of this approach. Initially, EfficientNet network models with and without transfer learning were trained and compared. Subsequently, the attention mechanism, specifically the CBAM module, was integrated into the MBConv module at various stages (8; 7, 8; 6, 7, 8; and 5, 6, 7, 8) of the EfficientNet network. The impact of these different configurations was then assessed through training. The final phase involved comparing the optimal model with traditional classical networks and existing methods.

The deep learning framework PyTorch 2.5.0 was utilized for model construction, supported by hardware comprising an Intel Core i9-7900 processor and NVIDIA Geforce GTX3090 graphics card. For classification purposes, the 'high' category was treated as a positive example, and the 'low' category as negative. Optimal model parameter settings, as validated through experimental testing, are presented in Table 2.

Table 2. Model parameter setting

Setting Item

Parameter Value

Learning Rate

0.1

Batch Size

32

Epoch

200

Dropout

0.2

Evaluation metrics such as accuracy, precision, sensitivity, and F1-score were employed to assess model performance.

$\text { Accuracy }=\frac{T P+T N}{T P+T N+F P+F N}$
(6)
$\text {Precision} =\frac{T P}{T P+F P}$
(7)
$\text {Sensitivity} =\frac{T P}{T P+F N}$
(8)
$\text {F1- score} =\frac{2 * Precision * Sensitivity }{ Precision + Sensitivity }$
(9)
3.2 Comparison of Transfer Learning Model Results

Figure 10 illustrates the model accuracy and loss value curves for configurations with and without transfer learning. At equivalent epochs, the model employing transfer learning exhibited higher accuracy, lower loss values, and faster convergence. Without transfer learning, accuracy peaked at approximately 93.8%, with a final loss value of 0.221. Post-transfer learning implementation, accuracy increased to 96.2%, and the loss value decreased to 0.143. Table 3 presents a comparison of the performance metrics, with transfer learning demonstrating superior results, thereby enhancing the classification performance of the fall risk assessment model and increasing classification efficiency.

(a)
(b)
Figure 10. (a) Comparison of accuracy; (b) Comparison of loss value
Table 3. Performance comparison of baseline models

Module

Precision

Sensitivity

F1-Score

EfficientNet

96.9%

96.8%

96.8%

Pre-EfficientNet

98.0%

97.2%

97.6%

3.3 Impact of Attention Mechanism Placement on Model Performance

Investigations were conducted on networks with pre-training and varying placements of the CBAM module. It was observed that models augmented with the CBAM module exhibited enhanced performance in terms of accuracy compared to the initial Pre-EfficientNet model without the module. The inclusion of the CBAM module resulted in an increase in accuracy by 0.3-2%, precision by 0.6-1.2%, and F1-score by 0.6-1.5%. These results underscore the efficacy of transfer learning in expediting network training and improving classification accuracy, particularly in scenarios with limited data.

Table 4. Classification results of different improved models

Module

Accuracy

Precision

Sensitivity

F1-Score

Pre-EfficientNetB0

96.2%

98.0%

97.2%

97.6%

Stage 8

96.5%

98.6%

97.8%

98.2%

Stage 7 and 8

96.7%

98.7%

98.0%

98.3%

Stage 6, 7 and 8

97.2%

99.1%

98.4%

98.7%

Stage 5, 6, 7 and 8

98.5%

99.2%

99.0%

99.1%

Table 4 presents the classification results of the various improved models, revealing that the network achieves optimal average evaluation metrics when the CBAM module is added at stages 5, 6, 7, and 8. The average accuracy in this configuration was recorded at 98.2%, marking a significant enhancement in the model’s classification capabilities. This improvement can be attributed to the CBAM attention mechanism's focus on crucial features in plantar pressure data from both spatial and channel dimensions. Figure 11 compares the accuracy and loss value variation curves of the optimal model (CBAM added at stages 5, 6, 7, 8) with the base model (Pre-EfficientNet). The optimal model demonstrated higher accuracy and lower loss values, stabilizing at an epoch of approximately 110, thereby confirming the added value of CBAM in learning pivotal features to improve classification accuracy.

(a)
(b)
Figure 11. (a) Comparison of model accuracy with or without CBAM; (b) Comparison of model loss values with and without CBAM
3.4 Evaluation of Model Effectiveness

Comparative experiments were conducted with classical networks such as ShuffleNet, ResNet50, and ResNet101. Figure 12 showcases the variation in accuracy among these models. The CBAM-EfficientNet approach achieved higher accuracy and faster convergence compared to other classical networks, attributed to the efficiencies brought by transfer learning and the inherent advantages of the EfficientNet network architecture (uniform tuning of network width, depth, and accuracy) complemented by the CBAM mechanism for extracting significant features from plantar pressure data. Table 5 displays the evaluation metrics for the classical network models and the proposed method in this study. The CBAM-EfficientNet method attained the highest accuracy at 98.5%, exhibiting superior performance in precision, sensitivity, and F1-score. Additionally, this model is characterized by a reduced number of parameters, thereby necessitating fewer resources.

Table 5. Classification results of different models

Module

Accuracy

Precision

Sensitivity

F1-Score

CBAM-EfficientNet

98.5%

99.2%

99.0%

99.1%

Shufflenet

91.1%

96.1%

95.0%

95.3%

Resnet50

94.3%

98.3%

96.0%

97.0%

Resnet101

93.9%

98.2%

95.5%

96.7%

(a)
(b)
Figure 12. (a) Comparison of accuracy with classical network models; (b) Comparison of loss values with classical network models
3.5 Comparison with Contemporary Research Findings

A comparative analysis was conducted to evaluate various methods used in current research for assessing fall risk through plantar pressure sensors, as summarized in Table 6. L​i​a​n​g​ ​e​t​ ​a​l​.​ ​(​2​0​1​9​) applied the ConvLSTM method to extract spatiotemporal features from raw plantar pressure data, demonstrating its effectiveness in predicting fall risk among the elderly and establishing a correlation between heel force and fall risk. S​h​a​l​i​n​ ​e​t​ ​a​l​.​ ​(​2​0​2​1​) focused on the efficacy of gait freeze detection and prediction using plantar pressure sensors, validating the suitability of LSTM networks for this type of data. C​h​a​n​ ​e​t​ ​a​l​.​ ​(​2​0​2​3​) explored fall detection using a hybrid CNN and RNN model, proving the model's applicability to both plantar pressure and inertial sensor data.

The comparative experiments, conducted on the dataset assembled in this study, indicate that the proposed CBAM-EfficientNet model outperforms existing methods in accuracy, precision, sensitivity, and F1-score, while also being more resource-efficient. Specifically, when compared to ConvLSTM, improvements of 6.2% in accuracy, 2.6% in precision, 3.4% in sensitivity, and 2.9% in F1-score were observed. Relative to LSTM, enhancements in accuracy, precision, sensitivity, and F1-score were 6.7%, 2.8%, 3.8%, and 3.4%, respectively. Compared with the hybrid CNN and RNN model, the CBAM-EfficientNet model showed an increase of 4.2% in accuracy, 1.6% in precision, 2.4% in sensitivity, and 0.9% in F1-score. The lightweight nature of the proposed network further substantiates the CBAM-EfficientNet model's superior performance in accurately classifying fall risk.

Table 6. Comparison of contemporary research findings

Module

Accuracy

Precision

Sensitivity

F1-Score

CBAM-EfficientNet

98.5%

99.2%

99.0%

99.1%

convLSTM (L​i​a​n​g​ ​e​t​ ​a​l​.​,​ ​2​0​1​9)

92.3%

96.6%

95.6%

96.0%

LSTM (Y​u​ ​e​t​ ​a​l​.​,​ ​2​0​2​0)

91.8%

96.4%

95.2%

95.7%

CNN+RNN (C​h​a​n​ ​e​t​ ​a​l​.​,​ ​2​0​2​3)

94.3%

97.6%

96.6%

98.0%

4. Conclusion

In pursuit of robust fall risk detection methods suitable for daily life applications, this research undertook an extensive examination of existing fall assessment methodologies. The selection of a particular data processing technique was necessitated by the voluminous nature of plantar pressure data. A novel neural network model for fall risk assessment was proposed, incorporating an attention mechanism, and was built upon a gait dataset gathered using plantar pressure sensor devices. Utilizing the EfficientNet model as a base, transfer learning was applied, and the CBAM module was integrated, enhancing the accuracy of risk classification. Initial network weights, trained on a large dataset, were employed to expedite network training and improve learning efficiency. The incorporation of the CBAM module at higher stages of the model augmented the network's capacity to discern critical features in plantar pressure data, thus boosting classification accuracy. The optimized model achieved a classification accuracy of 98.5%, a precision of 99.2%, and a sensitivity of 99.0%. These outcomes affirm the efficacy of the proposed method in enhancing the classification of plantar pressure data, which holds significant practical implications in assisting medical professionals in fall risk diagnosis.

The study leveraged the GASF algorithm to transform foot pressure sensor data into two-dimensional images, achieving data simplification and robustness. The integration of transfer learning markedly improved the efficiency of fall risk assessment classification. The experimental results validated the ability of GASF images to maintain accuracy in fall risk assessment while reducing data scale, making the model more suitable for practical deployment in fall risk monitoring scenarios.

Despite its contributions, this study has limitations. The complexity of the human body's motor system cannot be fully captured by plantar pressure sensors alone. Future work could involve the integration of additional sensors or diverse sensor types, ensuring practical applicability in daily life, to acquire more comprehensive limb movement data. This would enable more nuanced analysis and processing of complex movement behaviors. Additionally, the current model predicts fall risk as high or low but lacks the granularity to determine specific risk severity or fall probability. Future endeavors could focus on more precise data labeling to address this limitation.

Ethical Approval

This study adheres to strict ethical guidelines, ensuring the rights and privacy of participants. Informed consent was obtained from all participants, and personal information was protected throughout the study. The methodology and procedures of this research have been approved by the appropriate ethics committee. Participants were informed of their rights, including the right to withdraw from the study at any time. All collected data is used solely for the purpose of this research and is stored and processed in a secure and confidential manner.

Data Availability

The data used to support the research findings are available from the corresponding author upon request.

Conflicts of Interest

The authors declare no conflict of interest.

References
Beck, J. C. (1999). Geriatrics Review Syllabus: A Core Curriculum in Geriatric Medicine. Kendall/Hunt Publishing Company, Dubuque, IA, US. [Google Scholar]
Canayaz, M. (2021). C+EffxNet: A novel hybrid approach for COVID-19 diagnosis on CT images based on CBAM and EfficientNet. Chaos, Solitons Fractals, 151, 111310. [Google Scholar] [Crossref]
Chan, H. L., Ouyang, Y., Chen, R. S., Lai, Y. H., Kuo, C. C., Liao, G. S., Hsu, W. Y., & Chang, Y. J. (2023). Deep neural network for the detections of fall and physical activities using foot pressures and inertial sensing. Sensors, 23(1), 495. [Google Scholar] [Crossref]
Gao, W. D. & Zhao, Z. W. (2021). Gait phase recognition using fuzzy logic regulation with multisensor data fusion. J. Sens., 2021, 1–13. [Google Scholar] [Crossref]
Li, B. C., Yao, Z. M., Wang, J. G., Wang, S. N., Yang, X. J., & Sun, Y. N. (2020). Improved deep learning technique to detect freezing of gait in Parkinson’s disease based on wearable sensors. Electronics, 9(11), 1919. [Google Scholar] [Crossref]
Liang, S. Y., Liu, Y. M., Li, G. L., & Zhao, G. R. (2019). Elderly fall risk prediction with plantar center of force using convlstm algorithm. In 2019 IEEE International Conference on Cyborg and Bionic Systems (CBS), Munich, Germany (pp. 36–41). [Google Scholar] [Crossref]
Liang, S. Y., Ning, Y. K., Li, H. Q., Wang, L., Mei, Z. Y., Ma, Y. N., & Zhao, G. R. (2015). Feature selection and predictors of falls with foot force sensors using KNN-Based algorithms. Sensors, 15(11), 29393–29407. https://doi.org/ [Google Scholar] [Crossref]
Liu, X. G., Chen, M., Liang, T., Lou, C. G., Wang, H. R., & Liu, X. L. (2022). A lightweight double-channel depthwise separable convolutional neural network for multimodal fusion gait recognition. Math. Biosci. Eng., 19, 1195–1212. [Google Scholar] [Crossref]
Montanini, L., Del Campo, A., Perla, D., Spinsante, S., & Gambi, E. (2017). A footwear-based methodology for fall detection. IEEE Sens. J., 18(3), 1233–1242. [Google Scholar] [Crossref]
Pardoel, S., Shalin, G., Nantel, J., Lemaire, E. D., & Kofman, J. (2021). Early detection of freezing of gait during walking using inertial measurement unit and plantar pressure distribution data. Sensors, 21(6), 2246. [Google Scholar] [Crossref]
Peng, X. H., Feng, Y. K., Ji, S. J., Amos, J. T., Wang, W. N., Li, M., Ai, S. L., Qiu, X. Z., Dong, Y. Y., Ma, D., Yao, D. Z., Valdes-Sosa, P. A., & Ren, P. (2021). Gait analysis by causal decomposition. IEEE Trans. Neural. Syst. Rehabil. Eng., 29, 953–964. [Google Scholar] [Crossref]
Podsiadlo, D. & Richardson, S. J. (1991). The Timed “Up & Go”: A test of basic functional mobility for frail elderly persons. J. Am. Geriatrics Soc., 39(2), 142–148. [Google Scholar] [Crossref]
Raîche, M., Hébert, R., Prince, F., & Corriveau, H. (2000). Screening older adults at risk of falling with the Tinetti balance scale. The Lancet, 356(9234), 1001–1002. [Google Scholar] [Crossref]
Shalin, G., Pardoel, S., Lemaire, E. D., Nantel, J., & Kofman, J. (2021). Prediction and detection of freezing of gait in Parkinson’s disease from plantar pressure data using long short-term memory neural-networks. J. Neuroengineering Rehabil., 18(1), 1–15. [Google Scholar] [Crossref]
Shumway-Cook, A., Brauer, S., & Woollacott, M. (2000). Predicting the probability for falls in community-dwelling older adults using the Timed Up & Go Test. Phys. Therapy, 80(9), 896–903. [Google Scholar] [Crossref]
Tan, M. & Le, Q. V. (2019). EfficientNet: Rethinking model scaling for convolutional neural networks. arXiv Preprint, 11946, arXiv: 1905.11946. [Google Scholar]
Taylor, A. J., Menz, H. B., & Keenan, A. M. (2004). Effects of experimentally induced plantar insensitivity on forces and pressures under the foot during normal walking. Gait Posture, 20(3). [Google Scholar] [Crossref]
Tinetti, M. E. (1986). Performance-oriented assessment of mobility problems in elderly patients. J. Am. Geriatrics Soc., 34(2), 119–126. [Google Scholar] [Crossref]
Wang, H. X., Zhang, Z. G., Cui, X., & Cui, R. Y. (2022). Chinese-Korean Weibo sentiment classification based on pre-trained language model and transfer learning. In 2022 IEEE 2nd International Conference on Computer Communication and Artificial Intelligence (CCAI), Beijing, China (pp. 49–54). [Google Scholar] [Crossref]
Wang, Z. G. & Oates, T. (2015). Imaging time-series to improve classification and imputation. arXiv Preprint, 00327, arXiv:1506.00327. [Google Scholar] [Crossref]
Woo, S., Park, J., Lee, J. Y., & Kweon, I. S. (2018). Cbam: Convolutional block attention module. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany (pp. 3–19). [Google Scholar] [Crossref]
Yu, J. C., Gao, W. D., & Jiang, W. N. (2020). Foot pronation detection based on plantar pressure measurement. J. Phys. Conf. Ser., 1646(1), 012041. [Google Scholar] [Crossref]
Zhao, S. M., Liu, R., Fei, C. W., Zia, A. W., & Jing, L. X. (2020). Flexible sensor matrix film-based wearable plantar pressure force measurement and analysis system. PLoS One, 15(8), e0237090. [Google Scholar] [Crossref]

Cite this:
APA Style
IEEE Style
BibTex Style
MLA Style
Chicago Style
Li, C. C., Cai, Y. T., Habeeb, L. J., Rahman, A., & Ritzkal (2023). Enhancing Fall Risk Assessment in the Elderly: A Study Utilizing Transfer Learning in an Improved EfficientNet Network with the Gramian Angular Field Technique. Healthcraft. Front., 1(1), 1-14. https://doi.org/10.56578/hf010101
C. C. Li, Y. T. Cai, L. J. Habeeb, A. Rahman, and Ritzkal, "Enhancing Fall Risk Assessment in the Elderly: A Study Utilizing Transfer Learning in an Improved EfficientNet Network with the Gramian Angular Field Technique," Healthcraft. Front., vol. 1, no. 1, pp. 1-14, 2023. https://doi.org/10.56578/hf010101
@research-article{Li2023EnhancingFR,
title={Enhancing Fall Risk Assessment in the Elderly: A Study Utilizing Transfer Learning in an Improved EfficientNet Network with the Gramian Angular Field Technique},
author={Congcong Li and Yueting Cai and Laith Jaafer Habeeb and Atta-Ur Rahman and Ritzkal},
journal={Healthcraft Frontiers},
year={2023},
page={1-14},
doi={https://doi.org/10.56578/hf010101}
}
Congcong Li, et al. "Enhancing Fall Risk Assessment in the Elderly: A Study Utilizing Transfer Learning in an Improved EfficientNet Network with the Gramian Angular Field Technique." Healthcraft Frontiers, v 1, pp 1-14. doi: https://doi.org/10.56578/hf010101
Congcong Li, Yueting Cai, Laith Jaafer Habeeb, Atta-Ur Rahman and Ritzkal. "Enhancing Fall Risk Assessment in the Elderly: A Study Utilizing Transfer Learning in an Improved EfficientNet Network with the Gramian Angular Field Technique." Healthcraft Frontiers, 1, (2023): 1-14. doi: https://doi.org/10.56578/hf010101
cc
©2023 by the author(s). Published by Acadlore Publishing Services Limited, Hong Kong. This article is available for free download and can be reused and cited, provided that the original published version is credited, under the CC BY 4.0 license.