A CNN-Transformer Hybrid Recognition Approach for sEMG-Based Dynamic Gesture Prediction

Authors: Liu, Y., Li, X., Yang, L., Bian, G. and Yu, H.

Journal: IEEE Transactions on Instrumentation and Measurement

Volume: 72

eISSN: 1557-9662

ISSN: 0018-9456

DOI: 10.1109/TIM.2023.3273651

Abstract:

As a unique physiological electrical signal in the human body, surface electromyography (sEMG) signals always include human movement intention and muscle state. Through the collection of sEMG signals, different gestures can be effectively recognized. At present, the convolutional neural network (CNN) has been widely applied to different gesture recognition systems. However, due to its inherent limitations in global context feature extraction, it exists a certain shortcoming on high-precision prediction tasks. To solve this issue, a CNN-transformer hybrid recognition approach is proposed for high-precision dynamic gesture prediction. In addition, the continuous wavelet transform (CWT) is proposed for to acquire the time-frequency maps. To realize effective feature representation of local features from the time-frequency maps, an attention fusion block (AFB) is proposed to build the deep CNN network branch to effectively extract key channel information and spatial information from local features. Faced with the inherent limitations in global context feature extraction of CNNs, a transformer network branch is proposed to model the global relationship between pixels, called convolution and transformer (CAT) network branch. In addition, a multiscale feature attention (MFA) block is proposed for effective feature aggregation of local features and global contexts by learning adaptive multiscale features and suppressing irrelevant scale information. The experimental results on the established multichannel sEMG signal time-frequency map dataset show that the proposed CNN transformer hybrid recognition network has competitive recognition performance compared with other state-of-the-art recognition networks, and the average recognition speed of each spectrogram on the test set is only 14.7 ms. The proposed network can effectively improve network performance and identification efficiency.

Source: Scopus