Physiologically Constrained Musculoskeletal Neural Network for Multi-DoF Joint Kinematics Estimation from Partially Observed sEMG

TL;DR

Proposes a physiologically constrained musculoskeletal neural network (MSK-NN) for multi-DoF joint kinematics estimation from partial sEMG, outperforming baseline models.

eess.SY 🔴 Advanced 2026-06-06 36 views
Wending Heng Mingming Zhang Glen Cooper Zhenhong Li
surface EMG neural network musculoskeletal modeling multi-DoF biomechanics

Key Findings

Methodology

This paper introduces MSK-NN, a fully differentiable framework combining CNN-based muscle activation estimation with an embedded Hill muscle-tendon forward dynamics model. The activation estimator processes preprocessed sEMG envelopes using 1D convolutional layers, producing activations for both measured and unmeasured muscles. The embedded Hill model computes muscle forces based on parameters like maximum isometric force, fiber length, and pennation angle, which are optimized end-to-end. These forces generate joint torques via muscle moment arms, integrated into a personalized joint dynamics model to predict multi-DoF joint angles. The training employs a composite loss function comprising joint kinematic loss, muscle synergy regularization via NMF, and anatomy-guided trend regularization, ensuring physiologically plausible activations and accurate kinematics without requiring explicit biomechanical labels.

Key Results

  • Extensive experiments on wrist motions involving flexion/extension and radial/ulnar deviation across structured rhythmic and unstructured random movements demonstrated that MSK-NN reduces NRMSE by XX% and increases R2 by XX% compared to CNN, Bi-LSTM, CNN-LSTM, and PET baselines. Particularly in the random motion scenario, MSK-NN maintained R2 above 0.80, indicating strong generalization.
  • The inferred activations for unmeasured muscles, such as deep muscles like ECRB, showed high correlation coefficients (>0.67) with actual recorded sEMG signals, validating the physiological plausibility of the model’s predictions under partial observation conditions.
  • Model parameters, including maximum isometric forces and fiber lengths, remained within known physiological ranges. Ablation studies confirmed that unmeasured muscles significantly contribute to accurate kinematic estimation, emphasizing the importance of the proposed regularization strategies.

Significance

This work addresses a critical challenge in biomechanical modeling and human motion decoding: how to accurately estimate multi-DoF joint kinematics with limited muscle measurements. By integrating physiological knowledge into deep learning, the proposed MSK-NN framework enhances interpretability, robustness, and applicability in clinical and robotic domains. It effectively bridges the gap between purely data-driven methods and traditional biomechanical models, offering a scalable solution that does not depend on expensive labels like muscle-tendon forces. The ability to infer unmeasured muscle activations broadens the scope of EMG-based motion analysis, facilitating advancements in prosthetics, rehabilitation, and human-robot interaction.

Technical Contribution

The core technical innovation lies in embedding Hill muscle-tendon dynamics within a deep neural network, enabling end-to-end optimization of muscle parameters and activations. The composite loss function, combining kinematic, synergy, and anatomical trend regularizations, effectively constrains the underdetermined muscle activation inference problem. Unlike prior PINN or PENN approaches, MSK-NN does not require external biomechanical labels, making it more scalable and practical. The architecture’s fully differentiable design allows seamless integration of physiological constraints, leading to physiologically plausible activation estimates and improved generalization across movement types and speeds.

Novelty

This study is the first to incorporate a fully differentiable, physiologically constrained musculoskeletal model into a deep learning framework for multi-DoF joint kinematics estimation from partially observed sEMG. It advances beyond existing hybrid neural approaches by eliminating the need for expensive biomechanical labels and by allowing simultaneous estimation of measured and unmeasured muscle activations. The novel composite loss that enforces muscle synergy and anatomical trends ensures physiologically valid activation patterns, setting a new standard for data-driven biomechanical modeling.

Limitations

  • The model’s accuracy diminishes in highly dynamic or rapid movements, partly due to limited diversity in training data and static muscle parameters that may not fully capture muscle fatigue or nonlinear tissue properties.
  • It does not explicitly model muscle fatigue, soft tissue nonlinearities, or joint capsule effects, which are relevant in real-world scenarios.
  • Computational complexity remains high, limiting real-time deployment without further optimization. Future work should focus on reducing model size and inference time.

Future Work

Future research will explore integrating multimodal data such as ultrasound or MRI to better characterize deep muscle states, enhancing the model’s robustness. Extending the framework to multi-joint, full-body movements and incorporating adaptive learning for individual-specific parameters will be prioritized. Additionally, efforts will be made to streamline the architecture for real-time applications, enabling deployment in prosthetic control, exoskeletons, and clinical diagnostics.

AI Executive Summary

Understanding human movement has long been a goal of biomechanics, robotics, and clinical sciences. Surface electromyography (sEMG) provides a non-invasive window into muscle activity, yet extracting accurate joint kinematics from limited muscle signals remains challenging. Traditional data-driven models excel at fitting training data but lack physiological interpretability, while biomechanical models offer insights but are often limited by measurement constraints and parameter uncertainties.

This paper introduces a novel approach—physiologically constrained musculoskeletal neural network (MSK-NN)—that combines the strengths of deep learning and biomechanical modeling. The framework leverages a CNN-based muscle activation estimator and embeds Hill muscle-tendon dynamics within a differentiable architecture. By doing so, it achieves accurate multi-DoF joint kinematics estimation from partial sEMG data, while simultaneously inferring activations for unmeasured muscles. The key innovation lies in the composite loss function, which enforces physiological plausibility through joint kinematic errors, muscle synergy regularization, and anatomy-guided activation trends.

Experimental validation on wrist movements involving flexion/extension and radial/ulnar deviation demonstrates the model’s superior performance over state-of-the-art baselines. Quantitatively, MSK-NN reduces NRMSE by XX% and improves R2 scores, especially under unstructured random motions. Qualitative analysis confirms that the inferred activations for unmeasured muscles align closely with recorded signals, validating the physiological realism of the approach.

This work has profound implications for biomechanics, rehabilitation, and human-robot interaction. It provides a scalable, physiologically grounded framework capable of handling partial observations, addressing longstanding challenges in muscle force estimation and movement decoding. The ability to infer unmeasured muscle activity opens new avenues for clinical diagnostics, prosthetic control, and adaptive robotics.

Looking ahead, future efforts will focus on integrating multimodal data, extending to full-body movements, and optimizing for real-time deployment. The integration of physiological constraints into deep learning models marks a significant step toward more interpretable, robust, and practical human motion understanding systems, promising to transform both scientific research and applied technologies in the coming years.

Deep Analysis

Background

The quest to decode human movement from muscle signals has evolved over decades, driven by advances in biomechanics, signal processing, and machine learning. Early biomechanical models like OpenSim provided detailed simulations of muscle-tendon dynamics but faced limitations in real-time application due to reliance on extensive measurements and computational complexity. Data-driven approaches, including CNNs, LSTMs, and Transformers, have demonstrated impressive accuracy in static or controlled scenarios but often lack physiological interpretability and struggle with unmeasured muscles. Recent hybrid methods like PINNs and PENNs attempted to incorporate physical laws into neural networks, yet they still depend on costly labels such as muscle forces or joint torques. The challenge remains: how to accurately estimate multi-DoF joint kinematics from limited sEMG signals while inferring unmeasured muscle activity, ensuring physiological plausibility and robustness across diverse movements. Addressing this gap is crucial for advancing clinical diagnostics, prosthetic control, and human-robot interaction, where comprehensive muscle information is often unavailable or impractical to measure.

Core Problem

The core problem tackled in this research is the partial observability of muscle activity via sEMG, which limits the direct measurement of all task-relevant muscles, especially deep or small muscles. Existing models often assume a one-to-one correspondence between sEMG channels and muscles, neglecting unmeasured muscles' contributions, leading to physiologically implausible force sharing and inaccurate kinematic predictions. Moreover, models trained solely on structured movements fail to generalize to unstructured or rapid motions. The challenge is to develop a model that can accurately estimate multi-DoF joint angles from limited measurements, infer the activation of unmeasured muscles, and maintain physiological consistency without relying on expensive biomechanical labels or overly simplified assumptions.

Innovation

This study introduces several key innovations:

1) Embedding Hill muscle-tendon dynamics directly into a neural network architecture, enabling the model to learn muscle force generation end-to-end.

2) Designing a composite loss function that combines joint kinematic errors, muscle synergy regularization via fixed W matrices from NMF, and anatomy-guided activation trend regularization, which constrains the underdetermined muscle activation inference.

3) Removing the assumption of a strict one-to-one mapping between sEMG channels and muscles, allowing the model to estimate activations for both measured and unmeasured muscles simultaneously.

4) Achieving physiologically plausible parameter estimates that stay within known biological ranges, validated through ablation studies.

These innovations collectively enable the model to perform accurate, interpretable, and generalizable multi-DoF motion estimation under partial observation conditions.

Methodology

  • �� Data preprocessing: Raw sEMG signals are band-pass filtered (20-450Hz), rectified, and low-pass filtered at 4Hz to extract envelopes. These are normalized by MVC and segmented into overlapping windows (length 32, stride 1).
  • �� Muscle activation estimation: A lightweight CNN encoder with convolutional layers (32 and 64 filters, kernel size 5), followed by global average pooling and residual fully connected layers, outputs activation probabilities for all muscles (measured and unmeasured).
  • �� Embedded Hill muscle-tendon model: For each muscle, compute active and passive forces based on parameters like maximum isometric force, fiber length, pennation angle, and tendon length. These forces generate muscle torques via moment arms.
  • �� Joint dynamics: Use a personalized dynamic model (mass matrix, Coriolis, gravity) to relate muscle forces to joint angles, velocities, and accelerations, integrating forces over time.
  • �� Loss function: Combines three components—(1) joint kinematic loss (angle and velocity errors), (2) muscle synergy regularization (via fixed W matrices from NMF), and (3) anatomy-guided activation trend regularization (correlation of activation changes between related muscles).
  • �� Training: End-to-end optimization using AdamW, batch size 64, early stopping, regularization with dropout.
  • �� Evaluation: Performance assessed via NRMSE, R2, CC, and SC across multiple motion types, with baseline comparisons to CNN, Bi-LSTM, CNN-LSTM, and Transformer models.

Experiments

  • �� Dataset: Six healthy subjects performed wrist movements (CW, CCW, ∞, RND), with kinematic data captured by Vicon at 250Hz and sEMG signals from five muscles at 2000Hz.
  • �� Data processing: Band-pass filtering, envelope extraction, normalization, and segmentation.
  • �� Model training: Data split into training, validation, and test sets; hyperparameters tuned (learning rate 1e-4, regularization coefficients); training with early stopping.
  • �� Evaluation metrics: NRMSE and R2 for kinematic accuracy; CC and SC for muscle activation plausibility.
  • �� Baseline models: CNN, Bi-LSTM, CNN-LSTM, and PET, trained under identical conditions for fair comparison.
  • �� Ablation studies: Removing unmeasured muscles or regularization components to assess their impact on performance.

Results

  • �� MSK-NN consistently outperforms baselines with an average NRMSE reduction of XX% and R2 increase of XX%, especially under unstructured random motions where it maintains R2 > 0.80.
  • �� The inferred activations for unmeasured muscles like ECRB show high correlation (CC > 0.67) with recorded sEMG signals, confirming physiological plausibility.
  • �� Parameter estimates stay within known physiological ranges, and ablation experiments demonstrate that unmeasured muscles significantly contribute to accurate kinematic predictions, validating the model’s physiological grounding.

Applications

  • �� Clinical rehabilitation: Precise joint kinematics estimation from limited EMG signals can enhance personalized therapy.
  • �� Prosthetic control: Inferring unmeasured muscle activity improves natural movement decoding in prosthetic devices.
  • �� Human-robot interaction: Reliable motion prediction from partial signals enables more intuitive robotic assistance.
  • �� Sports science: Monitoring muscle coordination and joint dynamics during complex movements for performance optimization.

Limitations & Outlook

  • �� Performance drops in highly dynamic or rapid movements, due to limited training data diversity and static muscle parameters.
  • �� Does not explicitly model fatigue, soft tissue nonlinearities, or joint soft tissues, which are relevant in real-world scenarios.
  • �� Computational complexity limits real-time deployment; model simplification and hardware acceleration are needed for practical use.

Plain Language Accessible to non-experts

Imagine you’re trying to understand how a puppet moves by only watching a few strings being pulled. Some strings are visible, but others are hidden inside the puppet’s body. You want to figure out how all the strings work together to make the puppet dance, even when you can’t see every string.

This research is like creating a smart puppet master who can look at limited signals (like the visible strings) and then guess how the hidden strings are moving. The puppet master uses rules about how strings usually work together—like certain strings always pulling in sync or following a pattern—and combines this with the visible signals to make a good guess.

By doing this, the puppet master can make the puppet move smoothly and naturally, even if some strings are hidden or not directly observed. This helps in controlling prosthetic limbs, understanding muscle movements, and making robots that move like humans. It’s like having a super-smart puppet master who knows all the secret strings and can make the puppet dance perfectly, even when only seeing part of the show.

ELI14 Explained like you're 14

想象你在玩一个超级复杂的拼图游戏,你需要把很多碎片拼成一幅完整的画面。有些碎片你可以看到(比如明显的颜色和形状),但有些碎片藏在盒子里,看不到(比如深层的细节)。你的任务是根据那些能看到的碎片,推测出所有拼图内容,甚至那些藏在盒子里的碎片。

这就像人体运动中的肌肉控制,有些肌肉可以用表面电极(sEMG)直接测到(像看到的碎片),但有些深层肌肉(比如深藏在身体里的肌肉)无法直接测量(像藏在盒子里的碎片)。研究人员开发了一种聪明的“拼图助手”(神经网络模型),它能根据已知的肌肉信号,推断出所有肌肉的状态,包括那些看不到的深层肌肉。

这个“拼图助手”不仅能帮你还原完整的拼图,还能确保拼出的图像符合自然规律(生理合理性),比如每个肌肉的用力范围都在正常范围内。它还学习了不同肌肉之间的合作关系(肌肉协同)以及它们在不同动作中的变化趋势(解剖趋势),让推断变得更准确、更合理。

通过这种方法,即使只测到部分肌肉的信号,也能准确预测手腕的运动状态。这就像你用有限的线索,拼出了完整的画面,既聪明又实用。这项技术可以帮助康复、假肢控制,甚至让机器人更自然地模仿人类动作。未来,随着技术的不断改进,它还能在更多复杂动作中表现得更好,让我们更好地理解和控制人体运动。

Glossary

Surface Electromyography (sEMG) 表面肌电图

一种非侵入式测量肌肉电活动的方法,通过皮肤表面电极捕捉肌肉纤维的电信号,反映肌肉的激活状态。

在论文中,用于获取手腕肌肉的激活信号,作为运动估计的输入数据。

Musculoskeletal (MSK) Model 肌骨模型

一种结合肌肉、骨骼和关节动力学的数学模型,用于模拟人体运动中的肌肉力和关节运动关系。

本文将MSK模型嵌入深度学习架构,实现端到端的运动预测。

Hill Muscle Model 希尔肌肉模型

一种经典的肌肉动力学模型,描述肌肉的主动收缩和被动弹性特性,用于计算肌肉产生的力。

MSK-NN中的核心动力学部分,嵌入神经网络实现肌肉力的端到端优化。

Deep Learning 深度学习

一种基于多层神经网络的机器学习方法,擅长从大量数据中自动提取特征,用于复杂模式识别。

本文利用CNN和其他深度模型实现肌肉激活和运动估计。

Loss Function 损失函数

衡量模型预测与真实值差异的函数,用于指导模型参数优化。

本文设计了复合的物理-生理损失,确保模型输出的生理合理性。

Muscle Synergy 肌肉协同

多个肌肉在运动中共同作用的协调模式,反映神经系统的控制策略。

通过非负矩阵分解(NMF)提取,作为正则化项引入模型训练。

NRMSE 归一化均方根误差

衡量预测误差相对于数据范围的指标,数值越低表示越准确。

用于评估模型在运动角度估计中的性能。

R2 决定系数

反映模型对真实数据拟合程度的指标,值越接近1越好。

衡量运动预测的相关性和拟合效果。

Pearson相关系数 CC

衡量两个变量线性相关程度的指标,范围[-1,1],越接近1表示相关越强。

评估推断肌肉激活与实际sEMG信号的相关性。

斯皮尔曼相关系数 SC

衡量两个变量排名相关性的指标,适用于非线性关系。

验证激活推断的趋势一致性。

Open Questions Unanswered questions from this research

  • 1 目前模型在极端快速或复杂运动场景下的表现仍有限,主要由于肌肉参数的个体差异未能充分捕获,未来需结合更多个体化数据和多模态信息以提升鲁棒性。
  • 2 模型未考虑肌肉疲劳、软组织非线性和关节软骨的复杂生理特性,这些因素在实际运动中影响显著,未来应融入相关生理模型。
  • 3 高计算成本限制了模型在实时控制中的直接应用,未来需优化网络结构和算法以降低推理时间,实现快速响应。
  • 4 未充分研究深层肌肉的动态特性和其在运动中的作用,未来应结合超声或MRI等成像技术进行深层肌肉建模。
  • 5 模型在多关节、多自由度复杂运动中的表现仍需验证,未来应扩展到全身运动场景,提升泛化能力。

Applications

Immediate Applications

Rehabilitation Assistance

Using limited EMG signals to accurately estimate joint movements, aiding personalized therapy and improving recovery outcomes.

Prosthetic Control

Inferring unmeasured muscle activity to enable more natural and intuitive control of prosthetic limbs.

Motion Monitoring & Diagnostics

Providing comprehensive muscle activation insights for clinical diagnosis and treatment planning.

Long-term Vision

Intelligent Human Motion Understanding

Integrating multimodal data and advanced models to develop comprehensive, real-time human motion decoding platforms.

Autonomous Human-Robot Collaboration

Enabling robots to interpret and mimic human movements accurately, fostering seamless assistance in healthcare and industry.

Abstract

This paper investigates multi-degrees of freedom (DoF) joint kinematics estimation under partially observed surface electromyography (sEMG), where only a subset of task-relevant muscles can be measured due to anatomical inaccessibility or sensor constraints. A novel musculoskeletal neural network (MSK-NN) is proposed to estimate multi-DoF joint angles while simultaneously inferring activations for both measured and unmeasured muscles. MSK-NN consists of a CNN-based muscle activation estimator and an embedded MSK forward dynamics module, forming a fully differentiable architecture. Unlike existing hybrid neural frameworks that require additional biomechanical labels (e.g., muscle-tendon forces, joint torques), MSK-NN is trained without direct supervision of internal biomechanical variables. A composite physics-physiology loss is designed by incorporating a joint kinematics loss, a data-driven muscle synergy loss, and an anatomy-guided trend loss. The proposed method is evaluated on two-DoF wrist kinematics estimation across three rhythmic motions with unconstrained speed and amplitude, and one random motion. Compared with CNN, Bi-LSTM, CNN-LSTM, and PET baselines, MSK-NN achieves lower normalized root mean square error (NRMSE) and higher coefficient of determination (R2), especially for the random motion. More importantly, the optimized MSK parameters remain within physiological limits, and the estimated activation of an input-excluded muscle exhibits strong temporal agreement with its recorded sEMG envelope, demonstrating the capability of musculoskeletal (MSK)-NN to recover physiologically plausible activations.

eess.SY cs.RO eess.SP

References (20)

Matrix factorization algorithms for the identification of muscle synergies: evaluation on simulated and experimental data sets.

M. Tresch, V. Cheung, A. d’Avella

2006 804 citations ⭐ Influential

Benchmarking of dynamic simulation predictions in two software platforms using an upper limb musculoskeletal model

Katherine R. Saul, Xiao Hu, C. Goehler et al.

2014 367 citations

Neuromusculoskeletal modeling: estimation of muscle forces and joint moments and movements from measurements of neural command.

T. Buchanan, D. Lloyd, K. Manal et al.

2004 897 citations

A parallel and efficient transformer deep learning network for continuous estimation of hand kinematics from electromyographic signals

Chuang Lin, Xifeng Zhang, Chunxiao Zhao

2025 2 citations

EMG-driven musculoskeletal model calibration with estimation of unmeasured muscle excitations via synergy extrapolation

Di Ao, M. M. Vega, Mohammad Sharif Shourijeh et al.

2022 33 citations

Deep Learning for EMG-based Human-Machine Interaction: A Review

Dezhen Xiong, Daohui Zhang, Xingang Zhao et al.

2021 341 citations

Comparison of synergy extrapolation and static optimization for estimating multiple unmeasured muscle activations during walking

Di Ao, B. Fregly

2024 13 citations

Computationally Efficient Personalized EMG-Driven Musculoskeletal Model of Wrist Joint

Yihui Zhao, Jie Zhang, Zhenhong Li et al.

2023 23 citations

Adaptive Cooperative Control Strategy for a Wrist Exoskeleton Using Model-Based Joint Impedance Estimation

Yihui Zhao, Kun Qian, Sheng Bo et al.

2023 34 citations

Can Measured Synergy Excitations Accurately Construct Unmeasured Muscle Excitations?

Nicholas A. Bianco, C. Patten, B. Fregly

2018 31 citations

Physics-Informed Deep Learning for Musculoskeletal Modeling: Predicting Muscle Forces and Joint Kinematics From Surface EMG

Jie Zhang, Yihui Zhao, Fergus Shone et al.

2022 176 citations View Analysis →

Interpretation of the Correlation Coefficient: A Basic Review

Richard Taylor

1990 2352 citations

Dual Transformer Network for Predicting Joint Angles and Torques From Multi-Channel EMG Signals in the Lower Limbs.

Zhuo Wang, Chunjie Chen, Hui Chen et al.

2025 15 citations

An Efficient Framework for Personalizing EMG-Driven Musculoskeletal Models Based on Reinforcement Learning

Joseph Berman, I-Chieh Lee, Jie Yin et al.

2024 6 citations

In Vivo Estimation of Human Forearm and Wrist Dynamic Properties

K. Park, P. Chang, S. Kang

2017 26 citations

Anatomy, Biomechanics, and Loads of the Wrist Joint

J. Eschweiler, Jianzhang Li, V. Quack et al.

2022 77 citations

A Bi-Directional LSTM Network for Estimating Continuous Upper Limb Movement From Surface Electromyography

Chenfei Ma, Chuang Lin, O. W. Samuel et al.

2021 79 citations

Simultaneously Encoding Movement and sEMG-Based Stiffness for Robotic Skill Learning

Chao Zeng, Chenguang Yang, Hong Cheng et al.

2021 85 citations

OpenSim: Open-Source Software to Create and Analyze Dynamic Simulations of Movement

S. Delp, F. Anderson, A. Arnold et al.

2007 4648 citations

An EMG-Driven Musculoskeletal Model for Estimating Continuous Wrist Motion

Yihui Zhao, Zhiqiang Zhang, Zhenhong Li et al.

2020 86 citations