Detecting posture changes of athletes in sports is an important task in teaching and training competitions, but its detection remains challenging due to the diversity and complexity of sports postures. This paper introduces a single-stage pose estimation algorithm named yolov8-sp. This algorithm enhances the original yolov8 architecture by incorporating the concept of multi-dimensional feature fusion and the attention mechanism for automatically capturing feature importance. Furthermore, in this paper, angle extraction is conducted for three crucial motion joints in the motion scene, with polynomial corrections applied across successive frames. In comparison with the baseline yolov8, the improved model significantly outperforms it in AP50 (average precision) aspects. Specifically, the model’s performance improves from 84.5 AP to 87.1 AP, and the performance of AP50–95, APM, and APL aspects also shows varying degrees of improvement; the joint angle detection accuracy under different sports scenarios is tested, and the overall accuracy is improved from 73.2% to 89.0%, which proves the feasibility of the method for posture estimation of the human body in sports and provides a reliable tool for the analysis of athletes’ joint angles.
[1]
L. Wang,et al.
Human Action Recognition Based on Skeleton Information and Multi-Feature Fusion
,
2023,
Electronics.
[2]
J. Jeong,et al.
Real-Time Pose Estimation Based on ResNet-50 for Rapid Safety Prevention and Accident Detection for Field Workers
,
2023,
Electronics.
[3]
Michael J. Black,et al.
SMPL: A Skinned Multi-Person Linear Model
,
2023
.
[4]
H. Qi,et al.
DetPoseNet: Improving Multi-Person Pose Estimation via Coarse-Pose Filtering
,
2022,
IEEE Transactions on Image Processing.
[5]
Daniil Osokin,et al.
Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose
,
2018,
ICPRAM.
[6]
Lukasz Kaiser,et al.
Attention is All you Need
,
2017,
NIPS.
[7]
G. Hua,et al.
Robust Pose Estimation in Crowded Scenes with Direct Pose-Level Inference
,
2021,
NeurIPS.
[8]
Ilya Sutskever,et al.
Language Models are Unsupervised Multitask Learners
,
2019
.