GitHub - Magicherry/Pose_Estimation: An exercise standardness detection system based on Google Movenet pose estimation model.

Name	Name	Last commit message	Last commit date
Latest commit History 22 Commits
base_data	base_data
movenet	movenet
utils	utils
.gitattributes	.gitattributes
.gitignore	.gitignore
LICENSE	LICENSE
Readme.md	Readme.md
cameraTest.py	cameraTest.py
main.py	main.py
setup.py	setup.py

Real-time Pose Estimation for Exercise Form Analysis

Python Version TensorFlow OpenCV MoveNet Performance License

A real-time exercise form analysis system using Google's MoveNet pose estimation model to evaluate movement quality and count repetitions for fitness exercises.

Overview

This project implements an intelligent exercise monitoring system that uses computer vision to analyze human movement patterns. The system provides real-time feedback on exercise form, counts repetitions, and scores movement quality for common exercises like squats and pull-ups.

Features

Core Capabilities

Utilizes Google's MoveNet model to detect 17 human keypoints with high accuracy
Compares detected poses against reference movements using cosine similarity matching
Currently supports squats and pull-ups with extensible architecture for additional exercises
Automatically counts completed exercise repetitions based on movement patterns
Provides real-time scoring based on movement accuracy and form
Works with webcam input for real-time analysis
Supports batch processing of pre-recorded exercise videos
Saves annotated videos with pose overlays and performance metrics

Feature Comparison Matrix

Feature	Squat Analysis	Pull-up Analysis	Camera Mode	Video Mode
Real-time Detection	✅	✅	✅	✅
Form Scoring	✅	✅	✅	✅
Rep Counting	✅	✅	✅	✅
Live Switching	✅	✅	✅	❌
Video Export	✅	✅	✅	✅
Batch Processing	❌	❌	❌	✅

Technical Architecture

System Architecture Diagram

graph TD
 A[Video Input] --> B[Frame Capture]
 B --> C[MoveNet Model]
 C --> D[Pose Detection]
 D --> E[Keypoint Extraction]
 E --> F[Pose Normalization]
 F --> G[Similarity Matching]
 G --> H[Exercise Classification]
 H --> I[Repetition Counting]
 I --> J[Quality Scoring]
 J --> K[Real-time Display]
 K --> L[Video Output]
 
 M[Reference Database] --> G
 N[Exercise Models] --> H
 O[Scoring Algorithms] --> J

Pose Estimation Pipeline

Input Processing: Captures video frames from camera or file input
Pose Detection: Applies MoveNet model to extract 17 skeletal keypoints
Normalization: Normalizes pose vectors for scale and position invariance
Similarity Matching: Compares poses against reference database using cosine similarity
State Tracking: Monitors exercise phases (up/down positions) for repetition counting
Scoring: Calculates form quality scores based on anatomical angle analysis

Human Pose Keypoints (MoveNet)

ID	Keypoint	Body Region	ID	Keypoint	Body Region
0	nose	Head	9	left_wrist	Left Arm
1	left_eye	Head	10	right_wrist	Right Arm
2	right_eye	Head	11	left_hip	Left Leg
3	left_ear	Head	12	right_hip	Right Leg
4	right_ear	Head	13	left_knee	Left Leg
5	left_shoulder	Left Arm	14	right_knee	Right Leg
6	right_shoulder	Right Arm	15	left_ankle	Left Leg
7	left_elbow	Left Arm	16	right_ankle	Right Leg
8	right_elbow	Right Arm

Components

MoveNet Integration: TensorFlow Hub model wrapper for pose estimation
Similarity Engine: Ball Tree implementation for efficient pose matching
Exercise Evaluators: Specialized scoring algorithms for different exercise types
Video Pipeline: Multi-threaded video processing with real-time display

Installation

Prerequisites

Python 3.8+
CUDA-compatible GPU (recommended for optimal performance)
Webcam (for real-time analysis)

Dependencies

pip install tensorflow
pip install tensorflow-hub
pip install opencv-python
pip install scikit-learn
pip install numpy
pip install matplotlib

Setup

Clone the repository:

git clone https://github.com/Magicherry/Pose_Estimation.git
cd Pose_Estimation

Download MoveNet models (automatically handled by TensorFlow Hub)
Prepare reference pose data in base_data/ directory following the existing structure

Usage

Real-time Camera Analysis

python main.py

Configuration Options

Modify the following parameters in main.py:

use_camera: Set to True for live camera input, False for video file processing
sport_type: Choose between "squat" or "pull_up"
model_name: Select "movenet_lightning" (faster) or "movenet_thunder" (more accurate)

Keyboard Controls

1: Switch to squat analysis mode
2: Switch to pull-up analysis mode
ESC: Exit the application

Camera Testing

Test your camera setup:

python cameraTest.py

Project Structure

Pose_Estimation/
├── main.py # Main application entry point
├── cameraTest.py # Camera functionality testing
├── setup.py # Build configuration
├── base_data/ # Reference pose datasets
│ ├── squat/
│ │ ├── up/ # Reference images for squat up position
│ │ └── down/ # Reference images for squat down position
│ └── pull_up/
│ ├── up/ # Reference images for pull-up up position
│ └── down/ # Reference images for pull-up down position
├── movenet/ # MoveNet model files and utilities
│ ├── movenet.py # MoveNet model wrapper
│ └── singlepose/ # Pre-trained model weights
├── utils/
│ └── utils.py # Utility functions for pose visualization
└── result/ # Output directory for processed videos

Algorithm Details

Pose Normalization

The system normalizes detected poses to handle variations in:

Scale: Different distances from camera
Position: Various locations within frame
Orientation: Minor rotational differences

Exercise-Specific Scoring Methodology

Squat Form Assessment

The squat evaluation measures movement depth by analyzing the relationship between hip, knee, and ankle positions during the exercise.

Core Algorithm:

Calculate vertical distances from ankle to hip and ankle to knee
Compute depth ratio: knee_ankle_distance / hip_ankle_distance
Convert to 0-100 score scale

Scoring Criteria:

Score Range	Form Quality	Depth Ratio	Description
80-100	Excellent	0.8-1.0	Full depth squat
60-79	Good	0.6-0.8	Moderate depth
0-59	Poor	<0.6	Insufficient depth

Pull-up Range of Motion Assessment

The pull-up evaluation tracks face position relative to arm joints to determine completion quality.

Core Algorithm:

Calculate face center from 5 facial keypoints (nose, eyes, ears)
Compare face position to elbow and wrist levels
Score based on range of motion completion

Scoring Zones:

Score	Face Position	Range of Motion	Description
100	Above wrist level	Full ROM	Complete pull-up
1-99	Between wrist-elbow	Partial ROM	Proportional scoring
0	Below elbow level	No ROM	Incomplete movement

Algorithm Implementation

# Squat scoring example
def calculate_squat_score(hip_y, knee_y, ankle_y):
 hip_ankle_dist = abs(hip_y - ankle_y)
 knee_ankle_dist = abs(knee_y - ankle_y)
 depth_ratio = min(knee_ankle_dist / (hip_ankle_dist + 1e-6), 1.0)
 return int(depth_ratio * 100)
# Pull-up scoring example 
def calculate_pullup_score(face_y, elbow_y, wrist_y):
 if face_y < wrist_y:
 return 100
 elif face_y >= elbow_y:
 return 0
 else:
 return int((elbow_y - face_y) / (elbow_y - wrist_y) * 100)

Repetition Detection State Machine

stateDiagram-v2
 [*] --> Idle
 Idle --> DownDetected: Pose matches "down"
 DownDetected --> DownConfirmed: Sustained > 10 frames
 DownConfirmed --> UpDetected: Pose matches "up"
 UpDetected --> UpConfirmed: Sustained > 10 frames
 UpConfirmed --> RepComplete: Count++
 RepComplete --> Idle: Reset for next rep
 
 DownDetected --> Idle: Noise/false detection
 UpDetected --> DownConfirmed: Return to down

Similarity Matching Process

Step	Process	Formula
1	Pose Vector Extraction	`pose_vector = [x1,y1,x2,y2,...,x17,y17]`
2	L2 Normalization	`normalized = vector /
3	Cosine Similarity	`similarity = dot(v1,v2) / (
4	Distance Calculation	`distance = 2 ×ばつ (1 - similarity)`
5	Threshold Filtering	`match = distance < 0.3 ? valid : invalid`

Performance Optimization

Model Performance Comparison

Model Variant	Input Size	Speed (FPS)	Accuracy	Use Case
MoveNet Lightning	192x192	~30-50	Good	Real-time applications
MoveNet Thunder	256x256	~15-25	Excellent	High accuracy analysis

System Performance Metrics

Processing Pipeline Breakdown

Processing Stage	Execution Time	CPU Usage	Percentage	Bottleneck Level
Frame Capture	~2ms	Low	5%	⚪ Minimal
Pose Detection	~20ms	High	50%	🔴 Critical
Similarity Match	~5ms	Medium	12%	🟡 Moderate
Score Calculation	~3ms	Low	8%	⚪ Minimal
Video Rendering	~10ms	Medium	25%	🟡 Moderate
Total per Frame	~40ms	-	100%	25 FPS

General Hardware Requirements

Component	Minimum Spec	Recommended Spec	Performance Impact
CPU	Intel i5-8th gen / AMD Ryzen 5	Intel i7-10th gen / AMD Ryzen 7	Frame processing speed
RAM	8GB	16GB	Model loading & caching
GPU	Integrated	NVIDIA GTX 1060 / RTX 2060	Pose detection acceleration
Storage	2GB free space	5GB SSD	Model storage & video output
Camera	720p @ 30fps	1080p @ 30fps	Input quality & smoothness

macOS Hardware Compatibility

Component	Minimum Spec (Intel Mac)	Recommended Spec (Apple Silicon)	Performance Impact
CPU	Intel i5 (8th gen)	Apple M1+	Efficient real-time pose estimation
RAM	8GB DDR4	16GB Unified Memory	Smooth model loading and runtime
GPU	Integrated Intel UHD	Apple Integrated GPU (M-series)	Acceleration for pose detection
Storage	2GB free space	5GB SSD	Model cache and export reliability
Camera	Built-in FaceTime HD	External 1080p @ 60fps	Input smoothness and detail quality

Optimization Strategies:

Model Selection: Choose between Lightning (speed) and Thunder (accuracy) variants
Multi-threading: Separate threads for video input and processing
Efficient Matching: Ball Tree data structure for fast similarity searches
Memory Management: Optimized frame processing pipeline

Extending the System

Adding New Exercises

Create reference pose data in base_data/new_exercise/
Implement exercise-specific scoring function
Add exercise type to main configuration
Update similarity matching parameters if needed

Custom Scoring Algorithms

Implement new scoring functions following the pattern:

def update_custom_exercise_score(pose, best_score):
 # Analyze pose keypoints
 # Calculate exercise-specific metrics 
 # Return updated best score
 return new_score

Output Format

Visual Interface

The system provides real-time visual feedback with the following elements:

Main Video Display: Live camera feed with pose skeleton overlay (white lines, colored keypoints)
Performance Metrics: Exercise type, repetition count, current score, and processing FPS
Status Information: Frame counter, pose classification (up/down), and system status

Generated Files

Output Type	Format	Content	Size (10min session)
Video Output	MP4	Annotated video with pose overlay and metrics	200-800 MB
Session Data	JSON/CSV	Keypoint coordinates and confidence scores	1-5 MB
Exercise Log	TXT	Repetition counts, scores, and timestamps	<1 MB

Performance Examples

Metric	Squat	Pull-up	Video Mode	Live Mode
Repetitions	12 reps	8 reps	Higher accuracy	Real-time feedback
Best Score	94/100	87/100	91/100	82/100
Processing Speed	28 fps	31 fps	35 fps	25 fps
Detection Accuracy	95.2%	91.7%	96.8%	89.3%

Console Output

The system logs important events with timestamps:

[12:34:56.789] INFO: MoveNet model loaded successfully
[12:34:57.123] RESULT: Squat #8 completed, score: 87/100
[12:34:58.456] WARNING: Processing FPS dropped to 18

License

This project is licensed under the Mozilla Public License 2.0 - see the LICENSE file for details.

Acknowledgments

Google Research for the MoveNet pose estimation model
TensorFlow team for the model hub infrastructure
OpenCV community for computer vision tools

License

Magicherry/Pose_Estimation

Folders and files

Latest commit

History

Repository files navigation

Real-time Pose Estimation for Exercise Form Analysis

Overview

Features

Core Capabilities

Feature Comparison Matrix

Technical Architecture

System Architecture Diagram

Pose Estimation Pipeline

Human Pose Keypoints (MoveNet)

Components

Installation

Prerequisites

Dependencies

Setup

Usage

Real-time Camera Analysis

Configuration Options

Keyboard Controls

Camera Testing

Project Structure

Algorithm Details

Pose Normalization

Exercise-Specific Scoring Methodology

Squat Form Assessment

Pull-up Range of Motion Assessment

Algorithm Implementation

Repetition Detection State Machine

Similarity Matching Process

Performance Optimization

Model Performance Comparison

System Performance Metrics

Processing Pipeline Breakdown

General Hardware Requirements

macOS Hardware Compatibility

Extending the System

Adding New Exercises

Custom Scoring Algorithms

Output Format

Visual Interface

Generated Files

Performance Examples

Console Output

License

Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages