Participant Profile
11 inputs requiredPerformance Profile Radar
Model Performance
LeaderboardSplit Stability Analysis
80/20 · 70/30 · 50/50Accuracy Across Train/Test Splits ALL 6 MODELS · 3 RATIOS
K-Fold Cross-Validation
5-Fold · GeneralisationCV Mean ± Std Dev 5-Fold · Full Dataset
| # | Model | CV Mean | ± Std Dev | Performance Bar |
|---|
Classification Models
Detailed AnalysisRandom Forest
Best overall robust bagging ensemble of 200 Decision Trees.Neural Network
Multi-Layer Perceptron (128, 64) modeling highly non-linear boundaries.Support Vector Machine
Hyperplane separation utilizing the RBF (Gaussian) Kernel.Logistic Regression
Linear multinomial classification optimized via L-BFGS.Decision Tree
Recursive partition splits maximizing node purity (Gini).K-Nearest Neighbors
Instance-based learner computing Minkowski distances (k=11).Confusion Matrix & Error Analysis
Random Forest · 70/30 SplitRF Performance by Class True vs Predicted
| Predicted Class | |||||
|---|---|---|---|---|---|
| A | B | C | D | ||
| True Class | A | 882 87.8% | 94 | 22 | 6 |
| B | 136 | 603 60.1% | 240 | 25 | |
| C | 35 | 175 | 660 65.7% | 135 | |
| D | 8 | 24 | 170 | 802 79.9% | |
Regression Models
Broad Jump PredictionRF Regressor
Ensemble averaging across 200 jump-predicting trees.Neural Network Regressor
MLP (128, 64) non-linear regression for jump prediction.SVR (RBF Kernel)
Support Vector Regression utilizing insensitive tube (ε=0.1).Linear Regression
Ordinary Least Squares (OLS) identifying linear variable associations.Feature Importance
Permutation ImportanceTop Predictors RF · Permutation
Lower Predictors Ranked 7–11
Dataset Overview
Body PerformanceML Pipeline
End-to-EndData Quality Audit
Physiological LawsBlood Pressure Constraint
Enforced the Systolic > Diastolic physiological law. Measurements where resting pressure exceeded beating pressure were flagged as illogical and removed to ensure data integrity.
Duplicate Rectification
Identified and purged exact row duplicates. This prevents "Data Leakage" where the model might "memorize" identical participants across training and testing splits, artificially inflating accuracy.
Column Definitions
Schema| Column | Type | Description | Valid Range | ML Role |
|---|---|---|---|---|
| age | INT | Participant age in years | 18 – 80 | Feature |
| gender | CAT | Biological sex — M or F | M / F | Feature (encoded) |
| height_cm | FLOAT | Standing height in centimetres | 100 – 220 | Feature |
| weight_kg | FLOAT | Body weight in kilograms | 20 – 250 | Feature |
| body fat_% | FLOAT | Body fat percentage | 3 – 65% | Feature |
| diastolic | INT | Diastolic blood pressure | 40 – 130 mmHg | Feature |
| systolic | INT | Systolic blood pressure | 70 – 200 mmHg | Feature |
| gripForce | FLOAT | Hand grip strength | 0 – 70 kg | Feature (high importance) |
| sit_and_bend_forward_cm | FLOAT | Flexibility: sit-and-reach test | −25 – 200 cm | Feature (top importance) |
| sit-ups counts | INT | Number of sit-ups completed | 0 – 80 | Feature (high importance) |
| broad jump_cm | FLOAT | Standing broad jump distance | 50 – 300 cm | Feature + Regression Target |
| class | CAT | Performance band — A (best) to D (worst) | A / B / C / D | Classification Target |
Executive Summary
Final ReportBody Performance
Final Analytics
The Gharieb Team from the Military Technical College presents the definitive body performance intelligence system. Our methodology follows a recursive 5-stage pipeline—Data Cleaning, EDA, Multi-Split Modeling, Cross-Validation, and Production Deployment—to classify fitness tiers (A–D) with optimized precision.
Through rigorous statistical auditing (Physiological BP Laws, IQR Outlier Capping, and Permutation Importance analysis), we have engineered a robust engine that captures the non-linear relationship between physical metrics and athletic performance grades.
Key System Insights
Feature AnalysisBody Composition Dominance
Body fat percentage is the primary driver of performance classification. Individuals in Class 'A' consistently exhibit significantly lower fat levels, making it the most reliable physiological metric for predicting peak fitness grades.
Strength-Agility Synergy
Our analytics confirm that broad jump distance, sit-ups, and grip force move in near-perfect lockstep. High scores in one usually signal high scores in others, representing a unified explosive power-endurance coefficient.
Gender Threshold Scaling
Distributions across Classes A-D are exceptionally balanced between genders. This indicates that our grading criteria effectively scale according to biological sex standards, ensuring fair and accurate classification for all participants.
Blood Pressure Indifference
While vital for health monitoring, systolic and diastolic blood pressures showed minimal correlation with raw performance classes. This confirms that cardiovascular health is a background constant rather than a direct performance driver.
Methodology & Results
PerformanceStrategic Roadmap
Future WorkEnsemble Stacking
Combine RF, SVM, and MLP into a single meta-model to minimize residual variance.
SHAP Integration
Implement Game Theory explainability to provide local reasons for every prediction.
Deep Regression
Utilize Keras/PyTorch ANN architectures to push Broad Jump R² beyond current thresholds.
Grid Search Pro
Execute exhaustive hyperparameter optimization across all split ratios simultaneously.