Skip to content

A data-driven student performance analysis project using UCI dataset (396 students, 33 features). Implements machine learning models (K-means, PCA, Decision Tree, Random Forest, Linear Regression) to analyze academic patterns and predict student scores based on lifestyle, health, and study habits.

Notifications You must be signed in to change notification settings

cycle-sync-ai/student-score-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 

Repository files navigation

Student Score Analysis and Prediction

A comprehensive data analysis and machine learning project focused on analyzing student performance using the UCI student dataset containing 33 features across 396 instances.

Dataset Features

The analysis includes key student attributes:

  • Academic Performance (Grades)
  • Attendance (Absences)
  • Health Metrics
  • Lifestyle Factors:
    • Daily/Weekly Alcohol Consumption
    • Free Time Management
    • Internet Usage
  • Academic Factors:
    • Study Time
    • Travel Time to School

Technical Stack

Python Libraries

  • NumPy: Numerical computations and array operations
  • Pandas: Data manipulation and analysis
  • Seaborn: Statistical data visualization
  • Matplotlib: Creating static, animated, and interactive visualizations
  • Scikit-learn: Machine learning implementations
  • Pickle: Model serialization

Machine Learning Models

  • K-means Clustering: Student grouping analysis
  • Principal Component Analysis (PCA): Dimensionality reduction
  • Decision Tree: Classification and prediction
  • Random Forest: Ensemble learning for improved accuracy
  • Linear Regression: Score prediction

Analysis Workflow

  1. Data Loading and Preprocessing
  2. Exploratory Data Analysis
  3. Feature Engineering
  4. Model Training and Evaluation
  5. Performance Prediction

Goals

  • Analyze factors affecting student performance
  • Predict student scores based on various features
  • Identify key patterns in student behavior and academic performance
  • Generate actionable insights for educational improvement

Author

Discord Email

About

A data-driven student performance analysis project using UCI dataset (396 students, 33 features). Implements machine learning models (K-means, PCA, Decision Tree, Random Forest, Linear Regression) to analyze academic patterns and predict student scores based on lifestyle, health, and study habits.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published