Skip to content

SubashSK777/House-Price-Prediction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🏠 House Price Prediction Project

Predicting house prices using advanced regression techniques. We explore, clean, engineer, and model data from the famous Kaggle House Prices dataset.



Star-Struck Project Highlights

  • Robot Predictive Modeling: Deep dive into Linear, Ridge, and Lasso regression algorithms.
  • Scroll Advanced EDA: Comprehensive visual analysis of 79 features using Seaborn and Matplotlib.
  • Triangular Ruler Feature Engineering: Creating insightful features like TotalSF, HouseAge, and TotalBaths.
  • High Voltage Skewness Handling: Log-transforming target variables to optimize linear model performance.

File Folder Workflow Directory

Step description
Search EDA Understanding data distributions, correlations, and identifying missing values.
Clean Data Cleaning Handling NaN values with mode/median/none strategies for robustness.
Train Modeling Training 3 distinct regression models and hyperparameter tuning with Alpha.
Result Evaluation Comparing RMSE and R² scores to pick the champion model.

Rocket Quick Start

Launch your data science journey in minutes!

Clipboard Prerequisites


Setup Local Installation

Laptop

  1. Clone & Navigate:

    git clone https://github.com/SubashSK777/House-Price-Prediction.git
    cd House-Price-Prediction
  2. Install Libraries:

    pip install pandas numpy matplotlib seaborn scikit-learn jupyter
  3. Prepare Data: Place train.csv and test.csv in the root directory.

  4. Launch Notebook:

    jupyter notebook house_price.ipynb

Results Model Performance

Model RMSE (Log) R² Score
Linear Regression 0.1254 0.892
Ridge Regression (L2) 0.1248 0.895
Lasso Regression (L1) 0.1261 0.889

Note: Results may vary slightly based on feature selection and random seed.


Rocket

Built with ❤️ for Data Enthusiasts

About

Predicting house sale prices using Ridge & Lasso regression on 1,460 samples | EDA, feature engineering, scikit-learn

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors