Predicting house prices using advanced regression techniques. We explore, clean, engineer, and model data from the famous Kaggle House Prices dataset.
Predictive Modeling: Deep dive into Linear, Ridge, and Lasso regression algorithms.
Advanced EDA: Comprehensive visual analysis of 79 features using Seaborn and Matplotlib.
Feature Engineering: Creating insightful features like TotalSF,HouseAge, andTotalBaths.
Skewness Handling: Log-transforming target variables to optimize linear model performance.
Launch your data science journey in minutes!
- Python 3.8+
- Jupyter Notebook / VS Code
- Kaggle Dataset: Download Train & Test CSVs.
-
Clone & Navigate:
git clone https://github.com/SubashSK777/House-Price-Prediction.git cd House-Price-Prediction -
Install Libraries:
pip install pandas numpy matplotlib seaborn scikit-learn jupyter
-
Prepare Data: Place
train.csvandtest.csvin the root directory. -
Launch Notebook:
jupyter notebook house_price.ipynb
| Model | RMSE (Log) | R² Score |
|---|---|---|
| Linear Regression | 0.1254 | 0.892 |
| Ridge Regression (L2) | 0.1248 | 0.895 |
| Lasso Regression (L1) | 0.1261 | 0.889 |
Note: Results may vary slightly based on feature selection and random seed.

