Code for data preprocessing Datas from https://www.kaggle.com/c/house-prices-advanced-regression-techniques
numpy python 3.8 pandas matplotlib sklearn seaborn
Csv2Excel:transform csv file into excel Discretize:discretize the price(replace interval values with interval means.) Discretize_K-means:discretize the price(utilize K-means for discretization) Empty_del:remove the missing Norm:normalization Outlier handling for price:remove outliers for price Relate:Plotting a correlation heatmap Str2num:Convert categorical variables to numerical variables
result could be seen as follows: correlation_heatmap.pdf