I have this variables from kaggle : https://www.kaggle.com/harlfoxem/housesalesprediction and I made a liner regression model, but my when I look at my summary data I see that my values (like median) are too high, anyone knows what can be the problem?
Residuals:
Min 1Q Median 3Q Max
-1195019 -147916 -22050 101763 5530466
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) -6.329e+07 4.185e+06 -15.124 < 2e-16 ***
bedrooms 7.093e+02 2.705e+03 0.262 0.793
bathrooms 1.268e+05 4.129e+03 30.721 < 2e-16 ***
floors 2.564e+04 4.751e+03 5.398 6.83e-08 ***
waterfront 5.498e+05 2.639e+04 20.830 < 2e-16 ***
view 7.823e+04 3.324e+03 23.533 < 2e-16 ***
condition 6.620e+04 3.453e+03 19.172 < 2e-16 ***
yr_renovated 8.425e+01 5.535e+00 15.221 < 2e-16 ***
zipcode 6.407e+02 4.264e+01 15.023 < 2e-16 ***
sqft_living15 2.188e+02 4.040e+00 54.172 < 2e-16 ***
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
Residual standard error: 263600 on 15119 degrees of freedom
Multiple R-squared: 0.4959, Adjusted R-squared: 0.4956
F-statistic: 1652 on 9 and 15119 DF, p-value: < 2.2e-16