skip to main content
10.1145/3456126.3456139acmotherconferencesArticle/Chapter ViewAbstractPublication PagesasseConference Proceedingsconference-collections
Article

House-price Prediction Based on OLS Linear Regression and Random Forest

Published:29 June 2021Publication History

ABSTRACT

Based on a simple dataset with 20 features of houses, this paper builds two forecasting models to predict house prices. It primarily takes a series of data processes, including missing value process, outlier removal, new variables creation, and correlation analysis. Then this paper conducts the first theoretical model using OLS method. Next, it uses secondary cross-validation to optimize the parameters of random forest and gets a respectively realistic model. Finally based on the results, this paper analyzes and compares the advantages and disadvantages of the two models.

References

  1. Shekarian, E., & Fallahpour, A.. Predicting house price via gene expression programming. International Journal of Housing Markets and Analysis, 6(3) (2013), 250-268.Google ScholarGoogle ScholarCross RefCross Ref
  2. Thibodeau, T.G. Marking Single-Family Property Values to Market. Real Estate Economics, 2003, 31:1, 1-22.Google ScholarGoogle ScholarCross RefCross Ref
  3. Selim, H.. "Determinants of house prices in Turkey: hedonic regression versus artificial neural network", Expert Systems with Applications, Vol. 36 (2009) No. 2, pp. 2843-2852. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Bourassa, S. C., Cantoni, E., & Hoesli, M.. Predicting house prices with spatial dependence: A comparison of alternative methods. The Journal of Real Estate Research, 32(2) (2010), 139-160.Google ScholarGoogle ScholarCross RefCross Ref
  5. Anderson, S. T., & West, S. E. Open space, residential property values, and spatial context. Regional Science and Urban Economics, 36 (2006), 773–789.Google ScholarGoogle ScholarCross RefCross Ref
  6. Crompton, J. L. The impact of parks on property values: A review of the empirical evidence. Journal of Leisure Research, 33(1) (2001), 1–31.Google ScholarGoogle ScholarCross RefCross Ref
  7. Dehring, C., & Dunse, N. Housing density and the effect of proximity to public open space in Aberdeen, Scotland. Real Estate Economics, 34(4) (2006), 553–566.Google ScholarGoogle ScholarCross RefCross Ref
  8. Antipov, E. A., & Pokryshevskaya, E. B.. Mass appraisal of residential apartments: An application of Random forest for valuation and a CART-based approach for model diagnostics. Expert Systems with Applications, 39(2) (2012), 1772-1778. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Hong, J., Choi, H., & Kim, W. S.. A house price valuation based on the random forest approach: the mass appraisal of residential property in South Korea. International Journal of Strategic Property Management, 24(3) (2020), 140-152.Google ScholarGoogle ScholarCross RefCross Ref
  10. Yoo, Sanglim, Jungho Im, and John E. Wagner. "Variable Selection for Hedonic Model using Machine Learning Approaches: A Case Study in Onondaga County, NY." Landscape and Urban Planning 107, no. 3 (2012): 293-306.Google ScholarGoogle ScholarCross RefCross Ref
  11. Čeh, Marjan, Milan Kilibarda, Anka Lisec, and Branislav Bajat. "Estimating the Performance of Random Forest Versus Multiple Regression for Predicting Prices of the Apartments." ISPRS International Journal of Geo-Information 7, no. 5 (2018): 168.Google ScholarGoogle ScholarCross RefCross Ref
  12. Breiman, L. Random forests. Machine Learning, 45(1) (2001), 5–32. Google ScholarGoogle ScholarDigital LibraryDigital Library

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    ASSE '21: 2021 2nd Asia Service Sciences and Software Engineering Conference
    February 2021
    143 pages
    ISBN:9781450389082
    DOI:10.1145/3456126

    Copyright © 2021 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 29 June 2021

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • Article
    • Research
    • Refereed limited

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format