Description

Using gmplot, geopy, and Python data science tools we’ll discover realtor farms, and assess the characteristics of sales vs listing price. Real estate transactions tend to be geographically sparse and temporally rare. There is often both a listing and a selling agent in the representing a given property. The sales price is determined by a number of factor. While there has been considerable interest in building pricing models relying on physical parameters, there has been little work done in assessing the contribution of the realtor. The discovery of a ‘farm’ uses cluster identification methods. These farms can then be analyzed for imputed listing prices and the sales price, both of which are negotiated.

The problem: Most real estate analytics deal only with property description and location. Markets can swing quickly from buyer’s to seller’s advantage, so timing and days on market is important. Agent effects are not well understood and can be a significant factor in determining the actual price. Data source are examined . Python Modules utilized. Application of data science, e.g. modules pycluster, pyclustering, scikit-learn. (the talk is primarily application, not theory)