Dirty Data [Visualize This]

by Osman Parvez



This post is mostly for the Realtors in the audience but it should also concern those who track real estate markets in Boulder and beyond. 


Back in January, I wrote an open letter to BARA concerning an data error in their statistical analysis.   The chart below is a visualization of a problem with dirty data. 

  
IRES and Metrolist (METRO) provide MLS services for our region.   Throughout Boulder County, IRES dominates the market (blue).  In some locations, Broomfield most notably, Metrolist has a significant market share (grey). 


The data provided by the Boulder Realtor Association, some of which I use as a foundation for my analyses on this blog, does not contain Metrolist data.   Although IRES allows subscribers to view Metrolist listings, it’s not possible to download Metrolist data through the IRES interface.   


Sometimes agents will enter information into both MLS systems, creating a duplicate but at least one listing will show up in the data export.   The potential error shown above (red circle) is thus conservative.   The real error is likely slightly lower.  


In Boulder and Longmont, the error is so low that it’s essentially a non-issue. In Broomfield and Erie, not so much.   Always check the data source when you see an analysis.  I try to put the source on every one of my charts. 


When does this matter?
– If you’re a home buyer using an IDX listing alert system (virtually every agent’s website) that only draws from one MLS.  You’re potentially missing listings.    If you’re a serious buyer, make sure you get listing alerts from the MLS itself, not a third party website. 


– If you’re analyzing comps to determine the value of your home or a property you’re considering buying.   Double check that you have the full data set and be sure to pull comps from the tax assessor for unlisted transactions.


– If you’re tracking market trends in general, particularly in Broomfield and Erie.   Sales volume and inventory will be off.    Don’t trust statistics from these markets unless you personally aggregate and clean the data.   

Like this analysis?    Subscribe to my research.       Want to meet me in person?    Attend a Boulder Real Estate Meetup.    Ready to buy or sell?  Call me at 303.746.6896.  

Dirty Data [Visualize This]

by Osman Parvez



This post is mostly for the Realtors in the audience but it should also concern those who track real estate markets in Boulder and beyond. 


Back in January, I wrote an open letter to BARA concerning an data error in their statistical analysis.   The chart below is a visualization of a problem with dirty data. 

  
IRES and Metrolist (METRO) provide MLS services for our region.   Throughout Boulder County, IRES dominates the market (blue).  In some locations, Broomfield most notably, Metrolist has a significant market share (grey). 


The data provided by the Boulder Realtor Association, some of which I use as a foundation for my analyses on this blog, does not contain Metrolist data.   Although IRES allows subscribers to view Metrolist listings, it’s not possible to download Metrolist data through the IRES interface.   


Sometimes agents will enter information into both MLS systems, creating a duplicate but at least one listing will show up in the data export.   The potential error shown above (red circle) is thus conservative.   The real error is likely slightly lower.  


In Boulder and Longmont, the error is so low that it’s essentially a non-issue. In Broomfield and Erie, not so much.   Always check the data source when you see an analysis.  I try to put the source on every one of my charts. 


When does this matter?
– If you’re a home buyer using an IDX listing alert system (virtually every agent’s website) that only draws from one MLS.  You’re potentially missing listings.    If you’re a serious buyer, make sure you get listing alerts from the MLS itself, not a third party website. 


– If you’re analyzing comps to determine the value of your home or a property you’re considering buying.   Double check that you have the full data set and be sure to pull comps from the tax assessor for unlisted transactions.


– If you’re tracking market trends in general, particularly in Broomfield and Erie.   Sales volume and inventory will be off.    Don’t trust statistics from these markets unless you personally aggregate and clean the data.   

Like this analysis?    Subscribe to my research.       Want to meet me in person?    Attend a Boulder Real Estate Meetup.    Ready to buy or sell?  Call me at 303.746.6896.  

Share This Listing!

More about the author

Osman Parvez

Owner & Broker at House Einstein as well as primary author of the House Einstein blog with over 1,200 published articles about Boulder real estate. His work has appeared in the Wall Street Journal and Daily Camera.

Osman is the primary author of the House Einstein blog with over 1,200 published articles about Boulder real estate. His work has also appeared in many other blogs about Boulder as well as mainstream newspapers, including the Wall Street Journal and Daily Camera. Learn more about Osman.

Facebook | Twitter | Instagram | YouTube

Work with

House Einstein

Thinking about buying or selling and want professional advice?
Call us at 303.746.6896

Your referrals are deeply appreciated.

Like this content? Want more fresh listings? Subscribe to our newsletter!

This field is for validation purposes and should be left unchanged.