Market share helps companies demonstrate how prevelant their sales are in any given market. It’s a good thought exercise in sampling and data completeness. Some businesses can showcase their market share with pinpoint accuracy if they collect data from enough sources (paid or open source). But, when your industry is indirectly tied to another market, your share of the market is harder to calculate. The basic equation for market share is (your sales / all sales). The resulting fraction represents your percent of the pie. It helps showcase your competitiveness in any given market space.
This analysis demonstrates combining multiple sources of data to externally validate the unknown total sales in the market space. This was completed for a company adjacent to the real estate market. There’s no complete dataset for all home sales everywhere since there’s some home sales that don’t get added to a national or county database (paying with cash, for sale by owner, etc.). I used a combination of Zillow and data from FRED to normalize the home sales according to their Metropolitan Statistical Area (MSA). MSA’s help standardize geographic areas especially when sales could come from a major city and the surrounding towns. This helped us maintain an inferred total sales number when comparing our adjacent industry impact.
Much of the information has been annonimized, and there’s some parts I deliberately left out to maintain that annonymity.
Data Analysis
Import Data
Publicly available housing data for each MSA on Zillow was mapped to a table with county names per MSA from FRED to align with our sales data.
Cleaning Sales Data
I then created a function to clean our sales data according to what we considered qualifying sales based on features like warranties, type of sale, and payment received.
After we had confirmed the correct sub-sample of sales was complete, I combined them into a single dataframe, added the MSA column, and cleaned up the columns.
Importing Zillow Data
The Zillow dataset included many MSA’s we didnt’ operate in, so cleaning out those unnecessary to the analysis was the next step. Finally, to merge the two datasets, I reformatted the MSA data and cleaned up the column names.
Market Share Calculation
The calculation for market share once all the data was cleaned is a simple divsion if set up properly. The hard parts were getting the data to this spot across brands, market locations, and data sources. The final portion of the analysis supplied our team with an accurate representation of qualifying sales across 11 different locations for 4 of our biggest brands.