Grouping Stores based on past retail sales.

  • Context: Undergrad 
  • Thread starter Thread starter cmor
  • Start date Start date
  • Tags Tags
    Grouping
Click For Summary
SUMMARY

This discussion focuses on grouping retail stores based on their sales performance over the past 52 weeks. The goal is to create groups of 10-20 stores that exhibit similar sales trends, allowing for the development of location-specific distribution curves. The analysis emphasizes the importance of considering time series data, as stores with similar average sales may not necessarily follow the same sales trajectory. Key statistical measures such as variance and deviation from the mean are crucial for defining the criteria for grouping stores effectively.

PREREQUISITES
  • Understanding of time series analysis
  • Familiarity with statistical measures such as variance and mean deviation
  • Experience with data visualization tools for creating distribution curves
  • Knowledge of retail sales metrics and performance indicators
NEXT STEPS
  • Research advanced time series analysis techniques using Python libraries like Pandas and Statsmodels
  • Learn about clustering algorithms such as K-means and hierarchical clustering for grouping data
  • Explore statistical significance testing methods to assess group variance
  • Investigate data visualization tools like Tableau or Matplotlib for effective representation of sales trends
USEFUL FOR

Data analysts, retail managers, and business intelligence professionals looking to optimize sales strategies through effective grouping and analysis of retail performance data.

cmor
Messages
1
Reaction score
0
I have a set of retail data explaining how many sales per store occurred during each week of the past 52 weeks. I want to group these stores by which follow each other the best over the course of the year. Let's say they'll be 10-20 stores in each group, although this number will be refined based on the analysis. These groups will be used to then create location specific curves of distribution. By grouping I can smooth out anomalies while creating a bigger dataset to pull from. I also want to find those stores that have the biggest deviation from the mean.
It would be nice to also find variance within each group, what is statistically significant in these groupings.

I did a simple analysis by just creating curves for each store and comparing them to the mean. Then finding the percent difference for each week, and ranking each store by its average percent difference.

What is a better way to analyze each curve with respect to each other to create groups?
 
Physics news on Phys.org
It seems that in your analysis you are not taking into account that you are dealing with time series; comparing the distribution of sales of two stores to see which one follow each other the best is not a good idea since you might have S1 = {1,2,3,4,5,6} and S2 = {6,5,4,3,2,1} and they both will have the same distribution, yet S1 is clearly doing better and better and S2 worse and worse, and yet, I do not think you want to put them both int he same group, is that correct?

In fact, it is up to how you define 'follow each other the best', for example, imagine you have two stores following each other's ups and downs, but one having twice as much sells as the other, they definitely follow each other even if at different levels.

On the other hand you could have stores which behavior is totally independent and yet they might have the same average sales after a year... You see what I mean?

So to you question 'what is the best way to group' I think you really have to make sure what you want to have in each group and define with more precision the criterion 'follow each other the best'. Is it just the average sales you care about? you care about the variance? you care about he evolution over time? ...
 

Similar threads

  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 2 ·
Replies
2
Views
4K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 2 ·
Replies
2
Views
4K
  • · Replies 9 ·
Replies
9
Views
2K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 11 ·
Replies
11
Views
3K