Identifying the largest determining factor

  • Context: Undergrad 
  • Thread starter Thread starter SanDiegoMike
  • Start date Start date
Click For Summary
SUMMARY

The discussion focuses on identifying the largest determining factor affecting the value of coins based on various characteristics such as year, color, size, shape, and weight. Mike, an engineer with limited statistics knowledge, seeks a mathematical approach to analyze the correlation between these characteristics and coin value. Recommendations include utilizing scatter plot matrices for continuous variables and box-and-whisker plots for categorical variables, emphasizing the importance of data exploration in the analysis process. The suggested resources include a reference from Missouri State University and Sheskin's handbook for further guidance.

PREREQUISITES
  • Understanding of basic statistical concepts, particularly correlation analysis
  • Familiarity with data visualization techniques, including scatter plots and box-and-whisker plots
  • Knowledge of categorical and continuous variables in data analysis
  • Basic proficiency in data exploration methodologies
NEXT STEPS
  • Research "scatter plot matrix" for visualizing relationships between continuous variables
  • Learn about "box-and-whisker plots" for analyzing categorical data
  • Explore "Sheskin's Handbook of Statistics" for comprehensive statistical methodologies
  • Investigate "functional relationships in statistics" to understand variable conversion techniques
USEFUL FOR

This discussion is beneficial for data analysts, engineers, and statisticians interested in understanding the impact of various characteristics on value assessment, particularly in the context of categorical and continuous data analysis.

SanDiegoMike
Messages
4
Reaction score
0
Identifying the "largest determining factor"

Hello,

This is not strictly a statistics forum, but I'm hoping you guys may have sufficient background to help me out. I'm an engineer by trade, so my stats background is poor and I have had not had much luck searching for the answer or asking colleagues.

I have a database, which is similar to the following example in which we list a collection of 'coins' of various values. These coins all have different characteristics, ie: year, color, size, shape, and weight. I would like to determine to what degree each of those characteristics are most likely to determine the coin's value. Or at the very least, determine which of the characteristics is most predominant. My searching has led me to analysis which requires some form of functional relationship between say 'shape' and 'value' such that correlation can be determined, but I don't know how I would convert shape (ie: circle, square, octagonal) into a variable. My colleagues have suggested scatter plots to identify relationships, but my data sets are huge, and I would prefer something with a mathematical foundation.

If anyone could point me in the correct direction with regards to the appropriate analysis methodology, that would be fantastic.

thanks,
-mike.
 
Physics news on Phys.org


SanDiegoMike said:
My colleagues have suggested scatter plots to identify relationships, but my data sets are huge, and I would prefer something with a mathematical foundation.

Data exploration is arguably the most important step of any data analysis methodology so I wouldn't discount visual tools just yet. Maybe start with a scatter plot matrix for the continuous variables and box-and-whisker plots for the categorical variables.
 

Similar threads

  • · Replies 11 ·
Replies
11
Views
2K
  • · Replies 0 ·
Replies
0
Views
907
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 10 ·
Replies
10
Views
4K
  • · Replies 15 ·
Replies
15
Views
4K
  • · Replies 5 ·
Replies
5
Views
4K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 2 ·
Replies
2
Views
1K
  • · Replies 24 ·
Replies
24
Views
4K
  • · Replies 14 ·
Replies
14
Views
4K