Measures of Variability/Central Tendency For Certain Categories

  • Thread starter Thread starter Soaring Crane
  • Start date Start date
AI Thread Summary
The discussion focuses on identifying appropriate measures of central tendency and variability for different types of data: ordinal (response to light), continuous (HDL levels), and nominal (smoker status). It clarifies that for nominal data, measures like mean and median are not applicable, while for ordinal data, the median can be used. Continuous data allows for both mean and median as measures of central tendency, along with standard deviation and variance for variability. The term "highest-level measures" refers to the most suitable statistical methods based on the data type. Understanding the nature of the data is crucial for selecting the correct statistical measures.
Soaring Crane
Messages
461
Reaction score
0

Homework Statement



Suppose that I have the following groups/categories with how each is measured next to them:

Response to light – measured on a scale of 1 to 4
HDL levels – continuous value from 1 to 100
Smoker status - smoker or non-smoker

The question is as follows:

What are the highest-level measures of central tendency and variability that can be used?



Homework Equations



If descriptive, equations would not apply.

The Attempt at a Solution



I know that central tendency includes mean, mode, and median and variability includes variance and standard deviation, but could someone please tell me what is meant by “highest-level measures”? Do I include the descriptive term (for example, central tendency answer "mean") or an actual measurement?

Thank you.
 
Physics news on Phys.org
By “highest-level measures” I would assume that all the data is incorporated in the mean/variance estimate.

Solutions:

Answer 1: Use standard regression theory although the error terms (difference between fitted and observed) is not precisely normal (a mixture of dummy variables, categorical data and continuous). The Standard Error of Regression is the risk measure, the mean is given by the linear regression equation.

Answer 2: Use the error terms from answer 1 as an input to a Box-Cox Analysis of transformation. Use the proposed transformation on the y-variable to correct (or approximately adjust) for normality issues.

Answer 3: Use Least-Absolute deviations in place of a Least-Squares criteria. Apply risk measures (like average median deviation) and a median point estimate as discussed in the literature.
 
Last edited:
Soaring Crane said:

Homework Statement



Suppose that I have the following groups/categories with how each is measured next to them:

Response to light – measured on a scale of 1 to 4
HDL levels – continuous value from 1 to 100
Smoker status - smoker or non-smoker

The question is as follows:

What are the highest-level measures of central tendency and variability that can be used?



Homework Equations



If descriptive, equations would not apply.

The Attempt at a Solution



I know that central tendency includes mean, mode, and median and variability includes variance and standard deviation, but could someone please tell me what is meant by “highest-level measures”? Do I include the descriptive term (for example, central tendency answer "mean") or an actual measurement?

Thank you.

I'm not sure why the previous poster gave what he/she did - not relevant.
The question is essentially assessing your ability to determine the type of data that would be recorded. If the data is nominal (strictly categories) you wouldn't use mean or median for central tendency, nor would standard deviation qualify for variability. If the data is ordinal (can be arranged low to high) you could use median for central tendency. If the data were continuous (numerical) both mean and median would apply for central tendency, and standard deviation/variance for variability.
 
I tried to combine those 2 formulas but it didn't work. I tried using another case where there are 2 red balls and 2 blue balls only so when combining the formula I got ##\frac{(4-1)!}{2!2!}=\frac{3}{2}## which does not make sense. Is there any formula to calculate cyclic permutation of identical objects or I have to do it by listing all the possibilities? Thanks
Since ##px^9+q## is the factor, then ##x^9=\frac{-q}{p}## will be one of the roots. Let ##f(x)=27x^{18}+bx^9+70##, then: $$27\left(\frac{-q}{p}\right)^2+b\left(\frac{-q}{p}\right)+70=0$$ $$b=27 \frac{q}{p}+70 \frac{p}{q}$$ $$b=\frac{27q^2+70p^2}{pq}$$ From this expression, it looks like there is no greatest value of ##b## because increasing the value of ##p## and ##q## will also increase the value of ##b##. How to find the greatest value of ##b##? Thanks
Back
Top