Hypothesis test on transformed data

  • Context: Graduate 
  • Thread starter Thread starter A_B
  • Start date Start date
  • Tags Tags
    Data Test
Click For Summary
SUMMARY

This discussion addresses the calculation of the mean of the log-transformed data, lnA, derived from a sample A with mean μ. The key formula utilized is E[h(X)] = Integral (over some domain) h(x) pdf(x) dx for continuous variables, emphasizing the importance of adjusting the domain for the log function. If lnA follows a normal distribution, it is crucial to ensure that A is non-negative, as the domain of lnA is restricted to A >= 0. Additionally, transformation rules can be applied to derive the probability density function (PDF) of A from the normal distribution of lnA.

PREREQUISITES
  • Understanding of expectation in probability theory
  • Familiarity with log transformations in statistics
  • Knowledge of probability density functions (PDFs)
  • Basic concepts of normal distribution
NEXT STEPS
  • Study the properties of log-normal distributions
  • Learn about transformation rules for probability distributions
  • Explore the concept of cumulative distribution functions (CDFs)
  • Review introductory statistics texts focusing on expectation and transformations
USEFUL FOR

Statisticians, data analysts, and researchers working with log-transformed data or those interested in understanding the implications of transformations on statistical distributions.

A_B
Messages
87
Reaction score
1
Say I have a sample A with mean μ, and the log transformation of A, lnA. Is there any way of figuring out the mean of lnA? what if the distribution of lnA is normal?

thanks
Alex
 
Physics news on Phys.org
A_B said:
Say I have a sample A with mean μ, and the log transformation of A, lnA. Is there any way of figuring out the mean of lnA? what if the distribution of lnA is normal?

thanks
Alex

Hello A_B and welcome to the forums.

In this problem you just use the definition of E[h(X)] where h is a function and X is your random variable.

Basically you just have to use the formula that E[h(X)] = Integral (over some domain) h(x) pdf(x) dx for a continuous variable.

The key thing though is because you are using a log function, you need to adjust your domain to suit that. If for example your distribution A was normal, then you couldn't apply your transform over the whole of A (since the domain of a normal is the whole real line). So just be careful when you're defining the range so that ln(x) is valid for this domain.

If your distribution is discrete then instead of an integral, replace that with a summation. If you're confused about what I'm saying grab any introduction statistics book and look at the definition of expectation.

Also if ln(A) was normally distributed then A >= 0 for its domain, so that's something easy to check. One way to get the pdf of A if ln(A) was normal is to use transformation rules with PDF's. Since ln(A) has an inverse transformation (e^(x)), you should be able to use the transformation to get firstly a CDF and then a PDF (differentiating).
 

Similar threads

  • · Replies 6 ·
Replies
6
Views
5K
  • · Replies 7 ·
Replies
7
Views
3K
  • · Replies 7 ·
Replies
7
Views
3K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 11 ·
Replies
11
Views
4K
  • · Replies 9 ·
Replies
9
Views
2K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 31 ·
2
Replies
31
Views
5K
  • · Replies 4 ·
Replies
4
Views
2K
Replies
20
Views
3K