| New Reply |
Confidence limits for the inverse of an estimated value |
Share Thread | Thread Tools |
| Jun26-12, 07:13 AM | #1 |
|
|
Confidence limits for the inverse of an estimated value
I am aware that, in statistics, things get difficult as soon as they get nonlinear. And taking the reciprocal of a quantity is a nonlinear operation.
I have some data that would form a nice looking straight line, except for random error scattering it around the line. I have a total of about fifty points. If I fit a regresssion line to the data, I can find an estimate of the slope of the line. In my particular case, the slope of the line (if I knew it precisely) would give me the coefficient λ in a 1st order linear differential equation dC(t)/dt = -λC(t). Thus the regression analysis gives me an estimate for λ. There is a standard formula for calculating confidence limits on the estimate of the slope of a line computed via a regression analysis. This formula gives me the upper and lower confidence limits λ[itex]_{lower}[/itex] and λ[itex]_{upper}[/itex] on my estimate of λ. The solution for the differential equation is C(t) = C(0) exp(-t/T), where the time constant, T = 1/λ. It is the time constant T that is the thing of real interest because this will tell me how long a system takes to settle following a disturbance. Here is my question. (A) What is the "best", in some appropriate sense, estimate for T? Is it simply 1/(my regression estimate for λ)? (B) If so, what are the confidence limits for my estimate of T? Are they simply the inverses, 1/λ[itex]_{lower}[/itex] and 1/ λ[itex]_{upper}[/itex]. of my confidence limits on λ? Thank you for any help. I assume it is a simple and straightforward question but I have not succeeded in finding the answer nor in working it out myself. |
| Jun26-12, 02:39 PM | #2 |
|
|
Hi Calvadosser,
|
| Jun27-12, 02:07 AM | #3 |
|
|
To follow on with what viraltux said, it depends ultimately on how specific transformations preserve information about the probabilities and subsequent information.
With some things if you are given say x, and you need to find T = f(x), then you can apply the transformations to give new results which will conserve the probabilistic properties under that transformation. But other times, they don't. One example is with a technique known as highest posterior density in Bayesian analysis which doesn't. If you want to look into this problem in general, find out frameworks which deal with transformation of statistics, intervals, and other measures that conserve the probabilistic information under transformation. |
| Jun27-12, 02:46 AM | #4 |
|
Recognitions:
|
Confidence limits for the inverse of an estimated valueThe average is 0.5, so the 'correct' time constant is 2. If we take 2 samples of X, X1 and X2, and calculate the time constant as the inverse of their mean, what is the expected value of the result? [itex]\int^{1}_{x_{1}=0}\int^{1}_{x_{2}=0}2/(x_1+x_2)dx_2.dx_{1} = 2\int^{1}_{x_{1}=0}[ln(x_{1}+x_2)]^{1}_{x_{2}=0}dx_{1}[/itex] [itex] = 2\int^{1}_{x_{1}=0}(ln(x_{1}+1)-ln(x_{1}))dx_1[/itex] [itex] = 2[(x_{1}+1)ln(x_{1}+1)-x_{1}-x_{1}ln(x_{1})+x_{1}]^{1}_{x_{1}=0}[/itex] = 4 ln(2) =~ 2.77 |
| Jun27-12, 04:43 AM | #5 |
|
|
![]() But anyway, it is true that inference is a whole world; you may get really seemingly crazy estimators once you squeeze a problem, but for the problem presented by the OP as such what he/she is doing is just OK. PS: If I ever go into space I want you to check the rocket engines; I know you won't let anything pass
|
| Jun27-12, 05:07 PM | #6 |
|
Recognitions:
|
viraltux, I believe I have correctly modelled the consequence of using OP's procedure. Try it in a spreadsheet. Generate 100 pairs of samples from U(0,1); for each pair take the average, then the inverse. You will almost always get a result > 2, often > 3.
Of course, this is fairly extreme. More samples in each set would give a smaller error. Calvadosser, can you take a look at the distribution of the lambdas? If we know more about that we might be able either to suggest a better procedure or to put bounds on the error. |
| Jun28-12, 03:32 AM | #7 |
|
|
The OP has a straight line set of measurements in its model and he estimates via linear regression its slope having the value [itex]\hat{λ}[/itex] and a standard error for the estimate based on a Gaussian distribution. He does not have 100 slopes to work with which seems to be the way you are approaching the problem based on the example you post, so even if he wanted he could not do the inverse for every slope and see how the distribution of inverse slopes behaves (or the distribution of λ slopes for that matter). But even so, let's change the problem and imaging that he can actually measure a set of, let's say, n lambdas ([itex]λ_{1..n}[/itex]), and he wants to estimate [itex]T=1/λ[/itex] What you suggest is [itex]\hat{T}= \frac{1/λ_1 + 1/λ_2 + ... +1/λ_n}{n} [/itex] but this is wrong, this approach allows the function that calculates T to bias its own estimation. In this situations we apply what is called in inference the invariance principle which basically states that if you have a function [itex]f[/itex] and you want to estimate [itex]f(θ)[/itex] then you do [itex]\widehat{f(θ)} = f(\hat{θ})[/itex], which in the OP case would be [itex]\hat{T}=1/\hat{λ}[/itex], and, by the way, this principle would hold for whatever distribution θ, λ might have. |
| Jun28-12, 04:46 PM | #8 |
|
Recognitions:
|
To find an unbiased estimator for a statistic s, the following is standard procedure: - construct some candidate function fs({xi}) of n observations - compute E(fs) as a function of a presumed value s of the statistic - see how E(fs) compares with s E.g. with s being the mean and fmean({xi}) = (Ʃxi)/n gives E(fmean) = mean, but with s being the variance, we get fvar = (Ʃ(xi-Ʃxi)/n)2)/(n-1). If L is a linear function and fs is an unbiased estimator for s then L(fs) is an unbiased estimator for L(s). But it does not work for nonlinear functions. E.g. for the standard deviation, √fvar is not an unbiased estimator for √var. (There are corrections that have been developed, but none are perfect.) In my model for the OP procedure, I took just two observations, computed the mean, and inverted. There are two ways we can try to assess this as an estimator for 1/mean. In my first post I assessed it analytically. Since you thought the analysis flawed, I then assessed it numerically. To find E(f) I had to generate lots of pairs and take the average result. Now, as I said, my model was rather extreme. It took a uniform distribution 'close' to the origin (i.e. the std dev is a large fraction of the mean) and only used one pair of observations to compute the mean. Changing either of those would reduce the error. To get a bound on the error in the OP procedure we need to know more about the distribution and the number of samples. |
| Jun28-12, 08:41 PM | #9 |
|
|
Second, the fact that an estimator is biased does not make it bad, for instance, the maximum likelihood estimator for the variance of a Gaussian distribution is biased and yet is the one preferred in many areas of multivariate analysis. Well, so it seems that in order to fix this, which we don't have to, you are asking the OP about the distribution of λ when actually there is no such thing; we only have one unique value for λ which is estimated via MLE in a linear regression procedure. What we get is the estimation of λ using this method (or whatever method the OP used for the linear regression) and the error associated with it, that is all we have; no distributions, no number of points... no nothing, that's it, and in this scenario you apply the invariance principle to estimate T which is what the OP is already doing. And when we calculate the confidence interval for T we will see how the estimation of T is not centered in such interval, accounting this way for the bias that seems to be the issue in this discussion. |
| Jun28-12, 10:58 PM | #10 |
|
Recognitions:
|
|
| Jun28-12, 11:06 PM | #11 |
|
|
I too am interested with regards to the invariance principle, because although I know it exists for many probabilistic and statistical situations involving transformations, I've never actually never checked it out which is contributing to my ignorance.
Could you point out a reference or two for this viraltux (maybe even a book that covers it?) |
| Jun29-12, 07:18 AM | #12 |
|
|
![]() Hi chiro, I didn't find the paper I wanted but anyway, this one proves that applying functions to MLE estimators return the MLE estimator of the function (which would be the OP case) and has an example too: http://www.stats.ox.ac.uk/~dlunn/b8_02/b8pdf_6.pdf OK, I am going to briefly (and dramatically) describe what happened back in the days when the "bias vs unbiased" lesson came up in my faculty. - professor: "what is best, a biased or an unbiased estimator?" - students: "Unbiased", "of course" "I agree", "What kind of question is that?" - professor: OK, why? - students: ![]() - professor: soooooooooooooo... - students: Well, if you got it biased, and you know it is biased, and you can even calculate how biased it is, well, then you can take the bias away! why would anyone want to use a calculation that is known to be consistently higher or lower? - professor: That's right, why would anyone do that? - students: ![]() I am sure this situation is a classic in every inference course. Anyway, also a classic is to take the estimation of the Gaussian variance as an example, let's consider the following three estimators for the variance [itex]S_{unbiased} = \frac{1}{n-1} \sum_{i=1}^n\left(x_i - \overline{x} \right)^ 2[/itex] [itex]S_{MLE} = \frac{1}{n} \sum_{i=1}^n\left(x_i - \overline{x} \right)^ 2[/itex] [itex]S_{LSE} = \frac{1}{n+1} \sum_{i=1}^n\left(x_i - \overline{x} \right)^ 2[/itex] Well, turns out that among these three the unbiased estimator is the one with the highest error! (in terms of least square error). OK, this is hard to believe and very counter intuitive because, geeee, we know it is biased!! fix it!!! right? Well, you do that, and you introduce error. You know, I had the mathematical proof in front of my eyes and I still had to run a simulation to believe it, but it is true!! Then one student (in this case it was me) asked "But everyone in school, engineering, physicist... they all use the unbiased version to estimate the variance, why on Earth we don't all use the LSE version with the lowest error!?" professor: "Not everything that shines is gold" and he went on with the class. ![]() Oh well, I had to do all kind of guesses about my professor statement on why people don't widely use the LSE version, but I will not go on with this here, for now, suffice to say that what the OP is doing is OK, and that unbiased doesn't mean the "best". |
| Jun29-12, 07:25 AM | #13 |
|
|
Thank you viraltux for that result, I'll have to remember that because it's going to be very useful. I'm sure it's probably in my introductory stats book, but it's definitely good to know that it holds.
|
| Jun29-12, 07:35 AM | #14 |
|
|
|
| Jun29-12, 03:38 PM | #15 |
|
|
My thanks for the replies - and for the discussion, some of which (but not all) is over my head. I had originally supposed that more or less the same question is asked very frequently.
You have very kindly: - Reassured me that what I propose is, at very least, not a stupid thing to do. - Shown me that the question is deeper than I had imagined. I'll add it to me ever-growing list of interesting things to look into - when other things don't take up all the available time. |
| Jun29-12, 04:13 PM | #16 |
|
Recognitions:
|
(I'm wondering why you chose to use regression instead of fitting a curve to the C(t) data.) |
| Jun29-12, 05:57 PM | #17 |
|
Recognitions:
|
I consider this to be the answer to your prof's challenge. Choice of estimator should depend on the cost function. However, in the absence of beliefs regarding cost function, I gather that the gold standard is not the estimator with the least MSE, but the unbiased estimator with least MSE (http://en.wikipedia.org/wiki/Mean_sq...Interpretation). OTOH, a cost function which grows faster than the square of the error might well indicate use of one of the biased estimators. Haven't tried to figure that out. |
| New Reply |
| Thread Tools | |
Similar Threads for: Confidence limits for the inverse of an estimated value
|
||||
| Thread | Forum | Replies | ||
| How to find equations for confidence limits in Poisson distribution? | Calculus & Beyond Homework | 0 | ||
| Confidence interval for estimated mean of (discrete) uniform distribution | Set Theory, Logic, Probability, Statistics | 9 | ||
| Confidence in estimated parameters | Set Theory, Logic, Probability, Statistics | 9 | ||
| Confidence limits of expected odds | Precalculus Mathematics Homework | 2 | ||
| How to calculate confidence limits in bernoulli trials with only one outcome. | Set Theory, Logic, Probability, Statistics | 1 | ||