View Full Version : Incomplete knowledge and statistics
Arnold Neumaier
Aug16-04, 12:55 PM
<jabberwocky><div class="vbmenu_control"><a href="jabberwocky:;" onClick="newWindow=window.open('','usenetCode','toolbar=no, location=no,scrollbars=yes,resizable=yes,status=no ,width=650,height=400'); newWindow.document.write('<HTML><HEAD><TITLE>Usenet ASCII</TITLE></HEAD><BODY topmargin=0 leftmargin=0 BGCOLOR=#F1F1F1><table border=0 width=625><td bgcolor=midnightblue><font color=#F1F1F1>This Usenet message\'s original ASCII form: </font></td></tr><tr><td width=449><br><br><font face=courier><UL><PRE>\n\n\n\nIgor Khavkine wrote on p.2 of\n\n> http://www.physics.utoronto.ca/~igor/densityop.pdf\n\n\'\'statistical mechanics may be applied to a system\nwith any number of degrees of freedom as long as\nthere is not enough information available to apply\nthe laws of the underlying precise mechanical theory.\'\'\n\n\n\nIt is offen erroneously assumed that incomplete knowledge can\nalways be described by statistics. But this is by no means the case.\n\nIf you know about a number x only that it is in [0,1], you cannot apply\nstatistics since you know nothing at all about the distribution\n(except for its support). In particular, it would be a mistake to assume\nthat the distribution is uniform (ignorance interpretation)\n-- the noninformative prior of the Bayesian school, which makes this\nassumption, may be seriously flawed, since it is perfectly consistent with\nthe knowledge that in fact always x=0.75, except that you don\'t know it,\nor that x oscillates regularly, or.... The ignorance is in this case\nsimply deterministic lack of information.\n\nIn general, all one can deduce from information that takes the form of\ndeterministic bounds on a vector x of variables and/or on expressions in x\nis bounds on derived quantities y=f(x) one would like to compute from it.\nThis leads to global optimization problems, where f(x) is minimized or\nmaximized subject to the known constraints. See\nhttp://www.mat.univie.ac.at/~neum/glopt/intro.html\n\nStochastic lack of knowledge is of a different kind - it assumes that\nthe _maximal_attainable_ knowledge about the system, at the given level\nof description, is a probability distribution, and that this probability\ndistribution is indeed known.\n\nThere are also combinations of both, where one knows the maximal knowledge\nabout a system should be stochastic, but one lacks complete information\non the distribution. This is handled by the emerging field of \'imprecise\nprobability\', although there is not yet a generally accepted way for\nanalyzing such situations, and different schools with quite different\nbasic approaches compete. See, e.g,\nhttp://class.ee.iastate.edu/berleant/home/ServeInfo/Interval/intprob.html\n\nTheoretical physics is always concerned about describing the maximal\nattainable knowledge about a system (at a given level of description),\nirrespective of what anyone actually knows about it. In this way, and only\nin this way, it is possible to get close to the objectivity that science\nalways is striving for.\n\n\nArnold Neumaier\n</UL></PRE></font></td></tr></table></BODY><HTML>');"> <IMG SRC=/images/buttons/ip.gif BORDER=0 ALIGN=CENTER ALT="View this Usenet post in original ASCII form"> View this Usenet post in original ASCII form </a></div><P></jabberwocky>Igor Khavkine wrote on p.2 of
> http://www.physics.utoronto.ca/~igor/densityop.pdf
''statistical mechanics may be applied to a system
with any number of degrees of freedom as long as
there is not enough information available to apply
the laws of the underlying precise mechanical theory.''
It is offen erroneously assumed that incomplete knowledge can
always be described by statistics. But this is by no means the case.
If you know about a number x only that it is in [0,1], you cannot apply
statistics since you know nothing at all about the distribution
(except for its support). In particular, it would be a mistake to assume
that the distribution is uniform (ignorance interpretation)
-- the noninformative prior of the Bayesian school, which makes this
assumption, may be seriously flawed, since it is perfectly consistent with
the knowledge that in fact always x=0.75, except that you don't know it,
or that x oscillates regularly, or.... The ignorance is in this case
simply deterministic lack of information.
In general, all one can deduce from information that takes the form of
deterministic bounds on a vector x of variables and/or on expressions in x
is bounds on derived quantities y=f(x) one would like to compute from it.
This leads to global optimization problems, where f(x) is minimized or
maximized subject to the known constraints. See
http://www.mat.univie.ac.at/~neum/glopt/intro.html
Stochastic lack of knowledge is of a different kind - it assumes that
the _maximal_attainable_ knowledge about the system, at the given level
of description, is a probability distribution, and that this probability
distribution is indeed known.
There are also combinations of both, where one knows the maximal knowledge
about a system should be stochastic, but one lacks complete information
on the distribution. This is handled by the emerging field of 'imprecise
probability', although there is not yet a generally accepted way for
analyzing such situations, and different schools with quite different
basic approaches compete. See, e.g,
http://class.ee.iastate.edu/berleant/home/ServeInfo/Interval/intprob.html
Theoretical physics is always concerned about describing the maximal
attainable knowledge about a system (at a given level of description),
irrespective of what anyone actually knows about it. In this way, and only
in this way, it is possible to get close to the objectivity that science
always is striving for.
Arnold Neumaier
Nick Maclaren
Aug16-04, 01:32 PM
<jabberwocky><div class="vbmenu_control"><a href="jabberwocky:;" onClick="newWindow=window.open('','usenetCode','toolbar=no, location=no,scrollbars=yes,resizable=yes,status=no ,width=650,height=400'); newWindow.document.write('<HTML><HEAD><TITLE>Usenet ASCII</TITLE></HEAD><BODY topmargin=0 leftmargin=0 BGCOLOR=#F1F1F1><table border=0 width=625><td bgcolor=midnightblue><font color=#F1F1F1>This Usenet message\'s original ASCII form: </font></td></tr><tr><td width=449><br><br><font face=courier><UL><PRE>\nIn article <411FA51F.9050202@univie.ac.at>,\nArnold Neumaier <Arnold.Neumaier@univie.ac.at> wrote:\n>\n>Igor Khavkine wrote on p.2 of\n>\n>\'\'statistical mechanics may be applied to a system\n>with any number of degrees of freedom as long as\n>there is not enough information available to apply\n>the laws of the underlying precise mechanical theory.\'\'\n>\n>It is offen erroneously assumed that incomplete knowledge can\n>always be described by statistics. But this is by no means the case.\n\nIt slightly depends on what you mean by statistics. Some people\nhave used mathematical models that are significantly more general\nthan the \'standard\' one, and referred to them as statistical.\nAs usual, the increase in generality has led to a reduction in\nthe number of results that can be proved.\n\n>If you know about a number x only that it is in [0,1], you cannot apply\n>statistics since you know nothing at all about the distribution\n>(except for its support). In particular, it would be a mistake to assume\n>that the distribution is uniform (ignorance interpretation)\n>-- the noninformative prior of the Bayesian school, which makes this\n>assumption, may be seriously flawed, since it is perfectly consistent with\n>the knowledge that in fact always x=0.75, except that you don\'t know it,\n>or that x oscillates regularly, or.... The ignorance is in this case\n>simply deterministic lack of information.\n\nAgain, you are mistaken. You ARE right in your diatribe against the\nnon-informative prior, but you are wrong that you can\'t apply any\nform of statistics (or even Bayesian statistics). If you get hold\nof a good book on the mathematical foundations of statistics (NOT\none written for laymen, like physicists), you will see that.\n\n>Stochastic lack of knowledge is of a different kind - it assumes that\n>the _maximal_attainable_ knowledge about the system, at the given level\n>of description, is a probability distribution, and that this probability\n>distribution is indeed known.\n\nGrrk. Well, maybe. "Stochastic lack of knowledge" isn\'t a term I\nam familiar with.\n\n>There are also combinations of both, where one knows the maximal knowledge\n>about a system should be stochastic, but one lacks complete information\n>on the distribution. This is handled by the emerging field of \'imprecise\n>probability\', although there is not yet a generally accepted way for\n>analyzing such situations, and different schools with quite different\n>basic approaches compete. See, e.g,\n\n"Emerging"? The subject has been studied extensively since the 1920s!\nYou are right that there is no generally accepted way of analysing\nsuch problems.\n\n\nRegards,\nNick Maclaren.\n</UL></PRE></font></td></tr></table></BODY><HTML>');"> <IMG SRC=/images/buttons/ip.gif BORDER=0 ALIGN=CENTER ALT="View this Usenet post in original ASCII form"> View this Usenet post in original ASCII form </a></div><P></jabberwocky>In article <411FA51F.9050202@univie.ac.at>,
Arnold Neumaier <Arnold.Neumaier@univie.ac.at> wrote:
>
>Igor Khavkine wrote on p.2 of
>
>''statistical mechanics may be applied to a system
>with any number of degrees of freedom as long as
>there is not enough information available to apply
>the laws of the underlying precise mechanical theory.''
>
>It is offen erroneously assumed that incomplete knowledge can
>always be described by statistics. But this is by no means the case.
It slightly depends on what you mean by statistics. Some people
have used mathematical models that are significantly more general
than the 'standard' one, and referred to them as statistical.
As usual, the increase in generality has led to a reduction in
the number of results that can be proved.
>If you know about a number x only that it is in [0,1], you cannot apply
>statistics since you know nothing at all about the distribution
>(except for its support). In particular, it would be a mistake to assume
>that the distribution is uniform (ignorance interpretation)
>-- the noninformative prior of the Bayesian school, which makes this
>assumption, may be seriously flawed, since it is perfectly consistent with
>the knowledge that in fact always x=0.75, except that you don't know it,
>or that x oscillates regularly, or.... The ignorance is in this case
>simply deterministic lack of information.
Again, you are mistaken. You ARE right in your diatribe against the
non-informative prior, but you are wrong that you can't apply any
form of statistics (or even Bayesian statistics). If you get hold
of a good book on the mathematical foundations of statistics (NOT
one written for laymen, like physicists), you will see that.
>Stochastic lack of knowledge is of a different kind - it assumes that
>the _maximal_attainable_ knowledge about the system, at the given level
>of description, is a probability distribution, and that this probability
>distribution is indeed known.
Grrk. Well, maybe. "Stochastic lack of knowledge" isn't a term I
am familiar with.
>There are also combinations of both, where one knows the maximal knowledge
>about a system should be stochastic, but one lacks complete information
>on the distribution. This is handled by the emerging field of 'imprecise
>probability', although there is not yet a generally accepted way for
>analyzing such situations, and different schools with quite different
>basic approaches compete. See, e.g,
"Emerging"? The subject has been studied extensively since the 1920s!
You are right that there is no generally accepted way of analysing
such problems.
Regards,
Nick Maclaren.
Arnold Neumaier
Aug17-04, 11:26 AM
<jabberwocky><div class="vbmenu_control"><a href="jabberwocky:;" onClick="newWindow=window.open('','usenetCode','toolbar=no, location=no,scrollbars=yes,resizable=yes,status=no ,width=650,height=400'); newWindow.document.write('<HTML><HEAD><TITLE>Usenet ASCII</TITLE></HEAD><BODY topmargin=0 leftmargin=0 BGCOLOR=#F1F1F1><table border=0 width=625><td bgcolor=midnightblue><font color=#F1F1F1>This Usenet message\'s original ASCII form: </font></td></tr><tr><td width=449><br><br><font face=courier><UL><PRE>\n\n\n\nNick Maclaren wrote:\n> In article <411FA51F.9050202@univie.ac.at>,\n> Arnold Neumaier <Arnold.Neumaier@univie.ac.at> wrote:\n>\n>>If you know about a number x only that it is in [0,1], you cannot apply\n>>statistics since you know nothing at all about the distribution\n>>(except for its support). In particular, it would be a mistake to assume\n>>that the distribution is uniform (ignorance interpretation)\n>>-- the noninformative prior of the Bayesian school, which makes this\n>>assumption, may be seriously flawed, since it is perfectly consistent with\n>>the knowledge that in fact always x=0.75, except that you don\'t know it,\n>>or that x oscillates regularly, or.... The ignorance is in this case\n>>simply deterministic lack of information.\n>\n> Again, you are mistaken. You ARE right in your diatribe against the\n> non-informative prior, but you are wrong that you can\'t apply any\n> form of statistics (or even Bayesian statistics). If you get hold\n> of a good book on the mathematical foundations of statistics (NOT\n> one written for laymen, like physicists), you will see that.\n\nIf I am wrong, please enlighten me and sketch how to apply any\nform of statistics (or even Bayesian statistics) to this case.\nI know many books on statistics, and processed many statistical\napplications, but never came across techniques for doing statistics\nabout a sequence of numbers x_l of which you only know they are in [0,1].\n\n\n>>Stochastic lack of knowledge is of a different kind - it assumes that\n>>the _maximal_attainable_ knowledge about the system, at the given level\n>>of description, is a probability distribution, and that this probability\n>>distribution is indeed known.\n>\n> Grrk. Well, maybe. "Stochastic lack of knowledge" isn\'t a term I\n> am familiar with.\n\nMaybe I used a poor formulation. I meant:\n\'The lack of knowledge that statistics can model\'\nwhich should be intelligible.\n\n\n>>There are also combinations of both, where one knows the maximal knowledge\n>>about a system should be stochastic, but one lacks complete information\n>>on the distribution. This is handled by the emerging field of \'imprecise\n>>probability\', although there is not yet a generally accepted way for\n>>analyzing such situations, and different schools with quite different\n>>basic approaches compete. See, e.g,\n>\n> "Emerging"? The subject has been studied extensively since the 1920s!\n\nOn a very theoretical level only. Computationally useful versions (for more\nthan the most elementary problems) have not been forthcoming till recently.\n\n\nArnold Neumaier\n\n\n</UL></PRE></font></td></tr></table></BODY><HTML>');"> <IMG SRC=/images/buttons/ip.gif BORDER=0 ALIGN=CENTER ALT="View this Usenet post in original ASCII form"> View this Usenet post in original ASCII form </a></div><P></jabberwocky>Nick Maclaren wrote:
> In article <411FA51F.9050202@univie.ac.at>,
> Arnold Neumaier <Arnold.Neumaier@univie.ac.at> wrote:
>
>>If you know about a number x only that it is in [0,1], you cannot apply
>>statistics since you know nothing at all about the distribution
>>(except for its support). In particular, it would be a mistake to assume
>>that the distribution is uniform (ignorance interpretation)
>>-- the noninformative prior of the Bayesian school, which makes this
>>assumption, may be seriously flawed, since it is perfectly consistent with
>>the knowledge that in fact always x=0.75, except that you don't know it,
>>or that x oscillates regularly, or.... The ignorance is in this case
>>simply deterministic lack of information.
>
> Again, you are mistaken. You ARE right in your diatribe against the
> non-informative prior, but you are wrong that you can't apply any
> form of statistics (or even Bayesian statistics). If you get hold
> of a good book on the mathematical foundations of statistics (NOT
> one written for laymen, like physicists), you will see that.
If I am wrong, please enlighten me and sketch how to apply any
form of statistics (or even Bayesian statistics) to this case.
I know many books on statistics, and processed many statistical
applications, but never came across techniques for doing statistics
about a sequence of numbers x_l of which you only know they are in [0,1].
>>Stochastic lack of knowledge is of a different kind - it assumes that
>>the _maximal_attainable_ knowledge about the system, at the given level
>>of description, is a probability distribution, and that this probability
>>distribution is indeed known.
>
> Grrk. Well, maybe. "Stochastic lack of knowledge" isn't a term I
> am familiar with.
Maybe I used a poor formulation. I meant:
'The lack of knowledge that statistics can model'
which should be intelligible.
>>There are also combinations of both, where one knows the maximal knowledge
>>about a system should be stochastic, but one lacks complete information
>>on the distribution. This is handled by the emerging field of 'imprecise
>>probability', although there is not yet a generally accepted way for
>>analyzing such situations, and different schools with quite different
>>basic approaches compete. See, e.g,
>
> "Emerging"? The subject has been studied extensively since the 1920s!
On a very theoretical level only. Computationally useful versions (for more
than the most elementary problems) have not been forthcoming till recently.
Arnold Neumaier
Nick Maclaren
Aug17-04, 01:28 PM
<jabberwocky><div class="vbmenu_control"><a href="jabberwocky:;" onClick="newWindow=window.open('','usenetCode','toolbar=no, location=no,scrollbars=yes,resizable=yes,status=no ,width=650,height=400'); newWindow.document.write('<HTML><HEAD><TITLE>Usenet ASCII</TITLE></HEAD><BODY topmargin=0 leftmargin=0 BGCOLOR=#F1F1F1><table border=0 width=625><td bgcolor=midnightblue><font color=#F1F1F1>This Usenet message\'s original ASCII form: </font></td></tr><tr><td width=449><br><br><font face=courier><UL><PRE>\nIn article <4121F9CC.6080106@univie.ac.at>,\nArnold Neumaier <Arnold.Neumaier@univie.ac.at> wrote:\n>\n>If I am wrong, please enlighten me and sketch how to apply any\n>form of statistics (or even Bayesian statistics) to this case.\n>I know many books on statistics, and processed many statistical\n>applications, but never came across techniques for doing statistics\n>about a sequence of numbers x_l of which you only know they are in [0,1].\n\nWhat you actually SAID was: "If you know about a number x only that\nit is in [0,1], you cannot apply statistics since you know nothing at\nall about the distribution (except for its support)."\n\nYou are therefore implying that there IS an underlying distribution.\nThat being so, you can estimate it by using the empirical PDF, you\nan reason about transformations and so on. You can test whether two\nsamples come from the same distribution, have the same median, and\nso on. That\'s all traditional probability and statistics.\n\nStart with non-parametric statistics, and follow references. Stuart\nand Kendall (as was) is a good place for find a summary - it is now\ncalled Kendall\'s Advanced Theory of Statistics, if I recall.\n\n>>>There are also combinations of both, where one knows the maximal knowledge\n>>>about a system should be stochastic, but one lacks complete information\n>>>on the distribution. This is handled by the emerging field of \'imprecise\n>>>probability\', although there is not yet a generally accepted way for\n>>>analyzing such situations, and different schools with quite different\n>>>basic approaches compete. See, e.g,\n>>\n>> "Emerging"? The subject has been studied extensively since the 1920s!\n>\n>On a very theoretical level only. Computationally useful versions (for more\n>than the most elementary problems) have not been forthcoming till recently.\n\nWell, I am a bit out of touch, so you might be right that something\nhas emerged, but I dabbled with that in the 1970s and most of the\nresults were 30+ years old then. If you have any good references,\nI should be interested to look - but please remember that I would\nwant to see ones to the underlying mathematics.\n\nI think that it is far more likely that some non-specialists have\njust rediscovered some old results.\n\n\nRegards,\nNick Maclaren.\n</UL></PRE></font></td></tr></table></BODY><HTML>');"> <IMG SRC=/images/buttons/ip.gif BORDER=0 ALIGN=CENTER ALT="View this Usenet post in original ASCII form"> View this Usenet post in original ASCII form </a></div><P></jabberwocky>In article <4121F9CC.6080106@univie.ac.at>,
Arnold Neumaier <Arnold.Neumaier@univie.ac.at> wrote:
>
>If I am wrong, please enlighten me and sketch how to apply any
>form of statistics (or even Bayesian statistics) to this case.
>I know many books on statistics, and processed many statistical
>applications, but never came across techniques for doing statistics
>about a sequence of numbers x_l of which you only know they are in [0,1].
What you actually SAID was: "If you know about a number x only that
it is in [0,1], you cannot apply statistics since you know nothing at
all about the distribution (except for its support)."
You are therefore implying that there IS an underlying distribution.
That being so, you can estimate it by using the empirical PDF, you
an reason about transformations and so on. You can test whether two
samples come from the same distribution, have the same median, and
so on. That's all traditional probability and statistics.
Start with non-parametric statistics, and follow references. Stuart
and Kendall (as was) is a good place for find a summary - it is now
called Kendall's Advanced Theory of Statistics, if I recall.
>>>There are also combinations of both, where one knows the maximal knowledge
>>>about a system should be stochastic, but one lacks complete information
>>>on the distribution. This is handled by the emerging field of 'imprecise
>>>probability', although there is not yet a generally accepted way for
>>>analyzing such situations, and different schools with quite different
>>>basic approaches compete. See, e.g,
>>
>> "Emerging"? The subject has been studied extensively since the 1920s!
>
>On a very theoretical level only. Computationally useful versions (for more
>than the most elementary problems) have not been forthcoming till recently.
Well, I am a bit out of touch, so you might be right that something
has emerged, but I dabbled with that in the 1970s and most of the
results were 30+ years old then. If you have any good references,
I should be interested to look - but please remember that I would
want to see ones to the underlying mathematics.
I think that it is far more likely that some non-specialists have
just rediscovered some old results.
Regards,
Nick Maclaren.
Arnold Neumaier
Aug19-04, 12:39 PM
<jabberwocky><div class="vbmenu_control"><a href="jabberwocky:;" onClick="newWindow=window.open('','usenetCode','toolbar=no, location=no,scrollbars=yes,resizable=yes,status=no ,width=650,height=400'); newWindow.document.write('<HTML><HEAD><TITLE>Usenet ASCII</TITLE></HEAD><BODY topmargin=0 leftmargin=0 BGCOLOR=#F1F1F1><table border=0 width=625><td bgcolor=midnightblue><font color=#F1F1F1>This Usenet message\'s original ASCII form: </font></td></tr><tr><td width=449><br><br><font face=courier><UL><PRE>Nick Maclaren wrote:\n> In article <4121F9CC.6080106@univie.ac.at>,\n> Arnold Neumaier <Arnold.Neumaier@univie.ac.at> wrote:\n>\n>>If I am wrong, please enlighten me and sketch how to apply any\n>>form of statistics (or even Bayesian statistics) to this case.\n>>I know many books on statistics, and processed many statistical\n>>applications, but never came across techniques for doing statistics\n>>about a sequence of numbers x_l of which you only know they are in [0,1].\n>\n> What you actually SAID was: "If you know about a number x only that\n> it is in [0,1], you cannot apply statistics since you know nothing at\n> all about the distribution (except for its support)."\n>\n> You are therefore implying that there IS an underlying distribution.\n\nYes. The distribution is a sum of delta function centered at the x_l.\n\n\n> That being so, you can estimate it by using the empirical PDF,\n\nHow do you compute this empirical PDF if all you know is that the\nx_l are in [0,1]? The only information you get is that the PDF is zero\nfor x not in [0,1]. Thus statistics is powerless.\nOn the other hand, if you want to calculate the value of y=x-x^2, say,\ndeterministic analysis gives you the information y in [0, 1/4].\nNo amount of statistical analysis can give you any information beyond this.\n\nThus, incomplete knowledge in terms of deterministic bounds requires\ntraditional analysis; incomplete knowledge in terms of distributions\nrequires probability theory; and incomplete knowledge in terms of\nan incomplete sample from the ensemble of interest requires statistical\nestimation.\n\n\n> and reason about transformations and so on. You can test whether two\n> samples come from the same distribution, have the same median, and\n> so on. That\'s all traditional probability and statistics.\n\nThis all requires that you know the values of the x_l, which is much more\nknowledge than was assumed in the beginning. Of course, knowing a sample\none can do sample statistics and estimate parameters of distributions that\nare highly compatible with the sample.\n\n\n>>>>There are also combinations of both, where one knows the maximal knowledge\n>>>>about a system should be stochastic, but one lacks complete information\n>>>>on the distribution. This is handled by the emerging field of \'imprecise\n>>>>probability\', although there is not yet a generally accepted way for\n>>>>analyzing such situations, and different schools with quite different\n>>>>basic approaches compete. See, e.g,\n>>>\n>>>"Emerging"? The subject has been studied extensively since the 1920s!\n>>\n>>On a very theoretical level only. Computationally useful versions (for more\n>>than the most elementary problems) have not been forthcoming till recently.\n>\n> Well, I am a bit out of touch, so you might be right that something\n> has emerged, but I dabbled with that in the 1970s and most of the\n> results were 30+ years old then. If you have any good references,\n> I should be interested to look - but please remember that I would\n> want to see ones to the underlying mathematics.\n\nI quoted already the site\nhttp://class.ee.iastate.edu/berleant/home/ServeInfo/Interval/intprob.html\nwhere you can find a lot of links to people and relevant sites; in\nparticular those in Sections 3 and 4. Books about the underlying\nmathematics are given in Section 5.\n\n\n> I think that it is far more likely that some non-specialists have\n> just rediscovered some old results.\n\nNo. These are developments of the old results with interesting new\nperspectives. Many of those working now on it have been fully aware\nof what existed in the past.\n\n\nArnold Neumaier\n\n</UL></PRE></font></td></tr></table></BODY><HTML>');"> <IMG SRC=/images/buttons/ip.gif BORDER=0 ALIGN=CENTER ALT="View this Usenet post in original ASCII form"> View this Usenet post in original ASCII form </a></div><P></jabberwocky>Nick Maclaren wrote:
> In article <4121F9CC.6080106@univie.ac.at>,
> Arnold Neumaier <Arnold.Neumaier@univie.ac.at> wrote:
>
>>If I am wrong, please enlighten me and sketch how to apply any
>>form of statistics (or even Bayesian statistics) to this case.
>>I know many books on statistics, and processed many statistical
>>applications, but never came across techniques for doing statistics
>>about a sequence of numbers x_l of which you only know they are in [0,1].
>
> What you actually SAID was: "If you know about a number x only that
> it is in [0,1], you cannot apply statistics since you know nothing at
> all about the distribution (except for its support)."
>
> You are therefore implying that there IS an underlying distribution.
Yes. The distribution is a sum of \delta function centered at the x_l.
> That being so, you can estimate it by using the empirical PDF,
How do you compute this empirical PDF if all you know is that the
x_l are in [0,1]? The only information you get is that the PDF is zero
for x not in [0,1]. Thus statistics is powerless.
On the other hand, if you want to calculate the value of y=x-x^2, say,
deterministic analysis gives you the information y in [0, 1/4].
No amount of statistical analysis can give you any information beyond this.
Thus, incomplete knowledge in terms of deterministic bounds requires
traditional analysis; incomplete knowledge in terms of distributions
requires probability theory; and incomplete knowledge in terms of
an incomplete sample from the ensemble of interest requires statistical
estimation.
> and reason about transformations and so on. You can test whether two
> samples come from the same distribution, have the same median, and
> so on. That's all traditional probability and statistics.
This all requires that you know the values of the x_l, which is much more
knowledge than was assumed in the beginning. Of course, knowing a sample
one can do sample statistics and estimate parameters of distributions that
are highly compatible with the sample.
>>>>There are also combinations of both, where one knows the maximal knowledge
>>>>about a system should be stochastic, but one lacks complete information
>>>>on the distribution. This is handled by the emerging field of 'imprecise
>>>>probability', although there is not yet a generally accepted way for
>>>>analyzing such situations, and different schools with quite different
>>>>basic approaches compete. See, e.g,
>>>
>>>"Emerging"? The subject has been studied extensively since the 1920s!
>>
>>On a very theoretical level only. Computationally useful versions (for more
>>than the most elementary problems) have not been forthcoming till recently.
>
> Well, I am a bit out of touch, so you might be right that something
> has emerged, but I dabbled with that in the 1970s and most of the
> results were 30+ years old then. If you have any good references,
> I should be interested to look - but please remember that I would
> want to see ones to the underlying mathematics.
I quoted already the site
http://class.ee.iastate.edu/berleant/home/ServeInfo/Interval/intprob.html
where you can find a lot of links to people and relevant sites; in
particular those in Sections 3 and 4. Books about the underlying
mathematics are given in Section 5.
> I think that it is far more likely that some non-specialists have
> just rediscovered some old results.
No. These are developments of the old results with interesting new
perspectives. Many of those working now on it have been fully aware
of what existed in the past.
Arnold Neumaier
vBulletin® v3.8.7, Copyright ©2000-2012, vBulletin Solutions, Inc.