Can a Beta Distribution Model Scores in the Interval [0,1] for Ranked Retrieval?

TheOldHag · Jun 26, 2014

I have a set of scored items with the scores in the interval [0,1]. Roughly speaking the distribution of scores is about 50% equal to 0 and then sloping steeply downward all the way toward one or near to one. I want to fit this data to a distribution and use that down the road in some calculation but I'm not sure how to proceed.

My guess is that since the data lay in the interval [0,1] it can be modeled as a beta distribution. So now I need to find the parameters alpha and beta. Is it easy as calculating the sample mean and the sample variance and working backwards from the equations for the mean and variance of a beta distribution or does that only work for normal distributions? Since these are sample do they approximate a normal distribution so that I should be fitting a normal distribution to the data despite the interval [0,1] (it would have very thin tails)? Comments appreciated.

Greg Bernhardt · Jul 1, 2014

I'm sorry you are not generating any responses at the moment. Is there any additional information you can share with us? Any new findings?

TheOldHag · Jul 2, 2014

I think this contains what I'm looking for but have not dug in yet since this problem has been set aside temporarily.

http://dare.uva.nl/document/125861

The general issue I'm having here surrounds ranked retrieval. I have rankings and they do work and I can present them in descending order to the user so that they can see more relevant items first. But it is useful for a variety of other applications to know what is the probability of relevance given a score (or non-relevance). This paper here seems to construct two distributions for the score given relevant and given non-relevant and then goes from there. Another thing I can do with this is project a possible curve of precision and recall as the user proceeds through the items in ranked order.

Can a Beta Distribution Model Scores in the Interval [0,1] for Ranked Retrieval?

Similar threads

Undergrad My basic understanding of set theory

Undergrad The problem of points

Graduate Expected numbers of cards of a last color remaining

Graduate Probability puzzle

Undergrad How does axiom of foundation prevent infinite sequence of elements?

Insights Revisiting the Velocity-Time Function

Insights Remote Operated Gate Control System

Insights AI Enriched Problem Solving

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect