Shannon's entropy definition`

    Hi to everyone i was reading today the wikipedia's article about information entropy
    I need some help to understand why in the
    entropy of an event we use the log2(1/p(xi))
    I have read to the article that
    "An intuitive understanding of information entropy relates to the amount of uncertainty about an event associated with a given probability distribution. "
    Then why only the first part of the equation Sump(xi) is not enough. p(xi) denotes the amount of uncertainty of an event..
