I Help me understand skewness in QQ-plots please

bremenfallturm · May 8, 2025

I am trying to understand how QQ plots work, but I have a hard time understanding how to interpret skewness. Specifically, it is "the other way around" than I expect.

Let me explain.

From what I understand, in a QQ plot, we divide the normal distribution (typically ##N(0,1)##) and the dataset into ##n## quantiles (where ##n## is the number of datapoints). We sort the dataset and plot each datapoint against the normal distribution. For example, if we have 10 ordered datapoints $$a_1, a_2, ...$$, and have created 10 normal quantiles $$n_1,n_2,...$$ we would plot $$(a_1, n_1), (a_2, n_2)$$ and so on.

Now, here is when I don't understand how we interpret the skewness.
Consider the left skewed case for example (https://anasrana.github.io/ems-practicals/qq-plot.html)

If I look at the plot, my first intuition is this: it looks to me that all points below the line (the points between -4 and around -1 of the normal distribution's quantiles) are smaller than expected. This is because they are below the line. Therefore, the points would be drawn from a distribution where smaller values are more probable. Of course, looking at the actual distribution, we can see that it is the other way around.
My second idea is then this: if we have many large datapoints (i.e. in the image above), the graph axes are going to be scaled such that the smaller values fall below the line, and thus, we have a distribution with a tendency towards large datapoints. Does any of this make sense? Could you help me deepen my understanding?

mjc123 · May 15, 2025

bremenfallturm said:

If I look at the plot, my first intuition is this: it looks to me that all points below the line (the points between -4 and around -1 of the normal distribution's quantiles) are smaller than expected. This is because they are below the line. Therefore, the points would be drawn from a distribution where smaller values are more probable.

True.

bremenfallturm said:

Of course, looking at the actual distribution, we can see that it is the other way around.

No it isn't. It's exactly as expected.

bremenfallturm said:

My second idea is then this: if we have many large datapoints (i.e. in the image above), the graph axes are going to be scaled such that the smaller values fall below the line, and thus, we have a distribution with a tendency towards large datapoints. Does any of this make sense?

No, none of this makes sense.

I Help me understand skewness in QQ-plots please

Thread 'A variant of the Monty Hall problem'

Similar threads

B A Little Probability Puzzle

I Need help solving this Existence Algorithm for truth

I A variant of the Monty Hall problem

I What Are the Axioms of Fuzzy Logic and How Do They Extend Boolean Algebra?

I Please Explain (actually explain) The Monty Hall Problem

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers