Q-test: Why limited to one point?

  • Thread starter Icskatingqn
  • Start date
  • Tags
    Point
In summary, when using Q-testing to identify outliers in data, it is important to keep in mind the purpose and context of the data. Outliers can skew the average and may need to be removed if they are not representative of the majority of the data or if they are data errors. However, it is important to carefully consider which data points are being removed and for what purpose, as this can affect the overall interpretation of the data.
  • #1
Icskatingqn
2
0
So I recently learned how to Q-test possible outlying data points. But I am a little confused-why can I only Q-test one point?

I understand that in Q-testing, you are deleting one point. But if that point was so significant that it shouldn't have been included in data, why should the deleting of it affect the data-I think it should be viewed as never having been in the data. Then, we can Q-test other possible points using N-1 instead of N for the Q-test.

I already took a test over this in my Chemistry class, luckily there was nothing on this reason, but I would still like to know!

Thank you for any replies :)
 
Physics news on Phys.org
  • #2
I seem to remember idea that it should be applied only once was one of the assumptions used when the test was designed. But I can be wrong.

I will move the thread to the statistics forum.
 
  • #3
Icskatingqn said:
So I recently learned how to Q-test possible outlying data points. But I am a little confused-why can I only Q-test one point?

I understand that in Q-testing, you are deleting one point. But if that point was so significant that it shouldn't have been included in data, why should the deleting of it affect the data-I think it should be viewed as never having been in the data. Then, we can Q-test other possible points using N-1 instead of N for the Q-test.

I already took a test over this in my Chemistry class, luckily there was nothing on this reason, but I would still like to know!

Thank you for any replies :)

Hey Icskatingqn and welcome to the forums.

It depends on what you are trying to do.

Lets consider trying to estimate the mean (which is the average value of all data points in a sample and for a theoretical distribution which you could think as being the representation of an infinite amount of data).

Now let's look at a familiar thing of measuring income.

We know that most people earn around a certain amount (like say 50,000 to 70,000) but there are of course a few billionaires that 'skew' the average upwards.

You could see the billionaire incomes as the outliers and if you wanted to get a better indication of the 'average' income, then the billionaire data points might skew the average too much to give an accurate representation for the majority of the population.

This is one reason why we might censor some of the values (i.e. remove them) because in the context of what we are trying to measure, these don't help us and in many cases are damaging which is why we remove them.

Also another important thing is to remove outliers if they are 'data errors'. If its known that a particular value is 'impossible' or 'unplausible' then its a good idea to remove it. You never see people 4 metres tall, and even though it might be 'possible', it's not 'plausible' so we remove it.

But having said the above you have to be very careful about what data is removed and for what purpose. It depends on the nature of the experiment, what you are trying to find out, what the data is, and what the underlying process is.

In other words you just don't just 'remove outliers' as a standard thing: there has to be a good reason for it.
 

1. Why is the Q-test limited to only one point?

The Q-test is limited to only one point because it is designed to test for outliers or extreme values in a data set. By focusing on only one point, it can determine if that point is significantly different from the rest of the data.

2. Can the Q-test be used for multiple points?

No, the Q-test is specifically designed to be used on only one point at a time. It would not be appropriate to use it for multiple points as it would not accurately reflect the presence of outliers in the data set.

3. How is the Q-test different from other outlier detection methods?

The Q-test differs from other outlier detection methods in that it compares the data point in question to the median of the data set, rather than the mean. This makes it more robust to outliers and extreme values.

4. Is the Q-test suitable for all types of data?

The Q-test is most suitable for normally distributed data sets. If the data is highly skewed or has a heavy-tailed distribution, the Q-test may not be the most appropriate method for detecting outliers.

5. Can the Q-test be used in conjunction with other statistical tests?

Yes, the Q-test can be used in conjunction with other statistical tests to further analyze and understand the data. However, it is important to note that the Q-test should be used as a preliminary check for outliers before conducting other tests.

Similar threads

  • Set Theory, Logic, Probability, Statistics
Replies
4
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
991
  • Set Theory, Logic, Probability, Statistics
Replies
5
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
5
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
4
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
2
Views
2K
Replies
16
Views
1K
  • Biology and Medical
Replies
6
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
9
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
2K
Back
Top