Is the observed result significantly different from the expected result?

  • Thread starter PleaseHelpMe
  • Start date
  • Tags
    Statistics
In summary: However, if you are trying to show that the subjects had some knowledge of Japanese or some other bias then you need to do some other statistical test.In summary, the conversation discusses an experiment where participants were asked to choose the correct English translation of Japanese words. The results showed that they were able to choose the correct answer at an above-chance level. A Chi-square test was used to determine the significance of the results and it was found to be massively significant. However, there may be bias in the results and further experiments may need to be conducted to determine the source of this bias.
  • #1
PleaseHelpMe
9
0
Anyone good with statistics??

I've conducted an experiment looking at phonetic symbolism in Japanese words.
Thirty participants listened to 40 Japanese words and had to choose out of a choice of two possible English translations which one they thought was the correct answer.

I'm trying to find out whether or not the participants could give the correct answer at an above-chance expectancy level.

Since they didn't know Japanese, we can assume that they had a 50% chance of giving the correct answer (since there were two possible answers for each word- a correct and incorrect one).

There were 705 correct answers out of 1200 possible correct answers (30 x 40).
This is more than half, but is it significantly more than half?


I wasn't sure of the correct statistical test to use, but I used a Chi-square test and followed the instructions on this page: visualstatistics.net/SPSS%20workbook/chi-square_goodness_of_fit.htm

Obviously they use a different example but I'm pretty sure it's the same thing: comparing observed frequencies with expected frequencies.


The results suggested that it was massively significant. However, even if they only got 614 correct and 586 incorrect, it still reports a significant difference (Chi square = .653, Asymp. Sig. = .419).


This doesn't seem right to me, it seems very low.
If I tossed a coin 1200 times and it landed heads 614 times, then that means the coin is weighted??

705/1200 is only 58.75%, and that's significant?

I'm surely doing something wrong..
 
Mathematics news on Phys.org
  • #2
I'm sorry it looks like I've posted this in the wrong forum.
 
  • #3
I am sure you will get a good response here as well. The way I look at it is that assuming 50/50 chance "on average" your test would yield 600 correct responses. You got 705, which is 17.5% better than average (that is ((705-600)/600))

If you had unlimited time I would suggest you try flipping a coin 1200 times and see if you could get 58.75% heads or tails...

I ran a simple simulation of rand head tails flips 1200 times. After 81 experiments I finally had one that had 701 tails and 499 heads.
 
  • #4
I am sure someone will respond with an exact number (with 1200 flips), but with 120 head tails flips, the probability of getting above 70 of either heads or tails is only 5.477%

With 1200 flips the probability of getting above 700 either heads or tails is less.
 
  • #5
Unfortunately, even if you figure out the correct way to analyze this to determine whether or not it's statistically significant (it probably is), there is still a problem. What conclusions can you really draw from this?

So you've determined that the people were able to do better than 50%, which is what you'd expect if they really had no indication as to which english word was correct. However, where did this indication come from? Was it that they did in fact know some Japanese? Was it that there is a psychological trend for languages to use similar sounds for similar words? Or was it something about the way you picked the words that the people were quizzed on? (For example, did you take into account the number of cognates and pseudo-cognates in your sample? Did both of the choices for the definition always seem viable?)

---

As an example, let's say that I give you this word and the following choices:
kasa
1: home
2: umbrella

If I were to quiz a completely random group of Spanish speaking people, I'd expect that the majority of people would answer "home". However, if I'd quiz a group of people who happen to know that there are very few similarities between romance languages and Japanese (and know at least a little Spanish), I'd expect that they'd be more likely to answer "umbrella" just because they'd think that the answer would probably not be "home".

The correct answer is of course umbrella.

The problem is that a "random" group of people is not necessarily unbiased. What you have determined is that your group of people is probably slightly biased, but you still don't know where this bias comes from.
 
Last edited:
  • #6
Have you only done the experiment once? If so it isn't really enough to reach a firm conclusion, I would do it a few more times. Do any of the japanese words sound remotely like their english translation?
 
  • #7
PleaseHelpMe said:
I wasn't sure of the correct statistical test to use, but I used a Chi-square test and followed the instructions on this page: visualstatistics.net/SPSS%20workbook/chi-square_goodness_of_fit.htm

Obviously they use a different example but I'm pretty sure it's the same thing: comparing observed frequencies with expected frequencies.


The results suggested that it was massively significant. However, even if they only got 614 correct and 586 incorrect, it still reports a significant difference (Chi square = .653, Asymp. Sig. = .419).


This doesn't seem right to me, it seems very low.
If I tossed a coin 1200 times and it landed heads 614 times, then that means the coin is weighted??

705/1200 is only 58.75%, and that's significant?

I'm surely doing something wrong..

The test is binomial (two outcomes). For large n the binomial distribution is very well approximated with a Normal distribution mean np and varience np(1-p).

Taking the null hypothesis that the results are guesses (50/50) then p=0.5 and so the mean is 600 and the varience 300 (stdev = 17.3).

So yes the result is massively significant with the actual number of correct responses about 6 standard deviations above what is expected by chance (Z=6).
 

Related to Is the observed result significantly different from the expected result?

1. What is the purpose of using statistics in scientific research?

Statistics is used in scientific research to analyze and interpret data, identify patterns and relationships, make predictions, and draw conclusions. It allows researchers to make objective and evidence-based decisions and verify the validity of their findings.

2. What are some common statistical techniques used in scientific research?

Some common statistical techniques used in scientific research include descriptive statistics (e.g. mean, median, mode), inferential statistics (e.g. t-tests, ANOVA), correlation and regression analyses, and data visualization techniques (e.g. graphs, charts).

3. How can statistics be used to ensure the reliability and validity of research findings?

Statistics can be used to ensure the reliability and validity of research findings by providing a framework for analyzing and interpreting data, identifying and controlling for confounding variables, and determining the significance of results. Additionally, using appropriate statistical methods and following best practices can reduce bias and increase the accuracy of research findings.

4. What are some common mistakes to avoid when using statistics in scientific research?

Some common mistakes to avoid when using statistics in scientific research include misinterpreting data, using inappropriate statistical methods, failing to account for confounding variables, and relying on small sample sizes. It is also important to clearly define research questions and hypotheses, properly collect and analyze data, and accurately report results.

5. How can someone improve their skills in statistics for scientific research?

Improving skills in statistics for scientific research can be done through formal education and training, such as taking courses in statistics or data analysis. Additionally, practicing with real-world data and seeking guidance from experienced statisticians can also help improve skills. It is also important to stay updated on new statistical methods and techniques through reading scientific literature and attending conferences or workshops.

Similar threads

  • Set Theory, Logic, Probability, Statistics
Replies
2
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
20
Views
3K
  • Math Proof Training and Practice
2
Replies
67
Views
10K
  • General Discussion
Replies
11
Views
25K
Back
Top