Why do certain letters appear more frequently in written language?

In summary, different languages have varying frequencies of letters appearing in their written text. The distribution of these frequencies is often connected to the sounds used in the language, with vowels being used to sound out consonants. This can also be seen in frequency analysis, a tool used in deciphering coded messages. The development of frequency analysis stems from the recognition of statistical distributions in written text. It is likely that the frequency of certain letters, such as 'e' and 'a', is related to the sounds used in human speech.
  • #1
Avichal
295
0
English has the letter 'e' with most frequency. Other language also have some of their alphabets appearing more frequently in text hat others. Why is this? Why don't all letters appear with equal frequency?
Do humans speak vowels more comfortably? What exactly is the reason?
 
Physics news on Phys.org
  • #2
Tone languages are languages (like Chinese, Thai, Yoruba, and Zulu) in which the pitch or “tone” of words and syllables makes a difference to word meaning. For example, in Chinese huār (with a high level pitch) means ‘flower’ and huàr (with a falling pitch) means ‘picture’. In non-tonal languages (like English or Spanish), pitch is only used at the sentence level, for emphasis and overall meanings like questioning. Roughly half the languages in the world are tonal and half are non-tonal, but they’re fairly unevenly distributed: tone languages are the norm in sub-Saharan Africa and are common in Southeast Asia and among Native American languages especially in parts of Central and South America. Non-tone languages are the norm in Europe and Central, South and West Asia, and among the aboriginal languages of Australia.
http://www.lel.ed.ac.uk/~bob/tonegenessummary.html

The World Atlas of Langauge Structures (WALS) is a large database of structural (phonological, grammatical, lexical) properties of languages gathered from descriptive materials (such as reference grammars) by a team of 55 authors (many of them the leading authorities on the subject).
http://wals.info/

specifically, see this chapter:
Chapter 2: Vowel Quality Inventories
by Ian Maddieson
Some excerpts:

1. Introduction
This chapter discusses the number of vowel contrasts in the inventory of sounds in languages.

2. Establishing the values.
When vowel qualities are counted in this way in the sample of languages surveyed for this chapter, the average number of vowels in a language is just fractionally below 6. The smallest vowel quality inventory recorded is 2 and the largest 14.

3. Geographical distribution
There are strong areal patterns in the distribution of vowel quality inventories. Not surprisingly, languages with average inventory sizes are the most widely scattered. In just a few areas, southern Africa being one, they occur almost to the exclusion of the other two types.

http://wals.info/chapter/2
 
  • #3
Avichal said:
English has the letter 'e' with most frequency. Other language also have some of their alphabets appearing more frequently in text hat others. Why is this? Why don't all letters appear with equal frequency?
Do humans speak vowels more comfortably? What exactly is the reason?

Why should letters appear with equal frequency? Do the sounds of a language occur with equal frequency?

Vowels are used to sound out the consonants, at least in Indo-European languages. English has 5 vowels and 21 consonants. Other languages will have a slightly different mix.

Frequency analysis is one tool which can be used to attack ciphered messages. Other tools are needed along with FA to produce a complete decipherment.
 
  • #4
SteamKing said:
Why should letters appear with equal frequency? Do the sounds of a language occur with equal frequency?

Vowels are used to sound out the consonants, at least in Indo-European languages. English has 5 vowels and 21 consonants. Other languages will have a slightly different mix.

Frequency analysis is one tool which can be used to attack ciphered messages. Other tools are needed along with FA to produce a complete decipherment.

Yes, I know it's used to attack ciphered messages. Actually this question arose from that very context. I was curious to know why some letters had more frequency?
The question perhaps requires the knowledge of how human speech works. Can you pleas explain why this work
 
  • #5
All I can say about FA development comes from this article:

http://en.wikipedia.org/wiki/Frequency_analysis

See the section on History and Usage.

If you are really interested in cryptography and ciphers, I recommend the book by Kahn (in the References portion of the same article.)

However, it does stand to reason that the occurrence of letters in written text, like a lot of things, would have some statistical distribution, given enough samples of text written in the same language. Some clever person recognized this in the mists of time, before statistical analysis was ever thought of.
 
  • #6
SteamKing said:
All I can say about FA development comes from this article:

http://en.wikipedia.org/wiki/Frequency_analysis

See the section on History and Usage.

If you are really interested in cryptography and ciphers, I recommend the book by Kahn (in the References portion of the same article.)

However, it does stand to reason that the occurrence of letters in written text, like a lot of things, would have some statistical distribution, given enough samples of text written in the same language. Some clever person recognized this in the mists of time, before statistical analysis was ever thought of.
Yes, there has to be some statistical distribution but I find the sound of 'e', 'a' the most common across many languages. I am trying to find a reason behind this
 

1. What is frequency analysis?

Frequency analysis is a method used to study patterns or trends in a set of data by counting the number of times a particular value or category appears.

2. Why is frequency analysis important?

Frequency analysis allows scientists to identify relationships and patterns within data, which can help in making predictions and understanding complex systems.

3. How does frequency analysis work?

Frequency analysis works by organizing data into categories or bins and then counting the number of observations that fall into each category. This information is then used to create a visual representation, such as a histogram or bar chart, to show the distribution of the data.

4. What types of data can be analyzed using frequency analysis?

Frequency analysis can be applied to any type of data, including numerical, categorical, and even textual data. It is commonly used in fields such as statistics, biology, and social sciences.

5. Are there any limitations to frequency analysis?

While frequency analysis is a useful tool, it does have limitations. It assumes that the data is normally distributed and may not be effective for highly skewed data. It also does not take into account the relationships between different categories, so it may not provide a complete understanding of the data.

Similar threads

  • Art, Music, History, and Linguistics
Replies
9
Views
2K
Replies
6
Views
3K
Replies
9
Views
1K
  • Computing and Technology
2
Replies
44
Views
3K
Replies
10
Views
1K
Replies
16
Views
1K
Replies
15
Views
2K
  • Linear and Abstract Algebra
2
Replies
43
Views
5K
Replies
11
Views
3K
  • Art, Music, History, and Linguistics
Replies
12
Views
5K
Back
Top