Maths needed for voice simulation

  • Context: Undergrad 
  • Thread starter Thread starter greeniekin
  • Start date Start date
  • Tags Tags
    Simulation
Click For Summary
SUMMARY

The discussion focuses on the development of a voice simulation program that specifically targets vowel sounds, rather than text-to-speech functionality. Key concepts include generating frequencies that mimic vibrating vocal cords within a modeled throat or mouth shape. Participants emphasize the importance of understanding formants, which are the multiple frequencies present in vowel sounds. A suggested approach involves using a microphone for sound capture and applying Fourier decomposition to analyze and recreate vowel sounds effectively.

PREREQUISITES
  • Understanding of formants in vowel sounds
  • Knowledge of Fourier decomposition techniques
  • Familiarity with sound synthesis and signal analysis
  • Basic programming skills for audio processing
NEXT STEPS
  • Research sound synthesis techniques for vowel generation
  • Learn about Fourier analysis and its applications in audio
  • Explore tools for audio signal processing, such as MATLAB or Python libraries
  • Investigate existing voice simulation projects for practical insights
USEFUL FOR

This discussion is beneficial for programmers, audio engineers, and researchers interested in voice simulation, sound synthesis, and acoustic modeling techniques.

greeniekin
Messages
1
Reaction score
0
I am a programmer and I was thinking about making a program to simulate a voice. To be clear I'm not looking to make a text-to-speech program, and I'm only really expecting to do vowels.

My idea is to basically to generate a frequency(vibrating vocal chords) in a pipe(throat mouth). Having a different shape of the pipe.

Unfortunately searching for stuff i come across speech recognition or text to speech(and the best text to speech don't generate sounds they use recordings).

I have no idea what i should be looking for. If anyone could point me in the right direction or provide any insight it would be much appreciated.

I know little things like vowels have many different frequencies sounding at once, and that they are referred to as formants. Though i don't know how these are determined.
 
Physics news on Phys.org
greeniekin said:
I am a programmer and I was thinking about making a program to simulate a voice. To be clear I'm not looking to make a text-to-speech program, and I'm only really expecting to do vowels.

My idea is to basically to generate a frequency(vibrating vocal chords) in a pipe(throat mouth). Having a different shape of the pipe.

Unfortunately searching for stuff i come across speech recognition or text to speech(and the best text to speech don't generate sounds they use recordings).

I have no idea what i should be looking for. If anyone could point me in the right direction or provide any insight it would be much appreciated.

I know little things like vowels have many different frequencies sounding at once, and that they are referred to as formants. Though i don't know how these are determined.

I really don't know anything about this but I'd assume the approach you're suggesting will go nowhere. Attempting to simulate the physics of the vocal chords is probably the backwards way to do it. A better approach would be to get a microphone and sound out letters and sounds into it and then do a Fourier decomposition of each sound, strip all but the most dominant modes and replay it; see how close it sounds to correct. In this way you could very quickly establish templates for mouth sounds. Obviously there are far more advanced steps you could take from there which would relate to signal analysis.
 

Similar threads

  • · Replies 12 ·
Replies
12
Views
3K
  • · Replies 6 ·
Replies
6
Views
2K
Replies
4
Views
3K
  • · Replies 24 ·
Replies
24
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 92 ·
4
Replies
92
Views
7K
  • · Replies 7 ·
Replies
7
Views
3K
  • · Replies 2 ·
Replies
2
Views
1K
Replies
6
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K