Voice modulator (the Bowtie of Detective Conan)

  • Thread starter Thread starter Med Jacer
  • Start date Start date
Click For Summary
SUMMARY

This discussion focuses on developing a voice identification system, akin to the fictional "bowtie" from Detective Conan. Key tools mentioned include Apache Spark and DeepLearning4Java, which were previously utilized in a student project for gunshot recognition. The conversation emphasizes the importance of unique voice characteristics, such as tone, for effective identification and imitation. Resources like Wikipedia articles on speaker recognition, deep learning, and machine learning are recommended for foundational understanding.

PREREQUISITES
  • Understanding of voice identification techniques
  • Familiarity with Apache Spark for data processing
  • Knowledge of DeepLearning4Java for implementing deep learning algorithms
  • Basic concepts of machine learning and neural networks
NEXT STEPS
  • Research Apache Spark for real-time data processing in voice recognition
  • Explore DeepLearning4Java for building voice imitation models
  • Study machine learning algorithms applicable to voice identification
  • Watch 3blue1brown's videos on neural networks for a visual understanding of deep learning concepts
USEFUL FOR

Individuals interested in voice recognition technology, including software developers, data scientists, and researchers in artificial intelligence and machine learning.

Med Jacer
I'm seeking help to study some deeply the subject of voice identification, voice imitating, And if possible creating the bowtie of Conan or some something related to it.

How to create a system (program or algorithm) that identifies a person by his voice relying on its unique caracteristics (tone..) ?
Is it possible to imitate this voice ?

Any ideas or hints are welcome
 
Technology news on Phys.org
Here's a wiki article that can get you started in understanding the field:

https://en.wikipedia.org/wiki/Speaker_recognition

We did a student project last year where Apache Spark and some Deep Learning tools were used to construct a gunshot recognition network which would have similar characteristics to your voice recognition project.

So Apache Spark and DeepLearning4Java would be some software you could look at. There are other tools based on Pyrhon or Matlab that may also be of interest. You can find more information by researching Deep Learning or Machine Learning topics on Google and Wikipedia.

https://en.wikipedia.org/wiki/Deep_learning

https://en.wikipedia.org/wiki/Machine_learning

https://en.wikipedia.org/wiki/Apache_Spark

https://en.wikipedia.org/wiki/Deeplearning4j

There are also videos on Youtube by 3blue1brown on how neural nets work that also may be of interest.

https://www.google.com/search?newwi...i131i20i264k1j0i20i264k1j0i10k1.0.CmMSmTvsb8M
 
  • Like
Likes   Reactions: berkeman and QuantumQuest

Similar threads

Replies
2
Views
3K
Replies
10
Views
5K
  • · Replies 7 ·
Replies
7
Views
4K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 33 ·
2
Replies
33
Views
6K
  • · Replies 8 ·
Replies
8
Views
4K
  • · Replies 19 ·
Replies
19
Views
3K
Replies
6
Views
2K
  • · Replies 9 ·
Replies
9
Views
2K
  • · Replies 27 ·
Replies
27
Views
5K