Download PDFOpen PDF in browserSpeech Emotion Recognition Using LSTM and MFCC features9 pages•Published: August 6, 2024AbstractSpeech is utilized in human-machine connection and serves as a signal of human involvement. The Speech Emotion Recognition (SER) system is a novel form of this interactive system. Sufficient intelligence is provided by the SER to facilitate effective human-computer interaction. Based on the speaker's words, the SER system classifies emotions into groups such as "neutral," "calm," "happy," "sad," "angry," "fearful," "disgust," and "surprise." Languages and machine learning models suitable for SER are defined in this paper. Deep learning is used by this system to effectively classify and learn from multidimensional data. Primary results for a system using the LSTM algorithm and MFCC feature tools are also presented in this work. For the simplicity of user engagement, we have then implemented this model as a website through the usage of a third party.Keyphrases: long short term memory, machine learning, mel frequency cepstrum coefficient, speech emotion recognition. In: Rajakumar G (editor). Proceedings of 6th International Conference on Smart Systems and Inventive Technology, vol 19, pages 172-180.
|