Deep Learning Model CNN With LSTM For Speaker Recognition

Title	Deep Learning Model CNN With LSTM For Speaker Recognition
Publication Type	Journal Article
Year of Publication	2022
Authors	Alkhatib, B, Madian, M, Eddin, K
Journal	Journal of Digital Information Management
Volume	20
Issue	4
Start Page	131
Pagination	131-147
Date Published	12/2022
Type of Article	Research
Abstract	Speech recognition is one of the most important research fields nowadays because of its necessity in our daily lives and to raise the fields of security to the highest level, Itâ€™s a task of speech processing, and our main scope in this paper is on speaker verification, which is to identify persons from their voices where the process depends on digitizing the sound waves into a form that allows the system to deal with it. The verification process is based on the characteristics of the speaker's voice (voice biometrics) and sends it to a further process to extract the features of that voice using the feature extraction method and using AI techniques to perform the task of identification. MFCC is used for the task of features extraction and obtains the spectrogram of a given voice signal where it represents a bank of information about the voice and sends it to the CNN model for further processing for training the model on that signal to verify if the voice belongs to a user in the system or itâ€™s a new enrollment.

Main Menu1