Deep Learning Model CNN With LSTM For Speaker Recognition

TitleDeep Learning Model CNN With LSTM For Speaker Recognition
Publication TypeJournal Article
Year of Publication2022
AuthorsAlkhatib, B, Madian, M, Eddin, K
JournalJournal of Digital Information Management
Volume20
Issue4
Start Page131
Pagination131-147
Date Published12/2022
Type of ArticleResearch
Abstract

Speech recognition is one of the most important research fields nowadays because of its necessity in our daily lives and to raise the fields of security to the highest level, It’s a task of speech processing, and our main scope in this paper is on speaker verification, which is to identify persons from their voices where the process depends on digitizing the sound waves into a form that allows the system to deal with it. The verification process is based on the characteristics of the speaker's voice (voice biometrics) and sends it to a further process to extract the features of that voice using the feature extraction method and using AI techniques to perform the task of identification. MFCC is used for the task of features extraction and obtains the spectrogram of a given voice signal where it represents a bank of information about the voice and sends it to the CNN model for further processing for training the model on that signal to verify if the voice belongs to a user in the system or it’s a new enrollment.

Collaborative Partner

Institute of Electronic and Information Technology (IEIT)

Collaborative Partner

Collaborative Partner