An overview of speaker recognition

Main Article Content

Junxia Liu
CL Philip Chen*
Tieshan Li*
Yi Zuo
Peichao He

Abstract

Speaker recognition has been studied for many years and has been a hot topic. This paper presents an overview of speaker recognition methods, which include the classical and the state-of-art methods. According to the modular components of speaker recognition system, we firstly introduced the fundamentals of speaker recognition, which are mainly divided into two parts: feature extraction and speaker modeling. The most commonly speech features used in speaker recognition were elaborated firstly. In particular, the recent progress of deep neural network proposes a new approach of feature extraction and has become the technology trend. Secondly, the classical approaches of speaker recognition model were introduced, and elaborated the recent progress of deep learning speaker recognition. This paper especially provides an in-depth analysis on end-to-end model which consists of a training component to extract features, an enrollment component to training the speaker model, and an evaluation component with appropriate loss function for optimization. The final part concludes the paper with discussion on future trends.

Downloads

Download data is not yet available.

Article Details

Liu, J., Chen, C. P., Li, T., Zuo, Y., & He, P. (2019). An overview of speaker recognition. Trends in Computer Science and Information Technology, 4(1), 001–012. https://doi.org/10.17352/tcsit.000009
Review Articles

Copyright (c) 2019 Liu J, et al.

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.