Efficient window for monolingual and crosslingual speaker identification using MFCC

@article{Nagaraja2013EfficientWF,
  title={Efficient window for monolingual and crosslingual speaker identification using MFCC},
  author={B. G. Nagaraja and Haradagere Siddaramaiah Jayanna},
  journal={2013 International Conference on Advanced Computing and Communication Systems},
  year={2013},
  pages={1-4},
  url={https://api.semanticscholar.org/CorpusID:16249341}
}
  • B. NagarajaH. S. Jayanna
  • Published in 1 December 2013
  • Computer Science
  • 2013 International Conference on Advanced Computing and Communication Systems
Speaker identification system based on various windowing techniques based on mel-frequency cepstral coefficient shown to have considerably improved performance over baseline Hamming window technique.

Figures and Tables from this paper

A Study of Various Speech Features and Classifiers used in Speaker Identification

Different speech features & extraction techniques such as MFCC, LPCC, LPC, GLFCC, PLPC etc and different features classification models such as VQ, GMM, DTW, HMM and ANN for speaker identification system have been discussed.

Development of High Accuracy Classifier for the Speaker Recognition System

A hybrid speaker identification model for consistent speech features and high recognition accuracy is made and features using Mel frequency spectrum coefficients (MFCC) have been improved by incorporating a pitch frequency coefficient from speech time domain analysis.

Text-Dependent Multilingual Speaker Identification using Learning Vector Quantization and PSO-GA Hybrid Model

A multilingual speaker identification system using Learning Vector Quantization (LVQ) artificial Neural Network classifiers and feature selection is done using hybrid model of particle swarm optimizatiom (PSO) and Genetic Algorithm (GA).

Investigating the use of multiple languages for crisp and fuzzy speaker identification

This research evaluates speaker identification systems on a multilingual setup using three widely spoken languages which are Portuguese, English, and Chinese and indicates certain robustness on multiple languages.

Phonemes: An Explanatory Study Applied to Identify a Speaker

This paper extracted phonemes from a speaker’s voice recording and investigated the associated frequencies and amplitudes to be assist in identifying the person who is speaking, demonstrating the importance of phoneme in both speech and voice recognition systems.

Text-Dependent Multilingual Speaker Identification using Back Propagation Neural Network and PSO-GA Hybrid Model

A multilingual speaker identification system using Back Propagation (BPNN) artificial Neural Network classifiers and feature selection is done using hybrid model of particle swarm optimizatiom (PSO) and Genetic Algorithm (GA).

The Impact of Low-Pass Filter in Speaker Identification

Experimental results show that the low-pass filter significantly improves the accuracy of sound detection for the high noised signal.

Phoneme analysis for multiple languages with fuzzy‐based speaker identification

This paper investigates the effects of languages on speaker identification systems and the phonetic impact on their performance and expands the research study of fuzzy models in the speaker recognition field, using a Fuzzy C‐Means and FuzzY k‐Nearest Neighbours and comparing them with k-Nearest neighbours and Support Vector Machines.

I Sense You by Breath: Speaker Recognition via Breath Biometrics

The observation reveals that breath is a unique fingerprint of human respiratory system which offers overwhelming results for Speaker Recognition, and foresee breath biometric as a viable security measure for practical realizations.

Speaker Recognition Using Wavelet Packet Entropy, I-Vector, and Cosine Distance Scoring

The results of the experiments show that the proposed model can obtain good performance in clear and noisy environment and be insensitive to the low-quality speech, but the time cost of the model is high.

A Novel Windowing Technique for Efficient Computation of MFCC for Speaker Recognition

A novel family of windowing technique to compute mel frequency cepstral coefficient (MFCC) for automatic speaker recognition from speech based on fundamental property of discrete time Fourier transform related to differentiation in frequency domain is proposed.

Low-Variance Multitaper MFCC Features: A Case Study in Robust Speaker Verification

This paper provides detailed statistical analysis of MFCC bias and variance using autoregressive process simulations on the TIMIT corpus and proposes the multitaper method for MFCC extraction with a practical focus.

Robust text-independent speaker identification using Gaussian mixture speaker models

The individual Gaussian components of a GMM are shown to represent some general speaker-dependent spectral shapes that are effective for modeling speaker identity and is shown to outperform the other speaker modeling techniques on an identical 16 speaker telephone speech task.

Multilingual Text-independent Speaker Identification

    G. Durou
    Computer Science, Linguistics
  • 1998
The results indicate how speaker identiica-tion performance might be aaected when speakers do not use the same language during the training and testing, or when the population is composed of non-native people.

Significance of Vowel Onset Point Information for Speaker Verification

It is demonstrated in this work that for clean and matched conditions, relatively less number of frames from vowel-like regions are sufficient for speaker modeling and testing, and for degraded and mismatched conditions, vowel-like regions provide better performance.

Multi-variability speech database for robust speaker recognition

The initial study exploring the impact of mismatch in training and test conditions with collected data finds that the mismatch in sensor, speaking style, and environment result in significant degradation in performance compared to the matched case whereas for language mismatch case the degradation is found to be relatively smaller.

Speaker Identification and Verification by Combining MFCC and Phase Information

A phase information extraction method that normalizes the change variation in the phase according to the frame position of the input speech and combines the phase information with MFCCs in text-independent speaker identification and verification methods.

Automatic recognition of speakers from their voices

The paper indudes a discussion of the speaker-dependent properties of the speech signal, methods for selecting an efficient set of speech measurements, results of experimental studies illustrating the performance of various methods of speaker recognition, and a comparision of theperformance of automatic methods with that of human listeners.

New efficient window function, replacement for the hamming window

A new simple window function is presented, which for the same window order (M), has a main-lobe width less than or equal to that of the Hamming window, while offering about 2-4.5-dB smaller peak side-lobes amplitude and is computationally efficient for signal spectrum analysis.