Efficient window for monolingual and crosslingual speaker identification using MFCC

B. Nagaraja; H. S. Jayanna

DOI:10.1109/ICACCS.2013.6938702
Corpus ID: 16249341

Efficient window for monolingual and crosslingual speaker identification using MFCC

@article{Nagaraja2013EfficientWF,
  title={Efficient window for monolingual and crosslingual speaker identification using MFCC},
  author={B. G. Nagaraja and Haradagere Siddaramaiah Jayanna},
  journal={2013 International Conference on Advanced Computing and Communication Systems},
  year={2013},
  pages={1-4},
  url={https://api.semanticscholar.org/CorpusID:16249341}
}

B. NagarajaH. S. Jayanna
Published in International Conference on… 1 December 2013
Computer Science
2013 International Conference on Advanced Computing and Communication Systems

Speaker identification system based on various windowing techniques based on mel-frequency cepstral coefficient shown to have considerably improved performance over baseline Hamming window technique.

12 Citations

Figures and Tables from this paper

Topics

Mel-frequency Cepstral Coefficients Speaker Identification Universal Background Model Gaussian Mixture Model IITG-MV Classifier

A Study of Various Speech Features and Classifiers used in Speaker Identification

Priyatosh MishraP. Mishra

Computer Science

2016

Different speech features & extraction techniques such as MFCC, LPCC, LPC, GLFCC, PLPC etc and different features classification models such as VQ, GMM, DTW, HMM and ANN for speaker identification system have been discussed.

Development of High Accuracy Classifier for the Speaker Recognition System

Raghad Tariq Al-HassaniD. AtillaÇ. Aydin

Computer Science

Applied bionics and biomechanics

2021

A hybrid speaker identification model for consistent speech features and high recognition accuracy is made and features using Mel frequency spectrum coefficients (MFCC) have been improved by incorporating a pitch frequency coefficient from speech time domain analysis.

[PDF]

Text-Dependent Multilingual Speaker Identification using Learning Vector Quantization and PSO-GA Hybrid Model

Priyatosh MishraDr. Pankaj Kumar Mishra

Computer Science

2016

A multilingual speaker identification system using Learning Vector Quantization (LVQ) artificial Neural Network classifiers and feature selection is done using hybrid model of particle swarm optimizatiom (PSO) and Genetic Algorithm (GA).

Investigating the use of multiple languages for crisp and fuzzy speaker identification

T. A. LimaMárjory Da Costa-Abreu

Computer Science, Linguistics

11th International Conference of Pattern…

2021

This research evaluates speaker identification systems on a multilingual setup using three widely spoken languages which are Portuguese, English, and Chinese and indicates certain robustness on multiple languages.

Phonemes: An Explanatory Study Applied to Identify a Speaker

Saritha KinkiriBasel BarakatS. Keates

Computer Science

MIND

2020

This paper extracted phonemes from a speaker’s voice recording and investigated the associated frequencies and amplitudes to be assist in identifying the person who is speaking, demonstrating the importance of phoneme in both speech and voice recognition systems.

Text-Dependent Multilingual Speaker Identification using Back Propagation Neural Network and PSO-GA Hybrid Model

Priyatosh MishraP. Mishra

Computer Science

2016

A multilingual speaker identification system using Back Propagation (BPNN) artificial Neural Network classifiers and feature selection is done using hybrid model of particle swarm optimizatiom (PSO) and Genetic Algorithm (GA).

The Impact of Low-Pass Filter in Speaker Identification

Rizky AhmadS. Suyanto

Computer Science

2019 International Seminar on Research of…

2019

Experimental results show that the low-pass filter significantly improves the accuracy of sound detection for the high noised signal.

Phoneme analysis for multiple languages with fuzzy‐based speaker identification

Thales Aguiar de LimaMárjory Cristiany Da‐Costa Abreu

Computer Science, Linguistics

IET Biometrics

2022

This paper investigates the effects of languages on speaker identification systems and the phonetic impact on their performance and expands the research study of fuzzy models in the speaker recognition field, using a Fuzzy C‐Means and FuzzY k‐Nearest Neighbours and comparing them with k-Nearest neighbours and Support Vector Machines.

I Sense You by Breath: Speaker Recognition via Breath Biometrics

Li LuLingshuang LiuM. HussainYongshuai Liu

Computer Science, Medicine

IEEE Transactions on Dependable and Secure…

2020

The observation reveals that breath is a unique fingerprint of human respiratory system which offers overwhelming results for Speaker Recognition, and foresee breath biometric as a viable security measure for practical realizations.

Speaker Recognition Using Wavelet Packet Entropy, I-Vector, and Cosine Distance Scoring

Lei LeiKun She

Computer Science

J. Electr. Comput. Eng.

2017

The results of the experiments show that the proposed model can obtain good performance in clear and noisy environment and be insensitive to the low-quality speech, but the time cost of the model is high.

[PDF]

A Novel Windowing Technique for Efficient Computation of MFCC for Speaker Recognition

Md. SahidullahG. Saha

Computer Science

IEEE Signal Processing Letters

2013

A novel family of windowing technique to compute mel frequency cepstral coefficient (MFCC) for automatic speaker recognition from speech based on fundamental property of discrete time Fourier transform related to differentiation in frequency domain is proposed.

[PDF]

Multitaper MFCC and PLP features for speaker verification using i-vectors

Md. Jahangir AlamT. KinnunenP. KennyP. OuelletD. O'Shaughnessy

Computer Science

Speech Commun.

2013

Low-Variance Multitaper MFCC Features: A Case Study in Robust Speaker Verification

T. KinnunenR. Saeidi Haizhou Li

Computer Science

IEEE Transactions on Audio, Speech, and Language…

2012

This paper provides detailed statistical analysis of MFCC bias and variance using autoregressive process simulations on the TIMIT corpus and proposes the multitaper method for MFCC extraction with a practical focus.

Robust text-independent speaker identification using Gaussian mixture speaker models

D. ReynoldsR. Rose

Computer Science

IEEE Trans. Speech Audio Process.

1995

The individual Gaussian components of a GMM are shown to represent some general speaker-dependent spectral shapes that are effective for modeling speaker identity and is shown to outperform the other speaker modeling techniques on an identical 16 speaker telephone speech task.

Multilingual Text-independent Speaker Identification

G. Durou

Computer Science, Linguistics

1998

The results indicate how speaker identiica-tion performance might be aaected when speakers do not use the same language during the training and testing, or when the population is composed of non-native people.

Significance of Vowel Onset Point Information for Speaker Verification

Gayadhar PradhanS. Prasanna

Computer Science

2011

It is demonstrated in this work that for clean and matched conditions, relatively less number of frames from vowel-like regions are sufficient for speaker modeling and testing, and for degraded and mismatched conditions, vowel-like regions provide better performance.

Multi-variability speech database for robust speaker recognition

B. C. HarisGayadhar PradhanA. MisraS. K. ShuklaRohit SinhaS. Prasanna

Computer Science, Engineering

2011 National Conference on Communications (NCC)

2011

The initial study exploring the impact of mismatch in training and test conditions with collected data finds that the mismatch in sensor, speaking style, and environment result in significant degradation in performance compared to the matched case whereas for language mismatch case the degradation is found to be relatively smaller.

Speaker Identification and Verification by Combining MFCC and Phase Information

S. NakagawaLongbiao WangShinji Ohtsuka

Computer Science

IEEE Transactions on Audio, Speech, and Language…

2012

A phase information extraction method that normalizes the change variation in the phase according to the frame position of the input speech and combines the phase information with MFCCs in text-independent speaker identification and verification methods.

Automatic recognition of speakers from their voices

B. Atal

Computer Science

Proceedings of the IEEE

1976

The paper indudes a discussion of the speaker-dependent properties of the speech signal, methods for selecting an efficient set of speech measurements, results of experimental studies illustrating the performance of various methods of speaker recognition, and a comparision of theperformance of automatic methods with that of human listeners.

New efficient window function, replacement for the hamming window

M. Mottaghi-KashtibanM. Shayesteh

Engineering, Computer Science

2011

A new simple window function is presented, which for the same window order (M), has a main-lobe width less than or equal to that of the Hamming window, while offering about 2-4.5-dB smaller peak side-lobes amplitude and is computationally efficient for signal spectrum analysis.

Efficient window for monolingual and crosslingual speaker identification using MFCC

Figures and Tables from this paper

Topics

12 Citations

A Study of Various Speech Features and Classifiers used in Speaker Identification

Development of High Accuracy Classifier for the Speaker Recognition System

Text-Dependent Multilingual Speaker Identification using Learning Vector Quantization and PSO-GA Hybrid Model

Investigating the use of multiple languages for crisp and fuzzy speaker identification

Phonemes: An Explanatory Study Applied to Identify a Speaker

Text-Dependent Multilingual Speaker Identification using Back Propagation Neural Network and PSO-GA Hybrid Model

The Impact of Low-Pass Filter in Speaker Identification

Phoneme analysis for multiple languages with fuzzy‐based speaker identification

I Sense You by Breath: Speaker Recognition via Breath Biometrics

Speaker Recognition Using Wavelet Packet Entropy, I-Vector, and Cosine Distance Scoring

15 References

A Novel Windowing Technique for Efficient Computation of MFCC for Speaker Recognition

Multitaper MFCC and PLP features for speaker verification using i-vectors

Low-Variance Multitaper MFCC Features: A Case Study in Robust Speaker Verification

Robust text-independent speaker identification using Gaussian mixture speaker models

Multilingual Text-independent Speaker Identification

Significance of Vowel Onset Point Information for Speaker Verification

Multi-variability speech database for robust speaker recognition

Speaker Identification and Verification by Combining MFCC and Phase Information

Automatic recognition of speakers from their voices

New efficient window function, replacement for the hamming window

Related Papers