Drugman, Thomas ; Université de Mons > Faculté Polytechnique > Information, Signal et Intelligence artificielle
Bozkurt, Baris
Dutoit, Thierry ; Université de Mons > Faculté Polytechnique > Information, Signal et Intelligence artificielle
Language :
English
Title :
Causal-anticausal Decomposition of Speech using Complex Cepstrum for Glottal Source Estimation
Publication date :
01 July 2011
Journal title :
Speech Communication
ISSN :
0167-6393
Publisher :
Elsevier, Netherlands
Volume :
53
Issue :
6
Pages :
855-866
Peer reviewed :
Peer Reviewed verified by ORBi
Research institute :
R300 - Institut de Recherche en Technologies de l'Information et Sciences de l'Informatique R450 - Institut NUMEDIART pour les Technologies des Arts Numériques
Alku, P., Vilkman, E. 1994. Estimation of the glottal pulseform based on discrete all-pole modeling. In: Third International Conference on Spoken Language Processing, pp. 1619-1622.
P. Alku, J. Svec, E. Vilkman, and F. Sram Glottal wave analysis with pitch synchronous iterative adaptive inverse filtering Speech Comm. 11 2-3 1992 109 118 (Pubitemid 23572504)
P. Alku, T. Bäckström, and E. Vilkman Normalized amplitude quotient for parametrization of the glottal flow J. Acoust. Soc. Amer. 112 2002 701 710 (Pubitemid 34855925)
P. Alku, C. Magi, S. Yrttiaho, T. Bäckström, and B. Story Closed phase covariance analysis based on constrained linear prediction for glottal inverse filtering J. Acoust. Soc. Amer. 125 5 2009 3289 3305
T. Ananthapadmanabha, and G. Fant Calculation of true glottal flow and its components Speech Comm. 1982 167 184 (Pubitemid 13556172)
Bozkurt, B., Dutoit, T. 2003. Mixed-phase speech modeling and formant estimation, using differential phase spectrums. In: VOQUAL'03, pp. 21-24.
Bozkurt, B., Doval, B., D'Alessandro, C., Dutoit, T. 2004. A method for glottal formant frequency estimation. In: Proceedings of Interspeech.
B. Bozkurt, B. Doval, C. D'Alessandro, and T. Dutoit Zeros of Z-transform representation with application to source-filter separation in speech IEEE Signal Process. Lett. 12 4 2005
B. Bozkurt, L. Couvreur, and T. Dutoit Chirp group delay analysis of speech signals Speech Comm. 49 3 2007 159 176 (Pubitemid 46413359)
D. Childers Speech Processing and Synthesis Toolboxes 1999 Wiley and Sons, Inc.
D. Childers, and C. Lee Vocal quality factors : analysis, synthesis, and perception J. Acoust. Soc. Amer. 90 5 1991 2394 2410
C. D'Alessandro, B. Bozkurt, B. Doval, T. Dutoit, N. Henrich, V. Tuan, and N. Sturmel Phase-based methods for voice source analysis Adv. Nonlinear Speech Process. LNCS 4885 2008 1 27
H. Deng, R. Ward, M. Beddoes, and M. Hodgson A new method for obtaining accurate estimates of vocal-tract filters and glottal waves from vowel sounds IEEE Trans. ASSP 14 2006 445 455 (Pubitemid 46405346)
B. Doval, and C. D'Alessandro The spectrum of glottal flow models Acta Acustica United with Acustica 92 6 2006 1026 1046 (Pubitemid 46032049)
Doval, B., D'Alessandro, C., Henrich, N. 2003. The voice source as a causal/anticausal linear filter. In: Proceedings ISCA ITRW VOQUAL03, pp. 15-19.
T. Drugman, and T. Dutoit Glottal closure and opening instant detection from speech signals Proc. Interspeech 2009
Drugman, T., Bozkurt, B., Dutoit, T. 2009. Complex Cepstrum-based Decomposition of speech for glottal source estimation. In: Proceedings of Interspeech.
Drugman, T., Bozkurt, B., Dutoit, T. 2009. Chirp decomposition of speech signals for glottal source estimation. In: ISCA Workshop on Non-Linear Speech Processing.
G. Fant The LF-model revisited. Transformations and frequency domain analysis STL-QPSR 36 2-3 1995 119 156
Fant, G., Liljencrants, J., Lin, Q. 1985. A four parameter model of glottal flow, STL-QPSR4, pp. 1-13.
W. Gardner, and B. Rao Noncausal all-pole modeling of voiced speech IEEE Trans. Audio Speech Process. 5 1 1997 1 10 (Pubitemid 127746029)
Hanson, H. 1995. Individual variations in glottal characteristics of female speakers. In: Proceedings of ICASSP, pp. 772-775.
D. Klatt, and L. Klatt Analysis, synthesis and perception of voice quality variations among female and male talkers J. Acoust. Soc. Amer. 87 1990 820 857 (Pubitemid 20129722)
P. Naylor, A. Kounoudes, J. Gudnason, and M. Brookes Estimation of glottal closure instants in voiced speech using the DYPSA algorithm IEEE Trans. Audio Speech Lang. Process. 15 1 2007 34 43
F. Nordin, and T. Eriksson A speech spectrum distortion measure with interframe memory IEEE Int. Conf. Acoust. Speech Signal Process. 2 2001 717 720 (Pubitemid 32839021)
A. Oppenheim, and R. Schafer Discrete-Time Signal Processing 1989 Prentice-Hall (Chapter 12)
A. Oppenheim, A. Willsky, and I. Young Signals and Systems 1983 Prentice Hall International Editions
K. Paliwal, and B. Atal Efficient vector quantization of LPC parameters at 24 bits/frame IEEE Trans. Speech Audio Process. 1 1993 3 14
Pedersen, C., Andersen, O., Dalsgaard, P. 2009. ZZT-domain immiscibility of the opening and closing phases of the LF GFM under frame length variations, Proc. Interspeech.
M. Plumpe, T. Quatieri, and D. Reynolds Modeling of the glottal flow derivative waveform with application to speaker identification IEEE Trans. Speech Audio Process. 7 1999 569 586
T. Quatieri Minimum- and mixed-phase speech analysis/synthesis by adaptive homomorphic deconvolution IEEE Trans. Acoustics, Speech Signal Process. ASSP 27 4 1979 328 335
T. Quatieri Discrete-Time Speech Signal Processing 2002 Prentice-Hall (Chapter 6)
Schroeder, M., Grice, M. 2003. Expressing vocal effort in concatenative synthesis. In: Proceedings of 15th International Conference of Phonetic Sciences, pp. 2589-2592.
G. Sitton, C. Burrus, J. Fox, and S. Treitel Factoring very-high degree polynomials IEEE Signal Process. Mag. 2003 27 42
K. Steiglitz, and B. Dickinson Computation of the complex cepstrum by factorization of the z-transform Proc. ICASSP 2 1977 723 726 (Pubitemid 8560962)
K. Steiglitz, and B. Dickinson Phase unwrapping by factorization IEEE Trans. ASSP 30 6 1982 984 991
N. Sturmel, C. D'Alessandro, and B. Doval A comparative evaluation of the Zeros of the Z transform for voice source estimation Proc. Interspeech 2007
The Snack Sound Toolkit, .
I. Titze, and J. Sundberg Vocal intensity in speakers and singers J. Acoust. Soc. Amer. 91 5 1992 2936 2946
J. Tribolet, T. Quatieri, and A. Oppenheim Short-time homomorphic analysis Proc. ICASSP77 2 1977 716 722
D. Veeneman, and S. BeMent Automatic glottal inverse filtering from speech and electroglottographic signals IEEE Trans. Signal Process. 33 1985 369 377
W. Verhelst, and O. Steenhaut A new model for the short-time complex cepstrum of voiced speech IEEE Trans. ASSP 34 1986 43 51
J. Walker, and P. Murphy A review of glottal waveform analysis Prog. Nonlinear Speech Process. 2007 1 21
J. Wangrae, K. Jongkuk, and B. Myung Jin A study on pitch detection in time-frequency hybrid domain Lecture Notes in Computer Science 2005 Springer Berlin