FROM ACOUSTICS TO VOCAL TRACT TIME FUNCTIONS


Mitra V., Oezbek I. Y., Nam H., Zhou X., Espy-Wilson C. Y.

IEEE International Conference on Acoustics, Speech and Signal Processing, Taipei, Tayvan, 19 - 24 Nisan 2009, ss.4497-4498 identifier identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Cilt numarası:
  • Doi Numarası: 10.1109/icassp.2009.4960629
  • Basıldığı Şehir: Taipei
  • Basıldığı Ülke: Tayvan
  • Sayfa Sayıları: ss.4497-4498
  • Anahtar Kelimeler: Speech inversion, Support Vector Regression, vocal tract time functions, Acoustic-to-articulatory inversion
  • Atatürk Üniversitesi Adresli: Evet

Özet

In this paper we present a technique for obtaining Vocal Tract (VT) time functions from the acoustic speech signal. Knowledge-based Acoustic Parameters (APs) are extracted from the speech signal and a pertinent subset is used to obtain the mapping between them and the VT time functions. Eight different vocal tract constriction variables consisting of live constriction degree variables,. lip aperture (LA), tongue body (TBCD), tongue tip (TTCD), velum (VEL), and glottis (GLO); and three constriction location variables, lip protrusion (LP), tongue tip (TTCL), tongue body (TBCL) were considered in this study. The TAsk Dynamics Application model (TADA [1]) is used to create a synthetic speech dataset along with its corresponding VT time functions. We explore Support Vector Regression (SVR) followed by Kalman smoothing to achieve mapping between the APs and the VT time functions.