Legacy: Articulatory Synthesis Bibliography
Abramson, A. S., Nye, P. W., Henderson, J. B., & Marshall, C. W. (1981). Vowel height and the perception of consonantal nasality. Journal of the Acoustical Society of America, Vol. 70, 329-339.
Abraham, R. H., and Shaw, C. D. (1982). Dynamics–The geometry of behavior. Santa Cruz, CA: Aerial Press.
Badin, P., Bailly, G., Raybaudi, M., & Segebarth, C. (2002). A three-dimensional linear articulatory model based on MRI data. Journal of Phonetics, 30, 533 – 553.
Birkholz, P., Jackel, D., and B. J. Kröger. (2006). Construction and control of a three-dimensional vocal tract model. Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006) (Toulouse, France), 873–876.
Browman, C. P., and Goldstein, L. (1986). Towards an articulatory phonology. Phonology Yearbook, 3, 219-252. (PDF)
Browman, C. P., and Goldstein, L. (1989). Articulatory gestures as phonological units. Phonology, 6, 201-251.
Browman, C. P., and Goldstein, L. (1990a). Gestural specification using dynamically-defined articulatory structures. Journal of Phonetics, 18, 299-320.
Browman, C. P., and Goldstein, L. (1990b). Representation and reality: Physical systems and phonological structure. Journal of Phonetics, 18, 411-424.
Browman, C. P., and Goldstein, L. (1990c). Tiers in articulatory phonology, with some implications for casual speech. In T. Kingston and M. E. Beckman (Eds.), Papers in Laboratory Phonology I: Between the Grammar and Physics of Speech (pp. 341-376). Cambridge University Press.
Browman, C. P., & Goldstein, L. (1992). Articulatory phonology: An overview. Phonetica, 49, 155-180. (PDF)
Browman, C. P., Goldstein, L., Kelso, J .A. S., Rubin, P., and Saltzman, E. (1984). Articulatory synthesis from underlying dynamics. Journal of the Acoustical Society, 75, S22-S23 (A).
Byrd, Dani, and Jelena Krivokapic. (2021). Cracking Prosody in Articulatory Phonology. Annual Review of Linguistics, 7:31–53. https://doi.org/10.1146/annurev-linguistics-030920-050033.
Clements, G. N. (1992). Phonological primes: Features or gestures? Phonetica, 49, 181-193.
Coker. C. H. (1968). Speech synthesis with a parametric articulatory model. Proceedings of the Speech Symposium, Kyoto, Japan, paper A-4.
Coker, C. H., and Fujimura, O. (1966). Model for the specification of the vocal tract area function. Journal of the Acoustical Society of America. 40 (5): 1271.
Engwall, O. (2003). Combining MRI, EMA & EPG measurements in a three-dimensional tongue model. Speech Communication, 41, 303–329.
Fant, C. Gunnar M. (1960). Acoustic theory of speech production. The Hague, Mouton.
Fowler, C. A., Rubin, P., Remez, R. E., and Turvey, M. T. (1980). Implications for speech production of a general theory of action. In B. Butterworth (Ed.), Language production. New York: Academic Press. (PDF)
Gerard, J. M., Wilhelms-Tricarico, R., Perrier, P., and Payan, Y. (2003). A 3D dynamical biomechanical tongue model to study speech motor control. Recent Research Developments in Biomechanics, 1, 49–64.
Goldstein, L. and Rubin, P. (2007). Speech: Dances of the Vocal Tract. Odyssey Magazine, Jan. 2007, 14-15.
Harshman, R., Ladefoged, P., and Goldstein, L. (1977). Factor analysis of tongue shapes. Journal of the Acoustical Society of America, 62, 693–707.
Heinz J. M. and Stevens, K. N. (1964). On the Derivation of Area Functions and Acoustic Spectra from Cinéradiographic Films of Speech. The Journal of the Acoustical Society of America, 36, 1037-1038.
Henke, W. L. (1966). Dynamic Articulatory Model of Speech Production Using Computer Simulation. Unpublished doctoral dissertation, MIT, Cambridge, MA.
Honda, Takashi, Seiichi Inoue, and Yasuo Ogawa. (1968). A hybrid control system of a human vocal tract simulator. Reports of the 6th International Congress on Acoustics, ed. by Y. Kohasi, pp. 175–8. Tokyo, International Council of Scientific Unions.
Ishizaka, K., and Flanagan. J. L. (1972). Synthesis of Voiced Sounds From a Two-Mass Model of the Vocal Cords. Bell System Technical Journal, 51, #6, 1233-1268.
Iskarous, K., Goldstein, L., Whalen, D. H., Tiede, M., and Rubin, P. (2003). CASY: The Haskins configurable articulatory synthesizer. International Congress of Phonetic Sciences, Barcelona, Spain, 185-188.
Kelly, John L., and Carol Lochbaum. (1962). Speech synthesis. Proceedings of the Speech Communications Seminar, paper F7. Stockholm, Speech Transmission Laboratory, Royal Institute of Technology.
Kelso, J. A. S., Saltzman, E. L., and Tuller, B. (1986). The dynamical perspective on speech production: data and theory. Journal of Phonetics, 14, 29-59.
Kent, R. D., and Minifie, F. D. (1977). Coarticulation in recent speech production models. Journal of Phonetics, 5, 115-133.
Ladefoged, P., Anthony, J. F. K., and Riley, C. (1971). Direct measurement of the vocal tract. UCLA Working Papers in Phonetics, 19, 4-13.
Liberman, A. M., Cooper, F. S., Shankweiler, D. P., and Studdert-Kennedy, M. (1967). Perception of the speech code. Psychological Review, 74, 431-461. (PDF)
Lloyd, John E., Stavness, Ian, and Fels, Sidney. (2012), ArtiSynth: A fast interactive biomechanical modeling toolkit combining multibody and finite element simulation. Soft Tissue Biomechanical Modeling for Computer Assisted Surgery, pp. 355-394, Springer, 2012.
Macchi, M. (1988). Labial articulation patterns associated with segmental features and syllable structure in English. Phonetica, 45, 109-121.
Maeda, S. (1988). Improved articulatory model. Journal of the Acoustical Society of America, 84, Sup. 1, S146.
McGowan, R. S. (1994). ‘Recovering articulatory movement from formant frequency trajectories using task dynamics and a genetic algorithm: Preliminary studies. Speech Communications 16, 49–66.
Mermelstein, P. (1973). Articulatory model for the study of speech production. Journal of the Acoustical Society of America 53, 1070–1082. (PDF)
Mermelstein, P., Maeda, S., and Fujimura, O. (1971). Description of Tongue and Lip Movement in a Jaw‐Based Coordinate System. Journal of the Acoustical Society of America 49, 104.
Öhman, S. E. G. (1966). Coarticulation in VCV utterances: Spectrographic measurements. Journal of the Acoustical Society, 39, 151-168.
Palo, Pertti. A Review of Articulatory Speech Synthesis. (2006). Master’s Thesis submitted in partial fulfillment of the requirements for the degree of Master of Science in Technology, Helsinki University of Technology, Espoo, June 5, 2006, 1-126.
Raphael, L. J., Bell-Berti, F., Collier, R., and Baer, T. (1979). Tongue position in rounded nd unrounded front vowel pairs. Language and Speech, Vol. 22 pp. 37-48.
Rosenberg, A. E. (1971). Effect of Glottal Pulse Shape on the Quality of Natural Vowels. The Journal of the Acoustical Society of America, 49, #2, 583, 590. (PDF)
Rubin, P. E., Baer, T., and Mermelstein, P. (1981) An articulatory synthesizer for perceptual research, Journal of the Acoustical Society, 70, 321-328. (PDF)
Rubin, P., Saltzman, E., Goldstein, L., McGowan, R, Tiede, M. & Browman, C. (1996). CASY and extensions to the task-dynamic model. Proceedings of the 4th Speech Production Seminar, Grenoble, France, 125-128.
Saltzman, E. (1986). Task dynamic coordination of the speech articulators: A preliminary model. In H. Heuer and C. Fromm (Eds.), Experimental Brain Research Series 15 (pp. 129-144). New York: Springer-Verlag. (PDF)
Saltzman, E. (1998). Dynamics and coordinate systems in skilled sensorimotor activity. In Port, R. and Van Gelder, T. (Eds.), Mind as motion. Cambridge, MA: MIT Press.
Saltzman, E., and Kelso, J. A. S. (1987). Skilled actions: A task dynamic approach. Psychological Review, 94, 84-106. (PDF)
Saltzman, E. L., and Munhall, K. G. (1989) A dynamical approach to gestural patterning in speech production. Ecological Psychology, 1, 333-382. (PDF)
Stevens, Kenneth N.; Kasowski, S.; Fant, C. Gunnar M. (1953). An electrical analog of the vocal tract. Journal of the Acoustical Society of America, 25 (4), 734–42.
Story, Brad H. (2005). A parametric model of the vocal tract area function for vowel and consonant simulation. The Journal of the Acoustical Society of America, 117, 3231–3254.
Story, Brad H. (2018). History of Speech Synthesis in Phonetics Research. An invited chapter for W. Katz and P. Assmann (Eds),The Routledge Handbook of Phonetics, Chapter X, pp., Routledge.
Sussman, H. M., MacNeilage, P. F., and Hanson, R. J. (1973). Labial and mandibular dynamics during the production of bilabial consonants: Preliminary observations. Journal of Speech and Hearing Research, 16, 397-420.
Turvey, M. T. (1977). Preliminaries to a theory of action with reference to vision. In R. Shaw and J. Bransford (Eds.), Perceiving, acting and knowing: Toward an ecological psychology. Hillsdale, NJ: LEA.
See, also:
• ASY Details
• The Glottal Source Model
• ASY Synthesis Tables
• CASY: Configurable Articulatory Synthesis
• The Gestural Computational Model
• TADA
• Bibliography
• Acknowledgments
| ASY DEMO | VOWELS | VOCAL TRACT | DYNAMIC SYNTHESIS | INFORMATION |
• Glottal Source Model
• ASY Synthesis Tables
• CASY: Configurable Articulatory Synthesis
• Gestural Computational Model
• TADA
| ASY DEMO | VOWELS | VOCAL TRACT | DYNAMIC SYNTHESIS | INFORMATION |