Publications

September 1, 1997

Mixture Input Transformations for Adaptation of Hybrid Connectionist Speech Recognizers

ByVictor Abrash

In this paper, we propose a new algorithm to train mixtures of transformation networks (MTNs) in the hybrid connectionist recognition framework. We apply the new algorithm to nonnative speaker adaptation,…

Publications, Speech & natural language publications
September 1, 1997

Structure and Performance of a Dependency Language Model

We present a maximum entropy language model that incorporates both syntax and semantics via a dependency grammar.

Publications, Speech & natural language publications
September 1, 1997

Speech: A Privileged Modality

In this article, we use our interaction model to demonstrate that during multimodal fusion, speech should be a privileged modality, driving the interpretation of a query, and that in certain…

Publications, Speech & natural language publications
September 1, 1997

HMM State Clustering Across Allophone Class Boundaries

ByHarry Bratt

We present a novel approach to hidden Markov model (HMM) state clustering based on the use of broad phone classes and an allophone class entropy measure. Our algorithm allows clustering…

Publications, Speech & natural language publications
September 1, 1997

A Study of Multilingual Speech Recognition

ByHarry Bratt

This paper describes our work in developing multilingual (Swedish and English) speech recognition systems in the ATIS domain. The acoustic component of the multilingual systems is realized through sharing Gaussian…

Publications, Speech & natural language publications
September 1, 1997

Automatic Pronunciation Scoring of Specific Phone Segments for Language Instruction

ByHoracio Franco

The aim of the work described in this paper is to develop methods for automatically assessing the pronunciation quality of specific phone segments uttered by students learning a foreign language.

Education & learning publications, Publications, Speech & natural language publications
September 1, 1997

A Prosody-Only Decision-Tree Model for Disfluency Detection

We have developed a disfluency detection method using decision tree classifiers that use only local and automatically extracted prosodic features. Because the model doesn't rely on lexical information, it is…

Publications, Speech & natural language publications
August 1, 1997

Multimodal Interfaces for Internet

In this paper, we present a Java-enabled application with a multimodal (pen and voice) interface over the web. Our implementation approach was to add Java to the set of languages…

Cyber & formal methods publications, Publications
June 1, 1997

Using Differential Constraints to Reconstruct Complex Surfaces from Stereo

Stereo reconstruction algorithms often fail to properly deal with complex surfaces, because there is not enough image information. We propose to guide the reconstruction process using a priori information about…

Artificial intelligence publications, Computer vision publications, Publications
April 1, 1997

Neural-Network Based Measures of Confidence for Word Recognition

This paper proposes a probabilstic framework to define and evaluate confidence measures for word recognition. We describe a novel method to combine different knowledge sources and estimate the confidence in…

Publications, Speech & natural language publications
April 1, 1997

A Collaborative Environment for Authoring Large Knowledge Bases

ByPeter Karp, Suzanne Paley

Collaborative knowledge base (KB) authoring environments are critical for the construction of high-performance KBs. In this paper, we present an environment that satisfies many of these goals.

Artificial intelligence publications, Bioinformatics publications, Publications
April 1, 1997

Handset-Dependent Background Models for Robust Text-Independent Speaker Recognition

This paper studies the effects of handset distortion on telephone-based speaker recognition performance. Results on the 1996 NIST Speaker Recognition Evaluation corpus show that using handset-matched background models reduces false…

Publications, Speech & natural language publications