Publications
-
Morphology-Based Language Modeling for Arabic Speech Recognition
In this paper we investigate the use of morphology-based language models at different stages in a speech recognition system for conversational Arabic.
-
From Switchboard to Meetings: Development of the 2004 ICSI-SRI-UW Meeting Recognition System
We describe the ICSI-SRI-UW team's entry in the Spring 2004 NIST Meeting Recognition Evaluation. The system was derived from SRI's 5xRT Conversational Telephone Speech (CTS) recognizer by adapting CTS acoustic…
-
The ICSI-SRI-UW Metadata Extraction System
We describe a state-of-the-art system for automatic detection of "metadata" in both broadcast news and spontaneous telephone conversations, developed as part of the DARPA EARS Rich Transcription program.
-
A Wizard of Oz framework for collecting spoken human-computer dialogs
This paper describes a data collection process aimed at gathering human-computer dialogs in high-stress or “busy” domains where the user is concentrating on tasks other than the conversation, for example,…
-
Assessment In The Palm Of Your Hand: Handheld Computers Transform The Assessment Process
-
Automatic Diacritization of Arabic for Acoustic Modeling in Speech Recognition
In this paper we investigate different procedures that enable us to use training data by automatically inserting the missing diacritics into the transcription.
-
Database Editing Metrics for Pattern Matching
This paper introduces a family of metrics to measure the degree of qualitative match between a database and a pattern, that is, an elastic constraint on database objects and their…
-
National Early Intervention Longitudinal Study: Birth History and Health Status of Children Entering Early Intervention. NEILS Data Report 5
This report describes the birth history and health status of the children participating in early intervention.
-
Document icons and page thumbnails: issues in construction of document thumbnails for page-image digital libraries
This paper documents some issues encountered in creating various kinds of renderings of page images for the UpLib digital library system, and suggests approaches for each, based on both problem…
-
The SPARK Agent Framework
We describe the SRI Procedural Agent Realization Kit (SPARK), a new BDI agent framework that combines scaleability and the clean semantic underpinning of more formal agent frameworks.
-
A Minimal Solution to the Generalized 3-Point Pose Problem
It is a well known classical result that given the image projections of three known world points it is possible to solve for the pose of a calibrated perspective camera…
-
Comparing and Combining Generative and Posterior Probability Models: Some Advances in Sentence Boundary Detection in Speech
We compare and contrast two different models for detecting sentence-like units in continuous speech. Both models combine lexical, syntactic, and prosodic information.