Publications
-
SVM Modeling of “SNERF-Grams” for Speaker Recognition
We describe a new approach to modeling idiosyncratic prosodic behavior for automatic speaker recognition. The approach computes prosodic features by syllable, and models the syllable-feature sequences using support vector machines…
-
Effective Acoustic Modeling for Rate-of-Speech Variation in Large Vocabulary Conversational Speech Recognition
We investigate several variants of speech-rate-dependent acoustic models for large-vocabulary conversational speech recognition, in the framework of combining rate-specific models in decoding to compensate for speech rate variation.
-
National Early Intervention Longitudinal Study (NEILS): Family Outcomes at the End of Early Intervention
The report has two primary aims: to describe the outcomes reported by families following their experience with early intervention programs, and to identify a subset of families who were less…
-
On Using MLP Features in LVCSR
One of the major research thrusts in the speech group at ICSI is to use Multi-Layer Perceptron (MLP) based features in automatic speech recognition (ASR). This paper presents a study…
-
Assessment In The Palm Of Your Hand: Handheld Computers Transform The Assessment Process
-
Database Editing Metrics for Pattern Matching
This paper introduces a family of metrics to measure the degree of qualitative match between a database and a pattern, that is, an elastic constraint on database objects and their…
-
National Early Intervention Longitudinal Study: Birth History and Health Status of Children Entering Early Intervention. NEILS Data Report 5
This report describes the birth history and health status of the children participating in early intervention.
-
Automatic Diacritization of Arabic for Acoustic Modeling in Speech Recognition
In this paper we investigate different procedures that enable us to use training data by automatically inserting the missing diacritics into the transcription.
-
Document icons and page thumbnails: issues in construction of document thumbnails for page-image digital libraries
This paper documents some issues encountered in creating various kinds of renderings of page images for the UpLib digital library system, and suggests approaches for each, based on both problem…
-
Managing uncertainty in dialogue information state for real time understanding of multi-human meeting dialogue
Our ultimate aim is to model human-human dialogue (to the extent that it is feasible) in real-time, providing useful services (e.g. relevant document retrieval) and answering queries about the dialogue…
-
The SPARK Agent Framework
We describe the SRI Procedural Agent Realization Kit (SPARK), a new BDI agent framework that combines scaleability and the clean semantic underpinning of more formal agent frameworks.
-
A Minimal Solution to the Generalized 3-Point Pose Problem
It is a well known classical result that given the image projections of three known world points it is possible to solve for the pose of a calibrated perspective camera…