Publications

September 1, 2005

Two Experiments Comparing Reading with Listening for Human Processing of Conversational Telephone Speech

We report on results of two experiments designed to compare subjects’ ability to extract information from audio recordings of conversational telephone speech (CTS) with their ability to extract information from…

Publications, Speech & natural language publications
September 1, 2005

Development of a Conversational Telephone Speech Recognizer for Levantine Arabic

ByDimitra Vergyri

In this paper, we describe the development of a large-vocabulary speech recognition system for Levantine Arabic, which was a new dialectal recognition task for our existing system. We discuss the…

Publications, Speech & natural language publications
September 1, 2005

Meeting Structure Annotation: Data and Tools

ByJohn Niekrasz

We present a set of annotations of hierarchical topic segmentations and action item sub-dialogues collected over 65 meetings from the ICSI and ISL meeting corpora, designed to support automatic meeting…

Publications, Speech & natural language publications
September 1, 2005

Spoken Language Understanding

SLU systems contain an automatic speech recognition (ASR) component and must be robust to noise due to the spontaneous nature of spoken language and the errors introduced by ASR. SLU…

Publications, Speech & natural language publications
September 1, 2005

Does Active Learning Help Automatic Dialog Act Tagging in Meeting Data?

We ask if active learning with lexical cues can help for this task and this domain. To better address this question, we explore active learning for two different types of…

Publications, Speech & natural language publications
September 1, 2005

Pushing the Envelope — Aside

Despite successes, there are still significant limitations to speech recognition performance. For this reason, authors have proposed methods that incorporate different (and larger) analysis windows, which are described in this…

Information & computer science publications, Publications
September 1, 2005

Distinguishing Deceptive from Non-Deceptive Speech

ByAndreas Kathol, Martin Graciarena

We present results from a study seeking to distinguish deceptive from non-deceptive speech using machine learning techniques on features extracted from a large corpus of deceptive and non-deceptive speech. We…

Publications, Speech & natural language publications
September 1, 2005

Improved Discriminative Training Using Phone Lattices

We present an efficient discriminative training procedure utilizing phone lattices. Different approaches to expediting lattice generation, statistics collection, and convergence were studied.

Publications, Speech & natural language publications
September 1, 2005

Comparing HMM, Maximum Entropy, and Conditional Random Fields for Disfluency Detection

We compare a generative hidden Markov model (HMM)-based approach and two conditional models — a maximum entropy (Maxent) model and a conditional random field (CRF) — for detecting disfluencies in…

Publications, Speech & natural language publications
September 1, 2005

Generation of fast interpreters for Huffman compressed bytecode

Our approach uses canonical Huffman codes to generate compact opcodes with custom-sized operand fields and with a virtual machine that directly executes this compact code. In effect, this automatically creates…

Cyber & formal methods publications, Publications
September 1, 2005

Speech Translation for Low-Resource Languages: The Case of Pashto

ByKristin Precoda, Dimitra Vergyri, Andreas Kathol

We present a number of challenges and solutions that have arisen in the development of a speech translation system for American English and Pashto, highlighting those specific to a very…

Publications, Speech & natural language publications
September 1, 2005

Leveraging Speaker-dependent Variation of Adaptation

This work introduces an automatic procedure for determining the size of regression class trees for individual speakers using an ensemble of speaker-level features to control the number of transformations, if…

Publications, Speech & natural language publications