Speech & natural language publications
-
Name-aware Speech Recognition for Interactive Question Answering
In this work we show how interactivity in a voice-enabled question answering application may improve speech recognition. We allow the user to provide a target named entity before asking the…
-
An Iterative Unsupervised Learning Method for Information Distillation
In this work, we propose an iterative unsupervised sentence extraction method to answer open-ended natural language queries about an event. The approach consists of finding the subset of sentences that…
-
Exploiting dialogue act tagging and prosodic information for action item identification
In this paper we investigate the use of dialogue act tagging to improve the identification of action item descriptions and prosodic information to improve action item agreements.
-
Extracting Question/Answer Pairs in Multi-party Meetings
In this paper we introduce a new task for multi-party meetings: extracting question/answer pairs. We propose a method based on discriminative classification of individual sentences as questions and answers via…
-
System combination using auxiliary information for speaker verification
We propose a modified linear logistic regression procedure that conditions combination weights on the auxiliary information. A regularization procedure is used to control the complexity of the extended model.
-
Nonparametric feature normalization for SVM-based speaker verification
We investigate several feature normalization and scaling approaches for use in speaker verification based on support vector machines.
-
Recognizing Arabic speakers with English phones
We investigate the question of whether phone recognition models trained on large English databases can be used for speaker recognition in another language.
-
Improving NER in Arabic using a morphological tagger
We discuss a named entity recognition system for Arabic, and show how we incorporated the information provided by MADA, a full morphological tagger which uses a morphological analyzer.
-
Automatic Annotation of Dialogue Structure from Simple User Interaction
We investigate, through the transformation of human annotations into hypothetical idealized user interactions, the relative utility of various modes of user interaction and techniques for their interpretation.
-
Meeting Adjourned: Off-line Learning Interfaces for Automatic Meeting Understanding
We explore interfaces for presenting this information to users after a meeting is completed, using two post-meeting interfaces that display information from topics and action items respectively.
-
Meeting Structure Annotation
We describe a generic set of tools for representing, annotating, and analysing multi-party discourse, including: an ontology of multimodal discourse, a programming interface for that ontology, and NOMOS – a flexible and…
-
Voice-Based Speaker Recognition Combining Acoustic and Stylistic Features
We present a survey of the state of the art in voice-based speaker identification research. We describe the general framework of a text-independent speaker verification system, and, as an example,…