Petra-Maria Strauß


2010

pdf bib
Evaluation of the PIT Corpus Or What a Difference a Face Makes?
Petra-Maria Strauß | Stefan Scherer | Georg Layher | Holger Hoffmann
Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)

This paper presents the evaluation of the PIT Corpus of multi-party dialogues recorded in a Wizard-of-Oz environment. An evaluation has been performed with two different foci: First, a usability evaluation was used to take a look at the overall ratings of the system. A shortened version of the SASSI questionnaire, namely the SASSISV, and the well established AttrakDiff questionnaire assessing the hedonistic and pragmatic dimension of computer systems have been analysed. In a second evaluation, the user's gaze direction was analysed in order to assess the difference in the user's (gazing) behaviour if interacting with the computer versus the other dialogue partner. Recordings have been performed in different setups of the system, e.g. with and without avatar. Thus, the presented evaluation further focuses on the difference in the interaction caused by deploying an avatar. The quantitative analysis of the gazing behaviour has resulted in several encouraging significant differences. As a possible interpretation it could be argued that users are more attentive towards systems with an avatar - the difference a face makes.

2008

pdf bib
The PIT Corpus of German Multi-Party Dialogues
Petra-Maria Strauß | Holger Hoffmann | Wolfgang Minker | Heiko Neumann | Günther Palm | Stefan Scherer | Harald Traue | Ulrich Weidenbacher
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)

The PIT corpus is a German multi-media corpus of multi-party dialogues recorded in a Wizard-of-Oz environment at the University of Ulm. The scenario involves two human dialogue partners interacting with a multi-modal dialogue system in the domain of restaurant selection. In this paper we present the characteristics of the data which was recorded in three sessions resulting in a total of 75 dialogues and about 14 hours of audio and video data. The corpus is available at http://www.uni-ulm.de/in/pit.

pdf bib
A Flexible Wizard of Oz Environment for Rapid Prototyping
Stefan Scherer | Petra-Maria Strauß
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)

This paper presents a freely-available, and flexible Wizard of Oz environment for rapid prototyping. The system is designed to investigate the required features of a dialog system using the commonly used Wizard of Oz approach. The idea is that the time consuming design of such a tool can be avoided by using the provided architecture. The developers can easily adapt the database and extend the tool to the individual needs of the targeted dialog system. The tool is designed as a client-server architecture and provides efficient input features and versatile output types including voice, or an avatar as visual output. Furthermore, a scenario, namely restaurant selection, is introduced in order to give an example application for a dialog system.

2006

pdf bib
Wizard-of-Oz Data Collection for Perception and Interaction in Multi-User Environments
Petra-Maria Strauß | Holger Hoffman | Wolfgang Minker | Heiko Neumann | Günther Palm | Stefan Scherer | Friedhelm Schwenker | Harald Traue | Welf Walter | Ulrich Weidenbacher
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)

In this paper we present the setup of an extensive Wizard-of-Oz environment used for the data collection and the development of a dialogue system. The envisioned Perception and Interaction Assistant will act as an independent dialogue partner. Passively observing the dialogue between the two human users with respect to a limited domain, the system should take the initiative and get meaningfully involved in the communication process when required by the conversational situation. The data collection described here involves audio and video data. We aim at building a rich multi-media data corpus to be used as a basis for our research which includes, inter alia, speech and gaze direction recognition, dialogue modelling and proactivity of the system. We further aspire to obtain data with emotional content to perfom research on emotion recognition, psychopysiological and usability analysis.