Multimodal User Interaction

Dr Fang Chen
Senior Principal Researcher/Project Leader
Multimodal User Interface Project
National ICT Australia (NICTA)

Tuesday 12th October 2004 at 11am

Abstract

Multimodal user interaction is a discipline that draws upon a diverse range of signal processing research and technologies to address a challenging long-term goal: human-computer interaction that is truly natural. This presentation will provide an overview of the state of the art in multimodal user interaction research, design and applications, and will cover some of the current issues that motivate future research in the area.

In this presentation, key issues for multimodal research and interface design will be raised. Robustness of modalities based upon audio, video and other real time sensor inputs is one concern that has engaged the signal processing research community for years, and will continue to be a critical factor in the adoption of multimodal technologies. Multimodal systems can offer redundancy of input information that can be exploited to improve robustness. Another issue facing the practical use of multimodal interaction is the need for higher-level semantic fusion of different modality inputs. When complementary information from two or more different types of input sensors arrives at or near the same point in time, the multimodal system needs to combine the information in a meaningful and consistent way. Additionally, the presentation of output information in a manner that is perceptually optimized for the user is a research challenge that must account for the availability of output devices and their constraints, user preferences, application metadata, and relevant content. We will also address the issue of standardization of a multimodal architecture, which would allow researchers to create generic components that could be re-used across a wide variety of application domains. Finally, we will give a brief introduction to the current multimodal activities at NICTA.

Short resume

Fang Chen holds her PhD in Communications and Electronic Systems and an MBA. Her main research areas are in speech processing and multimodal systems, ranging from speech synthesis algorithms, natural language dialogue, user centered studies to multimodal interaction systems for PC-based applications and handheld devices. She has many publications and patents. Dr. Fang Chen was the Deputy Director of the Institute of Information Science and then the Head of School of Electronic and Information Engineering in Beijing Jiaotong University, China. She started to explore her career in industry as senior researcher and Team Leader of Text-to-Speech in Intel China Research Centre. After she joined Motorola as a Principal Research Scientist, she led the Speech and Language Generation research team, and acted as the manager of business relationships for Motorola China Research Centre. She also chaired the Patent and Publication Committees while working for the Motorola Australian Research Centre. She currently leads the multimodal activities at National ICT Australia, and has received Conjoint professor and Honorary Associate positions from the University of New South Wales and University of Sydney respectively.

Back to HAIL Home Page