Workshop on Spoken Dialogue Systems for Cybernetic Avatars (SDS4CA)
Date: September 17, 2024 (Tuesday) Afternoon
Venue: International Science Innovation Building,
Kyoto University, Kyoto, Japan
Fee: free of charge
Workshop of SIGDIAL 2024
Sponsor: JST Moonshot R&D Program: Avatar-Symbiotic Society
Background:
Spoken dialogue systems (SDS) have made dramatic advances thanks to
improved speech technology and large language models. However, they
still have limitations; Purely autonomous systems might be out of
control and produce unexpected responses and actions. Meanwhile,
avatars have become prevailing in online communications since the
pandemic. They enable us to play a particular role (e.g., civil
servant or salesperson) while staying in distant places such as
home. A hybrid of avatars with AI and robotics, called cybernetic
avatars (CA), will further expand the possibility by breaking the
physical and temporal constraints and handicaps. A hybrid of avatars
and spoken dialogue systems is expected to complement each other and
provide human-level services to many people in parallel and
simultaneously. The interface can be either a robot or a CG-based
avatar. The workshop focuses on this new area of spoken dialogue
technology for cybernetic avatars, which is sponsored by the Moonshot
Research and Development Program in Japan.
Topic:
The workshop features the emergent framework of cybernetic avatars, a
hybrid of avatars with AI and robotics, focusing on spoken dialogue
technology. Specifically, it addresses the following topics:
- Spoken dialogue technology for humanoid robots and avatars
- Applications of conversational robots and avatars
- Ethical issues in avatars and AI-based dialogue services
Papers are not published in this workshop.
Keynote Speakers:
- Hiroshi Ishiguro (Osaka University)
- Yukiko Nakano (Seikei University)
- Nancy F. Chen (A-STAR, I2R)
Organizers:
- Tatsuya Kawahara (Kyoto University)
- Ryuichiro Higashinaka (Nagoya University)
- Kazunori Komatani (Osaka University)
Program (tentative):
13:15 Opening
13:30 Keynote 1: Hiroshi Ishiguro (Osaka University)
Toward Avatar-Symbiotic Society
14:00 Keynote 2: Yukiko Nakano (Seikei University)
Avatar Social Ethics Design
14:30 Project talk 1: Tatsuya Kawahara (Kyoto University)
Semi-autonomous Dialogue for Cybernetic Avatars
14:50 Project talk 2: Ryuichiro Higashinaka (Nagoya University)
Autonomous Dialogue Agents, Summarization, and Anomaly Detection for Cybernetic Avatars in Parallel Conversations
15:10 Poster Sessions 1 (& Coffee Break)
- 1. A Preliminary Examination of the Impact of Dialogue Partners Having the User's Own Face and Voice on Self-Disclosure
Taiga Natori, Changzeng Fu, Hiroshi Ishiguro, Yuichiro Yoshikawa
- 3. Multimodal Human-Agent Dialogue Dataset with Brain Signal
Shun Katada, Ryu Takeda, Kazunori Komatani
- 5. Embodied Autonomous Interview System with Attentive Listening Behavior
Zi Haur Pang, Yahui Fu, Divesh Lala, Mikey Elmers, Koji Inoue, and Tatsuya Kawahara
- 7. Modeling of Touch Gestures during Human Hugging Interactions and Implementing on a Huggable Robot
Takuto Akiyoshi, Hidenobu Sumioka, Junya Nakanishi, Hirokazu Kato, and Masahiro Shiomi
- 9. Wizard-of-Oz Dialogue Data Collection for a Mobile Guide Robot
Ao Guo, Shota Mochizuki, Sanae Yamashita, Saya Nikaido, Tomoko Isomura, and Ryuichiro Higashinaka
- 13. Android Avatars' Motion Generation for Distance Learning
Naoki Kodani, Takahisa Uchida, Nahoko Kameo, Kurima Sakai, Tomo Funayama, Takashi Minato, Akane Kikuchi, Hiroshi Ishiguro
15:40 Poster Sessions 2 (& Coffee Break)
- 2. Out-of-Vocabulary Word Detection in Spoken Dialogues Based on Joint Decoding with User Response Patterns
Miki Oshio, Ryu Takeda, Kazunori Komatani
- 4. An Android Avatar with Operator-like Conversational Abilities
Yuya Komai, Takahisa Uchida, Naoki Kodani, Hiroshi Ishiguro
- 6. Ownership Information Acquisition of Objects in the environment by Active Question Generation with Multimodal Large Language Models and Probabilistic Generative Models
Saki Hashimoto, Tomochika Ishikawa, Shoichi Hasegawa, Akira Taniguchi, Yoshinobu Hagiwara, Lotfi ElHafi, Tadahiro Taniguchi
- 8. Impact of Data Size and Recording Environments on the Performance of Anomaly Detection in Human-Robot Interaction
Shota Mochizuki, Sanae Yamashita, Tomonori Kubota, Kohei Ogawa, Ryuichiro Higashinaka
- 12. Real-Time Framework for Speech Extraction Based on Independent Low-Rank Matrix Analysis with Spatial Regularization and Rank-Constrained Spatial Covariance Matrix Estimation
Yuto Ishikawa, Tomohiko Nakamura, Norihiro Takamune, Hiroshi Saruwatari
16:10 Keynote 3: Nancy F. Chen (A-STAR, I2R)
Multimodal, Multilingual Generative AI for Education
16:40 Project talk 3: Kazunori Komatani (Osaka University)
Detecting Non-Linguistic Situations in Dialogues
17:00 Oral session
- Investigating the Impact of Gender Stereotypes in Authority on Avatar Robots
Yuan-Chia Chang, Daniel J. Rea, Takayuki Kanda
- Inducing the Perception that a Chatbot is Operated by an Android
Seiya Mitsuno, Ryunosuke Kawashima, Midori Ban, Takahisa Uchida, Hiroshi Ishiguro, Yuichiro Yoshikawa
- Analysis and Detection of Differences in Spoken User Behaviors between Autonomous and Wizard-of-Oz Systems
Mikey Elmers, Koji Inoue, Divesh Lala, Keiko Ochi, and Tatsuya Kawahara
17:45 Panel discussions
18:30 Reception
Contact:
Tatsuya Kawahara (Kyoto University)
E-mail: kawahara@i.kyoto-u.ac.jp