第32回人工知能学会 AI チャレンジ研究会

第32回人工知能学会 AIチャレンジ研究会(SIG-Challenge) 予稿集

日時：
開催場所：
テーマ：
担当幹事：

B002-1 pp.1-2
[基調講演] 歌声合成技術VOCALOIDとその組み込み機器への応用可能性
　　　剣持秀紀（(株)ヤマハ）
B002-2 pp.3-8
低サイドローブ設計64ch球形マイクロホンアレイの開発
佐々木洋子, 椛澤光隆, 加賀美聡(産総研), 尾路京一(関西電力)
- 方位角・仰角の全方位に高感度な特性を持つ64chマイクロホンアレイの設計と移動ロボットによる音源定位実験について述べる。
B002-3 pp.9-15
言語・非言語情報を統合した指示パターンに対応するロボットの行動則獲得
岡田　将吾，伊豆蔵拓也，名渕　博人，高橋　徹，西田　豊明（京都大学）
- ユーザの言語・非言語を統合した指示パターンと、そのパターンに対応するロボットの行動パターンの対を、音声認識を利用した教師無し学習に基づき獲得する手法を提案する。
B002-4 pp.16-23
Robust Speech Recognition Using Optimized Wavelet Filtering in Reverberant Conditions
Randy Gomez and Tatsuya Kawahara (Kyoto Univ.)
- We present an optimization method of the wavelet parameters for dereverberation in automatic speech recognition (ASR). By tuning the wavelet parameters to improve the acoustic model likelihood, wavelet-based dereverberation methods become more effective in the ASR application. We evaluate several existing wavelet-based methods and optimize them, based on our proposed scheme. Experimental evaluations through ASR experiments demonstrate significant improvement for all methods with the proposed optimization.
B002-5 pp.24-29
Programming by Playing and Approaches for Expressive Robot Performances
Angelica Lim, Takeshi Mizumoto, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno(Kyoto Univ.)
- This paper extends our work with a theremin-playing robot accompanist. Here, we consider that a good accompanist should play with "expression": small deviations in volume, pitch and timing. We propose a Programming by Playing approach that allows a human flutist to transfer a performance to a robot thereminist, keeping these expressive changes intact. We also examine precisely what makes music robots play more or less “robotically”, and survey the field of musical expression in search of a good model to make robots play more like humans.
B002-6 pp.30-35
Joint use of distributed microphone array and laser range finders for speaker identification in meeting
Jani Even，Panikos Heracleous，石井　Carlos 寿憲，萩田　紀博（ATR）
- This paper presents a text-independent speaker identification system for meetings. During the meeting, each of the meeting participants carry a microphone while a human tracker monitors their movements. The human tracker is based on scanning laser range finder and gives the positions of all the participants at any time. The position information is used to track the geometry of the distributed microphone array formed by of all the microphones. Using the geometry of the distributed array it is possible to cancel interfering speeches and noises from the audio stream assigned to each of the participants. Then, using these processed audio stream, the participants are identified by means of Gaussian mixture models (GMM) that were trained before hand. The proposed system is able to perform identification of simultaneously speaking participants and is thus a good candidate system for meeting diarization task. In particular, the use of laser range finders is a novel approach that makes the position estimation immune to acoustic noise and reverberation. An experiment conducted with three subjects reproducing a meeting configuration demonstrates the performance of the system for identification.
B002-7 pp.36-41
ロボットの実環境におけるピッチ抽出に関する考察
石井カルロス寿憲，梁棟，石黒浩，萩田紀博（ATR）
- マイクロホンアレイを利用した音源定位、音源分離の技術およびＣＡＳＡによるピッチ抽出技術を組み合わせ、ロボットの実環境において複数話者のピッチ抽出を評価した。
B002-8 pp.42-47
ＩＣＡに基づく音声対話ロボット雑音抑圧における確率統計モデルを用いたパーミュテーション解決法
平田将久，八田俊之，脇坂龍，猿渡洋，鹿野清宏（奈良先端大）
- ＩＣＡに基づく音声対話ロボット雑音抑圧において，拡散性雑音に対応するため，統計モデルを用いたパーミュテーション解決法を提案する．
B002-9 pp.48-53
能動人工耳介
公文誠，野田佳孝，魚住守治（熊本大学）
- 形状を変化させることが可能で，指向性などを操作可能とすることを目指し，柔軟構造を持つ壁面と，腱駆動による人工耳介を開発したので報告する．
公知日

2010年11月26日

リンク

人工知能学会 AI チャレンジ研究会
Copyright (c) 2010, 人工知能学会 AI チャレンジ研究会.

第32回人工知能学会 AI チャレンジ研究会

第32回 人工知能学会 AIチャレンジ研究会(SIG-Challenge) 予稿集

公知日

リンク

第32回人工知能学会 AIチャレンジ研究会(SIG-Challenge) 予稿集