信号処理研究グループ
コンテンツ
トップページ
研究トピックス
メンバーリスト
発表文献
組織
NTTコミュニケーション科学基礎研究所
メディア情報研究部
メディア認識研究グループ
Signal Processing Research Group
コミュニケーション環境研究
協創情報研究部
人間情報研究部
守谷特別研究室
リンク
先端技術総合研究所
NTT

| 2011 | 2010 | 2009 | 2008 | 2007 | 2006 | 2005 | 2004 | 2003 | 2002 | 2001 | 2000 |

発表文献

2010

論文

  1. T. Yoshioka, T. Nakatani, M. Miyoshi, and H. G. Okuno, “New method for blind separation and dereverberation of highly reverberant mixtures,” accepted for publication in IEEE Transactions on Audio, Speech, and Language Processing, now available on IEEE Xplore, January 2010.
  2. T. Oba, T. Hori, and A. Nakamura, “Improved Sequential Dependency Analysis Integrating Labeling-based Sentence Boundary Detection,” IEICE, Vol.E93-D,No.5,pp.-, May 2010.
  3. J. Muramatsu, and S. Miyake “Hash property and coding theorems for sparce matrices and maximal-likelihood coding,” IEEE Transactions on Information Theory, vol. IT-56, no. 5, pp. 2143-2167, May 2010.
  4. J. Muramatsu, and S. Miyake “Hash property and fixed-rate universal coding theorems,” IEEE Transactions on Information Theory, vol. IT-56, no. 6, pp. 2688-2698, Jun. 2010.
  5. J. Muramatsu, and S. Miyake, “Construction of broadcast channel code based on hash property,” in Proceedings of the 2010 IEEE International Symposium on Information Theory, pp. 575-579, 2010.
  6. H. Sawada, S. Araki and S. Makino, “Underdetermined Convolutive Blind Source Separation via Frequency Bin-wise Clustering and Permutation Alignment,” IEEE Trans. Audio, Speech, and Language Procssing, (条件付採録).
  7. K. Ishizuka, S. Araki, and T. Kawahara, “Speech activity detection for muti-party conversation analyses based on likelihood ratio test on spatial magnitude,” IEEE Transaction on Audio, Speech, and Language Processing (in press).
  8. K. Ishizuka, T. Nakatani, M. Fujimoto, and N. Miyazaki, “Noise robust voice activity detection based on periodic to aperiodic component ratio,” Speech Communication, Vol.52, No.1, pp. 41-60, 2010.
  9. S. Araki, H. Sawada, and S. Makino, “Blind Speech Separation in a Meeting Situation with Maximum SNR Beamformers,” IEEE Trans. Audio, Speech, and Language Processing, (submitting)
  10. S. Watanabe and A. Nakamura, “Predictor-Corrector Adaptation based on a Macroscopic Time Evolution System,” IEEE Transactions on Audio, Speech, and Language Processing, vol. 18, issue 2, pp. 395-406, 2010.
  11. 西亀 健太, 和泉 洋介, 渡部 晋治, 西本 卓也, 小野 順貴, 嵯峨山 茂樹, “スパース性に基づくブラインド音源分離を用いたステレオ入力音声認識,” 電子情報通信学会論文誌 D-II, vol. J93-D, no. 3, pp. 303-311, 2010.

書籍, 解説記事

  1. 伊藤 慶明, 堀 貴明, “音声認識研究の最近の動向と今後の展望: 7.音声認識の応用システム― 音声文書検索音声対話/音声翻訳の新たな展開 ―,” 日本音響学会誌 66巻 January 2010.
  2. T. Yoshioka, T. Nakatani, K. Kinoshita, and M. Miyoshi, “Speech dereverberation and denoising based on time varying speech model and autoregressive reverberation model,” to appear in Speech Processing in Modern Communication: Challenges and Perspectives, Israel Cohen, Jacob Benesty, and Sharon Gannot (eds.), Springer, pp. 151-182, February 2010.
  3. M. Fujimoto, K. Takeda, and S. Nakamura, “Chapter 4.4.2: An evaluation database for in-car speech recognition and its common evaluation framework,” in "Resources and Standards of Spoken Language Systems - Advances in Oriental Spoken Language Processing, " World Scientific Publishing Co., March 2010.
  4. M. Miyoshi, M. Delcroix, K. Kinoshita, T. Yoshioka, T. Nakatani, and T. Hikichi, “Inverse-filtering for speech dereverberation without the use of room acoustics information,” to appear in Speech Dereverberation, Patrik A. Naylor and Nikolay Gaubitch (eds.), Springer.
  5. M. Fujimoto, “Chapter 1: Integration of statistical model-based voice activity detection and noise suppression for noise robust speech recognition,” in "Advances in Robust Speech Recognition Technology," Bentham Publishing Services. (in publishing)
  6. 渡部晋治, “音声認識における音響モデル研究の動向,” 日本音響学会誌66巻1号, pp. 599-604, 2010.
  7. 白井克彦 編著 “音声言語処理の潮流,” コロナ社, 4.3節分担執筆(出版)

国際会議予稿

  1. T. Yoshioka, T. Nakatani, and H. G. Okuno, “Noisy speech enhancement based on prior knowledge about spectral envelope and harmonic structure,” in Proceedings of the 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), pp. 4270-4273, March 2010.
  2. N. Yasuraoka, T. Yoshioka, T. Nakatani, A. Nakamura, and Hiroshi G. Okuno, “Music dereverberation using harmonic structure source model and Wiener filtering,” in Proceedings of the 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), pp. 53-56, March 2010.
  3. T. Hori, S. Watanabe, and A. Nakamura, “Search Error Risk Minimization in Viterbi Beam Search for Speech Recognition,” in Proc. ICASSP2010, pp. 4934-4937, March 2010.
  4. T. Oba, T. Hori and A. Nakamura, “A Comparative Study on Methods of Weighted Language Model Training for Reranking LVCSR N-best Hypotheses,” in Proc. ICASSP2010, pp. 5126-5129, March 2010.
  5. S. Watanabe, T. Hori, E. McDermott, and A. Nakamura, “A Discriminative Model for Continuous Speech Recognition Based on Weighted Finite State Transducers,” in Proc. ICASSP2010, pp. 4922-4925, March 2010.
  6. A. Ogawa and A. Nakamura, “Discriminative confidence and error cause estimation for extended speech recognition function,” Proc. ICASSP, pp. 4454-4457, March 2010.
  7. A. Ogawa and A. Nakamura, “A novel confidence measure based on marginalization of jointly estimated error cause probabilities,” Proc. Interspeech, September 2010.
  8. J. Muramatsu, K. Yoshimura, K., and P. Davis, “Information theoretic security based on bounded observability,” Proceedings of the 4th International Conference on Information Theoretic Security, Lecture Notes on Computer Science (LNCS), vol.5973, pp.128-139, Splinger (in press).
  9. D. Cournapeau, S. Watanabe, A. Nakamura, and T. Kawahara, “Using Online Model Comparison In The Variational Bayes Framework For Online Unsupervised Voice Activity Detection,” ICASSP 2010, pp. 4462-4465, 2010.
  10. E. McDermott, S. Watanabe, and A. Nakamura, “Discriminative Training Based On An Integrated View Of MPE And MMI In Margin And Error Space,” ICASSP 2010, pp. 4894-4897, 2010.
  11. H. Watanabe, S. Katagiri, K. Yamada, E. McDermott, A. Nakamura, S. Watanabe, and M. Ohsaki, “Minimum Error Classification With Geometric Margin Control,” ICASSP 2010, pp. 2170-2173, 2010.
  12. K. Aoyama, S. Watanabe, H. Sawada, Y. Minami, N. Ueda, and K. Saito, “Fast Similarity Search On A Large Speech Data Set With Neighborhood Graph Indexing,” ICASSP 2010, pp. 5358-5361, 2010.
  13. S. Araki, T. Nakatani and H. Sawada, “Simultaneous clustering of mixing and spectral model parameters for blind sparse source separation,” ICASSP2010, 2010.
  14. S. Watanabe, T. Hori, E. McDermott, and A. Nakamura, “A Discriminative Model For Continuous Speech Recognition Based On Weighted Finite State Transducers,” ICASSP 2010, pp. 4922-4925, 2010.
  15. T. Hori, S. Watanabe, and A. Nakamura, “Search Error Risk Minimization In Viterbi Beam Search For Speech Recognition,” ICASSP 2010, pp. 4934-4937, 2010.
  16. T. Nakatani and S. Araki, “SINGLE CHANNEL SOURCE SEPARATION BASED ON SPARSE SOURCE OBSERVATION MODEL WITH HARMONIC CONSTRAINT,” ICASSP2010, 2010.
  17. Y. Ansai, S. Araki, S. Makino, T. Nakatani, T. Yamada, A. Nakamura and N. Kitawaki, “Cepstral Smoothing of Separated Signals for Underdetermined Speech Separation,” ISCAS2010, (to appear)

その他会議予稿

  1. 久保 陽太郎, 渡部 晋治, 中村 篤, 小林 哲則, “最小相対エントロピー識別学習へのラティスによる仮説表現と並列化可能な最適化手法の導入,” 情報処理学会研究報告, Vol.2010-SLP-80 No.8, February 2010.
  2. 吉岡 拓也, 中谷 智広, 奥乃 博, “スペクトル包絡の事前学習と調波構造モデルを併用した音声強調,” 日本音響学会 2010年春季研究発表会講演論文集, 3-5-8, pp. 773-776, March 2010.
  3. 安良岡 直希, 吉岡 拓也, 中谷 智広, 中村 篤, 奥乃 博, “調波GMMとWienerフィルタに基づく音楽音響信号の残響抑圧,” 情報処理学会 第72回全国大会 講演論文集, 5T-4, vol. 2, pp. 181-182, March 2010.
  4. 藤本 雅清, 渡部 晋治, 中谷 智広, “Dirichlet事前分布を用いた音声区間検出法の評価と考察,” 日本音響学会, 平成22年度春季研究発表会, 1-6-5, pp. 13-17, March 2010.
  5. 田代(AS研), 荒木, 木村, 中村, “停電時上り音声通信を実現する光アクセス方式の提案,” 電子情報通信学会2010年総合大会, March 2010.
  6. 渡部 晋治, 堀 貴明, Erik McDermott, 中村 篤, “重み付有限状態トランスデューサを利用した, 連続音声認識のための識別モデルの提案,” 音響学会講演論文集, 1-6-13, March 2010.
  7. 藤本 雅清, 渡部 晋治, 中谷 智広, “Dirichlet 事前分布を用いた音声区間検出法の評価と考察,” 音響学会講演論文集, 1-6-5, March 2010.
  8. 堀 貴明, 渡部 晋治, 中村 篤, “サーチエラーリスク最小化に基づくViterbiビーム探索法の改善,” 音響学会講演論文集, 2-6-7, March 2010.
  9. 増村亮, 大庭隆伸, 伊藤彰則, 牧野正三, “線形分類器による音響モデル,” 音響学会講演論文集, pp. 29-30, March 2010.
  10. 大庭隆伸, 南泰浩, “クラス分類問題の強化学習による解釈,” 情報処理全国大会, Vol.2, pp. 93-94, March 2010.
  11. 小川厚徳,中村篤, “信頼度と誤り原因の推定における識別モデルの検討,” 音講論集,1-Q-6, March 2010.
  12. 小川厚徳,中村篤, “同時推定した誤り原因確率の周辺化に基づく信頼度,” 音講論集,1-Q-19, September 2010.
  13. 安齊(筑波大), 荒木, 牧野, 中谷, 山田, 中村, 北脇, “劣決定音源分離のための分離音声のケプストラムスムージング,” 日本音響学会2010年春季研究発表会, 2010.
  14. 荒木, 中谷, 澤田, “マイク間位相差とスペクトル包絡の同時クラスタリングに基づくスパース音源分離,” 日本音響学会2010年春季研究発表会, 2010.
english   japanese