信号処理研究グループ
コンテンツ
トップページ
研究トピックス
メンバーリスト
発表文献
組織
NTTコミュニケーション科学基礎研究所
メディア情報研究部
メディア認識研究グループ
Signal Processing Research Group
コミュニケーション環境研究
協創情報研究部
人間情報研究部
守谷特別研究室
リンク
先端技術総合研究所
NTT

| 2011 | 2010 | 2009 | 2008 | 2007 | 2006 | 2005 | 2004 | 2003 | 2002 | 2001 | 2000 |

発表文献

2004

論文

  1. R. Mukai, S. Araki, H. Sawada, S. Makino, “Evaluation of Separation and Dereverberation Performance in Frequency Domain Blind Source Separation,” Acoustical Science and Technology, Vol.25, No.2, pp.119-126, March. 2004.
  2. H. Sawada, R. Mukai, S. Araki, S. Makino, “Convolutive Blind Source Separation for more than Two Sources in the Frequency Domain,” Acoustical Science and Technology, the Acoustical Society of Japan, vol.25, no.4, pp. 296-298, July 2004.
  3. R. Mukai, H. Sawada, S. Araki, S. Makino, “Blind Source Separation for Moving Speech Signals using Blockwise ICA and Residual Crosstalk Subtraction,” IEICE Trans. Fundamentals, Special Section on Digital Signal Processing, vol.E87-A, no.8, pp.1941-1948, August, 2004.
  4. H. Sawada, R. Mukai, S. Araki, S. Makino, “A Robust and Precise Method for Solving the Permutation Problem of Frequency-Domain Blind Source Separation,” IEEE Trans. Speech and Audio Processing, vol.12, no.5, pp.530-538, September 2004.
  5. S. Watanabe and A. Nakamura, “Acoustic model adaptation based on coarse/fine training of transfer vectors,” (in Japanese), 情報科学技術レターズ.
  6. S. Watanabe, Y. Minami, A. Nakamura and N. Ueda, “Variational Bayesian Estimation and Clustering for Speech Recognition,” IEEE Transactions on Speech and Audio Processing, vol. 12, pp. 365-381, 2004.

書籍, 解説記事

  1. 牧野昭二 荒木章子, 向井良, 澤田宏, “畳込み混合のブラインド音源分離,” システム/制御/情報, vol.48, no.10, pp.401-408, 2004.
  2. 堀 貴明, 塚田 元, “音声情報処理の最先端「重み付き有限状態トランスデューサによる音声認識」,” 情報処理学会誌「情報処理」45巻10号, pp.1020--1026, October 2004.

国際会議予稿

  1. S. Araki, S. Makino, A. Blin, R. Mukai, and H. Sawada, “Underdetermined Blind Separation for Speech in Real Environments with Sparseness and ICA,” ICASSP2004, vol. III, pp. 881-884, May 2004 (invited).
  2. A. Blin, S. Araki and S. Makino, “A Sparseness-Mixing Matrix Estimation (SMME) Solving the Underdetermined BSS for Convolutive Mixtures,” ICASSP2004, vol. IV, pp. 85-88, May 2004.
  3. R. Mukai, H. Sawada, S. Araki, S. Makino, “Near-Field Frequency Domain Blind Source Separation for Convolutive Mixtures,” ICASSP2004, vol. IV, pp. 49-52, May 2004.
  4. H. Sawada, R. Mukai, S. Araki, S. Makino, “Convolutive Blind Source Separation for more than Two Sources in the Frequency Domain,” ICASSP2004, vol. III, pp. 885-888, May 2004 (invited).
  5. S. Makino, S. Araki, R. Mukai, and H. Sawada, “Audio source separation based on independent component analysis,” in Proc. ISCAS2004 (International Symposium on Circuits and Systems), vol. V, pp. 668-671, May 2004 (invited).
  6. R. Mukai, H. Sawada, S. Araki and S. Makino, “Frequency Domain Blind Source Separation using Small and Large Spacing Sensor Pairs,” ISCAS2004, vol. V, pp. 1-4, May 2004.
  7. S. Araki, S. Makino, H. Sawada and R. Mukai, “Underdetermined Blind Speech Separation with Directivity Pattern based Continuous Mask and ICA,” EUSIPCO2004, pp.1991-1994, September 2004.
  8. S. Araki, S. Makino, H. Sawada and R. Mukai, “Underdetermined Blind Separation of Convolutive Mixtures of Speech with Directivity Pattern based Mask and ICA,” ICA2004, pp.898-905, September 2004.
  9. H. Sawada, S. Winter, S. Araki, R. Mukai, S. Makino, “Estimating the Number of Sources for Frequency-Domain Blind Source Separation,” ICA2004 (5th International Conference on Independent Component Analysis and Blind Signal Separation), pp.610-617, September 2004.
  10. S. Winter, H. Sawada, S. Araki, S. Makino, “Overcomplete BSS for convolutive mixtures based on hierarchical clustering,” ICA2004, pp.652-660, September 2004.
  11. R. Mukai, H. Sawada, S. Araki, S. Makino, “Frequency Domain Blind Source Separation for Many Speech Signals,” ICA2004, pp.461-469, September 2004.
  12. S. Winter, H. Sawada, S. Araki, S. Makino, “Hierarchical Clustering Applied to Overcomplete BSS for Convolutive Mixtures,” SAPA2004 (ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing), Session I-3, October 2004.
  13. A. Blin, S. Araki, and S. Makino, “Underdetermined blind source separation for convolutive mixtures exploiting a sparseness-mixing matrix estimation (SMME),” in Proc. ICA2004 (International Congress on Acoustics), vol. IV, pp. 3139-3142, 2004.
  14. H. Sawada, R. Mukai, S. Araki, S. Makino, “Solving the Permutation and the Circularity Problem of Frequency-Domain Blind Source Separation,” ICA2004 (International Congress on Acoustics), vol. I, pp. 89-92, 2004 (invited).
  15. K. Ishizuka and N. Miyazaki, “Speech feature extraction method representing periodicity and aperiodicity in sub bands for robust speech recognition," Proceedings of the 29th International Conference on Acoustics, Speech, and Signal Processing (ICASSP2004), Vol.1, pp.141-144, 2004.
  16. K. Ishizuka and N. Miyazaki, “Speech feature extraction method representing periodicity and aperiodicity in sub bands for robust speech recognition," The 2nd NTT Workshop on Communication Scene Analysis (CSA2004) Poster presentation, 2004.
  17. K. Ishizuka, N. Miyazaki, T. Nakatani and Y. Minami, “mprovement in robustness of speech feature extraction method using sub-band based periodicity and aperiodicity decomposition," Proceedings of the 8th International Conference on Spoken Language Processing (Interspeech2004 - ICSLP), Vol.2, pp.937-940, 2004.
  18. P. Zolfaghari, H. Kato, S. Watanabe and S. Katagiri, “Speech Spectral Modelling using Mixture of Gaussians,” Proc. SWIM , 2004
  19. P. Zolfaghari, S. Watanabe, A. Nakamura and S. Katagiri, “Bayesian Modelling of the Speech Spectrum Using Mixture of Gaussians,” Proc. ICASSP'04, vol. 1, pp. 553-556, 2004.
  20. R. Mukai, H. Sawada, S. Araki, S. Makino, “A Solution for the Permutation Problem in Frequency Domain BSS using Near- and Far-field Models,” ICA2004 (International Congress on Acoustics), vol. IV, pp. 3135-3138, 2004.
  21. S. Araki, S. Makino, A. Blin, R. Mukai, and H. Sawada, “Underdetermined blind separation of convolutive mixtures of speech by combining time-frequency masks and ICA,” in Proc. ICA2004 (International Congress on Acoustics), vol. I, pp.321-324, 2004.
  22. S. Watanabe and A. Nakamura, “Acoustic model adaptation based on coarse-fine training of transfer vectors and its application to speaker adaptation task,” Proc. ICSLP'04 , vol. 4, 2933-2936, 2004.
  23. S. Watanabe and A. Nakamura, “Robustness of acoustic model topology determined by Variational Bayesian Estimation and Clustering for speech recognition for different speech data sets,” Proc. Workshop on statistical modeling approach for speech recognition - Beyond HMM, pp. 55-60, 2004.
  24. S. Watanabe, A. Sako (Ryukoku Univ.) and A. Nakamura, “Automatic Determination of Acoustic Model Topology using Variational Bayesian Estimation and Clustering,” Proc. ICASSP'04, vol. 1, pp. 813-816, 2004.
  25. T. Hori, C. Hori, and Y. Minami, “Fast on-the-fly composition for weighted finite-state transducers in 1.8 million-word vocabulary continuous-speech recognition,” in Proc. ICSLP2004, Vol. 1, pp. 289-292, 2004.

その他会議予稿

  1. 渡部 晋治, 佐古 淳 (龍谷大学), 中村 篤, “ベイズ的音声認識VBECを用いた音響モデル構造の自動決定,” 音響学会講演論文集, 1-8-6, pp. 11-12, March 2004.
  2. 渡部 晋治, 堀 貴明, Erik McDermott, 南 泰浩, 中村 篤, “音声認識システムSOLONの日本語話し言葉コーパスにおける評価,” 音響学会講演論文集, 2-8-7, pp. 73-74, March 2004.
  3. 木下慶介, 中谷智広, 三好正人, “調波構造を用いた残響除去法の明瞭性と認識率による音声品質評価,” 日本音響学会春季研究発表会, pp.611-612, March 2004.
  4. 堀 貴明, 南 泰浩, “有限状態トランスデューサ型デコーダの性能改善,” 日本音響学会講演論文集, 3-8-5, March 2004.
  5. 渡部 晋治, 中村 篤, “移動ベクトルのコース/ファイン学習にもとづく音響モデルの教師付き適応,” 音響学会講演論文集, 2-4-11, pp. 107-108, September 2004.
  6. 堀 貴明, 堀 智織, 南 泰浩, “WFSTの高速 on-the-fly合成による超大語彙連続音声認識", 日本音響学会講演論文集, 3-1-25, September 2004.
  7. Mike Schuster, 堀 貴明, “Evaluation of beyond triphone order context-dependent models on spontaneous Japanese,” 日本音響学会講演論文集, 3-1-26 September 2004.
  8. M. Schuster and T. Hori, “Efficient generation of high-order context-dependent weighted finite state transducers for speech recognition,” 第6回 音声言語シンポジウム, December 2004.
  9. H. Sawada, R. Mukai, S. Araki, S. Makino, “Blind Source Separation for Convolutive Mixtures in the Frequency Domain,” CSA2004.
  10. K. Kinoshita, T. Nakatani and M. Miyoshi, “Improving automatic speech recognition performance and speech intelligibility with harmonicity based dereverberation,” Proc. Of Interspeech, 2004
  11. K. Kinoshita, T. Nakatani and M. Miyoshi, “Speech dereverberation based on harmonic structure using a single microphone,” Poster presentation at 2004 NTT Workshop on Communication Scene Analysis, 2004
  12. R. Mukai, H. Sawada, S. Araki, S. Makino, “A Solution for the Permutation Problem in Frequency Domain BSS using Near- and Far-field Models,” CSA2004.
  13. S. Araki, S. Makino, H. Sawada and R. Mukai, “Blind Separation of More Speech than Sensors using Time-frequency Masks and ICA,” Proceedings of 2004 NTT Workshop on Communication Scene Analysis (CSA2004), (invited)
  14. S. Winter, H. Sawada,S. Araki, S. Makino, “Underdetermined Blind Source Separation for Convolutive Mixtures of Sparse Signals,” CSA2004
  15. 向井, 澤田, 荒木, 牧野, “狭間隔・広間隔の複数マイクロホン対を用いた周波数領域ブラインド音源分離,” 日本音響学会2004年春季研究発表会講演論文集, pp. 627-628, 2004.
  16. 石塚健太郎, 宮崎昇, 中谷智広, 南泰浩, “音声特徴抽出法SPADEにおける歪補正法の効果,” 日本音響学会講演論文集, 3-1-4, pp.117-118, 秋季, 2004.
  17. 渡部 晋治, “[チュートリアル講演] ベイズ法を用いた音声認識,” 電子情報通信学会技術研究報告, SP2004-74, pp. 13-20, 2004.
  18. 堀 貴明, 渡部 晋治, Erik McDermott, 南 泰浩, 中村 篤, “音声認識システムSOLONの日本語話し言葉コーパスによる評価,” 話し言葉の科学と工学ワークショップ講演予稿集, pp.85-92, 2004.
  19. 澤田, 向井, 荒木, 牧野, “独立成分分析を用いた音源数推定法,” 日本音響学会2004年秋季研究発表会講演論文集, pp. 753-754, 2004.
english   japanese