Signal Processing Research Group
Contents
Top
Research Topics
People
Publications
Organization
NTT Communication Science Laboratories
Media Information Laboratory
Recognition Research Group
Signal Processing Research Group
Communication environment research group
Innovative Communication Laboratory
Human and Information Science@Laboratory
Moriya Research Laboratory
NTT Science and Core Technology Laboratory Group
Nippon Telegraph and Telephone Corporation (NTT)

| 2011 | 2010 | 2009 | 2008 | 2007 | 2006 | 2005 | 2004 | 2003 | 2002 | 2001 | 2000 |

Publications

2005

Journal Papers

  1. A. Blin, S. Araki, and S. Makino, “Underdetermined blind separation of convolutive mixtures of speech using time-frequency mask and mixing matrix estimation,” IEICE Trans. Fundamentals, Vol.E88-A, No.7, pp.1693-1700, 2005.
  2. H. Sawada, R. Mukai, S. Araki, and S. Makino, “Estimating the number of sources using independent component analysis,” Acoustical Science and Technology, vol. 26, no. 5, pp.450-452, 2005.
  3. K. Kinoshita, T. Nakatani and M. Miyoshi, “Harmonicity based dereverberation for improving automatic speech recognition performance and speech intelligibility” IEICE,2005.
  4. S. Araki, S. Makino, R. Aichner(Univ. Erlangen-Nuremberg), T. Nishikawa(NAIST) and H. Saruwatari(NAIST), “Subband-based Blind Separation for Convolutive Mixtures of Speech,” IEICE Trans. Fundamentals, E88-A(12), pp. 3593-3603, 2005.
  5. S. Makino, H. Sawada, R. Mukai, and S. Araki, “Blind source separation of convolutive mixtures of speech in frequency domain,” IEICE Trans. Fundamentals, Vol.E88-A, No.7, pp.1640-1655, 2005. (invited)

Book Chapter, Tutorial Papers

  1. S. Araki, S. Makino, “Subband Based Blind Source Separation,” In J. Benesty, S. Makino, and J. Chen, editors, Speech Enhancement, pp. 329-352, Springer, March 2005.
  2. H. Sawada, R. Mukai, S. Araki and S. Makino, “Frequency-domain blind source separation,” In J. Benesty, S. Makino, and J. Chen, editors, Speech Enhancement, pp.299-327, Springer, March 2005.
  3. R. Mukai, H. Sawada, S. Araki and S. Makino, “Real-time blind source separation for moving speech signals,” In J. Benesty, S. Makino, and J. Chen, editors, Speech Enhancement, pp.353-369, Springer, March 2005.

Peer-reviewed Conference Papers

  1. S. Araki, S. Makino, H. Sawada and R. Mukai, “Reducing musical noise by a fine-shift overlap-add method applied to source separation using a time-frequency mask,” ICASSP2005, vol. III, pp. 81-84, March 2005.
  2. S. Araki, S. Makino, H. Sawada, and R. Mukai, “Source extraction from speech mixtures with null-directivity pattern based mask,” Proc. of Joint Workshop on Hands-Free Speech Communication and Microphone Arrays (HSCMA 2005), pp. d1-d2, March 2005.
  3. H. Sawada, S. Araki, R. Mukai, S. Makino, “Blind Extraction of a Dominant Source Signal from Mixtures of Many Sources,” ICASSP2005, vol. III, pp. 61-64, March 2005.
  4. H. Sawada, R. Mukai, S. Araki, and S. Makino, “Frequency-domain blind source separation without array geometry information,” Proc. of Joint Workshop on Hands-Free Speech Communication and Microphone Arrays (HSCMA 2005), pp.d13-d14, March 2005.
  5. R. Mukai, H. Sawada, S. Araki, and S. Makino, “Blind source separation and {DOA} estimation using small 3-D microphone array,” Proc. of Joint Workshop on Hands-Free Speech Communication and Microphone Arrays (HSCMA 2005), pp. d9-d10, March 2005.
  6. M. Schuster and T. Hori, “Efficient generation of high-order context-dependent weighted finite state transducers for speech recognition,” in Proc. ICASSP2005, Vol I, pp. 201-204, March 2005.
  7. T. Yoshioka, T. Hikichi, M. Miyoshi, and H. G. Okuno, “Blind estimation of room resonances using popular, classical, and ‚Šazz Music,” in Proceedings of the 118th Audio Engineering Society Convention (AES 118), article ID 6632, May. 2005.
  8. H. Sawada, S. Araki, R. Mukai, and S. Makino, “Blind extraction of a dominant source from mixtures of many sources using ICA and time-frequency masking,” Proc. of 2005 IEEE International Symposium on Circuits and Systems (ISCAS 2005), pp. 5882-5885, May 2005.
  9. H. Sawada, R. Mukai, S. Araki, and S. Makino, “Multiple source localization using independent component analysis,” Proc. of 2005 IEEE AP-S International Symposium and USNC/URSI National Radio Science Meeting, July 2005.
  10. H. Kato, Y. Nagahara (Meiji Univ.), S. Araki, and H. Sawada, “Pearson distribution system applied to blind speech separation,” 25th European Meeting of Statisticians (EMS2005), p.394, July 2005.
  11. T. Hori and A. Nakamura, “Generalized fast on-the-fly composition algorithm for WFST-based speech recognition,” in Proc. Interspeech2005-Eurospeech, pp. 557-560, September 2005.
  12. M. Schuster, T. Hori, and A. Nakamura, “Experiments with Probabilistic Principal Component Analysis in LVCSR,” in Proc. Interspeech2005-Eurospeech, pp. 1685-1688, September 2005.
  13. R. Mukai, H. Sawada, S. Araki, and S. Makino, “Blind Source Separation of 3-D Located Many Speech Signals,” in Proc. WASPAA2005, pp. 9-12, October 2005.
  14. T. Oba, T. Hori, and A. Nakamura, “Dependency modeling for integrated spontaneous speech processing,” in Proc. ASRU2005, pp. 284-289, November 2005.
  15. M. Schuster, T. Hori, “Construction of weighted finite state transducers for very wide context-dependent acoustic models,” in Proc. ASRU2005, pp. 162-167, November 2005.
  16. T. Oba, T. Hori, A. Nakamura, “Sequential Dependency Analysis for Spontaneous Speech Understanding,” ASRU2005, pp. 284-289, November 2005.
  17. F. Flego, S. Araki, H. Sawada, T. Nakatani, and S. Makino, “Underdetermined blind separation for speech in real environments with F0 adaptive comb filtering,” IWAENC2005, pp. 93-96, 2005.
  18. H. Sawada, R. Mukai, S. Araki, and S. Makino, “Real-time blind extraction of dominant target sources from many background interferences,” IWAENC2005, pp. 73-76, 2005.
  19. K. Ishizuka and T. Nakatani, “Robust speech feature extraction using subband based periodicity and aperiodicity decomposition in the frequency domain,” Proceedings of the Joint Workshop on Hands-Free Speech Communication and Microphone Arrays (HSCMA2005), pp.a13-a14, 2005.
  20. K. Ishizuka, H. Kato, and T. Nakatani, “Speech signal analysis with exponential autoregressive model,” Proceedings of the 30th International Conference on Acoustics, Speech, and Signal Processing (ICASSP2005), Vol.1, pp.225-228, 2005.
  21. K. Ishizuka, R. Mugitani, H. Kato, and S. Amano, “A longitudinal analysis of the spectral peaks of vowels for a Japanese infant,” Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech2005 - Eurospeech) pp.1169-1172, 2005.
  22. R. Mugitani, K. Ishizuka, and S. Amano, “Longitudinal development of mora-timed rhythmic structure in Japanese,” The 30th Boston University Conference on Language Development BUCLD30, p.52, 2005.
  23. R. Mukai, H. Sawada, S. Araki, and S. Makino, “Real-Time Blind Source Separation and DOA Estimation Using Small 3-D Microphone Array,” IWAENC2005, pp. 45-48, 2005.
  24. S. Araki, H. Sawada, R. Mukai and S. Makino, “A novel blind source separation method with observation vector clustering,” IWAENC2005, pp.117-120, 2005.
  25. S. Watanabe and A. Nakamura, “Effects of Bayesian predictive classification using variational Bayesian posteriors for sparse training data in speech recognition,” Proc. Interspeech '2005 Eurospeech, pp. 1105-1108, 2005.

Other Conference Papers

  1. K. Kinoshita, T. Nakatani and M. Miyoshi, “ Fast estimation of a precise dereverberation filter based on speech harmonicity,” Proc. Of International Conference on Acoustics, Speech, and Signal Processing(ICASSP), 2005.
  2. K. Kinoshita, T. Nakatani and M. Miyoshi, “Efficient blind dereverberation framework for automatic speech recognition,” Proc. of Interspeech, 2005.