Signal Processing Research Group
Contents
Top
Research Topics
People
Publications
Organization
NTT Communication Science Laboratories
Media Information Laboratory
Recognition Research Group
Signal Processing Research Group
Communication environment research group
Innovative Communication Laboratory
Human and Information Science Laboratory
Moriya Research Laboratory
NTT Science and Core Technology Laboratory Group
Nippon Telegraph and Telephone Corporation (NTT)

| 2011 | 2010 | 2009 | 2008 | 2007 | 2006 | 2005 | 2004 | 2003 | 2002 | 2001 | 2000 |

Publications

2007

Journal Papers

  1. S. Araki, H. Sawada, R. Mukai and S. Makino, “Underdetermined Blind Sparse Source Separation for Arbitrarily Arranged Multiple Sensors,” Signal Processing, vol. 87, pp. 1833-1847, February. 2007. doi:10.1016/j.sigpro.2007.02.003.
  2. M. Knaak (Technical University Berlin), S. Araki and S. Makino, “Geometrically Constrained Independent Component Analysis,” IEEE Trans. Audio, Speech and Language Processing, vol. 15, No. 2, pp. 715-726, February, 2007.
  3. T. Yamamoto, I. Oowada, H. Yip, A. Uchida, S. Yoshimori, K. Yoshimura, J. Muramatsu, S. Goto, and P. Davis, “Common-chaotic-signal induced synchronization in semiconductor lasers,” Opt. Express, vol.15, no.7, pp.3974-3980, April 2007.
  4. Hiroko Kato Solvang, Kentaro Ishizuka, and Masakiyo Fujimoto, “A voice activity detection based on an AR-GARCH model,” IEICE Transaction on Information Systems, Vol.J90-D, No.12, pp.3210-3220, 2007 (in Japanese).
  5. H. Sawada, S. Araki, R. Mukai and S. Makino, “Grouping Separated Frequency Components with Estimating Propagation Model Parameters in Frequency-Domain Blind Source Separation,” IEEE Trans. Audio, Speech & Language Processing, vol. 15, no. 5, pp. 1592-1604, July 2007.
  6. K. Ishizuka, R. Mugitani, H. Kato, and S. Amano, “Longitudinal developmental changes in spectral peaks of vowels produced by Japanese infants,” The Journal of the Acoustical Society of America, Vol.121, No.11, pp.2272-2282, 2007.
  7. K. Kinoshita, T. Nakatani and M. Miyoshi, “Fast estimation of a precise dereverberation filter based on the harmonic structure of speech,” Acoustical Science and Technology (AST)
  8. T. Yoshioka, T. Hikichi, and M. Miyoshi, “Dereverberation by using time-variant nature of speech production system,” EURASIP Journal on Advances in Signal Processing, vol. 2007, article ID 65698, doi:10.1155/2007/65698, 2007.
  9. T. Hori, C. Hori, Y. Minami, and A. Nakamura, “Efficient WFST-based one-pass decoding with on-the-fly hypothesis rescoring in extremely large vocabulary continuous speech recognition,” IEEE Trans., Audio, Speech and Language Processing, Vol. 15, pp. 1352-1365, 2007.

Book Chapter, Tutorial Papers

  1. H. Sawada, S. Araki, and S. Makino, “Frequency-Domain Blind Source Separation,” in Blind Speech Separation, S. Makino T.-W. Lee and H. Sawada, Eds., Springer, 2007.
  2. S. Araki, H. Sawada and S. Makino, “K-means based Underdetermined Blind Speech Separation,” in Blind Speech Separation, S. Makino T.-W. Lee and H. Sawada, Eds., Springer, 2007.

Peer-reviewed Conference Papers

  1. T. Nakatani, B.-H. Juang, T. Hikichi, T. Yoshioka, K. Kinoshita, M. Delcroix, and M. Miyoshi, “Study on speech dereverberation with autocorrelation codebook,” in Proceedings of the 2007 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), vol. 1, pp. 193-196, April 2007.
  2. M. Fujimoto, K. Ishizuka, and H. Kato, “Noise Robust Voice Activity Detection Based on Statistical Model and Parallel Non-linear Kalman filtering,” Proc. ICASSP '07, Vol. IV, pp. 797-800, April 2007.
  3. S. Araki, H. Sawada, and S. Makino, “Blind speech separation in a meeting situation with maximum SNR beamformers,” ICASSP2007, vol. 1, pp. 41-44, April 2007.
  4. J. Cermak, S. Araki, H. Sawada and S. Makino, “Blind Source Separation Based on Beamformer Array and Time Frequency Binary Masking,” in Proc. ICASSP2007, vol. I, pp. 145 -148, April 2007.
  5. J. E. Rubio, K. Ishizuka, H. Sawada, S. Araki, T. Nakatani and M. Fujimoto, “Two-Microphone Voice Activity Detection Based on the Homogeneity of the Direction of Arrival Estimates,” in Proc. ICASSP2007, vol.4, pp. 385-388, April 2007.
  6. T. Nakatani, T. Hikichi, K. Kinoshita, T. Yoshioka, M. Delcroix, M. Miyoshi, and Biing-Hwang Juang, “Robust blind dereverberation of speech signals based on characteristics of short-time speech segments,” in Proceedings of the 2007 IEEE International Symposium on Circuits and Systems (ISCAS 2007), pp. 2986-2989, May. 2007.
  7. H. Sawada, S. Araki and S. Makino, “Measuring Dependence of Bin-wise Separated Signals for Permutation Alignment in Frequency-domain BSS,” in Proc. ISCAS2007, pp. 3247 - 3250, May 2007.
  8. M. Fujimoto and K. Ishizuka, “Noise Robust Voice Activity Detection Based on Switching Kalman Filtering,” Proc. Eurospeech '07, pp. 2933-2936, August 2007.
  9. T. Oba, T. Hori, and A. Nakamura, “A Study of Efficient Discriminative Word Sequences for Reranking of Recognition Results based on N-gram Counts,” Interspeech2007, pp. 1753-1756, August 2007.
  10. T. Yoshioka, T. Nakatani, T. Hikichi, and M. Miyoshi, “Overfitting-resistant speech dereverberation,” in Proceedings of the 2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2007), pp. 163-166, October 2007.
  11. T. Nakatani, B.-H. Juang, T. Yoshioka, K. Kinoshita, and M. Miyoshi, “Importance of energy and spectral features in Gaussian source model for speech dereverberation,” in Proceedings of the 2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2007), pp. 299-302, October 2007.
  12. Y. Minami, M. Sawaki, K. Dohsaka, R. Higashinaka, K. Ishizuka, H. Isozaki, T. Matsubayashi, M. Miyoshi, A. Nakamura, T. Oba, H. Sawada, T. Yamada, and E. Maeda, “The world of Mushrooms: Human-computer interaction prototype systems for ambient intelligence,” Proceedings of the 9th International Conference on Multimodal Interfaces (ICMI2007), 2007.
  13. I. Oowada, Y. Yamamoto, H. Yip, H. Arizumi, A. Uchida, S. Yoshimori, K. Yoshimura, J. Muramatsu, S. Goto, and P. Davis, “Synchronization in semiconductor lasers subject to a common a common chaotic drive signal,” Proceedings of the 15th IEEE International Workshop on Nonlinear Dynamics of Electronic Systems Tokushima, Japan, pp.149-152, 2007.
  14. H. Sawada, S. Araki, and S. Makino, “A two-stage frequency-domain blind source separation method for underdetermined convolutive mixtures,” WASPAA2007.
  15. H. Sawada, S. Araki, and S. Makino, “MLSP 2007 data analysis competition: Frequency-domain blind source separation for convolutive mixtures of speech/and audio,” MLSP2007, 2007.
  16. J. E. Rubio, K. Ishizuka, H. Sawada, S. Araki, T. Nakatani, and M. Fujimoto, “Two-microphone voice activity detection based on the homogeneity of the direction of arrival estimate,” Proceedings of the 32nd International Conference on Acoustics, Speech, and Signal Processing (ICASSP2007), Vol.4, pp.385-388, 2007.
  17. J. Muramatsu, “Effect of random permutation of symbols in a sequence,” Proceedings of the 2007 IEEE International Symposium on Information Theory, Nice, France, pp.1486-1490, 2007.
  18. K. Ishizuka, T. Nakatani, M. Fujimoto, and N. Miyazaki, “Noise robust front-end processing with voice activity detection based on periodic to aperiodic component ratio,” Proceedings of the 10th European Conference on Speech Communication and Technology (Interspeech2007 - Eurospeech), pp.230-233, 2007.
  19. M. Fujimoto and K. Ishizuka, “Noise robust voice activity detection based on switching Kalman filter,” Proceedings of the 10th European Conference on Speech Communication and Technology (Interspeech2007 - Eurospeech), pp.2933-2936, 2007.
  20. M. Fujimoto, K. Ishizuka, and H. Kato, “Noise robust voice activity detection based on statistical model and parallel non-linear Kalman filtering,” Proceedings of the 32nd International Conference on Acoustics, Speech, and Signal Processing (ICASSP2007), Vol.4, pp.797-800, 2007.
  21. R. Mugitani, T. Kobayashi, and K. Ishizuka, “Perceptual development of phonemic categories for Japanese single/geminate obstruents,” The 32nd Boston University Conference on Language Development (BUCLD32), 2007.
  22. S. Miyake, and J. Muramatsu, “Constructions of a lossy source code using LDPC matrices,” Proceedings of the 2007 IEEE International Symposium on Information Theory, Nice, France, pp.1106-1110, 2007.
  23. S. Watanabe and A. Nakamura, “Incremental adaptation based on a macroscopic time evolution system,” Proc. ICASSP 2007, vol. 4, pp. 769-772, 2007.
  24. Y. Minami, M. Sawaki, K. Dohsaka, R. Higashinaka, K. Ishizuka, H. Isozaki, T. Matsubayashi, M. Miyoshi, A. Nakamura, T. Oba, H. Sawada, T. Yamada, and E. Maeda, “The world of Mushrooms: Human-computer interaction prototype systems for ambient intelligence,” Proceedings of the 9th International Conference on Multimodal Interfaces (ICMI2007), pp.366-373, 2007.

Other Conference Papers

  1. K. Kinoshita, M. Delcroix, T. Nakatani and M. Miyoshi, “Dereverberation of real recordings using linear prediction-based microphone array,” Audio Engineering Society (AES) 13th Regional Convention, Tokyo, 2007
  2. K. Kinoshita, M. Delcroix, T. Nakatani and M. Miyoshi, “Multi-step linear prediction based speech enhancement in noisy reverberant environment,” Proc. of Interspeech, pp.854-857, 2007