Signal Processing Research Group
Contents
Top
Research Topics
People
Publications
Organization
NTT Communication Science Laboratories
Media Information Laboratory
Recognition Research Group
Signal Processing Research Group
Communication environment research group
Innovative Communication Laboratory
Human and Information Science Laboratory
Moriya Research Laboratory
NTT Science and Core Technology Laboratory Group
Nippon Telegraph and Telephone Corporation (NTT)

| 2011 | 2010 | 2009 | 2008 | 2007 | 2006 | 2005 | 2004 | 2003 | 2002 | 2001 | 2000 |

Publications

2008

Journal Papers

  1. J. Muramatsu, “Effect of random permutation of symbols in a sequence,” IEEE Transactions on Information Theory, vol.IT-54, no.1, pp.78-86, January. 2008.
  2. J. Muramatsu, K. Yoshimura, K. Arai, and P. Davis, “Some results on secret key agreement using correlated sources,” NTT Technical Review, vol.6, No.2, February. 2008.
  3. M. Fujimoto and K. Ishizuka, “Noise Robust Voice Activity Detection Based on Switching Kalman Filter,” IEICE Transactions on Information and Systems, Vol. E91-D, No. 3, pp. 467-477, March. 2008.
  4. S. Miyake, and J. Muramatsu, “A construction of lossy source code using LDPC matrices, IEICE Transactions on Fundamentals,” vol.E91-A, no.6, pp.1488-1501, June 2008.
  5. T. Oba, T. Hori, and A. Nakamura, “Sequential Dependency Analysis for Online Spontaneous Speech Processing,” Speech Communication, Volume 50, Issue 7, pp. 616-625, July 2008.
  6. T. Nakatani, B.-H. Juang, T. Yoshioka, K. Kinoshita, M. Delcroix, and M. Miyoshi, “Speech dereverberation based on maximum likelihood estimation with time-varying Gaussian source model,” IEEE Transactions on Audio, Speech and Language Processing, vol. 16, no. 8, pp. 1512-1527, November 2008.
  7. K. Yoshimura, J. Muramatsu, and P. Davis, “Conditions for common-noise-induced synchronization in time-delay systems,” Physica D, vol. 237, no. 23, pp.3146-3152, December. 2008.
  8. H. K. Solvang, K. Ishizuka, and M. Fujimoto, “Voice activity detection based on adjustable linear prediction and GARCH models,” Speech Communication, Vol.50, No.6, pp.476-486, 2008.
  9. T. Nakatani, S. Amano, T. Irino, K. Ishizuka, and T. Kondo, “A method for fundamental frequency estimation and voicing decision: Application to infant utterances recorded in real acoustical environments,” Speech Communication, Vol.50, No.3, pp.203-214, 2008.

Book Chapter, Tutorial Papers

  1. S. Makino, S. Araki, and H. Sawada, “Underdetermined Blind Source Separation using Acoustic Arrays,” in Handbook on Array Processing and Sensor Networks, S. Haykin and K.J. Ray Liu, Eds, Wiley, 2008.

Peer-reviewed Conference Papers

  1. T. Yoshioka, T. Nakatani, T. Hikichi, and M. Miyoshi, “Maximum likelihood approach to speech enhancement for noisy reverberant signals,” in Proceedings of the 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), pp. 4585-4588, March 2008.
  2. T. Yoshioka and M. Miyoshi, “Adaptive suppression of non-stationary noise by using variational Bayesian method,” in Proceedings of the 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), pp. 4889-4892, March 2008.
  3. T. Nakatani, T. Yoshioka, K. Kinoshita, M. Miyoshi, and B.-H., Juang, “Blind speech dereverberation with multi-channel linear prediction based on short time Fourier transform representation,” in Proceedings of the 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), pp. 85-88, March 2008.
  4. M. Fujimoto and K. Ishizuka, and T. Nakatani, “A Voice Activity Detection Based on the Adaptive Integration of Multiple Speech Features and a Signal Decision Scheme,” Proc. ICASSP '08, pp. 4441-4444, March 2008.
  5. T. Oba, T. Hori, and A. Nakamura, “Efficient Discriminative Training of Error Corrective Models Using High-WER Competitors,” Asian Workshop on Speech Science and Technology, IEICE Technical Report SP2007-185-214, pp. 99-104, March 2008.
  6. A. Ogawa and S. Takahashi, “Weighted distance measures for efficient reduction of Gaussian mixture components in HMM-based acoustic model,” Proc. ICASSP, pp. 4173-4176, March 2008.
  7. T. Nakatani, T. Yoshioka, K. Kinoshita, M. Miyoshi, and B.-H., Juang, “Speech dereverberation in short time Fourier transform domain with cross band effect compensation,” in Proceedings of the 2008 Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA 2008), pp. 220-223, May 2008.
  8. T. Yoshioka, T. Nakatani, and M. Miyoshi, “An integrated method for blind separation and dereverberation of convolutive audio mixtures,” in Proceedings of the 16th European Signal Processing Conference (EUSIPCO 2008), CD-ROM Proceedings, August 2008.
  9. T. Yoshioka, T. Nakatani, and M. Miyoshi, “Enhancement of noisy reverberant speech by linear filtering followed by nonlinear noise suppression,” in Proceedings of the 2008 International Workshop on Acoustic Echo and Noise Control (IWAENC 2008), CD-ROM Proceedings, September 2008.
  10. T. Nakatani, T. Yoshioka, K. Kinoshita, M. Miyoshi, and B.-H. Juang, “Incremental estimation of reverberation with uncertainty using prior knowledge of room acoustics for speech dereverberation,” in Proceedings of the 2008 International Workshop on Acoustic Echo and Noise Control (IWAENC 2008), CD-ROM Proceedings, September 2008.
  11. M. Fujimoto, K. Ishizuka, and T. Nakatani, “Study of Integration of Statistical Model-Based Voice Activity Detection and Noise Suppression,” Proc. Interspeech '08, September 2008.
  12. M. Miyoshi, K. Kinoshita, T. Nakatani, and T. Yoshioka, “Principles and applications of dereverberation for noisy and reverberant audio signals,” in Proceedings of the 2008 Asilomar Conference on Signals, Systems, and Computers, CD-ROM Proceedings, October 2008.
  13. S. Miyake, and J. Muramatsu, “A construction of channel code, joint source-channel code, and universal code for arbitrary stationary memoryless channels using sparse matrices,” Proceedings of the 2008 IEEE International Symposium on Information Theory, Toronto, Canada, pp.1193-1197, 2008.
  14. D. Kolossa (TU Berlin), S. Araki , M. Delcroix, T. Nakatani, R. Orglmeister (TU Berlin), S. Makino, “Missing Feature Speech Recognition in a Meeting Situation with Maximum SNR Beamforming,” ISCAS2008, pp. 3218 -3221, 2008.
  15. J. Muramatsu, and S. Miyake, “Hash property and multi-terminal source coding theorems for sparse matrices and maximal-likelihood coding,” Proceedings of the 2008 IEEE International Symposium on Information Theory, Toronto, Canada, pp.424-428, 2008.
  16. J. Muramatsu, and S. Miyake, “Lossy source coding algorithm using lossless multi-terminal source codes,” Proceedings of the 2008 International Symposium on Information Theory and its Applications, Auckland, New Zealand, pp.606-611, 2008.
  17. K. Ishizuka, S. Araki, and T. Kawahara, “Statistical speech activity detection based on spatial power distribution for analyses of poster presentations,” Proceedings of the 10th International Conference on Spoken Language Processing (Interspeech2008 - ICSLP), pp.99-102, 2008.
  18. K. Ishizuka, S. Araki, T. Kawahara, “Statistical Speech Activity Detection based on Spatial Power Distribution for Analyses of Poster Presentations,” Interspeech2008, pp.99-102, 2008.
  19. K. Otsuka, S. Araki, K. Ishizuka, M. Fujimoto, M. Heinrich, J. Yamato, “A Realtime Multimodal System for Analyzing Group Meetings by Combining Face Pose Tracking and Speaker Diarization,” ICMI2008, pp. 257-264, 2008.
  20. K. Otsuka, S. Araki, K. Ishizuka, M. Fujimoto, M. Hinrich, and J. Yamato, “A realtime multimodal system for analyzing group meetings by combining face pose tracking and speaker diarization,” Proceedings of the 10th International Conference on Multimodal Interfaces (ICMI2008), pp. 257-264, 2008.
  21. M. Delcroix, T. Nakatani, and S. Watanabe, “Combined static and dynamic variance adaptation for efficient interconnection of speech enhancement pre-processor with speech recognizer,” Proc. ICASSP 2008 pp. 4073-4076, 2008.
  22. M. Fujimoto, K. Ishizuka, and T. Nakatani, “A voice activity detection based on the adaptive integration of multiple speech features and a signal decision scheme,” Proceedings of the 33rd International Conference on Acoustics, Speech and Signal Processing (ICASSP2008), pp.4441-4444, 2008.
  23. M. Fujimoto, K. Ishizuka, and T. Nakatani, “Study of integration of statistical model-based voice activity detection and noise suppression,” Proceedings of the 10th International Conference on Spoken Language Processing (Interspeech2008 - ICSLP), pp.2008-2011, 2008.
  24. S. Araki, M. Fujimoto, K. Ishizuka, H. Sawada, and S. Makino, “A DOA based speaker diarization system for real meetings,” HSCMA2008, pp.29-32, 2008.
  25. S. Araki, M. Fujimoto, K. Ishizuka, H. Sawada, and S. Makino, “A DOA based speaker diarization system for real meetings,” Proceedings of the Joint Workshop on Hands-Free Speech Communication and Microphone Arrays (HSCMA2008), pp.29-32, 2008.
  26. S. Araki, M. Fujimoto, K. Ishizuka, H. Sawada, and S. Makino, “Speaker indexing and speech enhancement in real meetings / conversations,” Proceedings of the 33rd International Conference on Acoustics, Speech and Signal Processing (ICASSP2008), pp.93-96, 2008.
  27. S. Watanabe and A. Nakamura, “A unified interpretation of adaptation techniques based on a macroscopic time evolution system with indirect/direct approaches,” Proc. ICASSP 2008 pp. 4285-4286, 2008.
  28. T. Hager, S. Araki, K. Ishizuka, M. Fujimoto, T. Nakatani, and S. Makino, “Handling speaker position changes in a meeting diarization system by combining DOA clustering and speaker identification,” Proceedings of the 11th International Workshop on Acoustic Echo and Noise Control (IWAENC2008), 2008.
  29. T. Hager, S. Araki, K. Ishizuka, M. Fujimoto, T. Nakatani, S. Makino, “Handling speaker position changes in a meeting diarization system by combining DOA clustering and speaker identification,” IWAENC2008 CD-ROM proceedings, 2008.
  30. T. Kawahara, H. Setoguchi, K. Takanashi, K. Ishizuka, and S. Araki, “Multi-modal recording, analysis and indexing of poster sessions,” Proceedings of the 10th International Conference on Spoken Language Processing (Interspeech2008 - ICSLP), pp.1622-1625, 2008.
  31. T. Kawahara, H. Setoguchi, K. Takanashi, K. Ishizuka, S. Araki, “Multi-Modal Recording, Analysis and Indexing of Poster Sessions,” Interspeech2008, pp. 1622-1625, 2008.

Other Conference Papers

  1. K. Kinoshita, T. Nakatani, M. Miyoshi and T. Kubota, “A new audio post-production tool for speech dereverberation,” Audio Engineering Society (AES) 125th Convention, San Francisco, 2008.