Signal Processing Research Group
Contents
Top
Research Topics
People
Publications
Organization
NTT Communication Science Laboratories
Media Information Laboratory
Recognition Research Group
Signal Processing Research Group
Communication environment research group
Innovative Communication Laboratory
Human and Information Science Laboratory
Moriya Research Laboratory
NTT Science and Core Technology Laboratory Group
Nippon Telegraph and Telephone Corporation (NTT)

| 2011 | 2010 | 2009 | 2008 | 2007 | 2006 | 2005 | 2004 | 2003 | 2002 | 2001 | 2000 |

Publications

2010

Journal Papers

  1. T. Yoshioka, T. Nakatani, M. Miyoshi, and H. G. Okuno, “New method for blind separation and dereverberation of highly reverberant mixtures,” accepted for publication in IEEE Transactions on Audio, Speech, and Language Processing, now available on IEEE Xplore, January 2010.
  2. T. Oba, T. Hori, and A. Nakamura, “Improved Sequential Dependency Analysis Integrating Labeling-based Sentence Boundary Detection,” IEICE, Vol.E93-D, No.5, pp.-, May. 2010.
  3. J. Muramatsu, and S. Miyake “Hash property and coding theorems for sparce matrices and maximal-likelihood coding,” IEEE Transactions on Information Theory, vol. IT-56, no. 5, pp. 2143-2167, May 2010.
  4. J. Muramatsu, and S. Miyake “Hash property and fixed-rate universal coding theorems,” IEEE Transactions on Information Theory, vol. IT-56, no. 6, pp. 2688-2698, Jun. 2010.
  5. J. Muramatsu, and S. Miyake, “Construction of broadcast channel code based on hash property,” in Proceedings of the 2010 IEEE International Symposium on Information Theory, pp. 575-579, 2010.
  6. K. Ishizuka, S. Araki, and T. Kawahara, “Speech activity detection for muti-party conversation analyses based on likelihood ratio test on spatial magnitude,” IEEE Transaction on Audio, Speech, and Language Processing (in press).
  7. K. Ishizuka, T. Nakatani, M. Fujimoto, and N. Miyazaki, “Noise robust voice activity detection based on periodic to aperiodic component ratio,” Speech Communication, Vol.52, No.1, pp. 41-60, 2010.
  8. S. Araki, H. Sawada, and S. Makino, “Blind Speech Separation in a Meeting Situation with Maximum SNR Beamformers,” IEEE Trans. Audio, Speech, and Language Processing, (submitting)
  9. S. Watanabe and A. Nakamura, “Predictor-Corrector Adaptation based on a Macroscopic Time Evolution System,” IEEE Transactions on Audio, Speech, and Language Processing, vol. 18, issue 2, pp. 395-406, 2010.

Book Chapter, Tutorial Papers

  1. T. Yoshioka, T. Nakatani, K. Kinoshita, and M. Miyoshi, “Speech dereverberation and denoising based on time varying speech model and autoregressive reverberation model,” to appear in Speech Processing in Modern Communication: Challenges and Perspectives, Israel Cohen, Jacob Benesty, and Sharon Gannot (eds.), Springer, pp. 151-182, February. 2010.
  2. M. Fujimoto, K. Takeda, and S. Nakamura, “Chapter 4.4.2: An evaluation database for in-car speech recognition and its common evaluation framework,” in “ Resources and Standards of Spoken Language Systems - Advances in Oriental Spoken Language Processing,” World Scientific Publishing Co., March 2010.
  3. M. Miyoshi, M. Delcroix, K. Kinoshita, T. Yoshioka, T. Nakatani, and T. Hikichi, “Inverse-filtering for speech dereverberation without the use of room acoustics information,” to appear in Speech Dereverberation, Patrik A. Naylor and Nikolay Gaubitch (eds.), Springer.
  4. M. Fujimoto, “Chapter 1: Integration of statistical model-based voice activity detection and noise suppression for noise robust speech recognition,” in “ Advances in Robust Speech Recognition Technology,” Bentham Publishing Services. (in publishing)

Peer-reviewed Conference Papers

  1. T. Yoshioka, T. Nakatani, and H. G. Okuno, “Noisy speech enhancement based on prior knowledge about spectral envelope and harmonic structure,” in Proceedings of the 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), pp. 4270-4273, March 2010.
  2. N. Yasuraoka, T. Yoshioka, T. Nakatani, A. Nakamura, and Hiroshi G. Okuno, “Music dereverberation using harmonic structure source model and Wiener filtering,” in Proceedings of the 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), pp. 53-56, March 2010.
  3. T. Hori, S. Watanabe, and A. Nakamura, “Search Error Risk Minimization in Viterbi Beam Search for Speech Recognition,” in Proc. ICASSP2010, pp. 4934-4937, March 2010.
  4. T. Oba, T. Hori and A. Nakamura, “A Comparative Study on Methods of Weighted Language Model Training for Reranking LVCSR N-best Hypotheses,” in Proc. ICASSP2010, pp. 5126-5129, March 2010.
  5. S. Watanabe, T. Hori, E. McDermott, and A. Nakamura, “A Discriminative Model for Continuous Speech Recognition Based on Weighted Finite State Transducers,” in Proc. ICASSP2010, pp. 4922-4925, March 2010.
  6. A. Ogawa and A. Nakamura, “Discriminative confidence and error cause estimation for extended speech recognition function,” Proc. ICASSP, pp. 4454-4457, March 2010.
  7. A. Ogawa and A. Nakamura, “A novel confidence measure based on marginalization of jointly estimated error cause probabilities,” Proc. Interspeech, September. 2010.
  8. J. Muramatsu, K. Yoshimura and P. Davis, “Information theoretic security based on bounded observability,” Proceedings of the 4th International Conference on Information Theoretic Security, Lecture Notes on Computer Science (LNCS), vol.5973, pp.128-139, Splinger (in press).
  9. D. Cournapeau, S. Watanabe, A. Nakamura, and T. Kawahara, “Using Online Model Comparison In The Variational Bayes Framework For Online Unsupervised Voice Activity Detection,” ICASSP 2010, pp. 4462-4465, 2010.
  10. E. McDermott, S. Watanabe, and A. Nakamura, “Discriminative Training Based On An Integrated View Of MPE And MMI In Margin And Error Space,” ICASSP 2010, pp. 4894-4897, 2010.
  11. H. Watanabe, S. Katagiri, K. Yamada, E. McDermott, A. Nakamura, S. Watanabe, and M. Ohsaki, “Minimum Error Classification With Geometric Margin Control,” ICASSP 2010, pp. 2170-2173, 2010.
  12. K. Aoyama, S. Watanabe, H. Sawada, Y. Minami, N. Ueda, and K. Saito, “Fast Similarity Search On A Large Speech Data Set With Neighborhood Graph Indexing,” ICASSP 2010, pp. 5358-5361, 2010.
  13. S. Araki, T. Nakatani and H. Sawada, “Simultaneous clustering of mixing and spectral model parameters for blind sparse source separation,” ICASSP2010, 2010.
  14. T. Hori, S. Watanabe, and A. Nakamura, “Search Error Risk Minimization In Viterbi Beam Search For Speech Recognition,” ICASSP 2010, pp. 4934-4937, 2010.
  15. T. Nakatani and S. Araki, “SINGLE CHANNEL SOURCE SEPARATION BASED ON SPARSE SOURCE OBSERVATION MODEL WITH HARMONIC CONSTRAINT,” ICASSP2010, 2010.
  16. Y. Ansai, S. Araki, S. Makino, T. Nakatani, T. Yamada, A. Nakamura and N. Kitawaki, “Cepstral Smoothing of Separated Signals for Underdetermined Speech Separation,” ISCAS2010, (to appear)

Other Conference Papers