Publications

For the full list of publications, please see my Google Scholar page.

  Selected conference papers

    K. Kinoshita, M. Delcroix, S. Araki, and T. Nakatani,
    "Tackling real noisy reverberant meetings with all-neural source separation, counting, and diarization system,''
    Proc. of  International Conference on Acoustics, Speech, and Signal Processing(ICASSP) , 2020. (to appear)
    [Paper], [Demo]
    K. Kinoshita, T. Ochiai, M. Delcroix, and T. Nakatani,
    "Improving noise robust automatic speech recognition with single-channel time-domain enhancement network,''
    Proc. of  International Conference on Acoustics, Speech, and Signal Processing(ICASSP) , 2020. (to appear)
    [Paper],
    T. von Neumann, K. Kinoshita, M. Delcroix, S. Araki, T. Nakatani and R. Haeb-Umbach,
    "All-neural online source separation, counting, and diarization for meeting analysis,''
    Proc. of  International Conference on Acoustics, Speech, and Signal Processing(ICASSP) , 2019.
    [Paper], [Demo]
    K. Kinoshita, L. Drude, M. Delcroix and T. Nakatani,
    "Listening to each speaker one by one with reccurent selective hearing networks,''
    Proc. of  International Conference on Acoustics, Speech, and Signal Processing(ICASSP) , pp.5064-5068, 2018.
    [Paper]
    K. Kinoshita, H. Kwon, T. Mori, M. Delcroix and T. Nakatani,
    "Neural Network-Based Spectrum Estimation for Online WPE Dereverberation,''
    Proc. of  Interspeech , pp.384-388, 2017.
    [Paper] [Presented slides] 
    K. Kinoshita, M. Delcroix, A. Ogawa, T. Higuchi and T. Nakatani,
    "Deep mixture density network for statistical model-based feature enhancement,''
    Proc. of  International Conference on Acoustics, Speech, and Signal Processing(ICASSP) , pp.251-255, 2017.
    [Paper] [Presented poster] 
    K. Kinoshita, M. Delcroix, A. Ogawa and T. Nakatani,
    "Text-informed speech enhancement with deep neural networks,''
    Proc. of  Interspeech , pp.1760-1764, 2015.
    [Paper] [Presented poster] 
    K. Kinoshita and T. Nakatani,
    "Modeling inter-node acoustic dependencies with Restricted Boltzmann Machine for distributed microphone array based BSS,''
    Proc. of  International Conference on Acoustics, Speech, and Signal Processing(ICASSP), 2015.
    K. Kinoshita and T. Nakatani,
    "Blind source separation using spatially distributed microphones based on microphone-location dependent source activities,''
    Proc. of  Interspeech, 2014.
    K. Kinoshita, M. Delcroix, T. Yoshioka, T. Nakatani, E. Habets, R. Haeb-Umbach, V. Leutnant, A. Sehr, W. Kellermann, R. Maas, S. Gannot and B. Raj,
    "The REVERB Challenge: A Common Evaluation Framework for Dereverberation and Recognition of Reverberant Speech,''
    Proc. of  Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) , 2013.
    K. Kinoshita and T. Nakatani,
    "Blind source separation using spatially distributed microphones based on microphone-location dependent source activities,''
    Proc. of  Interspeech , pp.49-52, 2013.
    K. Kinoshita, M. Delcroix, M. Souden and T. Nakatani,
    "Example-based speech enhancement with joint utilization of spatial, spectral & temporal cues of speech and noise,''
    Proc. of  Interspeech , 2012.
    K. Kinoshita, M. Souden, M. Delcroix and T. Nakatani,
    "Single channel dereverberation using example-based speech enhancement with uncertainty decoding technique,''
    Proc. of  Interspeech , pp.197-200, 2011.
    [Sound demonstration]
    K. Kinoshita, T. Nakatani and M. Miyoshi,
    "Blind upmix of stereo music signal using multi-step linear prediction based reverberation extraction,''
    Proc. of  International Conference on Acoustics, Speech, and Signal Processing(ICASSP) , pp.49-52, 2009.
    K. Kinoshita, T. Nakatani, M. Miyoshi,
    "Upmixing stereo music signals based on dereverberation mechanism,''
    Audio Engineering Society (AES) Japan conference, 2008
    M. Miyoshi, K. Kinoshita, T. Nakatani, T. Yoshioka, [Invited paper]
    "Principles and applications of dereverberation for noisy and reverberant audio signals,''
    in Proc. of  2008 Asilomar Conference on Signals, Systems, and Computers
    , pp.793-796, 2008
    K. Kinoshita, M. Delcroix, T. Nakatani and M. Miyoshi,
    "Multi-step linear prediction based speech enhancement in noisy reverberant environment,'' Proc. of Interspeech, pp.854-857, 2007
    K. Kinoshita, M. Delcroix, T. Nakatani and M. Miyoshi,
    "Dereverberation of real recordings using linear prediction-based microphone array,'' Proc. of  Audio Engineering Society (AES) 13th Regional Convention, 2007.
    K. Kinoshita, T. Nakatani and M. Miyoshi,
    "Spectral subtraction steered by multi-step forward linear prediction  for single channel speech dereverberation,'' Proc. of  International Conference on Acoustics, Speech, and Signal Processing(ICASSP), I, pp.817-820, 2006. [Sound demonstration]
    K. Kinoshita, T. Nakatani and M. Miyoshi,
    "Fast estimation of a precise dereverberatioin filter based on speech harmonicity,'' Proc. of 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Philadelphia, 2005. 

    K. Kinoshita, T. Nakatani and M. Miyoshi,
    "Improving automatic speech recognition performance and speech intelligibility with harmonicity based dereverberation,'' Proc. of International Conf. on Spoken Language Processing (ICSLP), Jeju, 2004. 
    T. Nakatani, M. Miyoshi and K. Kinoshita,
    "One microphone blind dereverberation based on quasi-periodicity of speech signals,'' Advances in Neural Information Processing Systems 16 (NIPS16), MIT Press, 2003.
    K. Kinoshita, D. Behne and T. Arai,
    "Duration and F0 as perceptual cues to Japanese vowel quantity,'' Proc. of the International Conf. on Spoken Language Processing (ICSLP), pp. 757-760, Denver, 2002.
    A. Takayuki, K. Kinoshita, N. Hodoshima, A. Kusumoto and T. Kitamura
    "Effects of supression steady-state portions of speech on intelligibility in reverberant environments,'' Acoustical Science and Technology, Vol. 23, No. 4, pp. 229-232, 2002.
    T. Kitamura, K. Kinoshita, T. Arai, A. Kusumoto and Y. Murahara
    "Designing modulation filters for improving speech intelligibility in reverberant environments,'' Proc. of the International Conf. on Spoken Language Processing (ICSLP), Vol. 3, pp. 586-589, Beijing, 2000.
    木下 慶介,ソウデン・メレズ,デルクロア・マーク,中谷 智広
    "Uncertainty decoding 技術に基づく事例モデルベース1ch 残響除去,'' 日本音響学会秋季研究発表会, pp.597-600, 2011.
    木下慶介, 吉岡拓也, 中谷智広 [Invited talk]
    "音声信号のブラインド残響除去:最新の研究動向,''
    信学技報, vol. 110, no. 56, pp. 25-30, 2010年5月
    , 2009 3月.
    木下慶介, 中谷智広, 三好正人
    "残響除去原理に基づき作成したステレオ音楽サラウンド再生音の主観評価,'' 日本音響学会秋季研究発表会, 2009.
    木下慶介, 久保田敏之 [Invited talk]
    "マルチステップ線形予測に基づく残響除去と、それを応用した世界初の残響除去ソフトウェアについて,'' AES日本支部例会, 2009 3月.
    木下慶介, 中谷智広, 三好正人
    "残響除去原理に基づくステレオ音楽信号のサラウンド化,'' 日本音響学会秋季研究発表会, 2008.
    木下慶介, 中谷智広, 澤田宏, 荒木章子, 三好正人
    "複数音源が存在する残響環境でのマルチステップ線形予測の効果,'' 日本音響学会秋季研究発表会, 2007.
    木下慶介, デルクロア・マーク、中谷智広, 三好正人 [Invited paper]
    "マルチステップ線形予測に基づく残響除去法の雑音耐性の音声認識による評価,'' 電子情報通信学会総合大会, pp.71-72, 2007.
    木下慶介, 中谷智広, 三好正人 [Invited talk]
    "マルチステップ線形予測を用いる音声残響除去方法について,'' 電気関係学会連合大会シンポジウム, 2007.
    木下慶介, デルクロア・マーク、中谷智広, 三好正人
    "実音場収音した音声による「マルチステップ線形予測に基づく残響除去方法」の評価,'' 日本音響学会秋季研究発表会, pp.421-422, 2006.
    木下慶介, 中谷智広, 三好正人
    "マルチステップ線形予測を用いた1ch残響除去法の検討,'' 日本音響学会春季研究発表会, pp.511-512, 2006.
    木下慶介, 中谷智広, 三好正人
    "調波構造を用いた残響除去法の明瞭性と認識率による音声品質評価,'' 日本音響学会春季研究発表会, pp.611-612, 2004.

  Selected journal papers

    K. Kinoshita, M. Delcroix, S. Gannot, E. A. P. Habets, R. Haeb-Umbach, W. Kellermann, V. Leutnant, R. Maas, T. Nakatani, B. Raj, A. Sehr and T. Yoshioka,
    ``A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research,'' EURASIP Journal on Advances in Signal Processing, DOI 10.1186/s13634-016-0306-6, 2016
    [PDF]
    K. Kinoshita, M. Delcroix, T. Nakatani and M. Miyoshi,
    ``Suppression of late reverberation effect on speech signal using long-term multiple-step linear prediction,'' IEEE Transactions on Audio, Speech and Language processing, 17(4), pp.534-545, 2009
    M. Miyoshi, M. Delcroix, and K. Kinoshita [Invited paper],
    ``Calculating Inverse filters for speech dereverberation,'' IEICE Trans. Fundamentals, E91-A(6), pp.1303-1309, 2008
    K. Kinoshita, T. Nakatani and M. Miyoshi,
    ``Fast estimation of a precise dereverberation filter based on the harmonic structure of speech '' Acoustical Science and Technology (AST), 28(2), pp.105-114, 2007
    K. Kinoshita, T. Nakatani and M. Miyoshi,
    ``Harmonicity based dereverberation for improving automatic speech recognition performance and speech  intelligibility'' IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, E88-A(7), pp.1724-1731, 2005.
    木下慶介, 中谷智広, 吉岡拓也, [Invited paper]
    ``音声のブラインド残響除去:最新の研究動向'' IEICE Fundamental Review, 4(4), pp.301-310, April, 2011.
    [PDF] 

Book chapters

    T. Yoshioka, T. Nakatani, K. Kinoshita, and M. Miyoshi,
    "Speech dereverberation and denoising based on time varying speech model and autoregressive reverberation model,'' in Speech Processing in Modern Communication: Challenges and Perspectives, Springer, pp.151-182, 2010
    M. Miyoshi, M. Delcroix, K. Kinoshita, T. Yoshioka, T. Nakatani and T. Hikichi,
    "Inverse Filtering for Speech Dereverberation Without the Use of Room Acoustics Information ,'' in Speech Dereverberation, Springer, pp.271-310, 2010.
    T. Nakatani, M. Miyoshi, and K. Kinoshita,
    "Single-Microphoene Blind Dereverberation,'' in Speech Enhancement, Springer, pp.247-270, 2005.
     

Keisuke Kinoshita at lab.ntt.co.jp