My research
I'm interested in...
Blind Signal Separation of convolutive mixtures, Speech Signal Separation (under reverberation), Speech extraction, Speech diarization, Communication Scene Analysis, Auditory Scene Analysis
at NTT CS labs.
Journal Papers, Letters (1st author)
- S. Araki, R. Mukai, S. Makino, T. Nishikawa(NAIST) and H. Saruwatari(NAIST),
``The Fundamental Limitation of Frequency Domain Blind Source Separation for Convolutive Mixtures of Speech,'' IEEE Trans. Speech Audio Processing, Vol. 11, No. 2, pp. 109-116, 2003. [pdf]
- S. Araki, S. Makino, Y. Hinamoto, R. Mukai, T. Nishikawa(NAIST) and H. Saruwatari(NAIST),
``Equivalence between Frequency Domain Blind Source Separation and Frequency Domain Adaptive Beamforming for Convolutive Mixtures'', EURASIP Journal on Applied Signal Processing, vol. 2003, no. 11, pp. 1157--1166, 2003. [pdf]
- S. Araki, S. Makino, R. Aichner(Univ. Erlangen-Nuremberg), T. Nishikawa(NAIST) and H. Saruwatari(NAIST), ``Subband-based Blind Separation for Convolutive Mixtures of Speech,'' IEICE Trans. Fundamentals, E88-A(12), pp. 3593--3603, 2005. [pdf]
- S. Araki, H. Sawada, R. Mukai and S. Makino, ``Underdetermined Blind Sparse Source Separation for Arbitrarily Arranged Multiple Sensors,'' Signal Processing, doi:10.1016/j.sigpro.2007.02.003, 2007 (available online at http://www.sciencedirect.com and http://dx.doi.org/10.1016/j.sigpro.2007.02.003).
- S. Araki, H. Sawada, R. Mukai and S. Makino, "DOA Estimation for Multiple Sparse Sources with Arbitrarily Arranged Multiple Sensors," Journal of Signal Processing Systems, doi:10.1007/s11265-009-0413-9, 2009 (available online at http://www.springerlink.com/content/8w54h51v31086776/)
- S. Araki, T. Nakatani, and H. Sawada, "Sparse source separation based on simultaneous clustering of source locational and spectral features", Acoustical Science and Technology, Acoustic Letter, vol. 32, no. 4, July, 2011.
Journal Papers, Letters (co-author)
[2001-2010]
- H. Sawada, R. Mukai, S. Araki, S. Makino, "Polar Coordinate based Nonlinear Function for Frequency Domain Blind Source Separation," IEICE Trans. Fundamentals, vol.E86-A, no.3, pp. 590-596, March 2003.
- R. Mukai, S. Araki, H. Sawada, S. Makino,
``Evaluation of Separation and Dereverberation Performance in Frequency Domain Blind Source Separation,'' Acoustical Science and Technology, Vol.25, No.2, Mar. 2004, pp.119-126.
- H. Sawada, R. Mukai, S. Araki, S. Makino, ``Convolutive Blind Source Separation for more than Two Sources in the Frequency Domain,'' Acoustical Science and Technology, the Acoustical Society of Japan, vol.25, no.4, pp. 296-298, July 2004.
- H. Sawada, R. Mukai, S. Araki, S. Makino,
``A Robust and Precise Method for Solving the Permutation Problem of Frequency-Domain Blind Source Separation,''
IEEE Trans. Speech and Audio Processing, vol.12, no.5, pp.530--538, Sept. 2004.
- R. Mukai, H. Sawada, S. Araki, S. Makino,
``Blind Source Separation for Moving Speech Signals using Blockwise ICA and Residual Crosstalk Subtraction,'' IEICE Trans. Fundamentals, Special Section on Digital Signal Processing, vol.E87-A, no.8, pp.1941--1948, Aug, 2004.
- M. Knaak (Technical University Berlin), S. Araki and S. Makino,``Geometrically Constrained Independent Component Analysis,'' IEEE Trans. Speech and Audio Processing, vol. 15, no. 2, pp.715--726, 2007.
- A. Blin, S. Araki, and S. Makino,``Underdetermined blind separation of convolutive mixtures of speech using time-frequency mask and mixing matrix estimation,'' IEICE Trans. Fundamentals, Vol.E88-A, No.7, pp.1693-1700, 2005
- H. Sawada, R. Mukai, S. Araki, and S. Makino, ``Estimating the number of sources using independent component analysis,'' Acoustical Science and Technology, nol. 26, no. 5, pp.450--452, 2005.
- S. Makino, H. Sawada, R. Mukai, and S. Araki, ``Blind source separation of convolutive mixtures of speech in frequency domain,'' IEICE Trans. Fundamentals, Vol.E88-A, No.7, pp.1640-1655, 2005 (invited)
- R. Mukai, H. Sawada, S. Araki, S. Makino, ''Frequency Domain Blind Source Separation of Many Speech Signals Using Near-field and Far-field Models,'' EURASIP Journal on Applied Signal Processing, vol. 2006, Article ID 83683, 13 pages, 2006. doi:10.1155/ASP/2006/83683.
- H. Sawada, S. Araki, R. Mukai, S. Makino, ''Blind extraction of dominant target sources using ICA and time-frequency masking,'' IEEE Trans. Audio, Speech, and Language Processing, vol.14, no.6, pp.2165-2173, Nov. 2006.
- H. Sawada, S. Araki, R. Mukai and S. Makino ,''Grouping Separated Frequency Components with Estimating Propagation Model Parameters in Frequency-Domain Blind Source Separation,'' IEEE Trans. Audio, Speech & Language Processing, vol. 15, no. 5, pp. 1592-1604, July 2007.
- H. Kato, Y. Nagahara, S. Araki, H. Sawada and S. Makino, "Frequency-Domain Pearson Distribution Approach for Independent Component Analysis (FD-Pearson-ICA) in Blind Source Separation," IEEE Trans. Audio, Speech and Language Processing, vol. 17, no. 4, pp. 639-649, May 2009.
- K. Ishizuka, S. Araki, and T. Kawahara, "Speech activity detection for muti-party conversation analyses based on likelihood ratio test on spatial magnitude," IEEE Transaction on Audio, Speech, and Language Processing, Vol.18, No.2, pp. 1354--1365, 2010.
[2011-]
- H. Sawada, S. Araki and S. Makino, "Underdetermined Convolutive Blind Source Separation via Frequency Bin-wise Clustering and Permutation Alignment," IEEE Trans. Audio, Speech, and Language Processing, vol.19, no.3, pp.516-527, March 2011.
- K. Ishiguro, T. Yamada, S Araki, T. Nakatani, and H. Sawada, "Probabilistic Speaker Clustering for DOA-based Diarization", IEEE Trans. ASLP, Vol. 20, No. 2, pp. 447-460, 2012.
- T. Hori, S. Araki, T. Yoshioka, M. Fujimoto, S. Watanabe, T. Oba, A. Ogawa, K. Otsuka, D. Mikami, K. Kinoshita, T. Nakatani, A. Nakamura, and J. Yamato, "Low-latency Real-time Meeting Recognition and Understanding Using Distant Microphones and Omni-directional Camera," IEEE Trans. ASLP, Vol. 20, No. 2, pp. 499-513, 2012.
- E. Vincent, S. Araki, F. Theis, G. Nolte, P. Bofill, H. Sawada, A. Ozerov, V. Gowreesunker, D. Lutter, and N. Q. K. Duong,"The Signal Separation Evaluation Campaign (2007-2010): Achievements and Remaining Challenges," Signal Processing 92, pp. 1928--1936, 2012.
- Marc Delcroix, Keisuke Kinoshita, Tomohiro Nakatani, Shoko Araki,
Atsunori Ogawa, Takaaki Hori, Shinji Watanabe, Masakiyo Fujimoto, Takuya
Yoshioka, Takanobu Oba, Yotaro Kubo, Mehrez Souden, Seong-Jun Hahm, and
Atsushi Nakamura, ``Speech recognition in living rooms: Integrated
speech enhancement and recognition system based on spatial, spectral and
temporal modeling of sounds,'' Computer Speech and Language, vol. 27,
pp. 851-873, May 2013.
- H. Sawada, H. Kameoka, S. Araki and N. Ueda, "Multichannel Extensions of Nonnegative Matrix Factorization with Complex-valued Data," IEEE Trans. Audio, Speech and Language Processing,
- M. Souden, S. Araki, K. Kinoshita, T. Nakatani and H. Sawada, "A Multichannel MMSE-Based Framework for Speech Sources Separation and Noise Reduction," IEEE Trans. Audio, Speech and Language Processing, no.9, vol.11, pp. 1913-1928, 2013.
- T. Nakatani, S. Araki, T. Yoshioka, M. Delcroix, and M. Fujimoto, "Dominance Based Integration of Spatial and Spectral Features for Speech Enhancement," IEEE Trans. ASLP., vol. 21, No. 12, pp.2516-2531, Dec. 2013.
- N. Ito, E. Vincent, T. Nakatani, N. Ono, S. Araki, and S. Sagayama,
"Blind Suppression of Nonstationary Diffuse Noise Based on Spatial Covariance Matrix Decomposition," Springer Journal of Signal Processing Systems. (invited)
- M. Delcroix, T. Yoshioka, A. Ogawa, Y. Kubo, M. Fujimoto, N. Ito, K. Kinoshita, M. Espi, S. Araki, T. Hori, and T. Nakatani, "Strategies for Distant Speech Recognition in Reverberant Environments," EURASIP Journal on Advances in Signal Processing.
- T. Higuchi, N. Ito, S. Araki, T. Yoshioka, M. Delcroix, and T. Nakatani, "Online MVDR Beamformer Based on Complex Gaussian Mixture Model with Spatial Prior for Noise Robust ASR," IEEE Trans on TASLP, 2017.
- T. Kawase, K. Niwa, M. Fujimoto, K. Kobayashi, S. Araki, and T. Nakatani, "Integration of Spatial Cue-based Noise Reduction and Speech Model-based Source Restoration for Real Time Speech Enhancement," Trans IEICE., vol. E100.A, issue 5, pp. 1127-1136, 2017.
- N. Ito, S. Araki, and T. Nakatani, "FASTFCA: A JOINT DIAGONALIZATION BASED FAST ALGORITHM FOR AUDIO SOURCE SEPARATION USING A FULL-RANK SPATIAL COVARIANCE MODEL," Arxiv, 2018.
- S. Emura, S. Araki, T. Nakatani, and N. Harada, "Distortionless beamforming optimized with l1 norm minimization," IEEE signal processing letters, 2019.
- K. Yamamoto, T. Irino, T. Matsui, S. Araki, K. Kinoshita, and T. Nakatani, "Speech intelligibility prediction with the dynamic compressive gammachirp filterbank and modulation power spectrum, "Acoustical Science and Technology, vol. 40, no. 2, pp. 84--92, 2019.
- K. Yamamoto, T. Irino, S. Araki,, K. Kinoshita, and T. Nakatani, "GEDI: Gammachirp Envelope Distortion Index for Predicting Intelligibility of Enhanced Speech," Speech Communication, vol. 123, pp. 43--58, Oct. 2020.
- S. Emura, H. Sawada, S. Araki, and N. Harada, "Multi-delay Sparse Approach to Residual Crosstalk Reduction for Blind Source Separation," IEEE Signal Processing Letters, vol. 27, pp. 1630-1634, Sept. 2020.
- R. Ikeshita, T. Nakatani, and S. Araki, "Block Coordinate Descent Algorithms forAuxiliary-Function-Based Independent Vector Extraction," IEEE Transactions on Signal Processing,vol. 69, pp. 3252--3267, 2021.
Book Chapter
- S. Araki, S. Makino, Subband Based Blind Source Separation, In J. Benesty, S. Makino, and J. Chen, editors, Speech Enhancement, pp. 329--352, Springer, March 2005.
- H. Sawada, R. Mukai, S. Araki and S. Makino, Frequency-domain blind source separation, In J. Benesty, S. Makino, and J. Chen, editors, Speech Enhancement, pp.299--327, Springer, March 2005.
- R. Mukai, H. Sawada, S. Araki and S. Makino, Real-time blind source separation for moving speech signals, In J. Benesty, S. Makino, and J. Chen, editors, Speech Enhancement, pp.353--369, Springer, March 2005.
- S. Makino, H. Sawada, R. Mukai, and S. Araki, ''Blind source separation of convolutive mixtures of audio signals in frequency domain, '' in Topics in Acoustic Echo and Noise Control, E. Haensler and G. Schmidt, Eds., Springer, 2006.
- S. Araki, H. Sawada and S. Makino, ''K-means based Underdetermined Blind Speech Separation,'' in Blind Speech Separation, S. Makino T.-W. Lee and H. Sawada, Eds., Springer, 2007.
- H. Sawada, S. Araki, and S. Makino, ''Frequency-Domain Blind Source Separation,'' in Blind Speech Separation, S. Makino T.-W. Lee and H. Sawada, Eds., Springer, 2007.
- S. Makino, S. Araki, and H. Sawada, "Underdetermined Blind Source Separation using Acoustic Arrays", in Handbook on Array Processing and Sensor Networks, S. Haykin and K.J. Ray Liu, Eds, Wiley, 2008.
- S. Makino, S. Araki, S. Winter, H. Sawada, "Underdetermined Blind Source Separation using Acoustic Arrays," Handbook on Array Processing and Sensor Networks, S. Haykin, and K. J. R. Liu Eds., Wiley, 2009.
- N. Ito, S. Araki, and T. Nakatani, "Multi-channel audio source separation by modelling audio directional statistics," in Audio Source Separation, S. Makino Ed., Springer, 2017.
- M. I. Mandel, S. Araki, and T.Nakatani, "Multichannel classification and clustering approaches," in Audio Source Separation and Speech Enhancement, E.Vincent, T.Virtanen, and S.Gannot, Eds., John Wiley & Sons, Oct., 2018.
International Conferences (1st author)
[2001]
- S. Araki, S. Makino, T. Nishikawa, and
H. Saruwatari, ``Limitation of Frequency Domain Blind Source Separation for Convolutive Mixture of Speech," International Workshop on Hands-Free Speech Communication, Apr. 2001.
- S. Araki, S. Makino, T. Nishikawa, and H. Saruwatari, ``Fundamental Limitation of Frequency Domain Blind Source Separation for Convolutive Mixture of Speech," IEEE International Conference on Acoustics, Speech, and Signal (ICASSP2001), pp.2737--2740, May, 2001.
- S. Araki, S. Makino, R. Mukai, and H. Saruwatari, ``Equivalence between Frequency Domain Blind Source Separation and Frequency Domain Adaptive Beamformers," Consistent & Reliable Acoustic Cues for Sound Analysis (CRAC), Sept. 2001.
- S. Araki, S. Makino, R. Mukai, and H. Saruwatari, ``Equivalence between Frequency Domain Blind Source Separation and Frequency Domain Adaptive Null Beamformers," 7th European Conference on Speech Communication and Technology (Eurospeech2001), vol.4, pp 2595-2598, Sept. 2001.
- S. Araki, S. Makino, R. Mukai, T. Nishikawa, and H. Saruwatari, ``Fundamental limitation of frequency domain Blind Source Separation for convolved mixture of speech," 3rd International Conference on INDEPENDENT COMPONENT ANALYSIS and BLIND SIGNAL SEPARATION (ICA2001) pp.132-137, Dec. 2001.
[2002]
- S. Araki, S. Makino, R. Mukai, Y. Hinamoto, T. Nishikawa and H. Saruwatari, ``Equivalence between Frequency Domain Blind Source Separation and Frequency Domain Adaptive Beamforming," ICASSP2002, vol. II, pp. 1785-1788, May 2002.
- S. Araki, S. Makino, R. Aichner, T. Nishikawa(NAIST), and H. Saruwatari(NAIST), ``Blind Source Separation for Convolutive Mixtures of speech using subband processing,'' SMMSP2002(Second International Workshop on Spectral Methods and Multirate Signal Processing), pp.195-202, Sept. 2002.
[2003]
- S. Araki, S. Makino, R. Aichner, T. Nishikawa(NAIST), and H. Saruwatari(NAIST), ``Subband Based Blind Source Separation with Appropriate Processing for Each Frequency Band,'' ICA2003, pp. 499--504, 2003 .
- S. Araki, S. Makino, R. Aichner, T. Nishikawa(NAIST), and H. Saruwatari(NAIST), ``Subband Based Blind Source Separation for Convolutive Mixtures of Speech,'' ICASSP2003, Vol. V, pp. 509--512, 2003.
- S. Araki, S. Makino, A. Blin, R. Mukai and H. Sawada, ``Blind Separation of More Speech than Sensors with Less Distortion by Combining Sparseness and ICA,'' IWAENC2003, pp.271--274, 2003, [pdf], -->sound demos.
- S. Araki, S. Makino, H. Sawada, A. Blin and R. Mukai,
``Underdetermined Blind Separation of Convolutive Mixtures of Speech
with Binary Masks and ICA,''
NIPS 2003 workshop on ICA: Sparse Representations in Signal Processing,
Dec., 2003. (We did not have the proceedings in the workshop).
[2004]
- S. Araki, S. Makino, A. Blin, R. Mukai, and H. Sawada, ''Underdetermined blind separation of convolutive mixtures of speech by combining time-frequency masks and ICA, '' in Proc. ICA2004 (International Congress on Acoustics), vol. I, pp.321--324, 2004.
- S. Araki, S. Makino, A. Blin, R. Mukai, and H. Sawada, ``Underdetermined Blind Separation for Speech in Real Environments with Sparseness and ICA,'' ICASSP2004, vol. III, pp. 881-884, May 2004 (invited), [pdf].
- S. Araki, S. Makino, H. Sawada and R. Mukai,
``Underdetermined Blind Speech Separation with Directivity Pattern based Continuous Mask and ICA,'' EUSIPCO2004, pp.1991--1994, Sept. 2004. [pdf],-->sound demos.
- S. Araki, S. Makino, H. Sawada and R. Mukai,
``Underdetermined Blind Separation of Convolutive Mixtures of Speech with Directivity Pattern based Mask and ICA,'' ICA2004, pp.898--905, Sept. 2004. -->sound demos.
[2005]
- S. Araki, S. Makino, H. Sawada and R. Mukai, ``Reducing musical noise by a fine-shift overlap-add method applied to source separation using a time-frequency mask,'' ICASSP2005, vol. III, pp. 81-84, March 2005. [pdf], -->sound demos.
- S. Araki, S. Makino, H. Sawada, and R. Mukai, ``Source extraction from speech mixtures with null-directivity pattern based mask,'' Proc. of Joint Workshop on Hands-Free Speech Communication and Microphone Arrays (HSCMA 2005), pp. d1-d2, March 2005.
- S. Araki, H. Sawada, R. Mukai and S. Makino,``A novel blind source separation method with observation vector clustering,'' , IWAENC2005, pp.117--120, 2005. [pdf], -->sound demos.
[2006]
- S. Araki, H. Sawada, R. Mukai and S. Makino,``DOA estimation for multiple sparse sourceswith normalized observation vector clustering,'', ICASSP2006, Vol. 5, pp.33--36, 2006. [pdf]
- S. Araki, H. Sawada, R. Mukai and S. Makino,``Underdetermined Sparse Source Separation of Convolutive Mixtures with Observation Vector Clustering,'', ISCAS2006, pp. 3594--3597, 2006.
- S. Araki, H. Sawada, R. Mukai and S. Makino,``Normalized Observation Vector Clustering Approach for Sparse Source Separation,'', EUSIPCO2006, (invited).
- S. Araki, H. Sawada, R. Mukai and S. Makino, "Performance evaluation of sparse source separation and DOA estimation with observation vector clustering in reverberant environments," IWAENC2006, 2006.
- S. Araki, H. Sawada, R. Mukai and S. Makino, " Blind sparse source separation with spatially smoothed time-frequency masking," IWAENC2006, 2006.
[2007]
- S. Araki, H. Sawada, and S. Makino, "Blind speech separation in a meeting situation with maximum SNR beamformers," ICASSP2007, vol. 1, pp. 41--44, Apr. 2007. [pdf]
[2008]
- S. Araki, M. Fujimoto, K. Ishizuka, H. Sawada, and S. Makino, "Speaker indexing and speech ehnancement in real meetings / conversations," ICASSP2008, pp.93--96, 2008. [pdf]
- S. Araki, M. Fujimoto, K. Ishizuka, H. Sawada, and S. Makino, "A DOA based speaker diarization system for real meetings," HSCMA2008, pp.29--32, 2008 (invited).[pdf]
[2009]
- S. Araki, T. Nakatani, H. Sawada, and S. Makino, "Blind sparse source separation for unknown number of sources using Gaussian mixture model fitting with Dirichlet prior," ICASSP2009, pp.33-36, 2009. [pdf]
- S. Araki, T. Nakatani, H. Sawada, and S. Makino, "Stereo source separation and source counting with MAP estimation with Dirichlet prior considering spatial aliasing problem," ICA2009, pp. 742--750, 2009. [pdf]
[2010]
- S. Araki, T. Nakatani and H. Sawada, "Simultaneous clustering of mixing and spectral model parameters for blind sparse source separation," ICASSP2010, 2010.
- S. Araki, A. Ozerov, V. Gowreesunker, H. Sawada, F. Theis, G. Nolte, D. Lutter, N. Duong, "The 2010 Signal Separation Evaluation Campaign (SiSEC2010): - Audio source separation - ," in Proc of LVA/ICA2010, 2010.
- S. Araki, F. Theis, G. Nolte, D. Lutter, A. Ozerov, V. Gowreesunker, H. Sawada, N. Duong, "The 2010 Signal Separation Evaluation Campaign (SiSEC2010): - Biomedical source separation - ," in Proc of LVA/ICA2010, 2010.
- S. Araki, T. Hori, M. Fujimoto, S. Watanabe, T. Yoshioka, T. Nakatani, "Online meeting recognizer with multichannel speaker diarization", Asilomar 2010. (invited)
[2011]
- S. Araki and T. Nakatani, "Hybrid Approach for Multichannel Source Separation Combining Time-frequency Mask with Multi-channel Wiener Filter," ICASSP2011, 2011.
- S. Araki, T. Hori, T. Yoshioka, M. Fujimoto, S. Watanabe, T. Oba, A. Ogawa, K. Otsuka, D. Mikami, M. Delcroix, K. Kinoshita, T. Nakatani, A. Nakamura, and J. Yamato, "Demonstration on low-latency meeting recognition and understanding using distant microphones," HSCMA2011, 2011.
[2012]
- S. Araki and T. Nakatani,"Sparse vector factorization for underdetermined BSS using wrapped-phase GMM and source log-spectral prior," ICASSP2012, 2012.
- S. Araki, F. Nesta, E. Vincent, Z. Koldovsky, G. Nolte, A. Ziehe, and A. Benichoux, "SiSEC2011 Overview: Audio source separation," in Proc. LVA/ICA2012, pp. 414--422, Mar. 2012.
[2015]
- S. Araki and T. Hayashi, M. Delcroix, M. Fujimoto, K. Takeda and T. Nakatani,"Exploring multi-channel features for denoising-autoencoder-based speech enhancement," ICASSP2015, 2015.
[2016]
- S. Araki, M. Okada, T. Higuchi, A. Ogawa and T. Nakatani, "SPATIAL CORRELATION MODEL BASED OBSERVATION VECTOR CLUSTERING AND MVDR BEAMFORMING FOR MEETING RECOGNITION," ICASSP2016, 2016.
[2017]
- S. Araki, N. Ito, D. Marc, A. Ogawa, K. Kinoshita, T. Higuchi, T. Yoshioka, D. Tran, S. Karita, and T. Nakatani, "Online Meeting Recognition in Noisy Environments with Time-Frequency Mask Based MVDR Beamforming," Proc. HSCMA, Mar. 2017.
- S. Araki, N. Ono, K. Kinoshita and M.Delcroix, "MEETING RECOGNITION WITH ASYNCHRONOUS DISTRIBUTED MICROPHONE ARRAY, " ASRU2017, 2017
[2018]
- S. Araki, N. Ono, K. Kinoshita, and M. Delcroix, "MEETING RECOGNITION WITH ASYNCHRONOUS DISTRIBUTED MICROPHONE ARRAY USING BLOCK-WISE REFINEMENT OF MASK-BASED MVDR BEAMFORMER," ICASSP2018, 2018.
- S. Araki, N. Ono, K. Kinoshita, and M. Delcroix,"Comparison of reference microphone selection algorithms for distributed microphone array based speech enhancement in meeting recognition scenarios," IWAENC2018, 2018.
- S. Araki, N. Ono, K. Kinoshita, and M. Delcroix, "Estimation of sampling frequency mismatch between distributed asynchronous microphones under existence of source movements with stationary time periods detection," ICASSP2019, 2019
- S. Araki, N. Ono, K. Kinoshita, and M. Delcroix, "PROJECTION BACK ONTO FILTERED OBSERVATIONS FOR SPEECH SEPARATION .WITH DISTRIBUTED MICROPHONE ARRAY," CAMSAP2019, 2019
International Conferences (co-author)
[2001]
- R. Mukai, S. Araki and S. Makino, ``Separation and Dereverberation Performance of Frequency Domain Blind Source Separation for Speech in a Reverberant Environment'', Eurospeech 2001, pp. 2599--2603, Sept. 2001.
- R. Mukai, S. Araki and S. Makino, ``Separation and Dereverberation Performance of Frequency Domain Blind Source Separation in a Reverberant Environment'', IWAENC 2001, pp. 127--130, Sept. 2001.
- R. Mukai, S. Araki and S. Makino, ``Separation and Dereverberation Performance of Frequency Domain Blind Source Separation,'' ICA2001, pp. 230-235, Dec. 2001.
- H. Sawada, R. Mukai, S. Araki, S. Makino, ``A Polar-Coordinate based Activation Function for Frequency Domain Blind Source Separation,'' ICA2001, pp. 663-668, Dec. 2001.
[2002]
- Y. Hinamoto(NAIST), T. Nishikawa(NAIST), H. Saruwatari(NAIST), S. Araki , S. Makino, and R. Mukai, ``Equivalence between Frequency Domain Blind Source Separation and Adaptive Beamforming,'' Proc. ICFS2002 (The International Conference on Fundamentals of Electronics, Communications and Computer Sciences), R-1, pp. 13-18, Mar. 2002.
- R. Aichner, S. Araki, S. Makino, T. Nishikawa(NAIST), and H. Saruwatari(NAIST), ``Time domain Blind Source Separation of non-stationary convolved signals by utilizing geometric beamforming,'' NNSP2002, pp. 445-454, 2002.
- H. Sawada, S. Araki, R. Mukai, S. Makino, ``Blind Source Separation with Different Sensor Spacing and Filter Length for Each Frequency Range,'' NNSP2002, pp. 465-474, 2002.
- R. Mukai, S. Araki, H. Sawada, S. Makino, ``Removal of Residual Cross-talk Components in Blind Source Separation using LMS Filters,'' NNSP2002, pp. 435-444, 2002.
- R. Mukai, S. Araki, H. Sawada, S. Makino, ``Removal of Residual Cross-talk Components in Blind Source Separation using Time-delayed Spectral Subtraction,''ICASSP2002, vol. II, pp.1789-1792, May 2002.
- H. Sawada, R. Mukai, S. Araki, S. Makino,``Polar Coordinate based Nonlinear Function for Frequency-Domain Blind Source Separation,''ICASSP2002, vol. I, pp. 1001-1004, May 2002.
[2003]
- S. Makino, S. Araki, R. Mukai, H. Sawada, H. Saruwatari (NAIST),`` ICA-Based Source Separation of Sounds,'' Proc. of 2002 China-Japan Joint Conference on Acoustics, Vol.21, pp. 83--86, 2002.
- M. Knaak, S. Araki, S. Makino, ``Geometrically Constraint ICA for a Robust Separation of Sound Mixtures,'', ICA2003, pp. 951--956, 2003.
- R. Aichner, H. Buchner, S. Araki, S. Makino, ``On-line Time-domain Blind Source Separation of Nonstationary Convoluved Signals,'' ICA2003, pp. 987--992, 2003.
- T. Nishikawa, H. Saruwatari, K. Shikano, S. Araki , S. Makino, ``Multistage ICA for Blind Source Separation of Real Acoustic Convolutive Mixture,'' ICA2003, pp. 523--528, 2003
- R. Mukai, H. Sawada, S. Araki, S. Makino, ``Real-Time Blind Source Separation for Moving Speakers using Blockwise ICA and Residual Crosstalk Subtraction,'' ICA2003, pp. 975-980, Apr. 2003.
- H. Sawada, R. Mukai, S. Araki, S. Makino, "A Robust and Precise Method for Solving the Permutation Problem of Frequency-Domain Blind Source Separation, " ICA 2003, pp. 505-510, Apr. 2003.
- M. Knaak, S. Araki , S. Makino, ``Geometrically Constraint ICA for a Convolutive Mixtures of Sound,'', ICASSP2003, Vol. II, pp. 725--728, 2003.
- R. Mukai, H. Sawada, S. Araki, S. Makino, ``Robust Real-Time Blind Source Separation for Moving Speakers in a Room,'' ICASSP2003, pp.
469-472, Apr. 2003.
- H. Sawada, R. Mukai, S. Araki, S. Makino, "A Robust Approach to the Permutation Problem of Frequency-Domain Blind Source Separation," ICASSP 2003, pp. 381-384, Apr. 2003.
- A. Blin, S. Araki and S. Makino,``Blind Source Separation when Speech Signals Outnumber Sensors using a Sparseness-Mixing Matrix Combination,'', IWAENC2003, pp. 211-214, 2003.
- R. Mukai, H. Sawada, S. de la Kethulle, S. Araki and S. Makino,``Array Geometry Arrangement for Frequency Domain Blind Source Separation,'' IWAENC2003, pp.219-222, 2003.
- H. Sawada, R. Mukai, S. de la Kethulle, S. Araki and S. Makino,``Spectral Smoothing for Frequency-Domain Blind Source Separation,'' IWAENC2003, pp.311-314, 2003.
[2004]
- A. Blin, S. Araki, and S. Makino, ''Underdetermined blind source separation for convolutive mixtures exploiting a sparseness-mixing matrix estimation (SMME), '' in Proc. ICA2004 (International Congress on Acoustics), vol. IV, pp. 3139--3142, 2004.
- R. Mukai, H. Sawada, S. Araki, S. Makino,``A Solution for the Permutation Problem in Frequency Domain BSS using Near- and Far-field Models,''ICA2004 (International Congress on Acoustics), vol. IV, pp. 3135--3138, 2004.
- H. Sawada, R. Mukai, S. Araki, S. Makino,``Solving the Permutation and the Circularity Problem of Frequency-Domain Blind Source Separation,''ICA2004 (International Congress on Acoustics), vol. I, pp. 89--92, 2004 (invited).
- A. Blin, S. Araki and S. Makino,``A Sparseness-Mixing Matrix Estimation (SMME) Solving the Underdetermined BSS for Convolutive Mixtures,'' ICASSP2004, vol. IV, pp. 85-88, May 2004.
- R. Mukai, H. Sawada, S. Araki, S. Makino,``Near-Field Frequency Domain Blind Source Separation for Convolutive Mixtures,'' ICASSP2004, vol. IV, pp. 49-52, May 2004.
- H. Sawada, R. Mukai, S. Araki, S. Makino,``Convolutive Blind Source Separation for more than Two Sources in the Frequency Domain,'' ICASSP2004, vol. III, pp. 885-888, May 2004 (invited).
- S. Makino, S. Araki, R. Mukai, and H. Sawada, ``Audio source separation based on independent component analysis, ''in Proc. ISCAS2004 (International Symposium on Circuits and Systems), vol. V, pp. 668-671, May 2004 (invited).
- R. Mukai, H. Sawada, S. Araki and S. Makino,``Frequency Domain Blind Source Separation using Small and Large Spacing Sensor Pairs,'' ISCAS2004, vol. V, pp. 1-4, May 2004.
- H. Sawada, S. Winter, S. Araki, R. Mukai, S. Makino,``Estimating the Number of Sources for Frequency-Domain Blind Source Separation,'' ICA2004 (5th International Conference on Independent Component Analysis and Blind Signal Separation), pp.610--617, Sept. 2004.
- S. Winter, H. Sawada, S. Araki, S. Makino,``Overcomplete BSS for convolutive mixtures based on hierarchical clustering,'' ICA2004, pp.652--660, Sept. 2004.
- R. Mukai, H. Sawada, S. Araki, S. Makino,``Frequency Domain Blind Source Separation for Many Speech Signals,'' ICA2004, pp.461--469, Sept. 2004.
- S. Winter, H. Sawada, S. Araki, S. Makino,``Hierarchical Clustering Applied to Overcomplete BSS for Convolutive Mixtures,'' SAPA2004 (ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing), Session I-3, Oct. 2004.
[2005]
- H. Sawada, S. Araki, R. Mukai, S. Makino,``Blind Extraction of a Dominant Source Signal from Mixtures of Many Sources,'' ICASSP2005, vol. III, pp. 61-64, March 2005.
- H. Sawada, R. Mukai, S. Araki, and S. Makino, ``Frequency-domain blind source separation without array geometry information,'' Proc. of Joint Workshop on Hands-Free Speech Communication and Microphone Arrays (HSCMA 2005), pp.d13-d14, March 2005.
- R. Mukai, H. Sawada, S. Araki, and S. Makino, ``Blind source separation and {DOA} estimation using small 3-D microphone array,'' Proc. of Joint Workshop on Hands-Free Speech Communication and Microphone Arrays (HSCMA 2005), pp. d9-d10, March 2005.
- H. Sawada, S. Araki, R. Mukai, and S. Makino,``Blind extraction of a dominant source from mixtures of many sources using ICAand time-frequency masking,'' Proc. of 2005 IEEE International Symposium on Circuits and Systems (ISCAS 2005), pp. 5882-5885, May 2005.
- H. Sawada, R. Mukai, S. Araki, and S. Makino, ``Multiple source localization using independent component analysis,'' Proc. of 2005 IEEE AP-S International Symposium and USNC/URSI National Radio Science Meeting, July 2005.
- H. Kato, Y. Nagahara (Meiji Univ.), S. Araki, and H. Sawada,``Pearson distribution system applied to blind speech separation,'' 25th European Meeting of Statsiticians (EMS2005), p.394, July 2005.
- F. Flego, S. Araki, H. Sawada, T. Nakatani, and S. Makino, ``Underdetermined blind separation for speech in real environments with F0 adaptive comb filtering,'' IWAENC2005, pp. 93--96, 2005.
- H. Sawada, R. Mukai, S. Araki, and S. Makino,``Real-time blind extraction of dominant target sources from many background interferences,'' IWAENC2005, pp. 73--76, 2005.
- R. Mukai, H. Sawada, S. Araki, and S. Makino, ``Real-Time Blind Source Separation and DOA Estimation Using Small 3-D Microphone Array,'' IWAENC2005, pp. 45--48, 2005.
- R. Mukai, H. Sawada, S. Araki, and S. Makino, ``Blind Source Separation of 3-D Located Many Speech Signals,'' in Proc. WASPAA2005, pp. 9-12, Oct., 2005.
[2006]
- H. Sawada, S. Araki, R. Mukai and S. Makino,''On Calculating the Inverse of Separation Matrix in Frequency-Domain BSS,'' ICA2006, pp. 691--699, 2006.
- H. Sawada, S. Araki, R. Mukai and S. Makino,''Solving the permutation problem of frequency-domain BSS when spatial aliasing occurs with wide sensor spacing,'' ICASSP2006, vol. V, pp. 77-80, Mar. 2006.
- R. Mukai, H. Sawada, S. Araki, S. Makino, "Blind Source Separation of Many Signals in the Frequency Domain," ICASSP2006, vol.5, pp.969--972, 2006.
- H. Kato, Y. Nagahara, S. Araki, H. Sawada and S. Makino, "Parametric Pearson Approach based Independent Component Analysis for Frequency Domain Blind Speech Separation," EUSIPCO2006, 2006.
- J. Cermak, S. Araki, H. Sawada and S. Makino, "Blind Speech Separation by Combining Beamformers and a Time Frequency Binary Mask," IWAENC2006, 2006.
- J. Cermak, S. Araki, H. Sawada and S. Makino, "Musical Noise Reduction in Time-frequency-binary-masking-based Blind Source Separation Systems," 16th Czech-German Workshop, 2006.
- R. Mukai, H. Sawada, S. Araki and S. Makino,
"Frequency Domain Blind Source Separation in a Noisy Environment,"
Joint meeting of ASA and ASJ 2006, Nov. 2006, (invited).
- H. Sawada, S. Araki, R. Mukai and S. Makino, ''Blind separation and localization of speeches in a meeting situation'', Asilomar 2006, pp. 1407-1411, Oct. 2006.
[2007]
- J. Cermak, S. Araki, H. Sawada and S. Makino"Blind Source Separation Based on Beamformer Array and Time Frequency Binary Masking," in Proc. ICASSP2007, vol. I, pp. 145 --148, Apr. 2007.
- J. E. Rubio, K. Ishizuka, H. Sawada, S. Araki, T. Nakatani and M. Fujimoto, "Two-Microphone Voice Activity Detection Based on the Homogeneity of the Direction of Arrival Estimates," in Proc. ICASSP2007, vol.4, pp. 385-388, Apr. 2007.
- H. Sawada, S. Araki and S. Makino, "Measuring Dependence of Bin-wise Separated Signals for Permutation Alignment in Frequency-domain BSS," in Proc. ISCAS2007, pp. 3247 - 3250, May 2007.
- H. Sawada, S. Araki, and S. Makino, "MLSP 2007 data analysis competition: Frequency-domain blind source separation for convolutive mixtures of speech/and audio," MLSP2007, 2007.
- H. Sawada, S. Araki, and S. Makino, "A two-stage frequency-domain blind source separation method for underdetermined convolutive mixtures", WASPAA2007.
[2008]
- D. Kolossa (TU Berlin), S. Araki , M. Delcroix, T. Nakatani, R. Orglmeister (TU Berlin), S. Makino, "Missing Feature Speech Recognition in a Meeting Situation with Maximum SNR Beamforming," ISCAS2008.
- T. Hager, S. Araki, K. Ishizuka, M. Fujimoto, T. Nakatani, S. Makino, "Handling speaker position changes in a meeting diarization system by combining DOA clustering and speaker identification," IWAENC2008, 2008.
- K. Ishizuka, S. Araki, T. Kawahara, "Statistical Speech Activity Detection based on Spatial Power Distribution for Analyses of Poster Presentations," Interspeech2008, pp.99-102, 2008.
- T. Kawahara, H. Setoguchi, K. Takanashi, K. Ishizuka, S. Araki, "Multi-Modal Recording, Analysis and Indexing of Poster Sessions," Interspeech2008, pp. 1622-1625, 2008.
- K. Otsuka, S. Araki, K. Ishizuka, M. Fujimoto, M. Heinrich, J. Yamato, "A Realtime Multimodal System for Analyzing Group Meetings by Combining Face Pose Tracking and Speaker Diarization," ICMI2008, pp. 257--264, 2008.
[2009]
- E. Vincent, S. Araki, and P. Bofill, "The 2008 Signal Separation Evaluation Campaign: A Community-Based Approach to Large-Scale Evaluation," ICA2009, pp. 734--741, 2009.
- K. Ishiguro, T. Yamada, S. Araki and T. Nakatani, "A Probabilistic Speaker Clustering for DOA-based Diarization,"
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2009), pp. 241-244, 2009. [pdf]
- K. Ishizuka, S. Araki, K. Otsuka, T. Nakatani, and M. Fujimoto, "A speaker diarization method based on the probabilistic fusion of audio-visual location information," Proceedings of the 11th International Conference on Multimodal Interfaces and Workshop on Machine Learning for Multi-modal Interaction (ICMI-MLMI2009), pp.55-62, 2009.
- K. Otsuka, S. Araki, D. Mikami, K. Ishizuka, M. Fujimoto, and J. Yamato: "Realtime Meeting Analysis and 3D Meeting Viewer Based on Omnidirectional Multimodal Sensors", Proc. ICMI-MLMI2009, 2009.
- T. Tashiro, S. Araki, Y. Nakanishi, H. Kimura, K. Kumozaki and M. Miyoshi, "Optical Access System with Emergency Voice Communication Using Blind Speech Separation for Demultiplexing Randomly Mixed Signals," GLOBECOM, 2009.
[2010]
- T. Nakatani and S. Araki,"SINGLE CHANNEL SOURCE SEPARATION BASED ON SPARSE SOURCE OBSERVATION MODEL WITH HARMONIC CONSTRAINT," ICASSP2010, 2010.
- Y. Ansai, S. Araki, S. Makino, T. Nakatani, T. Yamada, A. Nakamura and N. Kitawaki, "Cepstral Smoothing of Separated Signals for Underdetermined Speech Separation," ISCAS2010, 2010.
- T. Nakatani, S. Araki, T. Yoshioka, M. Fujimoto, "Multichannel Source Separation Based on Source Location Cue with Log-Spectral Shaping by Hidden Markov Source Model," Interspeech2010, 2010.
- T. Hori, S. Araki, T. Yoshioka, M. Fujimoto, S. Watanabe, T. Oba, A. Ogawa, K. Otsuka, D. Mikami, K. Kinoshita, T. Nakatani, A. Nakamura, J. Yamato, "Real-time Meeting Recognition and Understanding Using Distant Microphones and Omni-directional Camera," in Proc of SLT2010, 2010.
[2011]
- T. Nakatani, S. Araki, T. Yoshioka, and M. Fujimoto, "Joint unsupervised learning of hidden Markov source models and source location models for multichannel source separation," ICASSP2011, 2011.
- H. Sawada, H. Kameoka, S. Araki, and N. Ueda, "FORMULATIONS AND ALGORITHMS FOR MULTICHANNEL COMPLEX NMF,"ICASSP2011, 2011.
- K. Iso, S. Araki, S. Makino, T. Nakatani, H. Sawada, T. Yamada, and A. Nakamura, "BLIND SOURCE SEPARATION OF MIXED SPEECH IN A HIGH REBERBERATION ENVIRONMENT," HSCMA2011, 2011.
[2012]
- T. Maruyama, Shoko Araki, T. Nakatani, S. Miyabe, T. Yamada, S. Makino and A. Nakamura, "NEW ANALYTICAL UPDATE RULE FOR TDOA INFERENCE FOR UNDERDETERMINED BSS IN NOISY ENVIRONMENTS," ICASSP2012.
- T. Nakatani T. Yoshioka S. Araki M. Delcroix and M. Fujimoto, "LOGMAX OBSERVATION MODEL WITH MFCC-BASED SPECTRAL PRIOR FOR REDUCTION OF HIGHLY NONSTATIONARY AMBIENT NOISE," ICASSP2012, 2012.
- M. Souden, S. Araki, K. Kinoshita, T. Nakatani and H. Sawada, "A MULTICHANNEL MMSE-BASED FRAMEWORK FOR JOINT BLIND SOURCE SEPARATION AND NOISE REDUCTION," ICASSP2012.
- H. Sawada, H. Kameoka, S. Araki and Naonori Ueda, "EFFICIENT ALGORITHMS FOR MULTICHANNEL EXTENSIONS OF ITAKURA-SAITO NONNEGATIVE MATRIX FACTORIZATION," ICASSP2012.
- G. Nolte, D. Lutter, A. Ziehe, F. Nesta, E. Vincent, Z. Koldovsky, A. Benichoux and S. Araki, "SiSEC2011 Overview: Biomedical Data Analysis," in Proc. LVA/ICA2012, pp. 423--429, Mar. 2012.
- Takaaki Hori, Keisuke Kinoshita, Shoko Araki, Atsunori Ogawa, Takuya Yoshioka, Masakiyo Fujimoto, Takanobu Oba, Marc Delcroix, Mehrez Souden, Yotaro
Kubo, Seong-Jun Hahm, Dan Mikami, Kazuhiro Otsuka, Tomohiro Nakatani, Atsushi Nakamura, Junji Yamato, "Real-time audio-visual meeting recognition and understanding using distant microphone array," ICASSP2012, Show & Tell, Mar. 2012.
- T. Maruyama, S. Araki, T. Nakatani, S. Miyabe, T. Yamada, S. Makino and A. Nakamura, "New analytical calculation and estimation for TDOA inference for underdetermined BSS in noisy environments," Proc. on Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA2012), 2012.
[2013]
- Nobutaka Ito, Shoko Araki, and Tomohiro Nakatani, "Permutation-free
Convolutive Blind Source Separation via Full-band Clustering Based on
Frequency-independent Source Presence Priors," Proc. International
Conference on Acoustics, Speech, and Signal Processing (ICASSP),
Vancouver, Canada, May 2013.
- Tomohiro Nakatani, Mehrez Souden, Shoko Araki, Takuya Yoshioka,
Takaaki Hori, Atsunori Ogawa, "Coupling beamforming with spatial and
spectral feature based spectral enhancement and its application to
meeting recognition," Proc. International Conference on Acoustics,
Speech, and Signal Processing (ICASSP), Vancouver, Canada, May 2013.
- J. Ingrid, N. Ito, M. Souden, S. Araki and T. Nakatani, "Source number estimation based on clustering of speech activity sequences for microphone array processing" MLSP2013, 2013
[2014]
- N. Ito, S. Araki, and T. Nakatani, "Probabilistic Integration of iffuse Noise Suppression and Dereverberation," Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (IEEE ICASSP), May 2014.
- N. Ito, S. Araki, T. Nakatani and T. Yoshioka, "Relaxed disjointness based clustering for joint blind source separation and dereverberation," Proc. IWAENC2014, 2014.
- M. Delcroix, T. Yoshioka, A. Ogawa, Y. Kubo, M. Fujimoto, N. Ito, K. Kinoshita, M. Espi,S. Araki, T. Hori, and T. Nakatani, "Defeating reverberation: Advanced dereverberation and recognition techniques for hands-free speech recognition," Proc. GlobalSIP, 2014.
[2015]
- N. Ito, S. Araki, and T. Nakatani, "Permutation-free Clustering of Relative Transfer Function Features for Blind Source Separation," Proc. EUSIPCO 2015, 2015.
- T. Yoshioka, N. Ito, M. Delcroix, A. Ogawa, K. Kinoshita, M. Fujimoto, C. Yu, W. J. Fabian, M. Espi, T. Higuchi, S. Araki, and T. Nakatani, "NTT CHIME-3 SYSTEM: ADVANCES IN SPEECH ENHANCEMENT AND RECOGNITION FOR MOBILE MULTI-MICROPHONE DEVICES," ASRU2015, Dec., 2015. (Best paper award honorable mention)
[2016]
- H. Meutzner, S. Araki, M. Fujimoto and T. Nakatani, "A Generative-Discriminative Hybrid Approach to Multi-Channel Noise Reduction for Robust Automatic Speech Recognition," ICASSP2016, 2016.
- N. Ito,S. Araki and T. Nakatani, "MODELING AUDIO DIRECTIONAL STATISTICS USING A COMPLEX BINGHAM MIXTURE MODEL FOR BLIND SOURCE EXTRACTION FROM DIFFUSE NOISE," ICASSP2016, 2016.
- T. Kawase, K. Niwa, M. Fujimoto, N. Kamado, K. Kobayashi, S. Araki, and T. Nakatani, "Real-time integration of statistical model-based speech enhancement with unsupervised noise psd estimation using microphone array," ICASSP2016, 2016.
- N. Ito, S. Araki, T. Nakatani, "Complex Angular Central Gaussian Mixture Model for Directional Statistics in Mask-based Microphone Array Signal Processing, " EUSIPCO2016, 2016.
- N. Murata, H. Kameoka, K. Kinoshita, S. Araki, T. Nakatani, S. Koyama and H. Saruwatari, "Reverberation-robust underdetermined source separation with non-negative tensor double deconvolution," EUSIPCO2016, 2016.
- K. Yamamoto, T. Irino, T. Matsui, S. Araki, K. Kinoshita, and T. Nakatani, "Speech intelligibility prediction based on the envelope power spectrum model with the dynamic compressive gammachirp auditory filterbank," Interspeech2016, 2016.
- M. Fakhry, N. Ito, S. Araki and T. Nakatani, "Modeling audio directional statistics using a probabilistic spatial dictionary for speaker diarization in real meetings," IWAENC2016, 2016.
[2017]
- N. Ito, S. Araki, M. Delcroix, and T. Nakatani, "PROBABILISTIC SPATIAL DICTIONARY BASED ONLINE ADAPTIVE BEAMFORMING FOR MEETING RECOGNITION IN NOISY AND REVERBERANT ENVIRONMENTS," ICASSP2017, 2017.
- T. Nakatani, N. Ito, T. Higuchi, S. Araki and K. Kinoshita, "INTEGRATING DNN-BASED AND SPATIAL CLUSTERING-BASED MASK ESTIMATION FOR ROBUST MVDR BEAMFORMING ," ICASSP2017, 2017.
- K. Yamamoto, T. Irino, T. Matsui, S. Araki, K. Kinoshita and T. Nakatani, "Analysis of acoustic features for speech intelligibility prediction models," 5th ASA/ASJ Joint meeting, 2016
- K. Yamamoto, T. Irino, T. Matsui, S. Araki, K. Kinoshita and T. Nakatani, "Predicting Speech Intelligibility Using Gammachirp Envelope Distortion Analysis Method Based on the Signal-to-Distortion Ratio", Interspeech2017, 2017.
- N. Ito, S. Araki, and T. Nakatani, "Data-Driven and Physical Model-Based Designs of Probabilistic Spatial Dictionary for Online Meeting Diarization and Adaptive Beamforming," Proc. EUSIPCO, 2017.
- R. Higashinaka, Sakai, A. Sugiyama, Narimatsu, Arimoto, Fukutomi, Matsui, Imoto, Ijima, Itoh, S. Araki, Yoshikawa, Ishiguro and Matsuo, "Demonstration of an argumentative dialogue system based on argumentation structures," SEMDIAL2017, 2017.
[2018]
- N. Ito, T. Makino, S. Araki, T. Nakatani, "Maximum-Likelihood Online Speaker Diarization in Noisy Meetings Based on Categorical Mixture Model and Probabilistic Spatial Dictionary," Proc. ICASSP, May 2018.
- J. Azcarreta, N. Ito, S. Araki, and T. Nakatani, "Permutation-Free cGMM: Complex Gaussian Mixture Model with Inverse Wishart Mixture Model Based Spatial Prior for Permutation-Free Source Separation and Source Counting," Proc. ICASSP, May 2018.
- N. Ito, S. Araki, and T. Nakatani, "FastFCA: Joint Diagonalization Based Acceleration of Audio Source Separation Using a Full-Rank Spatial Covariance Model," Proc. EUSIPCO, Sep. 2018.
- N. Ito, C. Schymura, S. Araki, and T. Nakatani, "Noisy cGMM: Complex Gaussian Mixture Model with Non-Sparse Noise Model for Joint Source Separation and Denoising," Proc. EUSIPCO, Sep. 2018.
- K. Yamamoto, T. Irino, N. Ohashi, S. Araki, K. Kinoshita and T. Nakatani,"Multi-resolution Gammachirp Envelope Distortion Index for Intelligibility Prediction of Noisy Speech," Interspeech2018, 2018.
- Y. Matsui, T. Nakatani, M. Delcroix, K. Kinoshita1, N. Ito, S. Araki and S. Makino, "Online integration of DNN-based and spatial clustering-based mask estimation for robust MVDR beamforming," IWAENC2018, 2018.
- K. Yamamoto, T. Irino, S. Araki, K. Kinoshita and T. Nakatani,"Speech intelligibility prediction using a multi-resolution gammachirp envelope distortion index with common parameters for different noise conditions," UAC2018, 2018.
[2019]
- T. von Neumann, K. Kinoshita, M. Delcroix, S. Araki, T. Nakatani and R. Haeb-Umbach, "All-neural online source separation, counting, and diarization for meeting analysis," ICASSP2019, 2019.
- M Delcroix, K. Zmolikova, T. Ochiai, K Kinoshita, S. Araki, T. Nakatani, "Compact network for SpeakerBeam target speaker extraction," ICASSP2019, 2019.
- Y. Kubo, T. Nakatani, M. Delcroix, K. Kinoshita, S. Araki, "Mask-based MVDR beamformer for noisy multisource environments: introduction of time-varying spatial covariance model," ICASSP2019, 2019.
- Kenichi Arai, S. Araki, Atsunori Ogawa, Keisuke Kinoshita, Tomohiro Nakatani, Katsuhiko Yamamoto, and Toshio Irino, "Predicting speech intelligibility of enhanced speech using phone accuracy of DNN-based ASR system," Interspeech 2019, 9 2019.
- Tomohiro Nakatani, Kesuke Kinoshita, Rintaro Ikeshita, Hiroshi Sawada, and ,S. Araki, "Simultaneous denoising dereverberation, and source separation using a unified convolutional beamformer," Proc. WASPAA, 2019, 2019.
[2020]
- C. Schymura, T. Ochiai, M. Delcroix, K. Kinoshita, T. Nakatani, S. Araki and D. Kolossa, "A DYNAMIC STREAM WEIGHT BACKPROP KALMAN FILTER FOR AUDIOVISUAL SPEAKER TRACKING," ICASSP2020, 2020.
- M. Delcroix, T. Ochiai, K. Zmolikova, K. Kinoshita, N. Tawada, T. Nakatani and S. Araki, "Improving speaker discrimination of target speech extraction with time-domain SpeakerBeam," ICASSP2020, 2020.
- R.Ikeshita, T. Nakatani, S. Araki, "Overdetermined independent vector analysis," ICASSP2020, 2020.
- T. Nakatani, R Takahashi, T. Ochiai, K. Kinoshita, M. Delcroix, R. Ikeshita and S. Araki, "DNN-SUPPORTED MASK-BASED CONVOLUTIONAL BEAMFORMING FOR SIMULTANEOUS DENOISING, DEREVERBERATION, AND SOURCE SEPARATION," ICASSP2020, 2020.
- K. Kinoshita, M. Delcroix, S. Araki and T. Nakatani, "Tackling real noisy reverberant meetings with all-neural source separation, counting, and diarization system, " ICASSP2020, 2020.
- T. Ochiai, M. Delcroix, R. Ikeshita, K. Kinoshita, T. Nakatani and. S. Araki, "BEAM-TASNET: TIME-DOMAIN AUDIO SEPARATION NETWORK MEETS FREQUENCY-DOMAIN BEAMFORMER," ICASSP2020, 2020.
- S. Emura, S. Araki, H. Sawada and N. Harada, "A FREQUENCY-DOMAIN BSS METHOD BASED ON L1 NORM, UNITARY CONSTRAINT, AND CAYLEY TRANSFORM," ICASSP2020, 2020.
- C. Schymura, T. Ochiai, M. Delcroix, K. Kinoshita, T. Nakatani, S. Araki and D. Kolossa, "Exploiting Attention-based Sequence-to-Sequence Architectures for Sound Event Localization," EUSIPCO2020, 2020.
- K. Arai, S. Araki, A. Ogawa, K. Kinoshita, T. Nakatani, and T. Irino, "Predicting intelligibility of enhanced speech using posteriors derived from DNN-based ASR system," Interspeech2020, 2020.
- T. Nakatani, R. Ikeshita, K. Kinoshita, H. Sawada and S. Araki, "Computationally efficient and versatile framework for joint optimization of blind speech separation and dereverberation," Interspeech2020, 2020.
- T. Ochiai, M. Delcroix, Y. Koizumi, H. Ito, K. Kinoshita, S. Araki, "Listen to What You Want: Neural Network-based Universal Sound Selector," Interspeech2020, 2020.
- A. Aroudi, M. Delcroix, T. Nakatani, K. Kinoshita, S. Araki, and S. Doclo, "COGNITIVE-DRIVEN CONVOLUTIONAL BEAMFORMING USING EEG-BASED AUDITORY ATTENTION DECODING," MLSP2020, 2020.
[2021]
- T. Ueda, T. Nakatani. R. Ikeshita, K. Kinoshita, S. Araki, and S. Makino,"LOWLATENCY ONLINE BLIND SOURCE SEPARATION BASED ON JOINT OPTIMIZATION WITH BLIND DEREVERBERATION,"ICASSP 2021, 2021.
- T. Nakatani, R. Ikeshita, K. Kinoshita, H. Sawada and S. Araki, "BLIND AND NEURAL NETWORK-GUIDED CONVOLUTIONAL BEAMFORMER FOR JOINT DENOISING, DEREVERBERATION, AND SOURCE SEPARATION,"ICASSP 2021, 2021.
- T. Ochiai, M. Delcroix, T. Nakatani, R. Ikeshita, K. Kinoshita, and S. Araki,, "NEURAL NETWORK-BASED VIRTUAL MICROPHONE ESTIMATOR," ICASSP 2021, 2021.
- J. Wissing, B. Boenninghoff, D. Kolossa, T. Ochiai, M. Delcroix, K. Kinoshita, T. Nakatani, S. Araki , C. Schymura, "DATA FUSION FOR AUDIOVISUAL SPEAKER LOCALIZATION: EXTENDING DYNAMIC STREAM WEIGHTS TO THE SPATIAL DOMAIN," ICASSP 2021, 2021.
- H. Sato, T. Ochiai, K. Kinoshita, M. Delcroix, T. Nakatani, and S. Araki, "Multimodal Attention Fusion for Target Speaker Extraction, " SLT2021, 2021.
- T. Nakatani, R. Ikeshita, N. Kamo, K. Kinoshita, S. Araki, H. Sawada, "Switching Convolutional Beamformer," EUSIPCO2021, 2021.
- T. Ueda, T. Nakatani, R. Ikeshita, K. Kinoshita, S. Araki, S. Makino, "Low Latency Online Source Separation and Noise Reduction Based on Joint Optimization with Dereverberation," EUSIPCO2021, 2021.
- Christopher Schymura, Benedikt Bonninghoff, Tsubasa Ochiai, Marc Delcroix, Keisuke Kinoshita, Tomohiro Nakatani, Shoko Araki, Dorothea Kolossa, "PILOT: Introducing Transformers for Probabilistic Sound Event Localization," in Proceedings of Interspeech, 2021.
- Marc Delcroix, Jorge Bennasar Vazquez, Tsubasa Ochiai, Keisuke Kinoshita, Shoko Araki, "Few-Shot Learning of New Sound Classes for Target Sound Extraction," in Proceedings of Interspeech, 2021.
- Ayako Yamamoto, Toshio Irino, Kenichi Arai, Shoko Araki , Atsunori Ogawa, Keisuke Kinoshita and Tomohiro Nakatani, "Comparison of Remote Experiments Using Crowdsourcing and Laboratory Experiments on Speech Intelligibility, " Interspeech2021, 2021.
Ph.D thesis
- Convolutive Blind Speech Separation with Independent Component Analysis and Sparse Component Analysis, Hokkaido university, Mar. 2007.
Award
- 19th Awaya Kiyoshi Science Promotion Award from Acoustical Society of Japan (2001)
- Best Paper Award from International Workshop on Acoustic Echo and Noise Control (2003.9)
- 19th TELECOM System Technology Award from the Telecommunications Advancement Foundation (2004.3)
- The Academic Encouragement Prize from IEICE (the Institute of Electronics, Information, and Communication Engineers) (2006.3)
- The Itakura Prize Innovative Young Researcher Award from Acoustical Society of Japan (2008.3)
- The Commendation for Science and Technology by the Minister of Education,
Culture, Sports, Science and Technology, The Young Scientists' Prize, Apr. 2014.
Academic Activities
- ICA2003: Organizing committee member
- IWAENC2003: Finance chair
- EUSIPCO2006: Technical Program Committee Member, Special session co-organizer (on Underdetermined Sparse Audio Source Separation)
- WASPAA2007: Registration co-chairs
- ISCAS2008: Special session co-organizer (on Blind Separation and Dereverberation of Speech and Audio Signals)
- SiSEC2008(Signal Separation Evaluation Campaign): Evaluation chairs
- SiSEC2010(Signal Separation Evaluation Campaign): Evaluation chairs
- IEEE Audio & Acoustic Signal Processing Technical Committee Member, Jan. 2014 -- Dec. 2019.
- IEEE WIE (Women in Engineering), Kansai Section, Vice Chair, Feb. 2014 -- Dec. 2015.
- IEEE WIE (Women in Engineering), Kansai Section, Chair Jan. 2016-- Dec. 2017.
- IEEE Signal Processing Society HSCMA (Hands-free Speech Communication and Microphone Arrays) 2017, Technical Program Chair, Mar. 2016 -- Mar. 2017.
- IEEE WASPAA (Workshop on Applications of Signal Processing to Audio and Acoustics) 2017, Far East Liaison, Sept. 2016 -- Nov. 2017.
- IEEE IWAENC (International Workshop on Acoustic Signal Enhancement) 2018, Publications Chair, July 2017 -- Sept. 2018.
- The Acoustic Society of Japan, Executive council member, May 2017 -- May 2021.
- IEEE Japan Counsil, History Committee, Secretary, Jan. 2019 -- Dec. 2020.
- The Acoustic Society of Japan, Vice president, May 2021 --
- IEEE WASPAA (Workshop on Applications of Signal Processing to Audio and Acoustics) 2021, Technical Program Chair, Jan. 2021 -- Oct. 2021.
- Part-time lecturer at Graduate School of Information Science and Technology, the University of Tokyo, Apr. 2004.
- Invited lecturer at Winter School on Neuroinformatics, Sogang University, Seoul, January 29-30, 2009.
- Part-time lecturer at Faculty of Science and Engineering, Doshisha University, 2nd semester, 2009.
- Part-time lecturer at Graduate Schoool of Information Science, Nara Institute of Science and Technology, 2012.
Master thesis
Master thesis at Ando Lab. @Univ. of Tokyo :
Theory of effective sound transfer to basilar membrane and its application to an acoustic sensor
objective: understanding of the cochlea system, the improving of the sensitivity of the acoustic sensor mimicking a basilar membrane
- Investigate how to make the input impedance of the fishbone sensor mimicking the basilar membrane
- Match the sensor impedance and the impedance of the air with an exponential horn
- Confirm the increase of the output signal with the circuit
- S. Ando, S. Araki, N. Ono, A. Kimachi, M. Harada and N. Ikeuchi,
"Fishbone Acoustic Sensor with Digital PWM Controlled Frequency Characteristics," Technical Digest of the 17th Sensor Symposium, pp.359--362, Kawasaki, May 2000
Bachelor thesis
Bachelor thesis at Fujimura Lab. @Univ. of Tokyo : A Study on the Adaptive Category Analysis
--- The estimation of the spectrum and the proportion of each component from multi-spectral image data
- Senya Kiyasu, S. Araki, Hironori Takeuchi and Sadao Fujimura,
``Adaptive Spectral Unmixing for Estimation of Component Proportion,''
Proc. of the 1998 International Symposium on Noise Reduction for
Imaging and Communication Systems (ISNIC'98), pp.239-244 (1998)
back to HOME