研究内容
興味のあるところ
(残響下での)音声信号分離、抽出、話者セグメンテーション(diarization)、聴覚情景解析
at NTT CS labs.
論文、レター(1st author)
- S. Araki, R. Mukai, S. Makino, T. Nishikawa(NAIST) and H. Saruwatari(NAIST),
``The Fundamental Limitation of Frequency Domain Blind Source Separation for Convolutive Mixtures of Speech,'' IEEE Trans. Speech Audio Processing, Vol. 11, No. 2, pp. 109-116, 2003. [pdf]
- S. Araki, S. Makino, Y. Hinamoto, R. Mukai, T. Nishikawa(NAIST) and H. Saruwatari(NAIST),
``Equivalence between Frequency Domain Blind Source Separation and Frequency Domain Adaptive Beamforming for Convolutive Mixtures'', EURASIP Journal on Applied Signal Processing, vol. 2003, no. 11, pp. 1157--1166, 2003. [pdf]
- S. Araki, S. Makino, R. Aichner(Univ. Erlangen-Nuremberg), T. Nishikawa(NAIST) and H. Saruwatari(NAIST), ``Subband-based Blind Separation for Convolutive Mixtures of Speech,'' IEICE Trans. Fundamentals, E88-A(12), pp. 3593--3603, 2005. [pdf]
- S. Araki, H. Sawada, R. Mukai and S. Makino, ``Underdetermined Blind Sparse Source Separation for Arbitrarily Arranged Multiple Sensors,'' Signal Processing, doi:10.1016/j.sigpro.2007.02.003, 2007 (available online at http://www.sciencedirect.com and http://dx.doi.org/10.1016/j.sigpro.2007.02.003).
- S. Araki, H. Sawada, R. Mukai and S. Makino, "DOA Estimation for Multiple Sparse Sources with Arbitrarily Arranged Multiple Sensors," Journal of Signal Processing Systems, doi:10.1007/s11265-009-0413-9, 2009 (available online at http://www.springerlink.com/content/8w54h51v31086776/)
- S. Araki, T. Nakatani, and H. Sawada, "Sparse source separation based on simultaneous clustering of source locational and spectral features", Acoustical Science and Technology, Acoustic Letter, vol. 32, no. 4, July, 2011.
論文、レター (co-author)
[2001-2010]
- H. Sawada, R. Mukai, S. Araki, S. Makino, "Polar Coordinate based Nonlinear Function for Frequency Domain Blind Source Separation," IEICE Trans. Fundamentals, vol.E86-A, no.3, pp. 590-596, March 2003.
- R. Mukai, S. Araki, H. Sawada, S. Makino,
``Evaluation of Separation and Dereverberation Performance in Frequency Domain Blind Source Separation,'' Acoustical Science and Technology, Vol.25, No.2, Mar. 2004, pp.119-126.
- H. Sawada, R. Mukai, S. Araki, S. Makino, ``Convolutive Blind Source Separation for more than Two Sources in the Frequency Domain,'' Acoustical Science and Technology, the Acoustical Society of Japan, vol.25, no.4, pp. 296-298, July 2004.
- H. Sawada, R. Mukai, S. Araki, S. Makino,
``A Robust and Precise Method for Solving the Permutation Problem of Frequency-Domain Blind Source Separation,''
IEEE Trans. Speech and Audio Processing, vol.12, no.5, pp.530--538, Sept. 2004.
- R. Mukai, H. Sawada, S. Araki, S. Makino,
``Blind Source Separation for Moving Speech Signals using Blockwise ICA and Residual Crosstalk Subtraction,'' IEICE Trans. Fundamentals, Special Section on Digital Signal Processing, vol.E87-A, no.8, pp.1941--1948, Aug, 2004.
- M. Knaak (Technical University Berlin), S. Araki and S. Makino,``Geometrically Constrained Independent Component Analysis,'' IEEE Trans. Speech and Audio Processing, vol. 15, no. 2, pp.715--726, 2007.
- A. Blin, S. Araki, and S. Makino,``Underdetermined blind separation of convolutive mixtures of speech using time-frequency mask and mixing matrix estimation,'' IEICE Trans. Fundamentals, Vol.E88-A, No.7, pp.1693-1700, 2005
- H. Sawada, R. Mukai, S. Araki, and S. Makino, ``Estimating the number of sources using independent component analysis,'' Acoustical Science and Technology, nol. 26, no. 5, pp.450--452, 2005.
- S. Makino, H. Sawada, R. Mukai, and S. Araki, ``Blind source separation of convolutive mixtures of speech in frequency domain,'' IEICE Trans. Fundamentals, Vol.E88-A, No.7, pp.1640-1655, 2005 (invited)
- R. Mukai, H. Sawada, S. Araki, S. Makino, ''Frequency Domain Blind Source Separation of Many Speech Signals Using Near-field and Far-field Models,'' EURASIP Journal on Applied Signal Processing, vol. 2006, Article ID 83683, 13 pages, 2006. doi:10.1155/ASP/2006/83683.
- H. Sawada, S. Araki, R. Mukai, S. Makino, ''Blind extraction of dominant target sources using ICA and time-frequency masking,'' IEEE Trans. Audio, Speech, and Language Processing, vol.14, no.6, pp.2165-2173, Nov. 2006.
- H. Sawada, S. Araki, R. Mukai and S. Makino ,''Grouping Separated Frequency Components with Estimating Propagation Model Parameters in Frequency-Domain Blind Source Separation,'' IEEE Trans. Audio, Speech & Language Processing, vol. 15, no. 5, pp. 1592-1604, July 2007./li>
- H. Kato, Y. Nagahara, S. Araki, H. Sawada and S. Makino, "Frequency-Domain Pearson Distribution Approach for Independent Component Analysis (FD-Pearson-ICA) in Blind Source Separation," IEEE Trans. Audio, Speech and Language Processing, vol. 17, no. 4, pp. 639-649, May 2009.
- K. Ishizuka, S. Araki, and T. Kawahara, "Speech activity detection for muti-party conversation analyses based on likelihood ratio test on spatial magnitude," IEEE Transaction on Audio, Speech, and Language Processing, Vol.18, No.2, pp. 1354--1365, 2010.
[2011-]
- H. Sawada, S. Araki and S. Makino, "Underdetermined Convolutive Blind Source Separation via Frequency Bin-wise Clustering and Permutation Alignment," IEEE Trans. Audio, Speech, and Language Processing, vol.19, no.3, pp.516-527, March 2011.
- K. Ishiguro, T. Yamada, S Araki, T. Nakatani, and H. Sawada, "Probabilistic Speaker Clustering for DOA-based Diarization", IEEE Trans. ASLP, Vol. 20, No. 2, pp. 447-460, 2012.
- T. Hori, S. Araki, T. Yoshioka, M. Fujimoto, S. Watanabe, T. Oba, A. Ogawa, K. Otsuka, D. Mikami, K. Kinoshita, T. Nakatani, A. Nakamura, and J. Yamato, "Low-latency Real-time Meeting Recognition and Understanding Using Distant Microphones and Omni-directional Camera," IEEE Trans. ASLP, Vol. 20, No. 2, pp. 499-513, 2012.
- 安齊祐美, 荒木章子, 牧野昭二, 中谷智広, 山田武志, 中村篤, 北脇信彦, "劣決定音源分離のための分離音声のケプストラムスムージング", 日本音響学会論文誌, 第68巻2号, pp. 74--85, 2011.
- E. Vincent, S. Araki, F. Theis, G. Nolte, P. Bofill, H. Sawada, A. Ozerov, V. Gowreesunker, D. Lutter, and N. Q. K. Duong,"The Signal Separation Evaluation Campaign (2007-2010): Achievements and Remaining Challenges," Signal Processing 92, pp. 1928--1936, 2012.
- Marc Delcroix, Keisuke Kinoshita, Tomohiro Nakatani, Shoko Araki,
Atsunori Ogawa, Takaaki Hori, Shinji Watanabe, Masakiyo Fujimoto, Takuya
Yoshioka, Takanobu Oba, Yotaro Kubo, Mehrez Souden, Seong-Jun Hahm, and
Atsushi Nakamura, ``Speech recognition in living rooms: Integrated
speech enhancement and recognition system based on spatial, spectral and
temporal modeling of sounds,'' Computer Speech and Language, vol. 27,
pp. 851-873, May 2013.
- H. Sawada, H. Kameoka, S. Araki and N. Ueda, "Multichannel Extensions of Nonnegative Matrix Factorization with Complex-valued Data," IEEE Trans. Audio, Speech and Language Processing,
- M. Souden, S. Araki, K. Kinoshita, T. Nakatani and H. Sawada, "A Multichannel MMSE-Based Framework for Speech Sources Separation and Noise Reduction," IEEE Trans. Audio, Speech and Language Processing, no.9, vol.11, pp. 1913-1928, 2013.
- T. Nakatani, S. Araki, T. Yoshioka, M. Delcroix, and M. Fujimoto, "Dominance Based Integration of Spatial and Spectral Features for Speech Enhancement," Submitted to IEEE Trans. ASLP., vol. 21, No. 12, pp.2516-2531, Dec. 2013.
- 丸山卓郎, 荒木章子, 中谷智広, 宮部滋樹, 山田武志, 牧野昭二, 中村篤, "周波数依存到来時間差推定に基づく劣決定ブラインド音源分離の高速化,"日本音響学会論文誌, 2014
- 伊藤、荒木、木下、中谷、「音源位置情報に基づく劣決定ブラインド音源分離のためのパーミュテーションフリークラスタリング法」 電子通信学会論文誌A、Vol. J97-A, No.4, pp.234-246, 2014.
- N. Ito, E. Vincent, T. Nakatani, N. Ono, S. Araki, and S. Sagayama,
“Blind Suppression of Nonstationary Diffuse Noise Based on Spatial
Covariance Matrix Decomposition,” Springer Journal of Signal Processing Systems. (invited)
- M. Delcroix, T. Yoshioka, A. Ogawa, Y. Kubo, M. Fujimoto, N. Ito, K. Kinoshita, M. Espi, S. Araki, T. Hori, and T. Nakatani, “Strategies for Distant Speech Recognition in Reverberant Environments,” EURASIP Journal on Advances in Signal Processing
.
- T. Higuchi, N. Ito, S. Araki, T. Yoshioka, M. Delcroix, and T. Nakatani, "Online MVDR Beamformer Based on Complex Gaussian Mixture Model with Spatial Prior for Noise Robust ASR," IEEE Trans on TASLP, 2017.
- T. Kawase, K. Niwa, M. Fujimoto, K. Kobayashi, S. Araki, and T. Nakatani, "Integration of Spatial Cue-based Noise Reduction and Speech Model-based Source Restoration for Real Time Speech Enhancement," Trans IEICE., 2017.
- N. Ito, S. Araki, and T. Nakatani, "FASTFCA: A JOINT DIAGONALIZATION BASED FAST ALGORITHM FOR AUDIO SOURCE SEPARATION USING A FULL-RANK SPATIAL COVARIANCE MODEL," Arxiv, 2018.
- S. Emura, S. Araki, T. Nakatani, and N. Harada, "Distortionless beamforming optimized with l1 norm minimization," IEEE signal processing letters, 2019.
- K. Yamamoto, T. Irino, T. Matsui, S. Araki, K. Kinoshita, and T. Nakatani, "Speech intelligibility prediction with the dynamic compressive gammachirp filterbank and modulation power spectrum, "Acoustical Science and Technology, vol. 40, no. 2, pp. 84--92, 2019.
Book Chapter
- S. Araki, S. Makino, Subband Based Blind Source Separation, In J. Benesty, S. Makino, and J. Chen, editors, Speech Enhancement, pp. 329--352, Springer, March 2005.
- H. Sawada, R. Mukai, S. Araki and S. Makino, Frequency-domain blind source separation, In J. Benesty, S. Makino, and J. Chen, editors, Speech Enhancement, pp.299--327, Springer, March 2005.
- R. Mukai, H. Sawada, S. Araki and S. Makino, Real-time blind source separation for moving speech signals, In J. Benesty, S. Makino, and J. Chen, editors, Speech Enhancement, pp.353--369, Springer, March 2005.
- S. Makino, H. Sawada, R. Mukai, and S. Araki, ''Blind source separation of convolutive mixtures of audio signals in frequency domain, '' in Topics in Acoustic Echo and Noise Control, E. Haensler and G. Schmidt, Eds., Springer, 2006.
- S. Araki, H. Sawada and S. Makino, ''K-means based Underdetermined Blind Speech Separation,'' in Blind Speech Separation, S. Makino T.-W. Lee and H. Sawada, Eds., Springer, 2007.
- H. Sawada, S. Araki, and S. Makino, ''Frequency-Domain Blind Source Separation,'' in Blind Speech Separation, S. Makino T.-W. Lee and H. Sawada, Eds., Springer, 2007.
- S. Makino, S. Araki, S. Winter, H. Sawada, "Underdetermined Blind Source Separation using Acoustic Arrays," Handbook on Array Processing and Sensor Networks, S. Haykin, and K. J. R. Liu Eds., Wiley, 2009.
- N. Ito, S. Araki, and T. Nakatani, "Multi-channel audio source separation by modelling audio directional statistics," in Audio Source Separation, S. Makino Ed., Springer, 2017.
- M. I. Mandel, S. Araki, and T.Nakatani, "Multichannel classification and clustering approaches," in Audio Source Separation and Speech Enhancement, E.Vincent, T.Virtanen, and S.Gannot, Eds., John Wiley & Sons, Oct., 2018 (coming soon).
国際会議 (1st author)
[2001]
- S. Araki, S. Makino, T. Nishikawa, and
H. Saruwatari, ``Limitation of Frequency Domain Blind Source Separation for Convolutive Mixture of Speech," International Workshop on Hands-Free Speech Communication, Apr. 2001.
- S. Araki, S. Makino, T. Nishikawa, and H. Saruwatari, ``Fundamental Limitation of Frequency Domain Blind Source Separation for Convolutive Mixture of Speech," IEEE International Conference on Acoustics, Speech, and Signal (ICASSP2001), pp.2737--2740, May, 2001.
- S. Araki, S. Makino, R. Mukai, and H. Saruwatari, ``Equivalence between Frequency Domain Blind Source Separation and Frequency Domain Adaptive Beamformers," Consistent & Reliable Acoustic Cues for Sound Analysis (CRAC), Sept. 2001.
- S. Araki, S. Makino, R. Mukai, and H. Saruwatari, ``Equivalence between Frequency Domain Blind Source Separation and Frequency Domain Adaptive Null Beamformers," 7th European Conference on Speech Communication and Technology (Eurospeech2001), vol.4, pp 2595-2598, Sept. 2001.
- S. Araki, S. Makino, R. Mukai, T. Nishikawa, and H. Saruwatari, ``Fundamental limitation of frequency domain Blind Source Separation for convolved mixture of speech," 3rd International Conference on INDEPENDENT COMPONENT ANALYSIS and BLIND SIGNAL SEPARATION (ICA2001) pp.132-137, Dec. 2001.
[2002]
- S. Araki, S. Makino, R. Mukai, Y. Hinamoto, T. Nishikawa and H. Saruwatari, ``Equivalence between Frequency Domain Blind Source Separation and Frequency Domain Adaptive Beamforming," ICASSP2002, vol. II, pp. 1785-1788, May 2002.
- S. Araki, S. Makino, R. Aichner, T. Nishikawa(NAIST), and H. Saruwatari(NAIST), ``Blind Source Separation for Convolutive Mixtures of speech using subband processing,'' SMMSP2002(Second International Workshop on Spectral Methods and Multirate Signal Processing), pp.195-202, Sept. 2002.
[2003]
- S. Araki, S. Makino, R. Aichner, T. Nishikawa(NAIST), and H. Saruwatari(NAIST), ``Subband Based Blind Source Separation with Appropriate Processing for Each Frequency Band,'' ICA2003, pp. 499--504, 2003 .
- S. Araki, S. Makino, R. Aichner, T. Nishikawa(NAIST), and H. Saruwatari(NAIST), ``Subband Based Blind Source Separation for Convolutive Mixtures of Speech,'' ICASSP2003, Vol. V, pp. 509--512, 2003.
- S. Araki, S. Makino, A. Blin, R. Mukai and H. Sawada, ``Blind Separation of More Speech than Sensors with Less Distortion by Combining Sparseness and ICA,'' IWAENC2003, pp.271--274, 2003, [pdf], -->sound demos.
- S. Araki, S. Makino, H. Sawada, A. Blin and R. Mukai,
``Underdetermined Blind Separation of Convolutive Mixtures of Speech
with Binary Masks and ICA,''
NIPS 2003 workshop on ICA: Sparse Representations in Signal Processing,
Dec., 2003. (We did not have the proceedings in the workshop).
[2004]
- S. Araki, S. Makino, A. Blin, R. Mukai, and H. Sawada, ''Underdetermined blind separation of convolutive mixtures of speech by combining time-frequency masks and ICA, '' in Proc. ICA2004 (International Congress on Acoustics), vol. I, pp.321--324, 2004.
- S. Araki, S. Makino, A. Blin, R. Mukai, and H. Sawada, ``Underdetermined Blind Separation for Speech in Real Environments with Sparseness and ICA,'' ICASSP2004, vol. III, pp. 881-884, May 2004 (invited), [pdf].
- S. Araki, S. Makino, H. Sawada and R. Mukai,
``Underdetermined Blind Speech Separation with Directivity Pattern based Continuous Mask and ICA,'' EUSIPCO2004, pp.1991--1994, Sept. 2004. [pdf],-->sound demos.
- S. Araki, S. Makino, H. Sawada and R. Mukai,
``Underdetermined Blind Separation of Convolutive Mixtures of Speech with Directivity Pattern based Mask and ICA,'' ICA2004, pp.898--905, Sept. 2004. -->sound demos.
[2005]
- S. Araki, S. Makino, H. Sawada and R. Mukai, ``Reducing musical noise by a fine-shift overlap-add method applied to source separation using a time-frequency mask,'' ICASSP2005, vol. III, pp. 81-84, March 2005. [pdf], -->sound demos.
- S. Araki, S. Makino, H. Sawada, and R. Mukai, ``Source extraction from speech mixtures with null-directivity pattern based mask,'' Proc. of Joint Workshop on Hands-Free Speech Communication and Microphone Arrays (HSCMA 2005), pp. d1-d2, March 2005.
- S. Araki, H. Sawada, R. Mukai and S. Makino,``A novel blind source separation method with observation vector clustering,'' , IWAENC2005, pp.117--120, 2005. [pdf], -->sound demos.
[2006]
- S. Araki, H. Sawada, R. Mukai and S. Makino,``DOA estimation for multiple sparse sourceswith normalized observation vector clustering,'', ICASSP2006, Vol. 5, pp.33--36, 2006. [pdf]
- S. Araki, H. Sawada, R. Mukai and S. Makino,``Underdetermined Sparse Source Separation of Convolutive Mixtures with Observation Vector Clustering,'', ISCAS2006, pp. 3594--3597, 2006.
- S. Araki, H. Sawada, R. Mukai and S. Makino,``Normalized Observation Vector Clustering Approach for Sparse Source Separation,'', EUSIPCO2006, (invited).
- S. Araki, H. Sawada, R. Mukai and S. Makino, "Performance evaluation of sparse source separation and DOA estimation with observation vector clustering in reverberant environments," IWAENC2006, 2006.
- S. Araki, H. Sawada, R. Mukai and S. Makino, " Blind sparse source separation with spatially smoothed time-frequency masking," IWAENC2006, 2006.
[2007]
- S. Araki, H. Sawada, and S. Makino, "Blind speech separation in a meeting situation with maximum SNR beamformers," ICASSP2007, vol. 1, pp. 41--44, Apr. 2007. [pdf]
[2008]
- S. Araki, M. Fujimoto, K. Ishizuka, H. Sawada, and S. Makino, "Speaker indexing and speech ehnancement in real meetings / conversations," ICASSP2008, pp.93--96, 2008. [pdf]
- S. Araki, M. Fujimoto, K. Ishizuka, H. Sawada, and S. Makino, "A DOA based speaker diarization system for real meetings," HSCMA2008, pp.29--32, 2008 (invited).[pdf]
[2009]
- S. Araki, T. Nakatani, H. Sawada, and S. Makino, "Blind sparse source separation for unknown number of sources using Gaussian mixture model fitting with Dirichlet prior," ICASSP2009, pp.33-36, 2009. [pdf]
- S. Araki, T. Nakatani, H. Sawada, and S. Makino, "Stereo source separation and source counting with MAP estimation with Dirichlet prior considering spatial aliasing problem," ICA2009, pp. 742--750, 2009. [pdf]
[2010]
- S. Araki, T. Nakatani and H. Sawada, "Simultaneous clustering of mixing and spectral model parameters for blind sparse source separation," ICASSP2010, 2010.
- S. Araki, A. Ozerov, V. Gowreesunker, H. Sawada, F. Theis, G. Nolte, D. Lutter, N. Duong, "The 2010 Signal Separation Evaluation Campaign (SiSEC2010): - Audio source separation - ," in Proc of LVA/ICA2010, 2010.
- S. Araki, F. Theis, G. Nolte, D. Lutter, A. Ozerov, V. Gowreesunker, H. Sawada, N. Duong, "The 2010 Signal Separation Evaluation Campaign (SiSEC2010): - Biomedical source separation - ," in Proc of LVA/ICA2010, 2010.
- S. Araki, T. Hori, M. Fujimoto, S. Watanabe, T. Yoshioka, T. Nakatani, "Online meeting recognizer with multichannel speaker diarization", Asilomar 2010. (invited)
[2011]
- S. Araki and T. Nakatani, "Hybrid Approach for Multichannel Source Separation Combining Time-frequency Mask with Multi-channel Wiener Filter," ICASSP2011, 2011.
- S. Araki, T. Hori, T. Yoshioka, M. Fujimoto, S. Watanabe, T. Oba, A. Ogawa, K. Otsuka, D. Mikami, M. Delcroix, K. Kinoshita, T. Nakatani, A. Nakamura, and J. Yamato, "Demonstration on low-latency meeting recognition and understanding using distant microphones," HSCMA2011, 2011.
[2012]
- S. Araki and T. Nakatani,"Sparse vector factorization for underdetermined BSS using wrapped-phase GMM and source log-spectral prior," ICASSP2012, 2012.
- S. Araki, F. Nesta, E. Vincent, Z. Koldovsky, G. Nolte, A. Ziehe, and A. Benichoux, "SiSEC2011 Overview: Audio source separation," in Proc. LVA/ICA2012, pp. 414--422, Mar. 2012.
[2015]
- S. Araki and T. Hayashi, M. Delcroix, M. Fujimoto, K. Takeda and T. Nakatani,"Exploring multi-channel features for denoising-autoencoder-based speech enhancement," ICASSP2015, 2015.
[2016]
- S. Araki, M. Okada, T. Higuchi, A. Ogawa and T. Nakatani, "SPATIAL CORRELATION MODEL BASED OBSERVATION VECTOR CLUSTERING AND MVDR BEAMFORMING FOR MEETING RECOGNITION," ICASSP2016, 2016.
[2017]
- S. Araki, N. Ito, D. Marc, A. Ogawa, K. Kinoshita, T. Higuchi, T. Yoshioka, D. Tran, S. Karita, and T. Nakatani, "Online Meeting Recognition in Noisy Environments with Time-Frequency Mask Based MVDR Beamforming," Proc. HSCMA, Mar. 2017.
- S. Araki, N. Ono, K. Kinoshita and M.Delcroix, "MEETING RECOGNITION WITH ASYNCHRONOUS DISTRIBUTED MICROPHONE ARRAY, " ASRU2017, 2017
[2018]
- S. Araki, N. Ono, K. Kinoshita, and M. Delcroix, "MEETING RECOGNITION WITH ASYNCHRONOUS DISTRIBUTED MICROPHONE ARRAY USING BLOCK-WISE REFINEMENT OF MASK-BASED MVDR BEAMFORMER," ICASSP2018, 2018.
- S. Araki, N. Ono, K. Kinoshita, and M. Delcroix,"Comparison of reference microphone selection algorithms for distributed microphone array based speech enhancement in meeting recognition scenarios," IWAENC2018, 2018 (to appear).
- S. Araki, N. Ono, K. Kinoshita, and M. Delcroix, "Estimation of sampling frequency mismatch between distributed asynchronous microphones under existence of source movements with stationary time periods detection," ICASSP2019, 2019
- S. Araki, N. Ono, K. Kinoshita, and M. Delcroix, "PROJECTION BACK ONTO FILTERED OBSERVATIONS FOR SPEECH SEPARATION .WITH DISTRIBUTED MICROPHONE ARRAY," CAMSAP2019, 2019
国際会議 (co-author)
[2001]
- R. Mukai, S. Araki and S. Makino, ``Separation and Dereverberation Performance of Frequency Domain Blind Source Separation for Speech in a Reverberant Environment'', Eurospeech 2001, pp. 2599--2603, Sept. 2001.
- R. Mukai, S. Araki and S. Makino, ``Separation and Dereverberation Performance of Frequency Domain Blind Source Separation in a Reverberant Environment'', IWAENC 2001, pp. 127--130, Sept. 2001.
- R. Mukai, S. Araki and S. Makino, ``Separation and Dereverberation Performance of Frequency Domain Blind Source Separation,'' ICA2001, pp. 230-235, Dec. 2001.
- H. Sawada, R. Mukai, S. Araki, S. Makino, ``A Polar-Coordinate based Activation Function for Frequency Domain Blind Source Separation,'' ICA2001, pp. 663-668, Dec. 2001.
[2002]
- Y. Hinamoto(NAIST), T. Nishikawa(NAIST), H. Saruwatari(NAIST), S. Araki , S. Makino, and R. Mukai, ``Equivalence between Frequency Domain Blind Source Separation and Adaptive Beamforming,'' Proc. ICFS2002 (The International Conference on Fundamentals of Electronics, Communications and Computer Sciences), R-1, pp. 13-18, Mar. 2002.
- R. Aichner, S. Araki, S. Makino, T. Nishikawa(NAIST), and H. Saruwatari(NAIST), ``Time domain Blind Source Separation of non-stationary convolved signals by utilizing geometric beamforming,'' NNSP2002, pp. 445-454, 2002.
- H. Sawada, S. Araki, R. Mukai, S. Makino, ``Blind Source Separation with Different Sensor Spacing and Filter Length for Each Frequency Range,'' NNSP2002, pp. 465-474, 2002.
- R. Mukai, S. Araki, H. Sawada, S. Makino, ``Removal of Residual Cross-talk Components in Blind Source Separation using LMS Filters,'' NNSP2002, pp. 435-444, 2002.
- R. Mukai, S. Araki, H. Sawada, S. Makino, ``Removal of Residual Cross-talk Components in Blind Source Separation using Time-delayed Spectral Subtraction,''ICASSP2002, vol. II, pp.1789-1792, May 2002.
- H. Sawada, R. Mukai, S. Araki, S. Makino,``Polar Coordinate based Nonlinear Function for Frequency-Domain Blind Source Separation,''ICASSP2002, vol. I, pp. 1001-1004, May 2002.
[2003]
- S. Makino, S. Araki, R. Mukai, H. Sawada, H. Saruwatari (NAIST),`` ICA-Based Source Separation of Sounds,'' Proc. of 2002 China-Japan Joint Conference on Acoustics, Vol.21, pp. 83--86, 2002.
- M. Knaak, S. Araki, S. Makino, ``Geometrically Constraint ICA for a Robust Separation of Sound Mixtures,'', ICA2003, pp. 951--956, 2003.
- R. Aichner, H. Buchner, S. Araki, S. Makino, ``On-line Time-domain Blind Source Separation of Nonstationary Convoluved Signals,'' ICA2003, pp. 987--992, 2003.
- T. Nishikawa, H. Saruwatari, K. Shikano, S. Araki , S. Makino, ``Multistage ICA for Blind Source Separation of Real Acoustic Convolutive Mixture,'' ICA2003, pp. 523--528, 2003
- R. Mukai, H. Sawada, S. Araki, S. Makino, ``Real-Time Blind Source Separation for Moving Speakers using Blockwise ICA and Residual Crosstalk Subtraction,'' ICA2003, pp. 975-980, Apr. 2003.
- H. Sawada, R. Mukai, S. Araki, S. Makino, "A Robust and Precise Method for Solving the Permutation Problem of Frequency-Domain Blind Source Separation, " ICA 2003, pp. 505-510, Apr. 2003.
- M. Knaak, S. Araki , S. Makino, ``Geometrically Constraint ICA for a Convolutive Mixtures of Sound,'', ICASSP2003, Vol. II, pp. 725--728, 2003.
- R. Mukai, H. Sawada, S. Araki, S. Makino, ``Robust Real-Time Blind Source Separation for Moving Speakers in a Room,'' ICASSP2003, pp.
469-472, Apr. 2003.
- H. Sawada, R. Mukai, S. Araki, S. Makino, "A Robust Approach to the Permutation Problem of Frequency-Domain Blind Source Separation," ICASSP 2003, pp. 381-384, Apr. 2003.
- A. Blin, S. Araki and S. Makino,``Blind Source Separation when Speech Signals Outnumber Sensors using a Sparseness-Mixing Matrix Combination,'', IWAENC2003, pp. 211-214, 2003.
- R. Mukai, H. Sawada, S. de la Kethulle, S. Araki and S. Makino,``Array Geometry Arrangement for Frequency Domain Blind Source Separation,'' IWAENC2003, pp.219-222, 2003.
- H. Sawada, R. Mukai, S. de la Kethulle, S. Araki and S. Makino,``Spectral Smoothing for Frequency-Domain Blind Source Separation,'' IWAENC2003, pp.311-314, 2003.
[2004]
- A. Blin, S. Araki, and S. Makino, ''Underdetermined blind source separation for convolutive mixtures exploiting a sparseness-mixing matrix estimation (SMME), '' in Proc. ICA2004 (International Congress on Acoustics), vol. IV, pp. 3139--3142, 2004.
- R. Mukai, H. Sawada, S. Araki, S. Makino,``A Solution for the Permutation Problem in Frequency Domain BSS using Near- and Far-field Models,''ICA2004 (International Congress on Acoustics), vol. IV, pp. 3135--3138, 2004.
- H. Sawada, R. Mukai, S. Araki, S. Makino,``Solving the Permutation and the Circularity Problem of Frequency-Domain Blind Source Separation,''ICA2004 (International Congress on Acoustics), vol. I, pp. 89--92, 2004 (invited).
- A. Blin, S. Araki and S. Makino,``A Sparseness-Mixing Matrix Estimation (SMME) Solving the Underdetermined BSS for Convolutive Mixtures,'' ICASSP2004, vol. IV, pp. 85-88, May 2004.
- R. Mukai, H. Sawada, S. Araki, S. Makino,``Near-Field Frequency Domain Blind Source Separation for Convolutive Mixtures,'' ICASSP2004, vol. IV, pp. 49-52, May 2004.
- H. Sawada, R. Mukai, S. Araki, S. Makino,``Convolutive Blind Source Separation for more than Two Sources in the Frequency Domain,'' ICASSP2004, vol. III, pp. 885-888, May 2004 (invited).
- S. Makino, S. Araki, R. Mukai, and H. Sawada, ``Audio source separation based on independent component analysis, ''in Proc. ISCAS2004 (International Symposium on Circuits and Systems), vol. V, pp. 668-671, May 2004 (invited).
- R. Mukai, H. Sawada, S. Araki and S. Makino,``Frequency Domain Blind Source Separation using Small and Large Spacing Sensor Pairs,'' ISCAS2004, vol. V, pp. 1-4, May 2004.
- H. Sawada, S. Winter, S. Araki, R. Mukai, S. Makino,``Estimating the Number of Sources for Frequency-Domain Blind Source Separation,'' ICA2004 (5th International Conference on Independent Component Analysis and Blind Signal Separation), pp.610--617, Sept. 2004.
- S. Winter, H. Sawada, S. Araki, S. Makino,``Overcomplete BSS for convolutive mixtures based on hierarchical clustering,'' ICA2004, pp.652--660, Sept. 2004.
- R. Mukai, H. Sawada, S. Araki, S. Makino,``Frequency Domain Blind Source Separation for Many Speech Signals,'' ICA2004, pp.461--469, Sept. 2004.
- S. Winter, H. Sawada, S. Araki, S. Makino,``Hierarchical Clustering Applied to Overcomplete BSS for Convolutive Mixtures,'' SAPA2004 (ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing), Session I-3, Oct. 2004.
[2005]
- H. Sawada, S. Araki, R. Mukai, S. Makino,``Blind Extraction of a Dominant Source Signal from Mixtures of Many Sources,'' ICASSP2005, vol. III, pp. 61-64, March 2005.
- H. Sawada, R. Mukai, S. Araki, and S. Makino, ``Frequency-domain blind source separation without array geometry information,'' Proc. of Joint Workshop on Hands-Free Speech Communication and Microphone Arrays (HSCMA 2005), pp.d13-d14, March 2005.
- R. Mukai, H. Sawada, S. Araki, and S. Makino, ``Blind source separation and {DOA} estimation using small 3-D microphone array,'' Proc. of Joint Workshop on Hands-Free Speech Communication and Microphone Arrays (HSCMA 2005), pp. d9-d10, March 2005.
- H. Sawada, S. Araki, R. Mukai, and S. Makino,``Blind extraction of a dominant source from mixtures of many sources using ICAand time-frequency masking,'' Proc. of 2005 IEEE International Symposium on Circuits and Systems (ISCAS 2005), pp. 5882-5885, May 2005.
- H. Sawada, R. Mukai, S. Araki, and S. Makino, ``Multiple source localization using independent component analysis,'' Proc. of 2005 IEEE AP-S International Symposium and USNC/URSI National Radio Science Meeting, July 2005.
- H. Kato, Y. Nagahara (Meiji Univ.), S. Araki, and H. Sawada,``Pearson distribution system applied to blind speech separation,'' 25th European Meeting of Statsiticians (EMS2005), p.394, July 2005.
- F. Flego, S. Araki, H. Sawada, T. Nakatani, and S. Makino, ``Underdetermined blind separation for speech in real environments with F0 adaptive comb filtering,'' IWAENC2005, pp. 93--96, 2005.
- H. Sawada, R. Mukai, S. Araki, and S. Makino,``Real-time blind extraction of dominant target sources from many background interferences,'' IWAENC2005, pp. 73--76, 2005.
- R. Mukai, H. Sawada, S. Araki, and S. Makino, ``Real-Time Blind Source Separation and DOA Estimation Using Small 3-D Microphone Array,'' IWAENC2005, pp. 45--48, 2005.
- R. Mukai, H. Sawada, S. Araki, and S. Makino, ``Blind Source Separation of 3-D Located Many Speech Signals,'' in Proc. WASPAA2005, pp. 9-12, Oct., 2005.
[2006]
- H. Sawada, S. Araki, R. Mukai and S. Makino,''On Calculating the Inverse of Separation Matrix in Frequency-Domain BSS,'' ICA2006, pp. 691--699, 2006.
- H. Sawada, S. Araki, R. Mukai and S. Makino,''Solving the permutation problem of frequency-domain BSS when spatial aliasing occurs with wide sensor spacing,'' ICASSP2006, vol. V, pp. 77-80, Mar. 2006.
- R. Mukai, H. Sawada, S. Araki, S. Makino, "Blind Source Separation of Many Signals in the Frequency Domain," ICASSP2006, vol.5, pp.969--972, 2006.
- H. Kato, Y. Nagahara, S. Araki, H. Sawada and S. Makino, "Parametric Pearson Approach based Independent Component Analysis for Frequency Domain Blind Speech Separation," EUSIPCO2006, 2006.
- J. Cermak, S. Araki, H. Sawada and S. Makino, "Blind Speech Separation by Combining Beamformers and a Time Frequency Binary Mask," IWAENC2006, 2006.
- J. Cermak, S. Araki, H. Sawada and S. Makino, "Musical Noise Reduction in Time-frequency-binary-masking-based Blind Source Separation Systems," 16th Czech-German Workshop, 2006.
- R. Mukai, H. Sawada, S. Araki and S. Makino,
"Frequency Domain Blind Source Separation in a Noisy Environment,"
Joint meeting of ASA and ASJ 2006, Nov. 2006, (invited).
- H. Sawada, S. Araki, R. Mukai and S. Makino, ''Blind separation and localization of speeches in a meeting situation'', Asilomar 2006, pp. 1407-1411, Oct. 2006.
[2007]
- J. Cermak, S. Araki, H. Sawada and S. Makino"Blind Source Separation Based on Beamformer Array and Time Frequency Binary Masking," in Proc. ICASSP2007, vol. I, pp. 145 --148, Apr. 2007.
- J. E. Rubio, K. Ishizuka, H. Sawada, S. Araki, T. Nakatani and M. Fujimoto, "Two-Microphone Voice Activity Detection Based on the Homogeneity of the Direction of Arrival Estimates," in Proc. ICASSP2007, vol.4, pp. 385-388, Apr. 2007.
- H. Sawada, S. Araki and S. Makino, "Measuring Dependence of Bin-wise Separated Signals for Permutation Alignment in Frequency-domain BSS," in Proc. ISCAS2007, pp. 3247 - 3250, May 2007.
- H. Sawada, S. Araki, and S. Makino, "MLSP 2007 data analysis competition: Frequency-domain blind source separation for convolutive mixtures of speech/and audio," MLSP2007, 2007.
- H. Sawada, S. Araki, and S. Makino, "A two-stage frequency-domain blind source separation method for underdetermined convolutive mixtures", WASPAA2007.
[2008]
- D. Kolossa (TU Berlin), S. Araki , M. Delcroix, T. Nakatani, R. Orglmeister (TU Berlin), S. Makino, "Missing Feature Speech Recognition in a Meeting Situation with Maximum SNR Beamforming," ISCAS2008.
- T. Hager, S. Araki, K. Ishizuka, M. Fujimoto, T. Nakatani, S. Makino, "Handling speaker position changes in a meeting diarization system by combining DOA clustering and speaker identification," IWAENC2008, 2008.
- K. Ishizuka, S. Araki, T. Kawahara, "Statistical Speech Activity Detection based on Spatial Power Distribution for Analyses of Poster Presentations," Interspeech2008, pp.99-102, 2008.
- T. Kawahara, H. Setoguchi, K. Takanashi, K. Ishizuka, S. Araki, "Multi-Modal Recording, Analysis and Indexing of Poster Sessions," Interspeech2008, pp. 1622-1625, 2008.
- K. Otsuka, S. Araki, K. Ishizuka, M. Fujimoto, M. Heinrich, J. Yamato, "A Realtime Multimodal System for Analyzing Group Meetings by Combining Face Pose Tracking and Speaker Diarization," ICMI2008, pp. 257--264, 2008.
[2009]
- E. Vincent, S. Araki, and P. Bofill, "The 2008 Signal Separation Evaluation Campaign: A Community-Based Approach to Large-Scale Evaluation," ICA2009, pp. 734--741, 2009.
- K. Ishiguro, T. Yamada, S. Araki and T. Nakatani, "A Probabilistic Speaker Clustering for DOA-based Diarization,"
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2009), pp. 241-244, 2009. [pdf]
- K. Ishizuka, S. Araki, K. Otsuka, T. Nakatani, and M. Fujimoto, "A speaker diarization method based on the probabilistic fusion of audio-visual location information," Proceedings of the 11th International Conference on Multimodal Interfaces and Workshop on Machine Learning for Multi-modal Interaction (ICMI-MLMI2009), pp.55-62, 2009.
- K. Otsuka, S. Araki, D. Mikami, K. Ishizuka, M. Fujimoto, and J. Yamato: "Realtime Meeting Analysis and 3D Meeting Viewer Based on Omnidirectional Multimodal Sensors", Proc. ICMI-MLMI2009, 2009.
- T. Tashiro, S. Araki, Y. Nakanishi, H. Kimura, K. Kumozaki and M. Miyoshi, "Optical Access System with Emergency Voice Communication Using Blind Speech Separation for Demultiplexing Randomly Mixed Signals," GLOBECOM, 2009.
[2010]
- T. Nakatani and S. Araki,"SINGLE CHANNEL SOURCE SEPARATION BASED ON SPARSE SOURCE OBSERVATION MODEL WITH HARMONIC CONSTRAINT," ICASSP2010, 2010.
- Y. Ansai, S. Araki, S. Makino, T. Nakatani, T. Yamada, A. Nakamura and N. Kitawaki, "Cepstral Smoothing of Separated Signals for Underdetermined Speech Separation," ISCAS2010, 2010.
- T. Nakatani, S. Araki, T. Yoshioka, M. Fujimoto, "Multichannel Source Separation Based on Source Location Cue with Log-Spectral Shaping by Hidden Markov Source Model," Interspeech2010, 2010.
- T. Hori, S. Araki, T. Yoshioka, M. Fujimoto, S. Watanabe, T. Oba, A. Ogawa, K. Otsuka, D. Mikami, K. Kinoshita, T. Nakatani, A. Nakamura, J. Yamato, "Real-time Meeting Recognition and Understanding Using Distant Microphones and Omni-directional Camera," in Proc of SLT2010, 2010.
[2011]
- T. Nakatani, S. Araki, T. Yoshioka, and M. Fujimoto, "Joint unsupervised learning of hidden Markov source models and source location models for multichannel source separation," ICASSP2011, 2011.
- H. Sawada, H. Kameoka, S. Araki, and N. Ueda, "FORMULATIONS AND ALGORITHMS FOR MULTICHANNEL COMPLEX NMF,"ICASSP2011, 2011.
- K. Iso, S. Araki, S. Makino, T. Nakatani, H. Sawada, T. Yamada, and A. Nakamura, "BLIND SOURCE SEPARATION OF MIXED SPEECH IN A HIGH REBERBERATION ENVIRONMENT," HSCMA2011, 2011.
[2012]
- T. Maruyama, Shoko Araki, T. Nakatani, S. Miyabe, T. Yamada, S. Makino and A. Nakamura, "NEW ANALYTICAL UPDATE RULE FOR TDOA INFERENCE FOR UNDERDETERMINED BSS IN NOISY ENVIRONMENTS," ICASSP2012.
- T. Nakatani T. Yoshioka S. Araki M. Delcroix and M. Fujimoto, "LOGMAX OBSERVATION MODEL WITH MFCC-BASED SPECTRAL PRIOR FOR REDUCTION OF HIGHLY NONSTATIONARY AMBIENT NOISE," ICASSP2012, 2012.
- M. Souden, S. Araki, K. Kinoshita, T. Nakatani and H. Sawada, "A MULTICHANNEL MMSE-BASED FRAMEWORK FOR JOINT BLIND SOURCE SEPARATION AND NOISE REDUCTION," ICASSP2012.
- H. Sawada, H. Kameoka, S. Araki and Naonori Ueda, "EFFICIENT ALGORITHMS FOR MULTICHANNEL EXTENSIONS OF ITAKURA-SAITO NONNEGATIVE MATRIX FACTORIZATION," ICASSP2012.
- G. Nolte, D. Lutter, A. Ziehe, F. Nesta, E. Vincent, Z. Koldovsky, A. Benichoux and S. Araki, "SiSEC2011 Overview: Biomedical Data Analysis," in Proc. LVA/ICA2012, pp. 423--429, Mar. 2012.
- Takaaki Hori, Keisuke Kinoshita, Shoko Araki, Atsunori Ogawa, Takuya Yoshioka, Masakiyo Fujimoto, Takanobu Oba, Marc Delcroix, Mehrez Souden, Yotaro
Kubo, Seong-Jun Hahm, Dan Mikami, Kazuhiro Otsuka, Tomohiro Nakatani, Atsushi Nakamura, Junji Yamato, "Real-time audio-visual meeting recognition and understanding using distant microphone array," ICASSP2012, Show & Tell, Mar. 2012.
- T. Maruyama, S. Araki, T. Nakatani, S. Miyabe, T. Yamada, S. Makino and A. Nakamura, "New analytical calculation and estimation for TDOA inference for underdetermined BSS in noisy environments," Proc. on Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA2012), 2012.
[2013]
- Nobutaka Ito, Shoko Araki, and Tomohiro Nakatani, “Permutation-free
Convolutive Blind Source Separation via Full-band Clustering Based on
Frequency-independent Source Presence Priors,” Proc. International
Conference on Acoustics, Speech, and Signal Processing (ICASSP),
Vancouver, Canada, May 2013.
- Tomohiro Nakatani, Mehrez Souden, Shoko Araki, Takuya Yoshioka,
Takaaki Hori, Atsunori Ogawa, "Coupling beamforming with spatial and
spectral feature based spectral enhancement and its application to
meeting recognition," Proc. International Conference on Acoustics,
Speech, and Signal Processing (ICASSP), Vancouver, Canada, May 2013.
- J. Ingrid, N. Ito, M. Souden, S. Araki and T. Nakatani, "Source number estimation based on clustering of speech activity sequences for microphone array processing" MLSP2013, 2013
[2014]
- N. Ito, S. Araki, and T. Nakatani, “Probabilistic Integration of Diffuse Noise Suppression and Dereverberation, ” Proc. ICASSP2014, May 2014.
- N. Ito, S. Araki, T. Nakatani and T. Yoshioka, “Relaxed disjointness based clustering for joint blind source separation and dereverberation, ” Proc. IWAENC2014, 2014.
- M. Delcroix, T. Yoshioka, A. Ogawa, Y. Kubo, M. Fujimoto, N. Ito, K. Kinoshita, M. Espi,S. Araki, T. Hori, and T. Nakatani, "Defeating reverberation: Advanced dereverberation and recognition techniques for hands-free speech recognition," Proc. GlobalSIP, 2014.
[2015]
- N. Ito, S. Araki, and T. Nakatani, “Permutation-free Clustering of Relative Transfer Function Features for Blind Source Separation,” Proc. EUSIPCO 2015 (to appear).
- T. Yoshioka, N. Ito, M. Delcroix, A. Ogawa, K. Kinoshita, M. Fujimoto, C. Yu, W. J. Fabian, M. Espi, T. Higuchi, S. Araki, and T. Nakatani, "NTT CHIME-3 SYSTEM: ADVANCES IN SPEECH ENHANCEMENT AND RECOGNITION FOR MOBILE MULTI-MICROPHONE DEVICES," ASRU2015, Dec., 2015. (Best paper award honorable mention)
[2016]
- H. Meutzner, S. Araki, M. Fujimoto and T. Nakatani, "A Generative-Discriminative Hybrid Approach to Multi-Channel Noise Reduction for Robust Automatic Speech Recognition," ICASSP2016, 2016.
- N. Ito, S. Araki and T. Nakatani, "MODELING AUDIO DIRECTIONAL STATISTICS USING A COMPLEX BINGHAM MIXTURE MODEL FOR BLIND SOURCE EXTRACTION FROM DIFFUSE NOISE," ICASSP2016, 2016.
- T. Kawase, K. Niwa, M. Fujimoto, N. Kamado, K. Kobayashi, S. Araki, and T. Nakatani, "Real-time integration of statistical model-based speech enhancement with unsupervised noise psd estimation using microphone array," ICASSP2016, 2016.
- N. Ito, S. Araki, T. Nakatani, "Complex Angular Central Gaussian Mixture Model for Directional Statistics in Mask-based Microphone Array Signal Processing, " EUSIPCO2016, 2016.
- N. Murata, H. Kameoka, K. Kinoshita, S. Araki, T. Nakatani, S. Koyama and H. Saruwatari, "Reverberation-robust underdetermined source separation with non-negative tensor double deconvolution," EUSIPCO2016, 2016.
- K. Yamamoto, T. Irino, T. Matsui, S. Araki, K. Kinoshita, and T. Nakatani, "Speech intelligibility prediction based on the envelope power spectrum model with the dynamic compressive gammachirp auditory filterbank," Interspeech2016, 2016.
- M. Fakhry, N. Ito, S. Araki and T. Nakatani, "Modeling audio directional statistics using a probabilistic spatial dictionary for speaker diarization in real meetings," IWAENC2016, 2016.
[2017]
- N. Ito, S. Araki, M. Delcroix, and T. Nakatani, "PROBABILISTIC SPATIAL DICTIONARY BASED ONLINE ADAPTIVE BEAMFORMING
FOR MEETING RECOGNITION IN NOISY AND REVERBERANT ENVIRONMENTS," ICASSP2017, 2017.
- T. Nakatani, N. Ito, T. Higuchi, S. Araki and K. Kinoshita, "INTEGRATING DNN-BASED AND SPATIAL CLUSTERING-BASED MASK ESTIMATION FOR ROBUST MVDR BEAMFORMING ," ICASSP2017, 2017.
- K. Yamamoto, T. Irino, T. Matsui, S. Araki, K. Kinoshita and T. Nakatani, "Analysis of acoustic features for speech intelligibility prediction models," 5th ASA/ASJ Joint meeting, 2016
- K. Yamamoto, T. Irino, T. Matsui, S. Araki, K. Kinoshita and T. Nakatani, "Predicting Speech Intelligibility Using Gammachirp Envelope Distortion Analysis Method Based on the Signal-to-Distortion Ratio", Interspeech2017, 2017.
- N. Ito, S. Araki, and T. Nakatani, "Data-Driven and Physical Model-Based Designs of Probabilistic Spatial Dictionary for Online Meeting Diarization and Adaptive Beamforming," Proc. EUSIPCO, 2017.
- R. Higashinaka, Sakai, A. Sugiyama, Narimatsu, Arimoto, Fukutomi, Matsui, Imoto, Ijima, Itoh, S. Araki, Yoshikawa, Ishiguro and Matsuo, "Demonstration of an argumentative dialogue system based on argumentation structures," SEMDIAL2017, 2017.
[2018]
- N. Ito, T. Makino, S. Araki, T. Nakatani, "Maximum-Likelihood Online Speaker Diarization in Noisy Meetings Based on Categorical Mixture Model and Probabilistic Spatial Dictionary," Proc. ICASSP, May 2018.
- J. Azcarreta, N. Ito, S. Araki, and T. Nakatani, "Permutation-Free cGMM: Complex Gaussian Mixture Model with Inverse Wishart Mixture Model Based Spatial Prior for Permutation-Free Source Separation and Source Counting," Proc. ICASSP, May 2018.
- N. Ito, S. Araki, and T. Nakatani, "FastFCA: Joint Diagonalization Based Acceleration of Audio Source Separation Using a Full-Rank Spatial Covariance Model," Proc. EUSIPCO, Sep. 2018.
- N. Ito, C. Schymura, S. Araki, and T. Nakatani, "Noisy cGMM: Complex Gaussian Mixture Model with Non-Sparse Noise Model for Joint Source Separation and Denoising," Proc. EUSIPCO, Sep. 2018.
- K. Yamamoto, T. Irino, N. Ohashi, S. Araki, K. Kinoshita and T. Nakatani,"Multi-resolution Gammachirp Envelope Distortion Index for Intelligibility Prediction of Noisy Speech," Interspeech2018, 2018.
- Y. Matsui, T. Nakatani, M. Delcroix, K. Kinoshita1, N. Ito, S. Araki and S. Makino, "Online integration of DNN-based and spatial clustering-based mask estimation for robust MVDR beamforming," IWAENC2018, 2018.
- K. Yamamoto, T. Irino, S. Araki, K. Kinoshita and T. Nakatani,"Speech intelligibility prediction using a multi-resolution gammachirp envelope distortion index with common parameters for different noise conditions," UAC2018, 2018.
[2019]
- T. von Neumann, K. Kinoshita, M. Delcroix, S. Araki, T. Nakatani and R. Haeb-Umbach, "All-neural online source separation, counting, and diarization for meeting analysis," ICASSP2019, 2019.
- M Delcroix, K. Zmolikova, T. Ochiai, K Kinoshita, S. Araki, T. Nakatani, "Compact network for SpeakerBeam target speaker extraction," ICASSP2019, 2019.
- Y. Kubo, T. Nakatani, M. Delcroix, K. Kinoshita, S. Araki, "Mask-based MVDR beamformer for noisy multisource environments: introduction of time-varying spatial covariance model," ICASSP2019, 2019.
- Kenichi Arai, Shoko Araki, Atsunori Ogawa, Keisuke Kinoshita, Tomohiro Nakatani, Katsuhiko Yamamoto, and Toshio Irino, "Predicting speech intelligibility of enhanced speech using phone accuracy of DNN-based ASR system," Interspeech 2019, 9 2019.
- Tomohiro Nakatani, Kesuke Kinoshita, Rintaro Ikeshita, Hiroshi Sawada, and, ,S. Araki, "Simultaneous denoising dereverberation, and source separation using a unified convolutional beamformer," Proc. WASPAA, 2019, 2019.
国内研究報告
[2001]
- 荒木章子, 牧野昭二, 西川剛樹, 猿渡洋, "実環境での混合音声に対する周波数領域ブラインド音源分離手法の性能限界," 日本音響学会2001年春季研究発表会 (2001.3)
- 西川剛樹, 荒木章子, 牧野昭二, 猿渡洋, "帯域分割型ICAを用いたBlind Source Separationにおける帯域分割数の最適化," 日本音響学会2001年春季研究発表会(2001.3)
- 向井良, 荒木章子, 牧野昭二, "実環境におけるブラインド音源分離と残響除去性能に関する検討," 日本音響学会2001年春季研究発表会(2001.3)
- 西川剛樹, 荒木章子, 牧野昭二, 猿渡洋, "周波数領域Blind Source Separationにおける帯域分割数の最適化," 電子情報通信学会技術報告 Vol.EA2000-95, pp.53--59 (2001.1)
- 西川剛樹, 荒木章子, 牧野昭二, 猿渡洋, "周波数領域ブラインド音源分離における帯域分割数の最適化," 日本音響学会関西支部第3回若手研究者交流会(2000.12)
- 猿渡洋(NAIST), 西川剛樹(NAIST), 荒木章子, 牧野昭二, ``時間領域ICAと周波数領域ICAを併用した多段ICAによるブラインド音源分離,'' 日本神経回路学会 第11回全国大会 講演論文集, pp.99-100, Sept. 2001.
- 澤田宏, 向井良, 荒木章子, 牧野昭二 ``複素数に対する独立成分分析のための極座標表示に基づく活性化関数,'' 日本神経回路学会第11回全国大会 講演 論文集, pp. 97-98, Sept. 2001.
- 荒木章子, 牧野昭二, 西川剛樹, 猿渡洋, "周波数領域ブラインド音源分離と周波数領域適応ビームフォーマの関係について," 日本音響学会2001年秋季研究発表会 (2001.10)
- 向井良, 荒木章子, 澤田宏, 牧野昭二, ``非定常スペクトルサブトラクションによる音源分離後の残留雑音除去,'' 日本音響学会2001年秋季研究発表会講演論文集, pp. 617-618, Oct. 2001.
- 澤田宏,向井良,荒木章子,牧野昭二, ``周波数領域ブラインド音源分離のための極座標表示に基づく活性化関数,'' 日本音響学会2001年秋季研究発表会講演論文集, pp. 615-616, Oct. 2001.
- 雛元洋一(NAIST), 西川剛樹(NAIST), 猿渡洋(NAIST), 荒木章子, 牧野昭二, 向井良, ``周波数領域ブラインド音源分離と適応ビ?ムフォ?マの等価性について ,'' 電子情報通信学会技術研究報, Vol.EA2001-84,pp.75--82 (2001年11月).
[2002]
- 荒木章子, 牧野昭二, Robert Aichner, 西川剛樹(NAIST), 猿渡洋(NAIST), ``サブバンド処理によるブラインド音源分離に関する検討 ,'' 日本音響学会2002年春季研究発表会講演論文集, pp.619-620, 2002.
- 澤田宏, 荒木章子, 向井良, 牧野昭二, ``間隔の異なる複数のマイクペアによるブラインド音源分離,'' 日本音響学会2002年春季研究発表会講演論文集, pp. 621-622, 2002.
- 向井良, 荒木章子, 澤田宏, 牧野昭二 ,``周波数領域ICAと時間遅れスペクトル減算による残響下での実時間ブラインド音源分離,'' 日本音響学会2002年春季研究発表会講演論文集, pp. 673-674, 2002.
- 荒木章子, 牧野昭二, Robert Aichner, 西川剛樹(NAIST), 猿渡洋(NAIST), ``死角型ビームフォーマを初期値に用いる時間領域ブラインド音源分離,'' 日本音響学会2002年秋季研究発表会講演論文集,pp. 543-544, 2002.
- 西川剛樹(NAIST)、高谷智哉(NAIST)、猿渡洋(NAIST)、鹿野清宏(NAIST)、荒木章子, 牧野昭二, "KL情報量最小化に基づく時間領域ICAと非定常信号の同時無相関化に基づく時間領域ICAの比較," 日本音響学会2002年秋季研究発表会講演論文集, pp. 545-546, 2002.
- 澤田宏, 向井良, 荒木章子, 牧野昭二, "周波数領域ブラインド音源分離におけるpermutation問題の解法," 日本音響学会2002年秋季研究発表会講演論文集, pp. 541-542, 2002.
- 向井良, 澤田宏, 荒木章子, 牧野昭二, "ブラインド音源分離後の残留スペクトルの推定と除去," 日本音響学会2002年秋季研究発表会講演論文集, pp. 539-540, 2002.
[2003]
- 荒木章子, 牧野昭二, Robert Aichner, 西川剛樹(NAIST), 猿渡洋(NAIST), ``帯域に適した分離手法を用いるサブバンド領域ブラインド音源分離,'' 日本音響学会2003年春季研究発表会講演論文集, pp. 781-782, 2003.
- 向井良, 澤田宏, 荒木章子, 牧野昭二, ``移動音源の低遅延実時間ブラインド分離,''日本音響学会2003年春季研究発表会講演論文集, pp.779-780, 2003
- 澤田宏,向井良,荒木章子,牧野昭二,``周波数領域ブラインド音源分離におけるpermutation問題の頑健な解法,''日本音響学会2003年春季研究発表会講演論文集, pp.777-778, 2003
- 荒木章子, 向井良, 澤田宏, 牧野昭二, ``時間周波数マスキングとICAの併用による音源数 > マイク数の場合のブラインド音源分離,'' 日本音響学会2003年秋季研究発表会講演論文集, pp.587-588, 2003.
- 荒木章子, Audrey Blin, 牧野昭二, ``Blind Separation of More Speech Signals than Sensors using Time-frequency Masking and Mixing Matrix Estimation,'' 日本音響学会2003年秋季研究発表会講演論文集, pp.585-586, 2003.
- 向井良, 澤田宏, 荒木章子, 牧野昭二, ``周波数領域BSSにおける近距離場モデルを用いたパーミュテーションの解法,'' 日本音響学会2003年秋季研究発表会講演論文集, pp.589-590, 2003.
- 澤田宏, 向井良, 荒木章子, 牧野昭二, ``実環境における3音源以上のブラインド分離,'' 日本音響学会2003年秋季研究発表会講演論文集, pp.547-548, 2003.
[2004]
- 向井, 澤田, 荒木, 牧野, ``狭間隔・広間隔の複数マイクロホン対を用いた周波数領域ブラインド音源分離,''日本音響学会2004年春季研究発表会講演論文集, pp. 627--628, 2004.
- S. Araki, S. Makino, H. Sawada and R. Mukai,
``Blind Separation of More Speech than Sensors using Time-frequency Masks and ICA,''Proceedings of 2004 NTT Workshop on Communication Scene Analysis (CSA2004), (invited)
- S. Winter, H. Sawada,S. Araki, S. Makino,``Underdetermined Blind Source Separation for Convolutive Mixtures of Sparse Signals,'' CSA2004
- H. Sawada, R. Mukai, S. Araki, S. Makino,``Blind Source Separation for Convolutive Mixtures in the Frequency Domain,'' CSA2004.
- R. Mukai, H. Sawada, S. Araki, S. Makino,``A Solution for the Permutation Problem in Frequency Domain BSS using Near- and Far-field Models,'' CSA2004.
- 澤田, 向井, 荒木, 牧野,``独立成分分析を用いた音源数推定法,'' 日本音響学会2004年秋季研究発表会講演論文集, pp. 753--754, 2004.
[2005]
- 荒木, 澤田, 向井, 牧野,``観測ベクトルのクラスタリングによるブラインド音源分離,'' 信学会ソサイエティ大会, p. 208, 2005.
- 澤田, 荒木, 向井, 牧野,``多くの背景音からの主要音源のブラインド抽出,'' 信学会ソサイエティ大会, p. 210, 2005.
- 向井, 澤田, 荒木, 牧野,``3次元マイクロホンアレイを用いた多音源ブラインド分離,'' 信学会ソサイエティ大会, p. 209, 2005.
- 荒木, 澤田, 向井, 牧野,``観測信号ベクトル正規化とクラスタリングによる音源分離手法とその評価,'' 日本音響学会2005年秋季研究発表会, pp. 591--592, 2005. [pdf].
- 加藤, 永原, 荒木, 澤田, 牧野,``パラメトリックピアソン分布を用いた周波数領域ブラインド音源分離,'' 日本音響学会2005年秋季研究発表会, pp, 593--594, 2005.
[2006]
- 荒木, 澤田, 向井, 牧野,``観測信号ベクトルのクラスタリングに基づくスパース信号の到来方向推定,'' 日本音響学会2006年春季研究発表会, pp. 615--616, 2006, [pdf].
- 加藤, 永原, 荒木, 澤田, 牧野,``パラメトリックピアソン分布を用いた周波数領域ブラインド音源分離,'' 日本音響学会2006年春季研究発表会, pp, 549--550, 2006.
[2007]
-
荒木, 澤田, 牧野, ''話者分類とSN比最大化ビームフォーマに基づく会議音声強調,'' 日本音響学会2007年春季研究発表会, pp. 571--572, Mar. 2007. [pdf]
-
澤田, 荒木, 大塚, 藤本, 石塚, ''多人数多マイクでの発話区間検出?ピンマイクでの事例?,'' 日本音響学会2007年春季研究発表会, pp. 679--680, Mar. 2007.
- 石塚, J.E.Rubio, 澤田, 荒木, 中谷, 藤本, "信号到来方向推定の偏在性を用いた耐雑音音声区間検出法," 日本音響学会2007年秋季研究発表会, pp. 163--166, Sept. 2007.
- 木下, 中谷, 澤田, 荒木, 三好, "複数音源が存在する残響環境でのマルチステップ線形予測の効果," 日本音響学会2007年秋季研究発表会, Sept. 2007.
- 石塚, 荒木, 藤本, 瀬戸口(京大), 高梨(京大), 河原(京大), "ポスター会話に対する発話区間検出と話者識別の検討," 情報処理学会研究報告, pp. 217--222, Dec. 2007.
[2008]
- 荒木, 藤本, 石塚, 澤田, 牧野, "音声区間検出と方向情報を用いた会議音声話者識別システムとその評価," 日本音響学会2008年春季研究発表会, Mar. 2008. [pdf]
- 荒木, 澤田, 牧野, "音声のスパース性を用いたUnderdetermined音源分離," 電子情報通信学会2008年総合大会, Mar. 2008.
- 荒木, 伊藤, 澤田, 小野, 牧野, 嵯峨山, "周波数領域ICAにおける初期値の短時間データからの学習," 電子情報通信学会2008年総合大会, Mar. 2008.
- 荒木, 藤本, 石塚, 中谷, 澤田, 牧野, "音声区間推定と時間周波数領域方向推定の統合による会議音声話者識別," 電子情報通信学会技術研究報告, Vol.EA2008-40, pp 19--24, 2008.
- 大塚,荒木,石塚,藤本,大和,「多人数会話シーン分析に向けた実時間マルチモーダルシステムの構築 ? マルチモーダル全方位センサを用いた顔方向追跡と話者ダイアリゼーションの統合」,電子情報通信学会マルチメディア・仮想環境基礎研究会 (MVE), 信学技報, vol. 108, no. 328, MVE2008-68, pp. 55-62, 2008.
[2009]
- 石黒,山田,荒木,中谷,「ノンパラメトリックベイズを用いた会議音声話者識別のための話者クラスタリング法」,日本音響学会2009年春季研究発表会, pp.107--110, 2009.
- 小笠原(名大),石塚,荒木,藤本,中谷,大塚,「SN 比最大化ビームフォーマを用いたオンライン会議音声強調」,日本音響学会2009年春季研究発表会, 2009. [pdf]
- 石塚,荒木,大塚,中谷,藤本,「音響情報と映像情報から得られる位置情報の統合による話者ダイアライゼーション」,日本音響学会2009年春季研究発表会, 2009.
- 荒木, 中谷, 澤田, "ディリクレ事前分布を用いた音声のスパース性に基づく音源数推定と音源分離," 日本音響学会2009年秋季研究発表会, 2009. [pdf]
[2010]
- 荒木,中谷,澤田、「マイク間位相差とスペクトル包絡の同時クラスタリングに基づくスパース音源分離」,日本音響学会2010年春季研究発表会,2010.
- 安齊, 荒木, 牧野, 中谷, 山田, 中村, 北脇, 「劣決定音源分離のための分離音声のケプストラムスムージング」,日本音響学会2010年春季研究発表会,2010.
- 田代, 荒木, 木村, 中村, 「停電時上り音声通信を実現する光アクセス方式の提案」, 電子情報通信学会2010年総合大会, Mar. 2010.
- 田代, 荒木, 木村, 中村, "停電時上り音声通信光アクセス方式の実現技術の検討," 信学会ソサイエティ大会, Sept, 2010.
- 堀,荒木,吉岡,大庭,藤本,渡部,小川,大塚,三上,木下,中谷,中村,大和,"いつ誰が何を話したかを即座に認識するオンライン会話分析システム - (1)コンセプトとデザイン- ," 日本音響学会2010年秋季研究発表会,2010.
- 藤本, 荒木, 吉岡,木下,中谷,中村,"いつ誰が何を話したかを即座認識するオンライン会話分析システム -(2) 複数話者遠隔発話音声認識のための音声強調技術-," 日本音響学会2010年秋季研究発表会,2010.
- 中谷, 荒木, 吉岡, "DOA クラスタリングと音声の対数スペクトルHMM に基づく音源分離", 日本音響学会2010年秋季研究発表会,2010.
[2011]
- 荒木, 中谷, "時間周波数マスクと多chウィーナフィルタによるハイブリッド音源分離アプローチ," 日本音響学会2011年春季研究発表会,2011.
- 中谷, 荒木, 吉岡, 藤本, "音源スペクトルHMMと音源方向モデルの教師無し同時学習に基づく多チャンネル音源分離 ," 日本音響学会2011年春季研究発表会,2011.
- 礒, 荒木, 牧野, 中谷, 澤田, 山田, 中村, "高残響下で混合された音声の音源分離に関する研究," 日本音響学会2011年春季研究発表会,2011.
- 武田, 亀岡,澤田,荒木,山田,牧野, "音源のW-DO性を仮定した多チャンネル複素NMF による劣決定BSS," 日本音響学会2011年春季研究発表会, 1-Q-19, pp. 801-804, Mar. 2011.
- 中谷, 荒木,デルクロア, 吉岡, 藤本, "非定常雑音に頑健な統合的音声認識アプローチ: 音源方向GMMと対数スペクトルGMMに基づく統計モデルベース音声強調," 日本音響学会2011年秋季研究発表会, 2011.
- デルクロア, 木下, 中谷, 荒木, 小川, 堀, 渡部, 藤本, 吉岡, 大庭, 久保, ソウデン, ハム, 中村, "非定常雑音に頑健な統合的音声認識アプローチ:静的・動的モデル適応とシステムコンビネーションに基づく音声強調・認識の統合," 日本音響学会2011年秋季研究発表会, 2011.
- 丸山,荒木,中谷,宮部,山田,牧野,中村,"周波数依存の時間差モデルによる劣決定BSS," 信学技報, vol. 111, no. 306, EA2011-86, pp. 25-30, 2011年11月.
[2012]
- 礒,荒木,牧野,中谷,澤田、山田,宮部,中村,"フルランク空間相関行列モデルに基づく拡散性雑音除去," 電子情報通信学会総合大会, A-10-9, 2012年3月
- M. Souden,S. Araki,K. Kinoshita,T. Nakatani,H. Sawada, "A Multichannel MMSE-Based Approach for Speech Source Separation and Noise Reduction," 日本音響学会2012年春季研究発表会, 1-Q-15, 2012.
- 中谷,吉岡,荒木,デルクロア,藤本 "音声と雑音のメル周波数ケプストラム係数GMM に基づくモノラル/マルチチャンネル雑音抑圧," 日本音響学会2012年春季研究発表会, 1-Q-14, 2012.
- 堀,荒木,小川,ソウデン,デルクロア,吉岡,大庭,藤本,木下,久保,咸,渡部 ,中谷,中村, "会話分析タスクにおける複数人自由会話の遠隔発話音声認識の評価," 日本音響学会2012年春季研究発表会, 3-P-5, 2012.
- 武田,亀岡,澤田,荒木, "混合DOA モデルに基づく多チャンネル複素NMF による劣決定BSS", 日本音響学会2012年春季研究発表会, 2-1-9, 2012.
- 堀、荒木、大塚、中谷、中村、大和, "複数人会話シーン分析の研究と今後の展望",
信学技報, vol. 112, no. 141, SP2012-52, pp. 13-18, 2012年7月. (招待講演)
- 堀、小川、藤本、大庭、久保、ハム、荒木、ソウデン、デルクロア、吉岡、木下、中谷、中村、"会話分析タスクにおける複数人自由会話音声認識の改善," 日本音響学会2012年秋季研究発表会, 2012.
- 丹羽、日岡、荒木、古家、羽田, "最大SN比法への拡散センシングの適用," 日本音響学会2012年秋季研究発表会, 2012
- 澤田、亀岡、上田、荒木,"非負値行列因子分解NMFの多チャンネル拡張," 電子情報通信学会第27回信号処理シンポジウム, B5-4, 2012.
[2013]
- 伊藤信貴, 荒木章子, 中谷智広, “時変混合重みに基づくパーミュテーション問題のないクラスタリングベース音源分離,” 電気情報通信学会技術報告, May 2013.
- 伊藤、荒木、中谷, "時変混合重みに基づくパーミュテーション問題のないクラスタリングベース音源分離," 信学技報, vol. 113, no. 27, EA2013-2, pp. 7-12, 2013年5月.
- 伊藤、J. Ingrid, 荒木、中谷, "音源アクティビティ系列のクラスタリングに基づく高残響・劣決定下音源数推定法," 信学技報, vol. 113, no. 242, EA2013-66, pp. 17-21, 2013年10月.
- 堀、久保、小川、荒木、中村、「会話シーン分析の複数人自由会話音声認識におけるディープラーニングの効果」 音響学会2013年度秋季研究発表会, 2013.
[2014]
- 伊藤信貴,荒木章子,中谷智広,”確率的モデル統合に基づく拡散性雑音と残響の同時ブラインド抑圧,” 日本音響学会2014年春季研究発表会, 2014.
[2015]
- 荒木、林、デルクロア、藤本、武田、中谷、"マルチチャネル特徴を用いた denoising autoencoder による音声強調, " 日本音響学会講演論文集,March 2015.
- 伊藤信貴,荒木章子,中谷智広,“クラスタリングに基づく音源分離と線形予測に基づく残響除去の確率論的モデル統合, ” 日本音響学会講演論文集, Mar. 2015.
- 山本克彦,入野俊夫,荒木章子,木下慶介,中谷智広,“動的圧縮型ガンマチャープフィルタバンクを用いた強調音声の明瞭度予測法の提案,” 日本音響学会講演論文集, Sept., 2015.
- 伊藤信貴,荒木章子,中谷智広,“パーミュテーションフリークラスタリングに基づくマルチチャネル雑音除去 ,” 日本音響学会講演論文集, Sept., 2015.
- K. Yamamoto, T. Irino, S. Araki, K. Kinoshita and T. Nakatani, "Study on predicting speech intelligibility of enhanced speech sounds using the dynamic compressive gammachirp auditory filterbank and modulation filterbank," 日本音響学会聴覚研究会
- 山本、入野、松井、荒木、木下、中谷、“強調音声の明瞭度 −計算機は人の聞こえを予測できる?−,”日本音響学会関西支部 若手研究者交流研究発表会 2015/12/13
[2016]
- 荒木、岡田、樋口、小川、中谷、“時間周波数マスク推定に基づくMVDRビームフォーミングの会議音声認識への適用,”日本音響学会春季研究発表会 2016.
- 伊藤、荒木、中谷、“混合複素ビンガム分布を用いた方向統計量モデルとブラインド拡散性雑音除去,”日本音響学会春季研究発表会 2016.
- 吉岡、デルクロア、小川、Yu、伊藤、木下、藤本、Fabian, Espi, 樋口、荒木、中谷、“NTT CHiME-3 音声認識システム:全体構成とバックエンド,”日本音響学会春季研究発表会 2016.
- 中谷、伊藤、樋口、荒木、吉岡、藤本、木下、“NTT CHiME-3 音声認識システム:耐雑音フロントエンド,”日本音響学会春季研究発表会 2016.
- 川瀬、丹羽、藤本、鎌土、小林、荒木、中谷、“マイクロホンアレーによる実時間雑音PSD推定を用いたモデルベースの音声強調処理技術, ”日本音響学会春季研究発表会 2016.
- 山本、入野、松井、荒木、木下、中谷、“強調音声のための明瞭度予測法の検証:聴取実験結果との比較,”日本音響学会春季研究発表会 2016.
- 山本、入野、松井、荒木、木下、中谷、“動的圧縮型ガンマチャープフィルタバンクを用いた音声明瞭度予測法の改良,”日本音響学会聴覚研究会 2016.
- 村田、亀岡、木下、荒木、中谷、小山、猿渡、“非負値テンソル二重逆畳み込みによる残響環境下の劣決定音源分離,” 日本音響学会春季研究発表会 2016.
- 山本、入野、松井、荒木、木下、中谷, “動的圧縮型ガンマチャープフィルタバンクを用いた音声明瞭度予測法:強調音声を対象とした比較検討,” 音楽情報科学研究会, 2016.
- 荒木, 木下, 伊藤, 小川, デルクロア, 樋口, 吉岡, チャン, 中谷, “雑音のある環境での複数人会話音声認識,”日本音響学会 秋季研究発表会, 2016.【招待講演】
- 山本、入野、松井、荒木、木下、中谷, “音声明瞭度予測法dcGC-sEPSM の諸検討:評価用雑音の特性と予測精度への影響,”日本音響学会 秋季研究発表会, 2016.
- 伊藤、荒木、Fakhri、中谷, “統計的空間辞書を用いた方向統計量モデルに基づく複数人会話における話者識別,”日本音響学会 秋季研究発表会 2016.
[2017]
- 荒木, 小野, 木下, デルクロア, "非同期分散マイクロホンアレイを用いた実環境複数人会話音声認識に関する初期検討," 日本音響学会 秋季研究発表会 2017
- 伊藤, 荒木, デルクロア, 中谷, "統計的空間辞書に基づくオンライン話者識別と適応ビームフォーミングによる複数人会話音声認識のための音声強調," 日本音響学会 秋季研究発表会 2017
- 伊藤, 荒木, 中谷, "混合複素角度中心ガウス分布を用いた方向統計量モデルに基づくブラインド音源分離, "日本音響学会 秋季研究発表会 2017
- 大橋、山本、入野、荒木、木下、中谷, "雑音抑圧で音声は聴き取りやすくなる?−バブルvsピンク、お邪魔対決—,"日本音響学会関西支部 若手研究者交流研究発表会, 2017.
[2018]
- 荒木、小野、木下、デルクロア, "非同期分散マイクロホンアレイを用いた実環境複数人会話音声認識: 音声強調フィルタの逐次修正の効果," 日本音響学会春季研究発表会, 2018.
- 伊藤、荒木、中谷, "FastFCA:空間共分散行列の同時対角化に基づく時変複素ガウス分布を用いた音源分離法の高速化," 日本音響学会春季研究発表会 , 2018.
- 山本、大橋、入野、荒木、木下、中谷, "振幅包絡歪み指標に基づくバブル雑音下の音声明瞭度予測," 日本音響学会春季研究発表会, 2018.
- 伊藤、荒木、中谷, "一般化固有値問題に基づく同時対角化の新しい応用:音源分離の高速化," 日本応用数理学会, 2018.
- 大橋、余村、山本、荒木、木下、中谷、入野,"バブル雑音重畳と強調処理された音声の模擬難聴下における了解度,"信学会 応用音響研究会 2018.
- 山本、入野、荒木、木下、中谷, "複数の雑音条件下における共通パラメータを用いた音声了解度予測," 日本音響学会秋季研究発表会 2018. (to appear)
[2019]
- 荒木、小野、木下、デルクロア, "非同期分散マイクロホンアレイにおける音源の移動に頑健なサンプリング周波数ミスマッチ推定," 日本音響学会春季研究発表会, 2019.
- M. Delcroix, K. Zmolikova, T. Ochiai, K. Kinoshita, S. Araki, T. Nakatani, "Evaluation of SpeakerBeam target speech extraction in real noisy and reverberant conditions," 日本音響学会春季研究発表会, 2019.
- 荒木、小野、木下、デルクロア, "音源移動条件下での非同期分散マイクロホンアレイの同期処理とそれに基づく音源分離," 日本音響学会秋季研究発表会, 2019.
- 新井, 荒木, 小川, 木下, 中谷、山本, 入野 ,"DNN 音声認識システムによる単語了解度
予測," .日本音響学会秋季研究発表会, 2019.
- 木下、von Neumann, Delcroix, 荒木, 中谷, Haeb-Umbach, "オンライン音源分離・音源数推定・ダイアリゼーションの同時最適実現のための深層学習モデル," 日本音響学会秋季研究発表会, 2019.
学位論文
- Convolutive Blind Speech Separation with Independent Component Analysis and Sparse Component Analysis, 北海道大学, 平成19年3月.
解説記事
- 牧野昭二, 荒木章子, 向井良, 澤田宏, ''畳込み混合のブラインド音源分離, '' システム/制御/情報, vol.48, no.10, pp.401-408, 2004.
- 澤田 宏,荒木章子,牧野 昭二, "音源分離技術の最新動向", 電子情報通信学会学会誌, 91(4), 292-296, 2008年4月.
- 伊藤信貴, 荒木章子, 中谷智広, “どんな環境でも聞きたい音を聞き分ける,” 日本音響学会誌, vol. 71, no. 3, pp. 136-142, Mar. 2015.(招待論文)
招待講演
-
荒木, ``残響下でのブラインド音源分離 -マイクロホンアレイ技術との関連とその利用-,''
音響学会関西支部2002年度若手研究者交流研究発表会 招待講演I-1, 2002.
- 三好, 中谷, 向井, 澤田, 引地, 荒木, 木下, ``ブラインド信号処理技術の研究動向,''信学技報, vol. 104, No. 143, EA-2004-21, pp. 23--30, 2004
- 荒木, "変動する環境における音声のブラインド音源分離," 東京工業大学 男女共同参画推進パネルディスカッション, Nov. 2010.
- 澤田宏,荒木章子, "時間周波数マスクによる実環境でのブラインド音源分離", 電子情報通信学会 技術研究報告,vol. 110, no. 331, EA2010-104, pp. 43-48, 2010年12月.
- 荒木章子,藤本雅清,吉岡拓也,堀貴明,中谷智広, "複数人会話シーン分析におけるマイクロホンアレイ音声処理", 電子情報通信学会 技術研究報告,vol. 111, no. 28, pp. 83-88, 2011年5月.
- 荒木章子, "いつ誰が話したか?を即座に分析!−複数人対話のリアルタイムシーン分析ー", 千葉工業大学、第5回CIT音響フォーラム、2011年10月.
- 荒木章子「音声インタフェースを支える音響信号処理技術〜コミュニケーションシーン分析を例題に〜」 MathWorks Day ユーザー講演、2014年7月.
- 荒木章子, 堀 貴明, 中谷智広, "会話シーン分析の複数人自由会話音声認識における音声強調," 信学技報, vol. 114, no. 274, EA2014-25, pp. 9-14, 2014年10月.
その他の文章
- 荒木章子,"私のすすめるこの一冊", 日本音響学会誌, vol. 62, no. 2, p. 142--143, 2006.
- 荒木章子,"NTTにおける男女共同参画への取組み(男女共同参画のページ)", 電子情報通信学会誌, vol.91, no.1, pp.72-73, 2008.
- 荒木章子,"国際会議報告 ICASSP2009", 電子情報通信学会誌, vol.92, no. 12, p. 1040, 2009.
受賞
- 第19回粟屋潔学術奨励賞 (日本音響学会2001年秋季)
- Best Paper Award (International Workshop on Acoustic Echo and Noise Control) (2003.9)
- 電気通信普及財団第19回テレコムシステム技術賞 (2004.3)
- 日本音響学会 第45回佐藤論文賞(2005.3)(共著)
- 平成16年度電子情報通信学会論文賞(2005.5)(共著)
- 電子情報通信学会 平成17年度 学術奨励賞 (2005年電子情報通信学会ソサイエティ大会) (2006.3.25)
- MLSP 2007 Data Analysis Competition Award (2007.8.27)(共著)
- 第3回 日本音響学会独創研究奨励賞 板倉記念(2008.3.18)
- 第4回MVE賞,電子情報通信学会 マルチメディア・仮想環境基礎研究会,(共著)(2008.11.27)
- 平成26年度 科学技術分野の文部科学大臣表彰 若手科学者賞, "音響信号のブラインド音源分離とその応用に関する先駆的研究," (2014.4.15)
- 電気通信普及財団第30回テレコムシステム技術賞 (共著)(2015.3)
- IEEE Best paper award, Apr. 2015 (共著).
学会活動
- ICA2003: Organizing committee member
- IWAENC2003: finance chair
- EUSIPCO2006: Technical Program Committee Member, Special session co-organizer (on Underdetermined Sparse Audio Source Separation)
- WASPAA2007: Registration co-chairs
- ISCAS2008: Special session co-organizer (on Blind Separation and Dereverberation of Speech and Audio Signals)
- SiSEC2008(Signal Separation Evaluation Campaign): Evaluation chairs
- SiSEC2010(Signal Separation Evaluation Campaign): Evaluation chairs
- SiSEC2011(Signal Separation Evaluation Campaign): Evaluation chairs
- 電子情報通信学会 和文論文誌A 編集委員
- IEEE Audio & Acoustic Signal Processing Technical Committee Member, 2014年1月〜2018年12月
- IEEE WIE (Women in Engineering), Kansai Section, Vice Chair, 2014年2月〜2015年12月
- IEEE WIE (Women in Engineering), Kansai Section, Chair Jan. 2016-- Dec. 2017.
- IEEE Signal Processing Society HSCMA (Hands-free Speech Communication and Microphone Arrays) 2017, Technical Program Chair, Mar. 2016 -- Mar. 2017.
- IEEE WASPAA (Workshop on Applications of Signal Processing to Audio and Acoustics) 2017, Far East Liaison, Sept. 2016 -- Nov. 2017.
- IEEE IWAENC (International Workshop on Acoustic Signal Enhancement) 2018, Publications Chair, July 2017 -- Sept. 2018.
- 日本音響学会 理事 (広報電子化担当) 2017年5月〜2021年5月
その他
- 東京大学大学院 情報理工学研究科 非常勤講師 (システム情報工学特論 I) 2004.4.
- Winter School on Neuroinformatics, invited lecturer, Sogang University, Seoul, January 29-30, 2009.
- 同志社大学 理工学部 嘱託講師(特別講義B:音声情報処理技術), 秋学期, 2009-2010.
- 奈良先端科学技術大学院大学 講師(ゼミナールI) 2012年7月
修士(東大安藤研)
: 蝸牛基底膜への高効率伝達理論とその音響センサへの応用
<-目的- 蝸牛構造の理解と蝸牛基底膜を模したセンサの感度向上>
基底膜を模したfishbone型音響センサの
入力インピーダンスを純抵抗にする方策を模索
↓
センサのインピーダンスと空気のインピーダンスをマッチさせる
エクスポネンシャルホーンを作成
↓
信号検出回路を用いて、センサ出力の増加を確認
-修論発表OHP(2MB)
-荒木,日下部,小野,安藤,``蝸牛基底膜を模擬したfishbone音響センサの最適検出機構とその実験'', 第38回計測自動制御学会学術講演会予稿集, 1999 (ps.gz)
-荒木,日下部,小野,安部,安藤,``入力インピーダンスに着目した蝸牛基底膜モデルの解析と応用'',電気学会センサシステム応用研究会資料 pp43-48, 1998 (ps.gz)
(訂正:1頁目最後の式 Y(x,w) → Y(x,w)^{-1})
-安藤,小野,荒木, "蝸牛コルチ器のFM-AM検出モデルとそのセンサ応用," 電気学会センサシステム応用研究会, SSA-98-18, pp.31--36, 東京, 11月, 1998
-安藤,荒木,小野,来海,原田,池内, "全ディジタル型可変周波数特性フィッシュボーン音響センサ," 電気学会センサシステム応用研究会, SSA-00-8, pp.41-46, 東京, 3月, 2000
-S. Ando, S. Araki, N. Ono, A. Kimachi, M. Harada and N. Ikeuchi, "Fishbone Acoustic Sensor with Digital PWM Controlled Frequency Characteristics," Technical Digest of the 17th Sensor Symposium, pp.359--362, Kawasaki, May 2000
学士(東大藤村研)
: 適応的カテゴリー分解に関する研究
<-内容- 多重分光リモートセンシング画像の、各カテゴリーのスペクトルと占有面積率の推定>
-竹内, 荒木, 喜安, 藤村,``適応的カテゴリー分解による画素内混在比の推定'', 第37回計測自動制御学会学術講演会予稿集, Vol.1, pp.191-192 (1998) abstract
-S. Kiyasu, S. Araki , H. Takeuchi and S. Fujimura
``Adaptive Spectral Unmixing for Estimation of Component Proportion'',
Proc. of the 1998 International Symposium on Noise Reduction for
Imaging and Communication Systems (ISNIC'98), pp.239-244 (1998)
back to HOME