Yasunori Ohishi

Yasunori Ohishi

Senior Research Scientist, Supervisor

NTT Communication Science Laboratories

Biography

Yasunori Ohishi is a Senior Research Scientist, Supervisor, Group Leader of Computational Modeling Research Group in NTT Communication Science Laboratories. His research interests include acoustic signal processing, multimedia content analysis, and music information retrieval.

Interests
  • Acoustic signal processing
  • Multimedia content analysis
  • Music information retrieval
Education
  • PhD in Information Science, 2009

    Nagoya University

  • MSc in Information Science, 2006

    Nagoya University

  • BEng in Electrical and Electronic Engineering and Information Engineering, 2004

    Nagoya University

Skills

Acoustic signal processing

Expert

Multimedia content analysis

Expert

Music information retrieval

Expert

Machine learning

Advanced

Bayesian statistics and modeling

Advanced

Python

Advanced

Recent Publications

(2024). M2D-CLAP: Masked Modeling Duo Meets CLAP for Learning General-purpose Audio-Language Representation. In Interspeech.

Cite Code arXiv

(2024). Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval. In EUSIPCO.

Cite arXiv

(2024). Exploring Pre-trained General-purpose Audio Representations for Heart Murmur Detection. In EMBC.

Cite Code arXiv

Projects

ConceptBeam
Target speech extraction based on “concept” or semantic information.
The Places Japanese audio caption corpus
Japanese spoken captions for the Places205 image dataset

Experience

 
 
 
 
 
Senior Research Scientist, Supervisor
Nov 2024 – Present Atsugi, Kanagawa, Japan
 
 
 
 
 
Senior Manager
Nov 2022 – Oct 2024 Keihanna, Kyoto, Japan
Head of Human Resources in NTT Communication Science Laboratories.
 
 
 
 
 
Senior Research Scientist
Jun 2021 – Oct 2022 Atsugi, Kanagawa, Japan
Leading basic researches on acoustic signal processing, crossmodal semantic learning, audio captioning, and sound event localization and detection. Presented in major international conferences such as ICASSP, Interspeech, and DCASE.
 
 
 
 
 
Assistant Manager
Apr 2019 – May 2021 Atsugi, Kanagawa, Japan
Administrative and clerical supports for researchers and budget managements in Media Information Laboratory and Moriya Research Laboratory.
 
 
 
 
 
Senior Research Scientist
Oct 2017 – Mar 2019 Atsugi, Kanagawa, Japan
Leading basic researches on crossmodal semantic learning and multimedia event detection.
 
 
 
 
 
Deputy Manager
Oct 2014 – Sep 2017 Toyosu, Tokyo, Japan
Project leader for robust media search services, ‘MediaSearch+
 
 
 
 
 
Research Scientist
Apr 2009 – Sep 2014 Atsugi, Kanagawa, Japan
Leading basic researches on singing voice information processing and sound event detection. Presented in major international conferences such as ICASSP and Interspeech.
 
 
 
 
 
PhD Student
Apr 2006 – Mar 2009 Nagoya, Aichi, Japan
Research for analysis-synthesis model of singing voice that characterize varied singing behaviors and its practical applications.
 
 
 
 
 
MSc Student
Apr 2004 – Mar 2006 Nagoya, Aichi, Japan
Research for discrimination between singing and speaking voices.
 
 
 
 
 
BEng Student
Apr 2003 – Mar 2004 Nagoya, Aichi, Japan
Research for statistical analysis for the word hierarchy using an encyclopedic corpus.

Contact