Yasunori Ohishi

Yasunori Ohishi

Senior Manager

NTT Communication Science Laboratories

Biography

Yasunori Ohishi is a senior manager, head of human resources in NTT Communication Science Laboratories. His research interests include acoustic signal processing, multimedia content analysis, and music information retrieval.

Interests
  • Acoustic signal processing
  • Multimedia content analysis
  • Music information retrieval
Education
  • PhD in Information Science, 2009

    Nagoya University

  • MSc in Information Science, 2006

    Nagoya University

  • BEng in Electrical and Electronic Engineering and Information Engineering, 2004

    Nagoya University

Skills

Acoustic signal processing

Expert

Multimedia content analysis

Expert

Music information retrieval

Expert

Machine learning

Advanced

Bayesian statistics and modeling

Advanced

Python

Advanced

Recent Publications

(2023). First-shot anomaly sound detection for machine condition monitoring: A domain generalization baseline. In EUSIPCO.

Cite Code arXiv

(2023). Masked Modeling Duo for Speech: Specializing General-Purpose Audio Representation to Speech using Denoising Distillation. In Interspeech.

Cite Code arXiv

(2023). Investigating the impact of patching methods on the use of transformer-based image classification models for audio classification. In 日本音響学会2023年春季研究発表会.

Cite

Projects

ConceptBeam
Target speech extraction based on “concept” or semantic information.
The Places Japanese audio caption corpus
Japanese spoken captions for the Places205 image dataset

Experience

 
 
 
 
 
Senior Manager
Nov 2022 – Present Keihanna, Kyoto, Japan
Head of Human Resources in NTT Communication Science Laboratories.
 
 
 
 
 
Senior Research Scientist
Jun 2021 – Oct 2022 Atsugi, Kanagawa, Japan
Leading basic researches on acoustic signal processing, crossmodal semantic learning, audio captioning, and sound event localization and detection. Presented in major international conferences such as ICASSP, Interspeech, and DCASE.
 
 
 
 
 
Assistant Manager
Apr 2019 – May 2021 Atsugi, Kanagawa, Japan
Administrative and clerical supports for researchers and budget managements in Media Information Laboratory and Moriya Research Laboratory.
 
 
 
 
 
Senior Research Scientist
Oct 2017 – Mar 2019 Atsugi, Kanagawa, Japan
Leading basic researches on crossmodal semantic learning and multimedia event detection.
 
 
 
 
 
Deputy Manager
Oct 2014 – Sep 2017 Toyosu, Tokyo, Japan
Project leader for robust media search services, ‘MediaSearch+
 
 
 
 
 
Research Scientist
Apr 2009 – Sep 2014 Atsugi, Kanagawa, Japan
Leading basic researches on singing voice information processing and sound event detection. Presented in major international conferences such as ICASSP and Interspeech.
 
 
 
 
 
PhD Student
Apr 2006 – Mar 2009 Nagoya, Aichi, Japan
Research for analysis-synthesis model of singing voice that characterize varied singing behaviors and its practical applications.
 
 
 
 
 
MSc Student
Apr 2004 – Mar 2006 Nagoya, Aichi, Japan
Research for discrimination between singing and speaking voices.
 
 
 
 
 
BEng Student
Apr 2003 – Mar 2004 Nagoya, Aichi, Japan
Research for statistical analysis for the word hierarchy using an encyclopedic corpus.

Contact