Overview
We aim to develop a system that can give feedback or instructions to a person who wishes to better do something or do new things. Unlike the existing personal assistance methods based on manually defined rules, our goal is to develop a system with the following advantages: (1) individuality, meaning that the system output must be suitable for individual persons; (2) concreteness, so that the instructions are concrete enough for users to easily understand; and (3) automaticity, so that the system performs the above process automatically. To this end, we propose several kinds of learning-based (specifically, deep learning-based) approaches. We believe that these approaches will also lead to a generic media generation technique that will meet a variety of demands in the near future.
[Paper]
[Abstract]
[Poster]
[Paper (Japanese)]
[Abstract (Japanese)]
[Poster (Japanese)]
Publications
Generative Controller
Takuhiro Kaneko, Kaoru Hiramatsu, Kunio Kashino
CVPR 2017
[Paper] [Supplemental] [Project]
Feedback Generation

Takuhiro Kaneko, Kaoru Hiramatsu, Kunio Kashino
ACMMM 2016
[Paper]
Realistic Speech Synthesis & Voice Conversion
Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, and Nobukatsu Hojo
arXiv:2008.12604, Aug. 2020
[Paper]
Hirokazu Kameoka, Wen-Chin Huang, Kou Tanaka, Takuhiro Kaneko, Nobukatsu Hojo, Tomoki Toda
arXiv:2005.08445, May 2020
[Paper]
Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko, Nobukatsu Hojo
IEEE/ACM Trans. Audio Speech Lang. Process. (arXiv:1811.01609, Nov. 2018)
[Paper] [IEEE Xplore] [Project]
(Alternative title: ACVAE-VC: Non-parallel Many-to-Many Voice Conversion with Auxiliary Classifier Variational Autoencoder) Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo
IEEE/ACM Trans. Audio Speech Lang. Process. 27(9), Sept. 2019 (arXiv:1808.05092, Aug. 2018)
[Paper] [IEEE Xplore] [Project]
Nobukatsu Hojo, Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko
EUSIPCO 2018
[Paper]
Keisuke Oyamada, Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo, Hiroyasu Ando
EUSIPCO 2018 (arXiv:1804.02181, Apr. 2018)
[Paper]
Keisuke Oyamada, Hirokazu Kameoka, Takuhiro Kaneko, Hiroyasu Ando, Kaoru Hiramatsu, Kunio Kashino
APSIPA ASC 2017
[Paper]

Takuhiro Kaneko, Hirokazu Kameoka, Kaoru Hiramatsu, Kunio Kashino
Interspeech 2017
[Paper]

Takuhiro Kaneko, Hirokazu Kameoka, Nobukatsu Hojo, Yusuke Ijima, Kaoru Hiramatsu, Kunio Kashino
ICASSP 2017
[Paper]
Crossmodal
Review Paper
[Invited Review] Generative Adversarial Networks: Foundations and Applications
Takuhiro Kaneko
Acoustical Science and Technology 39(3), May 2018
[Paper]
[Paper (Japanese)]
Generative Personal Assistance with Audio and Visual Examples
Takuhiro Kaneko, Kaoru Hiramatsu, Kunio Kashino
NTT Technical Review 15(11), Nov. 2017
[Paper]
[Paper (Japanese)]
Talks & Exhibitions
[Tutorial] Foundations, Advances, and Applications of Generative Adversarial Networks
New!
Takuhiro Kaneko
JSAI 2020
(in Japanese)
[Program (Japanese)]
[Slides (Japanese)]
[Invited Talk] Generative Adversarial Image Synthesis with Decision Tree Latent Controller (CVPR 2018)
Takuhiro Kaneko, Kaoru Hiramatsu, Kunio Kashino
FIT 2019
(in Japanese)
[Program (Japanese)]
[Tutorial] Foundations, Advances, and Applications of Generative Adversarial Networks
Takuhiro Kaneko
MIRU 2019
(in Japanese)
[Abstract (Japanese)]
[Slides (Japanese)]
[Invited Talk] Foundations, Advances, and Applications of Generative Adversarial Networks: From Image Generation to Speech Synthesis and Voice Conversion
Takuhiro Kaneko
75th JSAI Seminar
(in Japanese)
[Abstract (Japanese)]
[Invited Talk] Generative Adversarial Networks: Foundations and Applications
Takuhiro Kaneko
JAMIT 2018
(in Japanese)
[Abstract (Japanese)]
Creating Favorite Images with Selective Decisions: Hierarchical Image Analysis and Synthesis with DTLC-GAN
Takuhiro Kaneko
NTT Communication Science Laboratories Open House 2018
[Poster]
[Poster (Japanese)]
Free-Feature-Point Image Generation: Interactive and Flexible Image Generation with Deep Learning
Takuhiro Kaneko
NTT R&D Forum 2018
[Poster]
[Poster (Japanese)]
[Invited Talk] Generative Attribute Controller with Conditional Filtered Generative Adversarial Networks (CVPR 2017)
Takuhiro Kaneko, Kaoru Hiramatsu, Kunio Kashino
MIRU 2017
(in Japanese)
[Program (Japanese)]
Generative Personal Assistance with Audio and Visual Examples: Deep Learning Opens the Way to Innovative Media Generation
Takuhiro Kaneko
NTT Communication Science Laboratories Open House 2017
[Paper]
[Abstract]
[Poster]
[Paper (Japanese)]
[Abstract (Japanese)]
[Poster (Japanese)]
Contact
Takuhiro Kaneko
NTT Communication Science Laboratories, NTT Corporation
takuhiro.kaneko.tb at hco.ntt.co.jp