Non-negative temporal decomposition (NTD)

Hiroya, S., ``Non-negative temporal decomposition of speech parameters,'' in Proc. ICASSP, pp. 5066-5069, March 2010. [download]

We present a non-negative temporal decomposition method for line spectrum pair and articulatory parameters. Based on the multiplicative update rules derived from a non-negative matrix factorization algorithm, these parameters decompose into a set of temporally overlapped event functions that are restricted to the range [0,1] and corresponding event vectors. With the proposed method, the RMS error of the measured and estimated articulatory parameters is 0.16 mm and the spectral distance of the measured and estimated line spectrum pair parameters is 1.97 dB. These results also show that these estimation errors of proposed method are significantly smaller than those of the conventional method. This technique will be useful for many applications, such as speech coding and speech modification.

IEEE Copyright Notice

©2010 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

back