WORLD

Japanese

Publications

When you cite the latest version of WORLD in your paper, please use the sentence "WORLD [1] (D4C edition [2])" and cite the followings.
[1] M. Morise, F. Yokomori, and K. Ozawa, ``WORLD: a vocoder-based high-quality speech synthesis system for real-time applications,'' IEICE transactions on information and systems, vol. E99-D, no. 7, pp. 1877-1884, 2016.
[2] M. Morise, ``D4C, a band-aperiodicity estimator for high-quality speech synthesis,'' Speech Communication, vol. 84, pp. 57-65, Nov. 2016.

Concept of WORLD

  • M. Morise, F. Yokomori, and K. Ozawa, ``WORLD: a vocoder-based high-quality speech synthesis system for real-time applications,'' IEICE transactions on information and systems, vol. E99-D, no. 7, pp. 1877-1884, 2016. (Note: This paper shows the version 0.1.4 (including DIO, CheapTrick and PLATINUM))
  • M. Morise, ``An attempt to develop a singing synthesizer by collaborative creation,'' Proc. the Stockholm Music Acoustics Conference 2013 (SMAC2013), pp. 287-292, Stockholm, July 30 - Aug. 3, 2013.
  • M. Morise, T. Nishiura, and H. Kawahara, ``Proposal of WORLD, a high-quality voice analysis, manipulation and synthesis system and its evaluation, '' ASJ technical report, vol. 41, no. 7, pp. 555-560, Toyama, Oct. 1-2, 2011. (in Japanese)

Fundamental frequency estimation method: DIO

  • M. Morise, H. Kawahara and T. Nishiura, ``Rapid F0 Estimation for High-SNR Speech Based on Fundamental Component Extraction,'' Trans. IEICE, vol. J93-D, no. 2, pp. 109-117, Feb. 2010. (in Japanese)
  • M. Morise, H. Kawahara and H. Katayose, ``Fast and reliable F0 estimation method based on the period extraction of vocal fold vibration of singing voice and speech,'' AES 35th International Conference, CD-ROM, London UK, Feb. 11-13, 2009.

Spectral envelope estimation method: CheapTrick (After Version 0.1.4)

  • M. Morise, ``CheapTrick, a spectral envelope estimator for high-quality speech synthesis,'' Speech Communication, vol. 67, pp. 1-7, March 2015.
  • M. Morise, ``Error evaluation of an F0-adaptive spectral envelope estimator in robustness against the additive noise and F0 error,'' IEICE transactions on information and systems, vol. E98-D, no. 7, pp. 1405-1408, July 2015.

Spectral envelope estimation method: STAR (Until Version 0.1.3)

  • M. Morise, Y. Yamashita, ``A method to estimate a temporally stable spectral envelope for periodic signals,'' Proc. ICA2013, 1aSCb, 6-page, Montreal, Canada, June 2-7, 2013.
  • M. Morise, T. Matsubara, K. Nakano, and T. Nishiura, ``A rapid spectrum envelope estimation technique of vowel for high-quality speech synthesis,'' Trans. IEICE, vol. J94-D, no. 7, pp. 1079-1087, July 2011. (in Japanese)

Aperiodicity estimation method: D4C (After Version 0.2.0)

  • M. Morise, ``D4C, a band-aperiodicity estimator for high-quality speech synthesis,'' Speech Communication, vol. 84, pp. 57-65, Nov. 2016.
  • M. Morise, ``A band-aperiodicity estimator and its error evaluation,'' IEICE Technical report, vol. 115, no. 99, pp. 13-18, Niigata, June 18-19, 2015. (in Japanese)

Excitation signal extraction method: PLATINUM (Until Version 0.1.4)

  • M. Morise, ``PLATINUM: A method to extract excitation signals for voice synthesis system,'' Acoust. Sci. & Tech., vol. 33, no. 2, pp. 123-125, March 2012.