Voice Conversion Approach through Feature Statistical Mapping

Over the past few decades the field of speech processing has undergone tremendous changes and grown to be important both theoretically and technologically. Great advances have already been made in a broad range of applications such as speech analysis and synthesis techniques, voice recognition, text...

Full description

Saved in:
Bibliographic Details
Main Author: Nasr, Abdulbaset M.
Format: Thesis
Language:English
English
Published: 2001
Subjects:
Online Access:http://psasir.upm.edu.my/id/eprint/11181/1/FK_2001_63.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
id my-upm-ir.11181
record_format uketd_dc
spelling my-upm-ir.111812024-05-31T08:48:31Z Voice Conversion Approach through Feature Statistical Mapping 2001-01 Nasr, Abdulbaset M. Over the past few decades the field of speech processing has undergone tremendous changes and grown to be important both theoretically and technologically. Great advances have already been made in a broad range of applications such as speech analysis and synthesis techniques, voice recognition, text to speech conversion and speech coding techniques to name a few. On the process of development of these applications, voice conversion (VC) technique has recently emerged as a new branch of speech synthesis dealing with the speaker identity. The basic idea behind VC is to modify one person's speech so that it is recognized as being uttered by another person. There are numerous applications of voice conversion technique. Examples include the personalization of text to speech (TTS) systems to reduce the need for a large speech database. It could also be used in the entertainment industry. VC technology could be used to dub movies more effectively by allowing the dubbing actor to speak with the voice of the original actor but in a different language. Voice conversion can also be used in the language translation applications to create the identity of a foreign speaker. This project proposes a simple parametric approach to VC through the use of the well-known speech analysis technique namely Linear Prediction (LP). LP is used as analysis tool to extract the most important acoustic parameters of a person's speech signal. These parameters are the pitch period, LP coefficients, the voicing decision and the speech signal energy. Then, the features of the source speaker are mapped to match those of the target speaker through the use of statistical mapping technique. To illustrate the feasibility of the proposed approach. a simple to use voice conversion software was developed. The program code was written in C++ and implemented using Microsoft Foundation C lass (MFC). The proposed scheme to the problem has shown satisfactory results, where the synthesized speech signal has come as c lose as possible to match that of a target speaker. 2001-01 Thesis http://psasir.upm.edu.my/id/eprint/11181/ http://psasir.upm.edu.my/id/eprint/11181/1/FK_2001_63.pdf text en public masters Universiti Putra Malaysia Faculty of Engineering Hassan, Md. Mahmud English
institution Universiti Putra Malaysia
collection PSAS Institutional Repository
language English
English
advisor Hassan, Md. Mahmud
topic


spellingShingle


Nasr, Abdulbaset M.
Voice Conversion Approach through Feature Statistical Mapping
description Over the past few decades the field of speech processing has undergone tremendous changes and grown to be important both theoretically and technologically. Great advances have already been made in a broad range of applications such as speech analysis and synthesis techniques, voice recognition, text to speech conversion and speech coding techniques to name a few. On the process of development of these applications, voice conversion (VC) technique has recently emerged as a new branch of speech synthesis dealing with the speaker identity. The basic idea behind VC is to modify one person's speech so that it is recognized as being uttered by another person. There are numerous applications of voice conversion technique. Examples include the personalization of text to speech (TTS) systems to reduce the need for a large speech database. It could also be used in the entertainment industry. VC technology could be used to dub movies more effectively by allowing the dubbing actor to speak with the voice of the original actor but in a different language. Voice conversion can also be used in the language translation applications to create the identity of a foreign speaker. This project proposes a simple parametric approach to VC through the use of the well-known speech analysis technique namely Linear Prediction (LP). LP is used as analysis tool to extract the most important acoustic parameters of a person's speech signal. These parameters are the pitch period, LP coefficients, the voicing decision and the speech signal energy. Then, the features of the source speaker are mapped to match those of the target speaker through the use of statistical mapping technique. To illustrate the feasibility of the proposed approach. a simple to use voice conversion software was developed. The program code was written in C++ and implemented using Microsoft Foundation C lass (MFC). The proposed scheme to the problem has shown satisfactory results, where the synthesized speech signal has come as c lose as possible to match that of a target speaker.
format Thesis
qualification_level Master's degree
author Nasr, Abdulbaset M.
author_facet Nasr, Abdulbaset M.
author_sort Nasr, Abdulbaset M.
title Voice Conversion Approach through Feature Statistical Mapping
title_short Voice Conversion Approach through Feature Statistical Mapping
title_full Voice Conversion Approach through Feature Statistical Mapping
title_fullStr Voice Conversion Approach through Feature Statistical Mapping
title_full_unstemmed Voice Conversion Approach through Feature Statistical Mapping
title_sort voice conversion approach through feature statistical mapping
granting_institution Universiti Putra Malaysia
granting_department Faculty of Engineering
publishDate 2001
url http://psasir.upm.edu.my/id/eprint/11181/1/FK_2001_63.pdf
_version_ 1804888621361135616