High quality time-scale modification of speech using a peak alignment overlap-add algorithm (PAOLA)


Dorran, David and Lawlor, Bob and Coyle, Eugene (2003) High quality time-scale modification of speech using a peak alignment overlap-add algorithm (PAOLA). In: 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). IEEE, pp. 700-703. ISBN 0780376633

[img]
Preview
Download (272kB) | Preview



Add this article to your Mendeley library


Abstract

The duration of a speech passage can be altered using audio time-scale modification techniques. Time-scale modification can be achieved in the time domain by segmenting the input signal into overlapping frames and recombining the frames with an overlap differing from the analysis overlap. We present a time-scale modification algorithm that uses a simple peak alignment technique to synchronize overlapping synthesis frames. The peak alignment overlap-add (PAOLA) algorithm also takes advantage of waveform properties to ensure a high quality output for the minimum number of iterations. The new algorithm produces a time-scaled output of approximately equal quality to that of an adaptive implementation of the commercially popular synchronised overlap-add (SOLA) algorithm, but offers a computational saving ranging from a factor of 15 (for a time-scale factor of 0.5) to 170 (for a time-scale factor of 1.1).

Item Type: Book Section
Keywords: speech processing; time-domain analysis; synchronisation; speech synthesis; audio signal processing;
Academic Unit: Faculty of Science and Engineering > Electronic Engineering
Item ID: 8791
Identification Number: 10.1109/ICASSP.2003.1198877
Depositing User: Robert Lawlor
Date Deposited: 11 Sep 2017 15:46
Publisher: IEEE
Refereed: Yes
URI:

Repository Staff Only(login required)

View Item Item control page

Document Downloads

More statistics for this item...