TY - GEN
T1 - Tempo modification of music signal using sinusoidal model and LPC-based residue model
AU - Ito, Akinori
AU - Igarashi, Yuki
AU - Ito, Masashi
AU - Nose, Takashi
PY - 2014/1/1
Y1 - 2014/1/1
N2 - Changing tempo of the music signal is one of the most basic signal processing applied to music signals. Traditional algorithms such as phase vocoder and PSOLA uniformly stretch and shrink the input signal. Therefore, those methods change not only the tempo but also the structure of the signal of the instrumental sound, such as attack and decay time, which changes the timbre of the instruments. To change the tempo of the music signal while keeping the Attack-Decay-Sustain-Release structure of the instruments, we need a non-linear modification of the time scale. To realize this, we propose a two-stage modeling of the music signal. The first stage models the music signal using the sinusoidal model that expresses the harmonic part of the signal using sum of sinusoids with temporally-variable amplitude and frequency. Because non-harmonic component of the signal cannot be modeled using the sinusoidal model, the residue of the sinusoidal model is analyzed using the linear-prediction coding (LPC) in the second stage, which expresses the reverberation of the impulsive sound. Then the residue of the LPC analysis is stretched or shrunk non-linearly according to the short-term power, where only the parts with small power are modified because the parts with larger power correspond to the attack parts. Finally, the modified residue is used to synthesis the modified signal using the LPC synthesis filter and sinusoidal synthesizer. We conducted experiments to modify the tempo of piano sounds, and compared the result with the conventional time stretch methods.
AB - Changing tempo of the music signal is one of the most basic signal processing applied to music signals. Traditional algorithms such as phase vocoder and PSOLA uniformly stretch and shrink the input signal. Therefore, those methods change not only the tempo but also the structure of the signal of the instrumental sound, such as attack and decay time, which changes the timbre of the instruments. To change the tempo of the music signal while keeping the Attack-Decay-Sustain-Release structure of the instruments, we need a non-linear modification of the time scale. To realize this, we propose a two-stage modeling of the music signal. The first stage models the music signal using the sinusoidal model that expresses the harmonic part of the signal using sum of sinusoids with temporally-variable amplitude and frequency. Because non-harmonic component of the signal cannot be modeled using the sinusoidal model, the residue of the sinusoidal model is analyzed using the linear-prediction coding (LPC) in the second stage, which expresses the reverberation of the impulsive sound. Then the residue of the LPC analysis is stretched or shrunk non-linearly according to the short-term power, where only the parts with small power are modified because the parts with larger power correspond to the attack parts. Finally, the modified residue is used to synthesis the modified signal using the LPC synthesis filter and sinusoidal synthesizer. We conducted experiments to modify the tempo of piano sounds, and compared the result with the conventional time stretch methods.
UR - http://www.scopus.com/inward/record.url?scp=84922662100&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84922662100&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84922662100
T3 - 21st International Congress on Sound and Vibration 2014, ICSV 2014
SP - 928
EP - 935
BT - 21st International Congress on Sound and Vibration 2014, ICSV 2014
PB - International Institute of Acoustics and Vibrations
T2 - 21st International Congress on Sound and Vibration 2014, ICSV 2014
Y2 - 13 July 2014 through 17 July 2014
ER -