窦维蓓

个人信息Personal Information

教授

教师英文名称:DOU Weibei

教师拼音名称:douweibei

办公地点:清华大学罗姆楼4-102

联系方式:Email: douwb@tsinghua.edu.cn; Tel: 010-62781703

学位:博士学位

毕业院校:电子科技大学学士、法国雷恩大学硕士、法国卡昂大学博士

学术论文

当前位置: 中文主页 >> 科学研究 >> 学术论文

MDCT Sinusoidal Analysis for Audio Signals Analysis and Processing

点击次数:

影响因子:1.877

DOI码:10.1109/TASL.2013.2250963

发表刊物:IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING

关键字:Frequency estimationMDCTpseudo-magnitudepseudo-phasewindow function

摘要:The Modified Discrete Cosine Transform (MDCT) is widely used in audio signals compression, but mostly limited to representing audio signals. This is because the MDCT is a real transform: Phase information is missing and spectral power varies frame to frame even for pure sine waves. We have a key observation concerning the structure of the MDCT spectrum of a sine wave: Across frames, the complete spectrum changes substantially, but if separated into even and odd subspectra, neither changes except scaling. Inspired by this observation, we find that the MDCT spectrum of a sine wave can be represented as an envelope factor times a phase-modulation factor. The first one is shift-invariant and depends only on the sine wave's amplitude and frequency, thus stays constant over frames. The second one has the form of for all odd bins and for all even bins, leading to subspectra's constant shapes. But this depends on the start point of a transform frame, therefore, changes at each new frame, and then changes the whole spectrum. We apply this formulation of the MDCT spectral structure to frequency estimation in the MDCT domain, both for pure sine waves and sine waves with noises. Compared to existing methods, ours are more accurate and more general (not limited to the sine window). We also apply the spectral structure to stereo coding. A pure tone or tone-dominant stereo signal may have very different left and right MDCT spectra, but their subspectra have similar shapes. One ratio for even bins and one ratio for odd bins will be enough to reconstruct the right from the left, saving half bitrate. This scheme is simple and at the same time more efficient than the traditional Intensity Stereo (IS).

合写作者:Weibei Dou,杨华中,Weibei Dou

第一作者:Shuhua Zhang,Yu Gao

论文类型:期刊论文

通讯作者:Shuhua Zhang,Weibei Dou

卷号:21

期号:7

页面范围:1403-1414

ISSN号:1558-7916

是否译文:

发表时间:2013-07-01