个人信息Personal Information
教授
教师英文名称:DOU Weibei
教师拼音名称:douweibei
办公地点:清华大学罗姆楼4-102
联系方式:Email: douwb@tsinghua.edu.cn; Tel: 010-62781703
学位:博士学位
毕业院校:电子科技大学学士、法国雷恩大学硕士、法国卡昂大学博士
MDCT Sinusoidal Analysis for Audio Signals Analysis and Processing
点击次数:
影响因子:1.877
DOI码:10.1109/TASL.2013.2250963
发表刊物:IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING
关键字:Frequency estimationMDCTpseudo-magnitudepseudo-phasewindow function
摘要:The Modified Discrete Cosine Transform (MDCT) is widely used in audio signals compression, but mostly limited to representing audio signals. This is because the MDCT is a real transform: Phase information is missing and spectral power varies frame to frame even for pure sine waves. We have a key observation concerning the structure of the MDCT spectrum of a sine wave: Across frames, the complete spectrum changes substantially, but if separated into even and odd subspectra, neither changes except scaling. Inspired by this observation, we find that the MDCT spectrum of a sine wave can be represented as an envelope factor times a phase-modulation factor. The first one is shift-invariant and depends only on the sine wave's amplitude and frequency, thus stays constant over frames. The second one has the form of for all odd bins and for all even bins, leading to subspectra's constant shapes. But this depends on the start point of a transform frame, therefore, changes at each new frame, and then changes the whole spectrum. We apply this formulation of the MDCT spectral structure to frequency estimation in the MDCT domain, both for pure sine waves and sine waves with noises. Compared to existing methods, ours are more accurate and more general (not limited to the sine window). We also apply the spectral structure to stereo coding. A pure tone or tone-dominant stereo signal may have very different left and right MDCT spectra, but their subspectra have similar shapes. One ratio for even bins and one ratio for odd bins will be enough to reconstruct the right from the left, saving half bitrate. This scheme is simple and at the same time more efficient than the traditional Intensity Stereo (IS).
合写作者:Weibei Dou,杨华中,Weibei Dou
第一作者:Shuhua Zhang,Yu Gao
论文类型:期刊论文
通讯作者:Shuhua Zhang,Weibei Dou
卷号:21
期号:7
页面范围:1403-1414
ISSN号:1558-7916
是否译文:否
发表时间:2013-07-01