• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索
High Impact Results & Cited Count Trend for Year Keyword Cloud and Partner Relationship

Query:

学者姓名:鲍长春

Refining:

Source

Submit Unfold

Co-Author

Submit Unfold

Clean All

Sort by:
Default
  • Default
  • Title
  • Year
  • WOS Cited Count
  • Impact factor
  • Ascending
  • Descending
< Page ,Total 48 >
Speech Enhancement With Robust Beamforming for Spatially Overlapped and Distributed Sources SCIE
期刊论文 | 2022 , 30 , 2778-2790 | IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING
Abstract&Keyword Cite

Abstract :

Most of the existing Beamforming methods are based on the assumptions that the sources are all point sources and the angular separation between the direction of arrival (DOA) of the source and the interference is large enough to assure good performance. In this paper, we consider a tough scenario where the target source and the interference are simultaneously spatially distributed and overlapped. To improve the performance of Beamforming in this scenario, we propose two approaches: the first approach exploits the non-Gaussianity as well as the spectrogram sparsity of the output of the microphone array; the second approach exploits the generalized sparsity with overlapped groups of the Beampattern. The proposed criteria are solved by methods based on linearized preconditioned alternating direction method of multipliers (LPADMM) with high accuracy and high computational efficiency. Numerical simulations and real data experiments show the advantages of the proposed approaches compared to previously proposed Beamforming methods for signal enhancement.

Keyword :

Speech enhancement Speech enhancement microphone array microphone array DOA DOA Correlation Correlation speech enhancement speech enhancement Interference Interference Arrays Arrays Array signal processing Array signal processing Signal to noise ratio Signal to noise ratio MVDR MVDR Microphone arrays Microphone arrays

Cite:

Copy from the list or Export to your reference management。

GB/T 7714 Xiong Wenmeng , Bao Changchun , Jia Maoshen et al. Speech Enhancement With Robust Beamforming for Spatially Overlapped and Distributed Sources [J]. | IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING , 2022 , 30 : 2778-2790 .
MLA Xiong Wenmeng et al. "Speech Enhancement With Robust Beamforming for Spatially Overlapped and Distributed Sources" . | IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING 30 (2022) : 2778-2790 .
APA Xiong Wenmeng , Bao Changchun , Jia Maoshen , Picheral, Jose . Speech Enhancement With Robust Beamforming for Spatially Overlapped and Distributed Sources . | IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING , 2022 , 30 , 2778-2790 .
Export to NoteExpress RIS BibTex
基于广义合成分析和深度神经网络的自回归系数估计方法 CQVIP
期刊论文 | 2021 , 49 (1) , 29-39 | 崔子豪
Abstract&Keyword Cite

Abstract :

基于广义合成分析和深度神经网络的自回归系数估计方法

Keyword :

广义合成分析 广义合成分析 莱文逊-杜宾迭代解 莱文逊-杜宾迭代解 深度神经网络 深度神经网络 AR系数 AR系数

Cite:

Copy from the list or Export to your reference management。

GB/T 7714 崔子豪 , 鲍长春 , 电子学报 . 基于广义合成分析和深度神经网络的自回归系数估计方法 [J]. | 崔子豪 , 2021 , 49 (1) : 29-39 .
MLA 崔子豪 et al. "基于广义合成分析和深度神经网络的自回归系数估计方法" . | 崔子豪 49 . 1 (2021) : 29-39 .
APA 崔子豪 , 鲍长春 , 电子学报 . 基于广义合成分析和深度神经网络的自回归系数估计方法 . | 崔子豪 , 2021 , 49 (1) , 29-39 .
Export to NoteExpress RIS BibTex
Power Exponent Based Weighting Criterion for DNN-Based Mask Approximation in Speech Enhancement EI
期刊论文 | 2021 , 28 , 618-622 | IEEE Signal Processing Letters
Abstract&Keyword Cite

Abstract :

In this letter, a novel weighted mean square error (WMSE) is proposed to improve the DNN-based mask approximation method for speech enhancement, in which the weighting is closely related to the power exponent about noisy spectrum amplitude (NSA) base. The power exponents 0 and 2 separately reflect ideal amplitude masking (IAM) without any clippings and the indirect mapping (IM) on short-time spectral amplitude (STSA), and it is highly related to the enhanced spectrum and the performance of the enhanced signal based on the tests. Also, the experimental results show that the outstanding weighting is the noisy spectrum base with the power exponent 1 for the phase-unaware masking and results in better harmonic structure restoration. The objective function with the WMSE on the NSA (WMSE-NSA) can averagely improve 0.1 on the test of perceptual evaluation of speech quality (PESQ) and 1.7% on the test of short-time objective intelligibility (STOI) compared with the MSE-based mask approximation methods. © 1994-2012 IEEE.

Keyword :

Deep neural networks Deep neural networks Photomapping Photomapping Speech intelligibility Speech intelligibility Speech enhancement Speech enhancement Mean square error Mean square error Approximation theory Approximation theory

Cite:

Copy from the list or Export to your reference management。

GB/T 7714 Cui, Zihao , Bao, Changchun . Power Exponent Based Weighting Criterion for DNN-Based Mask Approximation in Speech Enhancement [J]. | IEEE Signal Processing Letters , 2021 , 28 : 618-622 .
MLA Cui, Zihao et al. "Power Exponent Based Weighting Criterion for DNN-Based Mask Approximation in Speech Enhancement" . | IEEE Signal Processing Letters 28 (2021) : 618-622 .
APA Cui, Zihao , Bao, Changchun . Power Exponent Based Weighting Criterion for DNN-Based Mask Approximation in Speech Enhancement . | IEEE Signal Processing Letters , 2021 , 28 , 618-622 .
Export to NoteExpress RIS BibTex
Multi-Source DOA Estimation in Reverberant Environments by Jointing Detection and Modeling of Time-Frequency Points EI
期刊论文 | 2021 , 29 , 379-392 | ACM Transactions on Audio Speech and Language Processing
Abstract&Keyword Cite

Abstract :

In this article, the direction of arrival (DOA) estimation of multiple speech sources in reverberant environments is investigated based on the recording of a soundfield microphone. First, the recordings are analyzed in the time-frequency (T-F) domain to detect both 'points' (single T-F points) and 'regions' (multiple, adjacent T-F points) corresponding to a single source with low reverberation (known as low-reverberant-single-source (LRSS) points). Then, a LRSS point detection algorithm is proposed based on a joint dominance measure and instantaneous single-source point (SSP) identification. Following this, initial DOA estimates obtained for the detected LRSS points are analyzed using a Gaussian Mixture Model (GMM) derived by the Expectation-Maximization (EM) algorithm to cluster components into sources or outliers using a rule-based method. Finally, the DOA of each actual source is obtained from the estimated source components. Experiments on both simulated data and data recorded in an actual acoustic chamber demonstrate that the proposed algorithm exhibits improved performance for the DOA estimation in reverberant environments when compared to several existing approaches. © 2014 IEEE.

Keyword :

Gaussian distribution Gaussian distribution Reverberation Reverberation Frequency domain analysis Frequency domain analysis Frequency estimation Frequency estimation Clustering algorithms Clustering algorithms Maximum principle Maximum principle Direction of arrival Direction of arrival Image segmentation Image segmentation Audio recordings Audio recordings

Cite:

Copy from the list or Export to your reference management。

GB/T 7714 Jia, Maoshen , Wu, Yuxuan , Bao, Changchun et al. Multi-Source DOA Estimation in Reverberant Environments by Jointing Detection and Modeling of Time-Frequency Points [J]. | ACM Transactions on Audio Speech and Language Processing , 2021 , 29 : 379-392 .
MLA Jia, Maoshen et al. "Multi-Source DOA Estimation in Reverberant Environments by Jointing Detection and Modeling of Time-Frequency Points" . | ACM Transactions on Audio Speech and Language Processing 29 (2021) : 379-392 .
APA Jia, Maoshen , Wu, Yuxuan , Bao, Changchun , Ritz, Christian . Multi-Source DOA Estimation in Reverberant Environments by Jointing Detection and Modeling of Time-Frequency Points . | ACM Transactions on Audio Speech and Language Processing , 2021 , 29 , 379-392 .
Export to NoteExpress RIS BibTex
Multi-source localization by using offset residual weight SCIE
期刊论文 | 2021 , 2021 (1) | EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING
Abstract&Keyword Cite

Abstract :

Multiple sound source localization is a hot issue of concern in recent years. The Single Source Zone (SSZ) based localization methods achieve good performance due to the detection and utilization of the Time-Frequency (T-F) zone where only one source is dominant. However, some T-F points consisting of components from multiple sources are also included in the detected SSZ sometimes. Once a T-F point in SSZ is contributed by multiple components, this point is defined as an outlier. The existence of outliers within the detected SSZ is usually an unavoidable problem for SSZ-based methods. To solve this problem, a multi-source localization by using offset residual weight is proposed in this paper. In this method, an assumption is developed: the direction estimated by all the T-F points within the detected SSZ has a difference along with the actual direction of sources. But this difference is much smaller than the difference between the directions estimated by the outliers along with the actual source localization. After verifying this assumption experimentally, Point Offset Residual Weight (PORW) and Source Offset Residual Weight (SORW) are proposed to reduce the influence of outliers on the localization results. Then, a composite weight is formed by combining PORW and SORW, which can effectively distinguish the outliers and desired points. After that, the outliers are removed by composite weight. Finally, a statistical histogram of DOA estimation with outliers removed is used for multi-source localization. The objective evaluation of the proposed method is conducted in various simulated environments. The results show that the proposed method achieves a better performance compared with the reference methods in sources localization.

Keyword :

Multiple sound sources localization Multiple sound sources localization Direction of arrival estimation Direction of arrival estimation Soundfield microphone Soundfield microphone Reverberation Reverberation

Cite:

Copy from the list or Export to your reference management。

GB/T 7714 Jia, Maoshen , Gao, Shang , Bao, Changchun . Multi-source localization by using offset residual weight [J]. | EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING , 2021 , 2021 (1) .
MLA Jia, Maoshen et al. "Multi-source localization by using offset residual weight" . | EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING 2021 . 1 (2021) .
APA Jia, Maoshen , Gao, Shang , Bao, Changchun . Multi-source localization by using offset residual weight . | EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING , 2021 , 2021 (1) .
Export to NoteExpress RIS BibTex
GAN-Based Inter-Channel Amplitude Ratio Decoding in Multi-Channel Speech Coding EI
会议论文 | 2021 | 12th International Symposium on Chinese Spoken Language Processing, ISCSLP 2021
Abstract&Keyword Cite

Abstract :

In this paper, a multi-channel speech coding method based on down-mixing and inter-channel amplitude ratio (ICAR) decoding based on generative adversarial network (GAN) is proposed. Firstly, spatial parameter inter-channel time difference (ICTD) is extracted. In the short-time Fourier transform (STFT) domain, the amplitude of the down-mixed mono signal is obtained by adding and averaging the amplitude of the multi-channel speech signals, the phase of the down-mixed mono signal is replaced by the phase of the reference channel, the STFT of the down-mixed mono signal is obtained. Then, the inverse STFT is used to obtain the down-mixed mono signal. The amplitude ratio between multichannel speech signals and down-mixed signal (ICAR) is extracted. The down-mixed mono signal is coded by Speex codec, and ICTD is quantized by a uniform scalar quantizer. The ICAR needn't to be encoded. The ICAR is decoded from a well-trained GAN at the decoder based on the decoded mono signal. Finally, the decoded multi-channel speech signals are recovered by using the decoded down-mixed mono signal, decoded ICTD and the decoded ICAR. The experimental results show that the proposed multi-channel speech coding method can recover multi-channel speech signals with spatial information. © 2021 IEEE.

Keyword :

Signal reconstruction Signal reconstruction Speech communication Speech communication Decoding Decoding Inverse problems Inverse problems Speech coding Speech coding

Cite:

Copy from the list or Export to your reference management。

GB/T 7714 Zhu, Jinru , Bao, Changchun . GAN-Based Inter-Channel Amplitude Ratio Decoding in Multi-Channel Speech Coding [C] . 2021 .
MLA Zhu, Jinru et al. "GAN-Based Inter-Channel Amplitude Ratio Decoding in Multi-Channel Speech Coding" . (2021) .
APA Zhu, Jinru , Bao, Changchun . GAN-Based Inter-Channel Amplitude Ratio Decoding in Multi-Channel Speech Coding . (2021) .
Export to NoteExpress RIS BibTex
A multi-source localization method based on clustering and outlier removal CPCI-S
期刊论文 | 2021 , 950-955 | 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC)
Abstract&Keyword Cite

Abstract :

Multiple sound source localization is a hot topic of concern in recent years. In this paper, a multi-source localization method based on weight clustering and outlier removal is proposed to deal with the multiple source localization in the environment with high reverberation time. In this kind of environments, there are always some T-F points consisting of components from multiple sources mixed in the detected spares components. These T-F points, which are called outliers, usually carry the wrong information of localization and could lead to the decline of localization accuracy. To solve this problem, the Point Offset Residual Weight (PORW) and Source Offset Residual Weight (SORW) are introduced to measure the contribution of each T-F point to the localization. The binary clustering is proposed to distinguish and remove the outliers. After that, a statistical histogram of DOA estimation is drawn using the composite weight to weaken the effect of components that interfere with the localization. Finally, the multi-source localization is conducted through peak searching. The objective evaluation of the proposed method is conducted in various simulated environments. The results show that the proposed method achieves a better performance compared with the reference methods in sources localization.

Keyword :

reverberation reverberation direction of arrival estimation direction of arrival estimation multiple sources localization multiple sources localization sound field microphone sound field microphone sparsity sparsity

Cite:

Copy from the list or Export to your reference management。

GB/T 7714 Gao, Shang , Jia, Maoshen , Bao, Changchun . A multi-source localization method based on clustering and outlier removal [J]. | 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC) , 2021 : 950-955 .
MLA Gao, Shang et al. "A multi-source localization method based on clustering and outlier removal" . | 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC) (2021) : 950-955 .
APA Gao, Shang , Jia, Maoshen , Bao, Changchun . A multi-source localization method based on clustering and outlier removal . | 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC) , 2021 , 950-955 .
Export to NoteExpress RIS BibTex
利用概率混合模型的理想比率掩蔽多声源分离方法 CSCD
期刊论文 | 2021 , 37 (10) , 1806-1815 | 信号处理
Abstract&Keyword Cite

Abstract :

针对基于时频掩蔽的分离方法在多声源场景下的分离效果不佳的问题,论文提出一种利用概率混合模型的理想比率掩蔽多声源分离方法。首先,利用冯·米塞斯分布对时频点处方位角估计进行拟合以及拉普拉斯分布对归一化声压梯度信号向量进行拟合,由此建立概率混合模型。其次,利用期望最大化算法对模型参数进行求解,估计各声源对应的理想比率掩蔽。最后,利用估计出的理想比率掩蔽,从麦克风采集信号中分离得到各声源信号。实验结果表明,与现有基于时频掩蔽的多声源分离方法相比,论文所提方法在欠定场景下具有更好的分离效果。

Keyword :

概率混合模型 概率混合模型 多声源分离 多声源分离 理想比率掩蔽 理想比率掩蔽

Cite:

Copy from the list or Export to your reference management。

GB/T 7714 贾怡恬 , 杨淇善 , 贾懋珅 et al. 利用概率混合模型的理想比率掩蔽多声源分离方法 [J]. | 信号处理 , 2021 , 37 (10) : 1806-1815 .
MLA 贾怡恬 et al. "利用概率混合模型的理想比率掩蔽多声源分离方法" . | 信号处理 37 . 10 (2021) : 1806-1815 .
APA 贾怡恬 , 杨淇善 , 贾懋珅 , 许文杰 , 鲍长春 . 利用概率混合模型的理想比率掩蔽多声源分离方法 . | 信号处理 , 2021 , 37 (10) , 1806-1815 .
Export to NoteExpress RIS BibTex
Auto-Regressive Coefficient Estimation Based on the GABS and DNN EI
期刊论文 | 2021 , 49 (1) , 29-39 | Acta Electronica Sinica
Abstract&Keyword Cite

Abstract :

The auto-regressive (AR) model is an effective method to describe the correlation of time series.The classic AR coefficient estimation method utilizes a simple assumption about residual signal.It is a challenge to accurately estimate the auto-regressive coefficients in a complex environment such as noise or interference.Even though Deep Neural Networks (DNN)based AR (DNN-AR) coefficient estimation method can estimate the AR coefficients in a complex environment,the DNN-AR method is easily affected by the numerical stability of Levinson-Durbin recursion (LDR) approach during the training stage.The main target is to improve the stability and overall performance of the DNN-AR based method.In this paper,the precision transform method is utilized to improve computational efficiency while keeping system stability,and the generalized analysis-by-synthesis combing DNN (GABS-DNN) model is proposed for improving the accuracy of AR coefficient estimation and stability of the DNN training in the noisy environment.The GABS-DNN model consists of three main parts:spectrum enhancement network in the modifier,DNN preprocessing and LDR parameter estimation at the encoder,and the conversion from autoregressive coefficient to power spectrum at the decoder.In the process of optimizing the objective function,the error between the enhanced spectrum and the observed spectrum is added for reducing the influence of the gradient of the LDR on the enhanced network during back-propagation,which results in a stable estimation of the AR coefficients of noisy speech. © 2021, Chinese Institute of Electronics. All right reserved.

Keyword :

Backpropagation Backpropagation Deep neural networks Deep neural networks Numerical methods Numerical methods System stability System stability Complex networks Complex networks Computational efficiency Computational efficiency

Cite:

Copy from the list or Export to your reference management。

GB/T 7714 Cui, Zi-Hao , Bao, Chang-Chun . Auto-Regressive Coefficient Estimation Based on the GABS and DNN [J]. | Acta Electronica Sinica , 2021 , 49 (1) : 29-39 .
MLA Cui, Zi-Hao et al. "Auto-Regressive Coefficient Estimation Based on the GABS and DNN" . | Acta Electronica Sinica 49 . 1 (2021) : 29-39 .
APA Cui, Zi-Hao , Bao, Chang-Chun . Auto-Regressive Coefficient Estimation Based on the GABS and DNN . | Acta Electronica Sinica , 2021 , 49 (1) , 29-39 .
Export to NoteExpress RIS BibTex
基于瞬时相位差和深度学习的丢包隐藏方法 CSCD
期刊论文 | 2021 , 37 (10) , 1791-1798 | 信号处理
Abstract&Keyword Cite

Abstract :

实时IP语音通信在数据包会丢失的情况下,语音质量会受到严重影响。为了恢复传输过程中丢失的语音信息,本文提出了一种基于瞬时相位差(Instantaneous Phase Deviation, IPD)和深度神经网络(Deep Neural Network, DNN)的丢包隐藏(Packet Loss Concealment, PLC)方法。在训练阶段,将语音的对数功率谱(Log Power Spectrum, LPS)和IPD作为训练DNN的输入特征,以学习从接收包到丢失包的映射关系;在重构阶段,将丢包前接收到的语音包送入训练好的DNN中,恢复出丢失包的语音。实验结果表明,在不同丢包率下,所提方...

Keyword :

相位特征 相位特征 丢包隐藏 丢包隐藏 深度神经网络 深度神经网络

Cite:

Copy from the list or Export to your reference management。

GB/T 7714 黄晋维 , 鲍长春 . 基于瞬时相位差和深度学习的丢包隐藏方法 [J]. | 信号处理 , 2021 , 37 (10) : 1791-1798 .
MLA 黄晋维 et al. "基于瞬时相位差和深度学习的丢包隐藏方法" . | 信号处理 37 . 10 (2021) : 1791-1798 .
APA 黄晋维 , 鲍长春 . 基于瞬时相位差和深度学习的丢包隐藏方法 . | 信号处理 , 2021 , 37 (10) , 1791-1798 .
Export to NoteExpress RIS BibTex
10| 20| 50 per page
< Page ,Total 48 >

Export

Results:

Selected

to

Format:
Online/Total:3512/2642838
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.