CLC number: TP309
On-line Access: 2024-08-27
Received: 2023-10-17
Revision Accepted: 2024-05-08
Crosschecked: 0000-00-00
Cited: 0
Clicked: 7170
FAN Xing, GU Wei-kang, YE Xiu-qing. Research on fast real-time adaptive audio mixing in multimedia conference[J]. Journal of Zhejiang University Science A, 2005, 6(6): 507-512.
@article{title="Research on fast real-time adaptive audio mixing in multimedia conference",
author="FAN Xing, GU Wei-kang, YE Xiu-qing",
journal="Journal of Zhejiang University Science A",
volume="6",
number="6",
pages="507-512",
year="2005",
publisher="Zhejiang University Press & Springer",
doi="10.1631/jzus.2005.A0507"
}
%0 Journal Article
%T Research on fast real-time adaptive audio mixing in multimedia conference
%A FAN Xing
%A GU Wei-kang
%A YE Xiu-qing
%J Journal of Zhejiang University SCIENCE A
%V 6
%N 6
%P 507-512
%@ 1673-565X
%D 2005
%I Zhejiang University Press & Springer
%DOI 10.1631/jzus.2005.A0507
TY - JOUR
T1 - Research on fast real-time adaptive audio mixing in multimedia conference
A1 - FAN Xing
A1 - GU Wei-kang
A1 - YE Xiu-qing
J0 - Journal of Zhejiang University Science A
VL - 6
IS - 6
SP - 507
EP - 512
%@ 1673-565X
Y1 - 2005
PB - Zhejiang University Press & Springer
ER -
DOI - 10.1631/jzus.2005.A0507
Abstract: In multimedia conference, the capability of audio processing is basic and requires more for real-time criteria. In this article, we categorize and analyze the schemes, and provide several multipoint speech audio mixing schemes using weighted algorithm, which meet the demand of practical needs for real-time multipoint speech mixing, for which the ASW and AEW schemes are especially recommended. Applying the adaptive algorithms, the high-performance schemes we provide do not use the saturation operation widely used in multimedia processing. Therefore, no additional noise will be added to the output. The above adaptive algorithms have relatively low computational complexity and good hearing perceptibility. The schemes are designed for parallel processing, and can be easily implemented with hardware, such as DSPs, and widely applied in multimedia conference systems.
[1] Daigle, J.N., Langford, I.D., 1986. Model for analysis of packet voice communications systems. IEEE Journal on Selected Areas in Communications, 4(6):847-855.
[2] González, A.J., Abdel-Wahab, H., 1998. Audio Mixing for Interactive Multimedia Communications. JCIS’98, Research Triangle, NC, p.217-220.
[3] ITU-T, 2000. Packet-Based Multimedia Communication System. ITU-T Recommendation H.323 v4.
[4] Rangan, P.V., Vin, H.M., Ramanathan, S., 1993. Communication architectures and algorithms for media mixing in multimedia conferences. IEEE/ACM Transactions on Networking, 1(1):20-30.
[5] Schulzrinne, H., Caner, S., Frederick, R., Jacobson, V., 1996. RTP: A Transport Protocol for Real-time Applications. IETF RFC 1889.
[6] Tu, W., Hu, R.M., Ai, H.J., Xie, X., 2002. Audio MP in video conference. Geomantics and Information Science of Wuhan University, 27(1):98-101 (in Chinese).
[7] Yang, S.T., Yu, S.S., Zhou, J.L., 2001. A multipoint real-time speech mixing and scheduling algorithm based on packet networks. Journal of Software, 12(9):1413-1419 (in Chinese).
Open peer comments: Debate/Discuss/Question/Opinion
<1>