zoukankan      html  css  js  c++  java
  • 芬兰赫尔辛基工业大学DirAC工程:Directional audio coding

    http://www.acoustics.hut.fi/research/cat/DirAC/

    New audio technology could revolutionize the industry


    A group of scientists have developed a new audio technology in the Helsinki University of Technology (TKK). Directional audio coding (DirAC) technique aims to produce the best possible sound for the listener regardless of how many speakers the sound system has and where the speakers are located. TKK has now sold the protecting patents to a German research organization Fraunhofer-Gesellschaft. 

    The technique is based on Spatial Impulse Response Rendering (SIRR) and would use for example a remote control to measure and determine what kind of speaker setup surrounds the listener. By transmitting audio and metadata between the microphone, the audio system and the speakers it would produce optimal audio quality for the listener. The technology is said to be suitable for teleconferencing as well. 

    According to TKK DirAC could become the industry standard. The recordings would hold 1-3 channels of audio and metadata. However, the studies are not yet finished. 

    TKK and its researchers will continue to work on DirAC with Fraunhofer-Gesellschaft. 

    Source - Afterdawn.com

    Seems like a good idea, but everyone has different listening tastes, what sounds good to me, might sound rubbish to you.

    Directional audio coding

     

    Directional audio coding (DirAC) is a technique for various tasks in spatial sound reproduction. It is based on Spatial impulse response rendering, it is based on the same principles, and partly on the same methods. The processing can be divided into three steps:

    Analysis: the sound signals are divided into frequency bands using filterbank or STFT. The diffuseness and direction of arrival of sound at each frequency band are analyzed depending on time.

    Transmission: A mono channel is transmitted with directional information, or, in applications targeting for best quality, all recorded channels are transmitted.

    Synthesis: the sound at each frequency channel is first divided into diffuse and non-diffuse streams. The diffuse stream is then produced using method which produces maximally diffuse perception of sound, and non-diffuse stream is produced with a technique which produces as point-like perception of sound source as possible.

    Synthesis can be implemented in various ways, depending on microphone technique, transmission type, and reproduction system.

    Example applications for DirAC

    Reproduction of B-format recordings. Demos available for 5.0 loudspeaker setup. Traditionally, B-format recordings are reproduced using e.g. Ambisonics, which produces coherent loudspeaker signals. This produces blurred spatial image and small optimal listening area. In DirAC, the coherence can be avoided since in both diffuse and non-diffuse reproduction, which produces less blurring and larger listening area.

    Transmission of spatial information as side band to mono signal in teleconferencingDemos available for 5.0 loudspeaker setup. The microphone setup is a custom B-format microphone composed of four miniature capsules. Sound is transmitted as a mono signal, with a narrow side band containing the azimuth directions for each frequency band depending on time.

    In the teleconferencing demos the application of DirAC as a new type of directional microphone for noisy recording environments. This is implemented by reproducing only the sound coming from the direction of speech source. Although the SNR decreases from 0 to -25dB, speech is still somehow intelligible, although the reproduced speech signal contains lots of distortion.

    Upmixing of stereo files to multichannel files. The stereophonic file is recorded with a simulated B-format microphone in simulated anechoic conditions. The sound can then be decoded to arbitrary reproduction systems.


    Some selected publications Short description
    N/A Laitinen MV, Kuech F, Disch S, V. Pulkki "Reproducing Applause-Type Signals with Directional Audio Coding" J. Audio Eng. Soc., 59(1/2) 2011.  Surrounding applause-type signals are very hard signals for many parametric spatial audio reproduction methods. It is shown, however, in this article, that such coding is possible, though the needed time-frequency resolution is very fine for such processing.
    N/A Vilkamo J, Lokki T, and Pulkki V. "Directional audio coding: Virtual microphone-based synthesis and subjective evaluation" J. Audio Engineering Society 57(9) 2009,  The use of virtual microphones in DirAC processing is presented here, and the quality produced by DirAC is shown to be very good in extensive listening tests.
    N/A Pulkki V, Laitinen MV, and Erkut C. "Efficient spatial sound synthesis for virtual worlds" The AES 35th International Conference London, UK, February 11-13 2009.  The use of DirAC in virtual world audio rendering is shown here. DirAC can be used to position virtual sources, to control the spatial extent of the sources and to provide reverberation efficiently. Also, recorded spatial sound scenes can easily be augmented with virtual sources.
    N/A Laitinen MV and Pulkki V "Binaural reproduction for directional audio coding" IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. New Paltz, NY, USA, October 18-21 2009. It is shown, that DirAC provides a nicely externalized perception of spatial sound, when using head tracking in headphone listening.
    N/A V. Pulkki "Directional audio coding in spatial sound reproduction and stereo upmixing". AES 28th Int. Conf. Pitea, Sweden, June 2006.  The use of DirAC in high-fidelity reproduction of B-format recordings is presented here. Also, the idea of using DirAC in stereo to multichannel upmixing is presented.
    N/A V. Pulkki and C. Faller "Directional audio coding: Filterbank and STFT-based design. In 120th AES Convention, Paris, France, May 20-23, 2006. Audio Engineering Society. Paper # 6658.  The use of DirAC in teleconferencing is presented here, with some discussion on the selection of time-frequency analysis methods in different applications.

    esittely | introduction | contact info | teaching | research | publications | current news | demonstrations | software | links | feedback | intranet | finder

    http://www.acoustics.hut.fi/research/cat/sirr/
    Modified: 23.5.2011 
    Feedback >

    home
  • 相关阅读:
    What is systemvolumeinformation? delete it?
    What is "found.000" ? How to deal with it?
    install Mac OS on Vmware
    字符串数组全排列
    Hadoop开发相关问题
    String直接赋值和使用new的区别
    输入两个递增排序的链表,合并这两个链表并使新链表中的结点仍然是按照递增排序的
    括号匹配问题
    预编译语句
    两个有序单链表合并成一个有序单链表的java实现
  • 原文地址:https://www.cnblogs.com/gaozehua/p/2749567.html
Copyright © 2011-2022 走看看