Fraunhofer Showcases Dialogue Enhancement Technology for Improved Speech Intelligibility and Listening Control
Tested by BBC during Wimbledon, Fraunhofer’s new technology enables end users to adapt the volume of different audio elements within a single broadcast program. Fraunhofer IIS, the world’s renowned source for audio and multimedia technologies, is previewing its new dialogue enhancement technology at IFA and IBC 2011. It allows TV and radio audiences to individually adjust the volume of dialogue, music or sound effects within a single broadcast program. Fraunhofer’s technology will be the first of its kind to enable this level of audio control at the low bit-rates required for broadcasting.
Mixing the sound for broadcasting of live events, such as football or tennis, is not always an easy task due to the varying listening preference of each end user. Some sport fans would rather hear more stadium atmosphere to experience a better live feeling, while others prefer better intelligibility of the commentator’s voice. To solve this issue, Fraunhofer brings control to the listener with its new dialogue enhancement technology. It will help broadcasters meet the demand for better speech intelligibility and enable radio or TV listeners to increase or decrease the volume of specific audio elements to their taste.
Recently the subject of a BBC experiment during Wimbledon 2011, the technology enabled users of a specifically designed media player to listen to Radio 5 Live’s online stream and adjust the audio volume levels of the commentator or background ambience during live coverage of the match for a tailored audio experience. The user reaction was positive, with users reporting the new feature as a useful extension for broadcasting.
Fraunhofer’s dialogue enhancement technology allows for highly efficient transportation of individual audio objects, such as a commentator’s voice or stadium atmosphere, in a compatible mono or stereo downmix. The audio encoder receives these objects and produces a single mix, as well as a stream of parametric side information. The transmission of the mix, plus side information, is extremely bit-rate efficient, as each audio object only slightly increases the overall bitrate. The mix can be produced automatically or by a sound engineer. On the receiving side, the user is then able to adjust the volume of each object individually, to improve the intelligibility of the sports commentator.
Fraunhofer’s dialogue enhancement technology is completely compatible with existing transmission and playback equipment. Devices that are not capable of decoding the parametric side information will play back the mixed audio signal.
Fraunhofer will demonstrate the dialogue enhancement technology at IFA Berlin 2-7 Sept. 2011 at the Fraunhofer booth in hall 11.1 and at IBC in Amsterdam 9-13 Sept. 2011, at the Fraunhofer booth in hall 8, B80.