Jump to content
Wikipedia The Free Encyclopedia

Unified Speech and Audio Coding

From Wikipedia, the free encyclopedia
Audio compression standard

Unified Speech and Audio Coding (USAC) is an audio compression format and codec for both music and speech or any mix of speech and audio using very low bit rates between 12 and 64 kbit/s.[1] It was developed by Moving Picture Experts Group (MPEG) and was published as an international standard ISO/IEC 23003-3 (a.k.a. MPEG-D Part 3)[2] and also as an MPEG-4 Audio Object Type in ISO/IEC 14496-3:2009/Amd 3 in 2012.[3]

It uses time-domain linear prediction and residual coding tools (ACELP-like techniques) for speech signal segments and transform coding tools (MDCT-based techniques) for music signal segments and it is able to switch between the tool sets dynamically in a signal-responsive manner. It is being developed with the aim of a single, unified coder with performance that equals or surpasses that of dedicated speech coders and dedicated music coders over a broad range of bitrates. Enhanced variations of the MPEG-4 Spectral Band Replication (SBR) and MPEG-D MPEG Surround parametric coding tools are integrated into the USAC codec.[4] [5]

Extended HE-AAC

[edit ]

The MPEG-D USAC standard (ISO/IEC 23003-3) defines the Extended High Efficiency AAC profile, which contains all of the tools of the HE-AAC v2 profile plus the mono/stereo capabilities of the Baseline USAC profile. As a result, a decoder built according to the Extended High Efficiency AAC profile is able to also decode the bit streams created for the previous AAC family profiles. The Extended High Efficiency AAC profile was designed for applications relying on a consistent performance at low data rates while being able to decode all existing AAC-LC, HE-AAC and HE-AACv2 content.[6]

xHE-AAC

[edit ]
Logo for xHE-AAC used by Fraunhofer

Fraunhofer has defined the xHE-AAC codec as the combination of the Extended High Efficiency AAC profile and appropriate parts of the MPEG-D DRC Loudness Control Profile or Dynamic Range Control Profile.[7] xHE-AAC extends the operating range of the codec from 12 to 300 kbit/s for stereo signals and allows seamless switching between bitrates over this range for adaptive bitrate delivery (using standards such as MPEG-DASH or HLS for example). xHE-AAC also includes MPEG-D DRC mandatory loudness control to playback content at a consistent volume and offers new dynamic range control profiles for listening in noisy situations.[8]

While xHE-AAC decoders will be able to decode the bit streams created for the previous AAC family profiles, xHE-AAC encoders are typically intended for encoding of MPEG-D USAC audio object type (AOT 42) with MPEG-D DRC loudness metadata, though some may support encoding legacy AAC object types.[7]

xHE-AAC is a mandatory audio codec in the Digital Radio Mondiale standard[9] [10] [11] and is a trademark of Fraunhofer.[7]

In April 2016, Via Licensing announced the launch of a xHE-AAC patent pool licensing program for 2016.[12] In 2018, xHE-AAC was included in Via Licensing's AAC patent pool at no additional cost.[8] [13]

In January 2021, Fraunhofer announced a test service and trademark program for xHE-AAC and announced that the codec is being used by Netflix.[14] [15] Netflix reported that users switched from speakers to headphones 16% less often (due to poor sound quality or inadequate volume) on high dynamic range content when using xHE-AAC instead of HE-AAC. Netflix also explained that xHE-AAC allowed them to begin streaming with adaptive audio bitrates to Android devices.[16] Fraunhofer also announced xHE-AAC licenses to MainConcept,[17] Poikosoft,[18] and LG.[19] xHE-AAC is supported by the Bento4 DASH/HLS packager.[20] In January 2022, MainConcept established a web encoding service to test xHE-AAC. In October 2022, xHE-AAC decoding was added to Windows 11 and Xbox devices.[21]

Compatibility

[edit ]

xHE-AAC is supported in Android since Android Pie [8] and in iOS since iOS 13. It has been announced that it will be added to watchOS 7 [22] [23] and has been licensed to Microsoft.[24] Playing xHE-AAC audio files is supported in foobar2000 from version 2.25 onwards.[25] In October 2022, Windows 11 added support for xHE-AAC in the 22H2 update.[26]

See also

[edit ]
  • Opus (codec) – a royalty free alternative, low latency codec for a similar usage

References

[edit ]
  1. ^ MPEG. "Unified Speech and Audio Coding". The Moving Pictures Experts Group. Retrieved 2016年11月11日.
  2. ^ "ISO/IEC DIS 23003-3 - Information technology -- MPEG audio technologies -- Part 3: Unified speech and audio coding". 2011年02月15日. Retrieved 2011年07月18日.
  3. ^ "ISO/IEC 14496-3:2009/PDAM 3 - Transport of unified speech and audio coding (USAC)". 2011年06月30日. Retrieved 2011年07月18日.
  4. ^ Neuendorf; et al. (2013年12月20日), The ISO/MPEG Unified Speech and Audio Coding Standard—Consistent High Quality for All Content Types and at All Bit Rates , retrieved 2015年06月13日
  5. ^ Neuendorf; et al. (2012年04月26日), MPEG Unified Speech and Audio Coding-The ISO/MPEG standard for high-efficiency audio coding of all content types , retrieved 2015年06月13日
  6. ^ Neuendorf, Max; Multrus, Markus; Rettelbach, Nikolaus; Fuchs, Guillaume; Robilliard, Julien; Lecomte, Jérémie; Wilde, Stephan; Bayer, Stefan; Disch, Sascha; Helmrich, Christian; Lefebvre, Roch; Gournay, Philippe; Bessette, Bruno; Lapierre, Jimmy; Kjörling, Kristofer; Purnhagen, Heiko; Villemoes, Lars; Oomen, Werner; Schuijers, Erik; Kikuiri, Kei; Chinen, Toru; Norimatsu, Takeshi; Chong, Kok Seng; Oh, Eunmi; Kim, Miyoung; Quackenbush, Schuyler; Grill, Bernhard (2013年12月01日). "The ISO/MPEG Unified Speech and Audio Coding Standard - Consistent High Quality for all Content Types and at all Bit Rates". Journal of the Audio Engineering Society. 61 (12): 956–977. ISSN 0004-7554.
  7. ^ a b c "The xHE-AAC Trademark Program". Fraunhofer Institute for Integrated Circuits IIS. Retrieved 2021年02月11日.
  8. ^ a b c "Fraunhofer's xHE-AAC Audio Codec Software Extends Native AAC Support In Android P For Better Quality At Low Bitrates". Fraunhofer Institute for Integrated Circuits IIS. Retrieved 2020年07月11日.
  9. ^ "Technical Info | Digital Radio Mondiale". www.drm.org. Archived from the original on 2016年01月26日. Retrieved 2016年08月02日.
  10. ^ "xHE-AAC". Fraunhofer Institute for Integrated Circuits IIS. Retrieved 2016年08月02日.
  11. ^ xHE-AAC in Digital Radio Mondiale (DRM) (PDF). Fraunhofer IIS. 2015.
  12. ^ "Via Licensing Announces Extended High Efficiency AAC Patent Pool - Via Corp". www.via-corp.com. Archived from the original on 2016年06月18日. Retrieved 2016年08月02日.
  13. ^ "Via Adds MPEG-D DRC To Advanced Audio Coding Patent Pool – ViaCorp" . Retrieved 2020年07月11日.
  14. ^ "Fraunhofer IIS Introduces New Test Service and Trademark Program for xHE-AAC Audio Codec". www.businesswire.com. 2021年01月12日. Retrieved 2021年01月13日.
  15. ^ "Netflix Now Streaming with Fraunhofer's xHE-AAC Audio on Android Mobile". www.businesswire.com. 2021年01月12日. Retrieved 2021年01月13日.
  16. ^ Blog, Netflix Technology (2021年01月21日). "Optimizing the Aural Experience on Android Devices with xHE-AAC". Medium. Retrieved 2021年01月26日.
  17. ^ "MainConcept launches xHE-AAC FFmpeg Encoder Plugin based on audio codec software from Fraunhofer – Fraunhofer Audio Blog" . Retrieved 2021年10月06日.
  18. ^ "Poikosoft's EZ CD Audio Converter now supports xHE-AAC Audio Codec from Fraunhofer IIS – Fraunhofer Audio Blog" . Retrieved 2021年10月06日.
  19. ^ "LG Electronics licenses xHE-AAC and AAC-ELD audio codec software from Fraunhofer IIS – Fraunhofer Audio Blog" . Retrieved 2021年10月06日.
  20. ^ "xHE-AAC audio codec supported by Bento4 DASH/HLS Packager – Fraunhofer Audio Blog" . Retrieved 2021年10月06日.
  21. ^ "xHE-AAC Audio Codec now in Windows 11 – Fraunhofer Audio Blog" . Retrieved 2022年10月20日.
  22. ^ "Apple recommends xHE-AAC for streaming of all audio assets – Fraunhofer Audio Blog" . Retrieved 2020年07月11日.
  23. ^ "What's new in streaming audio for Apple Watch - WWDC 2020 - Videos". Apple Developer. Retrieved 2020年07月11日.
  24. ^ "Fraunhofer IIS licenses xHE-AAC audio codec software to Microsoft – Fraunhofer Audio Blog" . Retrieved 2020年07月11日.
  25. ^ "foobar2000: Release Notes". www.foobar2000.org. Retrieved 2025年09月14日.
  26. ^ "xHE-AAC Audio Codec now in Windows 11". 2022年10月20日. Retrieved 2024年04月20日.
[edit ]
Video
compression
ISO, IEC,
MPEG
ITU-T, VCEG
SMPTE
TrueMotion and AOMedia
Chinese Standard
  • AVS1 P2/AVS+(GB/T 20090.2/16)
  • AVS2 P2(GB/T 33475.2,GY/T 299.1)
    • HDR Vivid(GY/T 358)
  • AVS3 P2(GY/T 368)
Others
Audio
compression
ISO, IEC,
MPEG
ITU-T
IETF
3GPP
ETSI
Bluetooth SIG
Chinese Standard
Others
Image
compression
IEC, ISO, IETF,
W3C, ITU-T, JPEG
Others
Containers
ISO, IEC
ITU-T
IETF
SMPTE
Others
Collaborations
Methods
Lists
See Compression methods for techniques and Compression software for codecs
MPEG-1 Parts
MPEG-2 Parts
MPEG-4 Parts
MPEG-7 Parts
MPEG-21 Parts
MPEG-D Parts
MPEG-G Parts
MPEG-H Parts
MPEG-I Parts
MPEG-5 Parts
Other

AltStyle によって変換されたページ (->オリジナル) /