Meta Releases TorchCodec 0.14 with HDR Video and Fast Audio Decoding

Key Takeaways

▸New WavDecoder provides significantly faster audio decoding by bypassing FFmpeg and reading WAV data directly
▸HDR video decoding support preserves full color precision in float32 output for both CPU and CUDA
▸NPP dependency removed, reducing installation complexity for CUDA-accelerated decoding

Source:

Hacker Newshttps://github.com/meta-pytorch/torchcodec/releases/tag/v0.14.0↗

Summary

Meta has released TorchCodec 0.14, a major update to its open-source multimedia codec library for PyTorch. The new version introduces two significant features: a dedicated WavDecoder for fast audio decoding and native HDR (High Dynamic Range) video decoding support for both CPU and CUDA.

The WavDecoder bypasses FFmpeg entirely, reading WAV data directly from files, bytes, or file-like objects, resulting in significantly faster audio decoding. The decoder supports multiple sample formats including int16, int32, and float32. TorchCodec 0.14 also adds HDR video decoding capabilities that preserve full color precision when using float32 output, available for both CPU and CUDA processing, enabling more accurate representation of video content with extended dynamic range.

The release includes several additional improvements, including faster audio seeking, removal of the NVIDIA NPP library dependency (simplifying installation for CUDA users), and bug fixes for process teardown crashes and odd-dimension video handling. The library is compatible with PyTorch 2.11 and later versions.

Audio seeking performance substantially improved
HDR feature available in beta, with refinements expected based on user feedback

Meta Releases TorchCodec 0.14 with HDR Video and Fast Audio Decoding

Key Takeaways

Summary

More from Meta

Meta Brings PyTorch Monarch Fault-Tolerant Training Framework to AMD GPUs

TurboPrefill: Community Optimization Achieves 3.27× LLaMA.cpp Speedup

US Army Burned Through Annual AI Token Budget in Over a Month, Forcing Limits

Comments

Suggested

Toolgz Slashes LLM Tool-Definition Tokens 80% With Zero Accuracy Loss

Apertus 1.5 Brings Image Understanding and 4x Context Window to Open-Source LLM

Silicon Valley Splits Over Chinese AI: Safety vs. Access Debate Intensifies

Meta Releases TorchCodec 0.14 with HDR Video and Fast Audio Decoding

Key Takeaways

Summary

More from Meta

Meta Brings PyTorch Monarch Fault-Tolerant Training Framework to AMD GPUs

TurboPrefill: Community Optimization Achieves 3.27× LLaMA.cpp Speedup

US Army Burned Through Annual AI Token Budget in Over a Month, Forcing Limits

Comments

Suggested

Toolgz Slashes LLM Tool-Definition Tokens 80% With Zero Accuracy Loss

Apertus 1.5 Brings Image Understanding and 4x Context Window to Open-Source LLM

Silicon Valley Splits Over Chinese AI: Safety vs. Access Debate Intensifies