TY - JOUR
T1 - High-Frequency Enhanced Hybrid Neural Representation for video compression
AU - Yu, Li
AU - Li, Zhihui
AU - Xiao, Jimin
AU - Gabbouj, Moncef
N1 - Publisher Copyright:
© 2025 Elsevier Ltd
PY - 2025/7/1
Y1 - 2025/7/1
N2 - Neural Representations for Videos (NeRV) have simplified the video codec process and achieved swift decoding speeds by encoding video content into a neural network, presenting a promising solution for video compression. However, existing work overlooks the crucial issue that videos reconstructed by these methods lack high-frequency details. To address this problem, this paper introduces a High-Frequency Enhanced Hybrid Neural Representation Network. Our method focuses on leveraging high-frequency information to improve the synthesis of fine details by the network. Specifically, we design a wavelet high-frequency encoder that incorporates Wavelet Frequency Decomposer (WFD) blocks to generate high-frequency feature embeddings. Next, we design the High-Frequency Feature Modulation (HFM) block, which leverages the extracted high-frequency embeddings to enhance the fitting process of the decoder. Finally, with the refined Harmonic decoder block and a Dynamic Weighted Frequency Loss, we further reduce the potential loss of high-frequency information. Experiments on the Bunny and UVG datasets demonstrate that our method outperforms other methods, showing notable improvements in detail preservation and compression performance.
AB - Neural Representations for Videos (NeRV) have simplified the video codec process and achieved swift decoding speeds by encoding video content into a neural network, presenting a promising solution for video compression. However, existing work overlooks the crucial issue that videos reconstructed by these methods lack high-frequency details. To address this problem, this paper introduces a High-Frequency Enhanced Hybrid Neural Representation Network. Our method focuses on leveraging high-frequency information to improve the synthesis of fine details by the network. Specifically, we design a wavelet high-frequency encoder that incorporates Wavelet Frequency Decomposer (WFD) blocks to generate high-frequency feature embeddings. Next, we design the High-Frequency Feature Modulation (HFM) block, which leverages the extracted high-frequency embeddings to enhance the fitting process of the decoder. Finally, with the refined Harmonic decoder block and a Dynamic Weighted Frequency Loss, we further reduce the potential loss of high-frequency information. Experiments on the Bunny and UVG datasets demonstrate that our method outperforms other methods, showing notable improvements in detail preservation and compression performance.
KW - High-frequency information
KW - Neural representation for videos
KW - Video compression
KW - Wavelet transform
UR - http://www.scopus.com/inward/record.url?scp=105002784487&partnerID=8YFLogxK
U2 - 10.1016/j.eswa.2025.127552
DO - 10.1016/j.eswa.2025.127552
M3 - Article
AN - SCOPUS:105002784487
SN - 0957-4174
VL - 281
JO - Expert Systems with Applications
JF - Expert Systems with Applications
M1 - 127552
ER -