High-Frequency Enhanced Hybrid Neural Representation for video compression

Li Yu; Zhihui Li; Jimin Xiao; Moncef Gabbouj

doi:10.1016/j.eswa.2025.127552

High-Frequency Enhanced Hybrid Neural Representation for video compression

Li Yu^*, Zhihui Li, Jimin Xiao, Moncef Gabbouj

^*Corresponding author for this work

Department of Intelligent Science

Research output: Contribution to journal › Article › peer-review

Abstract

Neural Representations for Videos (NeRV) have simplified the video codec process and achieved swift decoding speeds by encoding video content into a neural network, presenting a promising solution for video compression. However, existing work overlooks the crucial issue that videos reconstructed by these methods lack high-frequency details. To address this problem, this paper introduces a High-Frequency Enhanced Hybrid Neural Representation Network. Our method focuses on leveraging high-frequency information to improve the synthesis of fine details by the network. Specifically, we design a wavelet high-frequency encoder that incorporates Wavelet Frequency Decomposer (WFD) blocks to generate high-frequency feature embeddings. Next, we design the High-Frequency Feature Modulation (HFM) block, which leverages the extracted high-frequency embeddings to enhance the fitting process of the decoder. Finally, with the refined Harmonic decoder block and a Dynamic Weighted Frequency Loss, we further reduce the potential loss of high-frequency information. Experiments on the Bunny and UVG datasets demonstrate that our method outperforms other methods, showing notable improvements in detail preservation and compression performance.

Original language	English
Article number	127552
Journal	Expert Systems with Applications
Volume	281
DOIs	https://doi.org/10.1016/j.eswa.2025.127552
Publication status	Published - 1 Jul 2025

Keywords

High-frequency information
Neural representation for videos
Video compression
Wavelet transform

Access to Document

10.1016/j.eswa.2025.127552

Cite this

@article{c59f204d936a4e7b919fca63c1a5c17b,

title = "High-Frequency Enhanced Hybrid Neural Representation for video compression",

abstract = "Neural Representations for Videos (NeRV) have simplified the video codec process and achieved swift decoding speeds by encoding video content into a neural network, presenting a promising solution for video compression. However, existing work overlooks the crucial issue that videos reconstructed by these methods lack high-frequency details. To address this problem, this paper introduces a High-Frequency Enhanced Hybrid Neural Representation Network. Our method focuses on leveraging high-frequency information to improve the synthesis of fine details by the network. Specifically, we design a wavelet high-frequency encoder that incorporates Wavelet Frequency Decomposer (WFD) blocks to generate high-frequency feature embeddings. Next, we design the High-Frequency Feature Modulation (HFM) block, which leverages the extracted high-frequency embeddings to enhance the fitting process of the decoder. Finally, with the refined Harmonic decoder block and a Dynamic Weighted Frequency Loss, we further reduce the potential loss of high-frequency information. Experiments on the Bunny and UVG datasets demonstrate that our method outperforms other methods, showing notable improvements in detail preservation and compression performance.",

keywords = "High-frequency information, Neural representation for videos, Video compression, Wavelet transform",

author = "Li Yu and Zhihui Li and Jimin Xiao and Moncef Gabbouj",

note = "Publisher Copyright: {\textcopyright} 2025 Elsevier Ltd",

year = "2025",

month = jul,

day = "1",

doi = "10.1016/j.eswa.2025.127552",

language = "English",

volume = "281",

journal = "Expert Systems with Applications",

issn = "0957-4174",

publisher = "Elsevier",

}

TY - JOUR

T1 - High-Frequency Enhanced Hybrid Neural Representation for video compression

AU - Yu, Li

AU - Li, Zhihui

AU - Xiao, Jimin

AU - Gabbouj, Moncef

PY - 2025/7/1

Y1 - 2025/7/1

N2 - Neural Representations for Videos (NeRV) have simplified the video codec process and achieved swift decoding speeds by encoding video content into a neural network, presenting a promising solution for video compression. However, existing work overlooks the crucial issue that videos reconstructed by these methods lack high-frequency details. To address this problem, this paper introduces a High-Frequency Enhanced Hybrid Neural Representation Network. Our method focuses on leveraging high-frequency information to improve the synthesis of fine details by the network. Specifically, we design a wavelet high-frequency encoder that incorporates Wavelet Frequency Decomposer (WFD) blocks to generate high-frequency feature embeddings. Next, we design the High-Frequency Feature Modulation (HFM) block, which leverages the extracted high-frequency embeddings to enhance the fitting process of the decoder. Finally, with the refined Harmonic decoder block and a Dynamic Weighted Frequency Loss, we further reduce the potential loss of high-frequency information. Experiments on the Bunny and UVG datasets demonstrate that our method outperforms other methods, showing notable improvements in detail preservation and compression performance.

AB - Neural Representations for Videos (NeRV) have simplified the video codec process and achieved swift decoding speeds by encoding video content into a neural network, presenting a promising solution for video compression. However, existing work overlooks the crucial issue that videos reconstructed by these methods lack high-frequency details. To address this problem, this paper introduces a High-Frequency Enhanced Hybrid Neural Representation Network. Our method focuses on leveraging high-frequency information to improve the synthesis of fine details by the network. Specifically, we design a wavelet high-frequency encoder that incorporates Wavelet Frequency Decomposer (WFD) blocks to generate high-frequency feature embeddings. Next, we design the High-Frequency Feature Modulation (HFM) block, which leverages the extracted high-frequency embeddings to enhance the fitting process of the decoder. Finally, with the refined Harmonic decoder block and a Dynamic Weighted Frequency Loss, we further reduce the potential loss of high-frequency information. Experiments on the Bunny and UVG datasets demonstrate that our method outperforms other methods, showing notable improvements in detail preservation and compression performance.

KW - High-frequency information

KW - Neural representation for videos

KW - Video compression

KW - Wavelet transform

UR - http://www.scopus.com/inward/record.url?scp=105002784487&partnerID=8YFLogxK

U2 - 10.1016/j.eswa.2025.127552

DO - 10.1016/j.eswa.2025.127552

M3 - Article

AN - SCOPUS:105002784487

SN - 0957-4174

VL - 281

JO - Expert Systems with Applications

JF - Expert Systems with Applications

M1 - 127552

ER -

High-Frequency Enhanced Hybrid Neural Representation for video compression

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this