TY - JOUR
T1 - Implicit Image-to-Image Schrödinger Bridge for image restoration
AU - Wang, Yuang
AU - Yoon, Siyeop
AU - Jin, Pengfei
AU - Tivnan, Matthew
AU - Song, Sifan
AU - Chen, Zhennong
AU - Hu, Rui
AU - Zhang, Li
AU - Li, Quanzheng
AU - Chen, Zhiqiang
AU - Wu, Dufan
N1 - Publisher Copyright:
© 2025 Elsevier Ltd
PY - 2025/9
Y1 - 2025/9
N2 - Diffusion-based models have demonstrated remarkable effectiveness in image restoration tasks; however, their iterative denoising process, which starts from Gaussian noise, often leads to slow inference speeds. The Image-to-Image Schrödinger Bridge (I2SB) offers a promising alternative by initializing the generative process from corrupted images while leveraging training techniques from score-based diffusion models. In this paper, we introduce the Implicit Image-to-Image Schrödinger Bridge (I3SB) to further accelerate the generative process of I2SB. I3SB restructures the generative process into a non-Markovian framework by incorporating the initial corrupted image at each generative step, effectively preserving and utilizing its information. To enable direct use of pretrained I2SB models without additional training, we ensure consistency in marginal distributions. Extensive experiments across many image corruptions—including noise, low resolution, JPEG compression, and sparse sampling—and multiple image modalities—such as natural, human face, and medical images— demonstrate the acceleration benefits of I3SB. Compared to I2SB, I3SB achieves the same perceptual quality with fewer generative steps, while maintaining or improving fidelity to the ground truth.
AB - Diffusion-based models have demonstrated remarkable effectiveness in image restoration tasks; however, their iterative denoising process, which starts from Gaussian noise, often leads to slow inference speeds. The Image-to-Image Schrödinger Bridge (I2SB) offers a promising alternative by initializing the generative process from corrupted images while leveraging training techniques from score-based diffusion models. In this paper, we introduce the Implicit Image-to-Image Schrödinger Bridge (I3SB) to further accelerate the generative process of I2SB. I3SB restructures the generative process into a non-Markovian framework by incorporating the initial corrupted image at each generative step, effectively preserving and utilizing its information. To enable direct use of pretrained I2SB models without additional training, we ensure consistency in marginal distributions. Extensive experiments across many image corruptions—including noise, low resolution, JPEG compression, and sparse sampling—and multiple image modalities—such as natural, human face, and medical images— demonstrate the acceleration benefits of I3SB. Compared to I2SB, I3SB achieves the same perceptual quality with fewer generative steps, while maintaining or improving fidelity to the ground truth.
KW - Diffusion model
KW - Image restoration
KW - Schrödinger Bridge
UR - https://www.scopus.com/pages/publications/105001944789
U2 - 10.1016/j.patcog.2025.111627
DO - 10.1016/j.patcog.2025.111627
M3 - Article
AN - SCOPUS:105001944789
SN - 0031-3203
VL - 165
JO - Pattern Recognition
JF - Pattern Recognition
M1 - 111627
ER -