Outpainting Natural Scenery Images by Fusing Forecasting Information

Yujie Geng; Penglei Gao; Xi Yang; Yuyao Yan; Kaizhu Huang

doi:10.1088/1742-6596/2278/1/012031

Outpainting Natural Scenery Images by Fusing Forecasting Information

Yujie Geng, Penglei Gao, Xi Yang^*, Yuyao Yan, Kaizhu Huang

^*Corresponding author for this work

Department of Intelligent Science

Research output: Contribution to journal › Conference article › peer-review

Abstract

Image outpainting receives less attention due to the challenges of predicting the spatial content consistency and maintaining the high quality in generated images. All-sides image prediction is a generalized task in outpainting, aiming to extrapolate the outside content for all sides of an image. It is a challenging task to maintain spatial and semantic consistency between the original input and the generated multi-step regions. In this paper, we embed a novel Multi-view Recurrent Content Transfer module into an Encoder-Decoder architecture for long-range all-side image outpainting. A multi-head attention mechanism is leveraged to fuse information from different representation sub-spaces at different positions to enhance the consistency of generated images and the original input. Our model could obtain sufficient temporal information in predicting the extended feature maps, which improves the quality of long-range images extrapolation. We experimentally demonstrate that our proposed method could produce visually appealing results for outside image outpainting against the state-of-the-art image inpainting and outpainting approaches. Modelling the temporal relationship could help generate the outside regions and reconstruct the input regions smoothly and realistically. In addition, an attempt to possibly allow for arbitrary output resolutions is supported as well.

Original language	English
Article number	012031
Journal	Journal of Physics: Conference Series
Volume	2278
Issue number	1
DOIs	https://doi.org/10.1088/1742-6596/2278/1/012031
Publication status	Published - 1 Jun 2022
Event	2022 6th International Conference on Machine Vision and Information Technology, CMVIT 2022 - Virtual, Online Duration: 25 Feb 2022 → …

Access to Document

10.1088/1742-6596/2278/1/012031

Cite this

@article{3239f9a2da124c8fb5b43f74b4bacd5f,

title = "Outpainting Natural Scenery Images by Fusing Forecasting Information",

abstract = "Image outpainting receives less attention due to the challenges of predicting the spatial content consistency and maintaining the high quality in generated images. All-sides image prediction is a generalized task in outpainting, aiming to extrapolate the outside content for all sides of an image. It is a challenging task to maintain spatial and semantic consistency between the original input and the generated multi-step regions. In this paper, we embed a novel Multi-view Recurrent Content Transfer module into an Encoder-Decoder architecture for long-range all-side image outpainting. A multi-head attention mechanism is leveraged to fuse information from different representation sub-spaces at different positions to enhance the consistency of generated images and the original input. Our model could obtain sufficient temporal information in predicting the extended feature maps, which improves the quality of long-range images extrapolation. We experimentally demonstrate that our proposed method could produce visually appealing results for outside image outpainting against the state-of-the-art image inpainting and outpainting approaches. Modelling the temporal relationship could help generate the outside regions and reconstruct the input regions smoothly and realistically. In addition, an attempt to possibly allow for arbitrary output resolutions is supported as well.",

author = "Yujie Geng and Penglei Gao and Xi Yang and Yuyao Yan and Kaizhu Huang",

note = "Publisher Copyright: {\textcopyright} Published under licence by IOP Publishing Ltd.; 2022 6th International Conference on Machine Vision and Information Technology, CMVIT 2022 ; Conference date: 25-02-2022",

year = "2022",

month = jun,

day = "1",

doi = "10.1088/1742-6596/2278/1/012031",

language = "English",

volume = "2278",

journal = "Journal of Physics: Conference Series",

issn = "1742-6588",

number = "1",

}

TY - JOUR

T1 - Outpainting Natural Scenery Images by Fusing Forecasting Information

AU - Geng, Yujie

AU - Gao, Penglei

AU - Yang, Xi

AU - Yan, Yuyao

AU - Huang, Kaizhu

N1 - Publisher Copyright: © Published under licence by IOP Publishing Ltd.

PY - 2022/6/1

Y1 - 2022/6/1

N2 - Image outpainting receives less attention due to the challenges of predicting the spatial content consistency and maintaining the high quality in generated images. All-sides image prediction is a generalized task in outpainting, aiming to extrapolate the outside content for all sides of an image. It is a challenging task to maintain spatial and semantic consistency between the original input and the generated multi-step regions. In this paper, we embed a novel Multi-view Recurrent Content Transfer module into an Encoder-Decoder architecture for long-range all-side image outpainting. A multi-head attention mechanism is leveraged to fuse information from different representation sub-spaces at different positions to enhance the consistency of generated images and the original input. Our model could obtain sufficient temporal information in predicting the extended feature maps, which improves the quality of long-range images extrapolation. We experimentally demonstrate that our proposed method could produce visually appealing results for outside image outpainting against the state-of-the-art image inpainting and outpainting approaches. Modelling the temporal relationship could help generate the outside regions and reconstruct the input regions smoothly and realistically. In addition, an attempt to possibly allow for arbitrary output resolutions is supported as well.

AB - Image outpainting receives less attention due to the challenges of predicting the spatial content consistency and maintaining the high quality in generated images. All-sides image prediction is a generalized task in outpainting, aiming to extrapolate the outside content for all sides of an image. It is a challenging task to maintain spatial and semantic consistency between the original input and the generated multi-step regions. In this paper, we embed a novel Multi-view Recurrent Content Transfer module into an Encoder-Decoder architecture for long-range all-side image outpainting. A multi-head attention mechanism is leveraged to fuse information from different representation sub-spaces at different positions to enhance the consistency of generated images and the original input. Our model could obtain sufficient temporal information in predicting the extended feature maps, which improves the quality of long-range images extrapolation. We experimentally demonstrate that our proposed method could produce visually appealing results for outside image outpainting against the state-of-the-art image inpainting and outpainting approaches. Modelling the temporal relationship could help generate the outside regions and reconstruct the input regions smoothly and realistically. In addition, an attempt to possibly allow for arbitrary output resolutions is supported as well.

UR - http://www.scopus.com/inward/record.url?scp=85132012677&partnerID=8YFLogxK

U2 - 10.1088/1742-6596/2278/1/012031

DO - 10.1088/1742-6596/2278/1/012031

M3 - Conference article

AN - SCOPUS:85132012677

SN - 1742-6588

VL - 2278

JO - Journal of Physics: Conference Series

JF - Journal of Physics: Conference Series

IS - 1

M1 - 012031

T2 - 2022 6th International Conference on Machine Vision and Information Technology, CMVIT 2022

Y2 - 25 February 2022

ER -

Outpainting Natural Scenery Images by Fusing Forecasting Information

Abstract

Access to Document

Other files and links

Fingerprint

Cite this