Abstract
Image outpainting receives less attention due to the challenges of predicting the spatial content consistency and maintaining the high quality in generated images. All-sides image prediction is a generalized task in outpainting, aiming to extrapolate the outside content for all sides of an image. It is a challenging task to maintain spatial and semantic consistency between the original input and the generated multi-step regions. In this paper, we embed a novel Multi-view Recurrent Content Transfer module into an Encoder-Decoder architecture for long-range all-side image outpainting. A multi-head attention mechanism is leveraged to fuse information from different representation sub-spaces at different positions to enhance the consistency of generated images and the original input. Our model could obtain sufficient temporal information in predicting the extended feature maps, which improves the quality of long-range images extrapolation. We experimentally demonstrate that our proposed method could produce visually appealing results for outside image outpainting against the state-of-the-art image inpainting and outpainting approaches. Modelling the temporal relationship could help generate the outside regions and reconstruct the input regions smoothly and realistically. In addition, an attempt to possibly allow for arbitrary output resolutions is supported as well.
| Original language | English |
|---|---|
| Article number | 012031 |
| Journal | Journal of Physics: Conference Series |
| Volume | 2278 |
| Issue number | 1 |
| DOIs | |
| Publication status | Published - 1 Jun 2022 |
| Event | 2022 6th International Conference on Machine Vision and Information Technology, CMVIT 2022 - Virtual, Online Duration: 25 Feb 2022 → … |
Fingerprint
Dive into the research topics of 'Outpainting Natural Scenery Images by Fusing Forecasting Information'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver