Three-dimensional shape generation via variational autoencoder generative adversarial network with signed distance function

Ebenezer Akinyemi Ajayi; Kian Ming Lim; Siew Chin Chong; Chin Poo Lee

doi:10.11591/ijece.v13i4.pp4009-4019

Three-dimensional shape generation via variational autoencoder generative adversarial network with signed distance function

Ebenezer Akinyemi Ajayi, Kian Ming Lim^*, Siew Chin Chong, Chin Poo Lee

^*Corresponding author for this work

Multimedia University

Research output: Contribution to journal › Article › peer-review

1 Citation (Scopus)

Abstract

Mesh-based 3-dimensional (3D) shape generation from a 2-dimensional (2D) image using a convolution neural network (CNN) framework is an open problem in the computer graphics and vision domains. Most existing CNN-based frameworks lack robust algorithms that can scale well without combining different shape parts. Also, most CNN-based algorithms lack suitable 3D data representations that can fit into CNN without modification(s) to produce high-quality 3D shapes. This paper presents an approach that integrates a variational autoencoder (VAE) and a generative adversarial network (GAN) called 3 dimensional variational autoencoder signed distance function generative adversarial network (3D-VAE-SDFGAN) to create a 3D shape from a 2D image that considerably improves scalability and visual quality. The proposed method only feeds a single 2D image into the network to produce a mesh-based 3D shape. The network encodes a 2D image of the 3D object into the latent representations, and implicit surface representations of 3D objects corresponding to those 2D images are subsequently generated. Hence, a signed distance function (SDF) is proposed to maintain object inside-outside information in the implicit surface representation. Polygon mesh surfaces are then produced using the marching cubes algorithm. The ShapeNet dataset was used in the experiments to evaluate the proposed 3D-VAE-SDFGAN. The experimental results show that 3D-VAE-SDFGAN outperforms other state-of-the-art models.

Original language	English
Pages (from-to)	4009-4019
Number of pages	11
Journal	International Journal of Electrical and Computer Engineering
Volume	13
Issue number	4
DOIs	https://doi.org/10.11591/ijece.v13i4.pp4009-4019
Publication status	Published - Aug 2023
Externally published	Yes

Keywords

3D shape generation
Convolution neural network
Generative adversarial network
Signed distance function
Variational autoencoder

Access to Document

10.11591/ijece.v13i4.pp4009-4019

Cite this

@article{b5be02f52c024d3caa9c676e1b64dd7c,

title = "Three-dimensional shape generation via variational autoencoder generative adversarial network with signed distance function",

abstract = "Mesh-based 3-dimensional (3D) shape generation from a 2-dimensional (2D) image using a convolution neural network (CNN) framework is an open problem in the computer graphics and vision domains. Most existing CNN-based frameworks lack robust algorithms that can scale well without combining different shape parts. Also, most CNN-based algorithms lack suitable 3D data representations that can fit into CNN without modification(s) to produce high-quality 3D shapes. This paper presents an approach that integrates a variational autoencoder (VAE) and a generative adversarial network (GAN) called 3 dimensional variational autoencoder signed distance function generative adversarial network (3D-VAE-SDFGAN) to create a 3D shape from a 2D image that considerably improves scalability and visual quality. The proposed method only feeds a single 2D image into the network to produce a mesh-based 3D shape. The network encodes a 2D image of the 3D object into the latent representations, and implicit surface representations of 3D objects corresponding to those 2D images are subsequently generated. Hence, a signed distance function (SDF) is proposed to maintain object inside-outside information in the implicit surface representation. Polygon mesh surfaces are then produced using the marching cubes algorithm. The ShapeNet dataset was used in the experiments to evaluate the proposed 3D-VAE-SDFGAN. The experimental results show that 3D-VAE-SDFGAN outperforms other state-of-the-art models.",

keywords = "3D shape generation, Convolution neural network, Generative adversarial network, Signed distance function, Variational autoencoder",

author = "Ajayi, {Ebenezer Akinyemi} and Lim, {Kian Ming} and Chong, {Siew Chin} and Lee, {Chin Poo}",

year = "2023",

month = aug,

doi = "10.11591/ijece.v13i4.pp4009-4019",

language = "English",

volume = "13",

pages = "4009--4019",

journal = "International Journal of Electrical and Computer Engineering",

issn = "2088-8708",

number = "4",

}

TY - JOUR

T1 - Three-dimensional shape generation via variational autoencoder generative adversarial network with signed distance function

AU - Ajayi, Ebenezer Akinyemi

AU - Lim, Kian Ming

AU - Chong, Siew Chin

AU - Lee, Chin Poo

PY - 2023/8

Y1 - 2023/8

N2 - Mesh-based 3-dimensional (3D) shape generation from a 2-dimensional (2D) image using a convolution neural network (CNN) framework is an open problem in the computer graphics and vision domains. Most existing CNN-based frameworks lack robust algorithms that can scale well without combining different shape parts. Also, most CNN-based algorithms lack suitable 3D data representations that can fit into CNN without modification(s) to produce high-quality 3D shapes. This paper presents an approach that integrates a variational autoencoder (VAE) and a generative adversarial network (GAN) called 3 dimensional variational autoencoder signed distance function generative adversarial network (3D-VAE-SDFGAN) to create a 3D shape from a 2D image that considerably improves scalability and visual quality. The proposed method only feeds a single 2D image into the network to produce a mesh-based 3D shape. The network encodes a 2D image of the 3D object into the latent representations, and implicit surface representations of 3D objects corresponding to those 2D images are subsequently generated. Hence, a signed distance function (SDF) is proposed to maintain object inside-outside information in the implicit surface representation. Polygon mesh surfaces are then produced using the marching cubes algorithm. The ShapeNet dataset was used in the experiments to evaluate the proposed 3D-VAE-SDFGAN. The experimental results show that 3D-VAE-SDFGAN outperforms other state-of-the-art models.

AB - Mesh-based 3-dimensional (3D) shape generation from a 2-dimensional (2D) image using a convolution neural network (CNN) framework is an open problem in the computer graphics and vision domains. Most existing CNN-based frameworks lack robust algorithms that can scale well without combining different shape parts. Also, most CNN-based algorithms lack suitable 3D data representations that can fit into CNN without modification(s) to produce high-quality 3D shapes. This paper presents an approach that integrates a variational autoencoder (VAE) and a generative adversarial network (GAN) called 3 dimensional variational autoencoder signed distance function generative adversarial network (3D-VAE-SDFGAN) to create a 3D shape from a 2D image that considerably improves scalability and visual quality. The proposed method only feeds a single 2D image into the network to produce a mesh-based 3D shape. The network encodes a 2D image of the 3D object into the latent representations, and implicit surface representations of 3D objects corresponding to those 2D images are subsequently generated. Hence, a signed distance function (SDF) is proposed to maintain object inside-outside information in the implicit surface representation. Polygon mesh surfaces are then produced using the marching cubes algorithm. The ShapeNet dataset was used in the experiments to evaluate the proposed 3D-VAE-SDFGAN. The experimental results show that 3D-VAE-SDFGAN outperforms other state-of-the-art models.

KW - 3D shape generation

KW - Convolution neural network

KW - Generative adversarial network

KW - Signed distance function

KW - Variational autoencoder

UR - http://www.scopus.com/inward/record.url?scp=85151858155&partnerID=8YFLogxK

U2 - 10.11591/ijece.v13i4.pp4009-4019

DO - 10.11591/ijece.v13i4.pp4009-4019

M3 - Article

AN - SCOPUS:85151858155

SN - 2088-8708

VL - 13

SP - 4009

EP - 4019

JO - International Journal of Electrical and Computer Engineering

JF - International Journal of Electrical and Computer Engineering

IS - 4

ER -

Three-dimensional shape generation via variational autoencoder generative adversarial network with signed distance function

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this