Arbitrary-Shaped Text Detection with Adaptive Text Region Representation

Xiufeng Jiang; Shugong Xu; Shunqing Zhang; Shan Cao

doi:10.1109/ACCESS.2020.2999069

Arbitrary-Shaped Text Detection with Adaptive Text Region Representation

Xiufeng Jiang, Shugong Xu^*, Shunqing Zhang, Shan Cao

^*Corresponding author for this work

Shanghai University

Research output: Contribution to journal › Article › peer-review

18 Citations (Scopus)

Abstract

Text detection/localization, as an important task in computer vision, has witnessed substantial advancements in methodology and performance with convolutional neural networks. However, the vast majority of popular methods use rectangles or quadrangles to describe text regions. These representations have inherent drawbacks, especially relating to dense adjacent text and loose regional text boundaries, which usually cause difficulty detecting arbitrarily shaped text. In this paper, we propose a novel text region representation method, with a robust pipeline, which can precisely detect dense adjacent text instances with arbitrary shapes. We consider a text instance to be composed of an adaptive central text region mask and a corresponding expanding ratio between the central text region and the full text region. More specifically, our pipeline generates adaptive central text regions and corresponding expanding ratios with a proposed training strategy, followed by a new proposed post-processing algorithm which expands central text regions to the complete text instance with the corresponding expanding ratios. We demonstrated that our new text region representation is effective, and that the pipeline can precisely detect closely adjacent text instances of arbitrary shapes. Experimental results on common datasets demonstrate superior performance of our work.

Original language	English
Article number	9104986
Pages (from-to)	102106-102118
Number of pages	13
Journal	IEEE Access
Volume	8
DOIs	https://doi.org/10.1109/ACCESS.2020.2999069
Publication status	Published - 2020
Externally published	Yes

Keywords

arbitrary-shaped
deformable convolutional network
Scene text detection
text region representation

Access to Document

10.1109/ACCESS.2020.2999069

Cite this

@article{8eeb704caff54144b43d686189e758b0,

title = "Arbitrary-Shaped Text Detection with Adaptive Text Region Representation",

abstract = "Text detection/localization, as an important task in computer vision, has witnessed substantial advancements in methodology and performance with convolutional neural networks. However, the vast majority of popular methods use rectangles or quadrangles to describe text regions. These representations have inherent drawbacks, especially relating to dense adjacent text and loose regional text boundaries, which usually cause difficulty detecting arbitrarily shaped text. In this paper, we propose a novel text region representation method, with a robust pipeline, which can precisely detect dense adjacent text instances with arbitrary shapes. We consider a text instance to be composed of an adaptive central text region mask and a corresponding expanding ratio between the central text region and the full text region. More specifically, our pipeline generates adaptive central text regions and corresponding expanding ratios with a proposed training strategy, followed by a new proposed post-processing algorithm which expands central text regions to the complete text instance with the corresponding expanding ratios. We demonstrated that our new text region representation is effective, and that the pipeline can precisely detect closely adjacent text instances of arbitrary shapes. Experimental results on common datasets demonstrate superior performance of our work.",

keywords = "arbitrary-shaped, deformable convolutional network, Scene text detection, text region representation",

author = "Xiufeng Jiang and Shugong Xu and Shunqing Zhang and Shan Cao",

note = "Publisher Copyright: {\textcopyright} 2013 IEEE.",

year = "2020",

doi = "10.1109/ACCESS.2020.2999069",

language = "English",

volume = "8",

pages = "102106--102118",

journal = "IEEE Access",

issn = "2169-3536",

}

TY - JOUR

T1 - Arbitrary-Shaped Text Detection with Adaptive Text Region Representation

AU - Jiang, Xiufeng

AU - Xu, Shugong

AU - Zhang, Shunqing

AU - Cao, Shan

PY - 2020

Y1 - 2020

N2 - Text detection/localization, as an important task in computer vision, has witnessed substantial advancements in methodology and performance with convolutional neural networks. However, the vast majority of popular methods use rectangles or quadrangles to describe text regions. These representations have inherent drawbacks, especially relating to dense adjacent text and loose regional text boundaries, which usually cause difficulty detecting arbitrarily shaped text. In this paper, we propose a novel text region representation method, with a robust pipeline, which can precisely detect dense adjacent text instances with arbitrary shapes. We consider a text instance to be composed of an adaptive central text region mask and a corresponding expanding ratio between the central text region and the full text region. More specifically, our pipeline generates adaptive central text regions and corresponding expanding ratios with a proposed training strategy, followed by a new proposed post-processing algorithm which expands central text regions to the complete text instance with the corresponding expanding ratios. We demonstrated that our new text region representation is effective, and that the pipeline can precisely detect closely adjacent text instances of arbitrary shapes. Experimental results on common datasets demonstrate superior performance of our work.

AB - Text detection/localization, as an important task in computer vision, has witnessed substantial advancements in methodology and performance with convolutional neural networks. However, the vast majority of popular methods use rectangles or quadrangles to describe text regions. These representations have inherent drawbacks, especially relating to dense adjacent text and loose regional text boundaries, which usually cause difficulty detecting arbitrarily shaped text. In this paper, we propose a novel text region representation method, with a robust pipeline, which can precisely detect dense adjacent text instances with arbitrary shapes. We consider a text instance to be composed of an adaptive central text region mask and a corresponding expanding ratio between the central text region and the full text region. More specifically, our pipeline generates adaptive central text regions and corresponding expanding ratios with a proposed training strategy, followed by a new proposed post-processing algorithm which expands central text regions to the complete text instance with the corresponding expanding ratios. We demonstrated that our new text region representation is effective, and that the pipeline can precisely detect closely adjacent text instances of arbitrary shapes. Experimental results on common datasets demonstrate superior performance of our work.

KW - arbitrary-shaped

KW - deformable convolutional network

KW - Scene text detection

KW - text region representation

UR - http://www.scopus.com/inward/record.url?scp=85086471849&partnerID=8YFLogxK

U2 - 10.1109/ACCESS.2020.2999069

DO - 10.1109/ACCESS.2020.2999069

M3 - Article

AN - SCOPUS:85086471849

SN - 2169-3536

VL - 8

SP - 102106

EP - 102118

JO - IEEE Access

JF - IEEE Access

M1 - 9104986

ER -

Arbitrary-Shaped Text Detection with Adaptive Text Region Representation

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this