Robust and fast text‐line extraction using local linearity of the text‐line

Hideaki Goto, Hirotomo Aso

研究成果: Article査読

6 被引用数 (Scopus)

抄録

Text region extraction is a necessary process before character recognition is done for document images. This paper describes a new algorithm, Linear Segment Linking (LSL), for text‐line extraction from document images. The algorithm groups together the piecewise linear elements in the document images, which may be assumed to be text lines, and then extracts them from the images. The algorithm requires less knowledge about document structure and is robust for distortion of the image. The primitive rectangles are introduced for the intermediate representation of image. It is easier and faster to create them than the usual circumscribing rectangles. A method of splitting the bridges between neighboring text lines is proposed. Combining the bridge splitting process with the text line extraction, the locally touching text lines will be extracted as individual ones.

本文言語English
ページ(範囲)21-31
ページ数11
ジャーナルSystems and Computers in Japan
26
13
DOI
出版ステータスPublished - 1995

ASJC Scopus subject areas

  • 理論的コンピュータサイエンス
  • 情報システム
  • ハードウェアとアーキテクチャ
  • 計算理論と計算数学

フィンガープリント

「Robust and fast text‐line extraction using local linearity of the text‐line」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル