25.2.2.2.3 Text Line Extraction in Documents

Chapter Contents (Back)
Document Analysis. Text Line Extraction. Text Line Segmentation. Printed text, not cursive script.
See also Cursive Script, Historical Documents, Text Line Segmentation, Script Line, Segmentation, Text Line Extraction.

Chen, S., Haralick, R.M., Phillips, I.T.,
Extraction of Text Lines and Text Blocks on Document Images Based on Statistical Modeling,
IJIST(7), No. 4, Winter 1996, pp. 343-356. 9612
BibRef

Wu, J.C.[Jui-Chen], Hsieh, J.W.[Jun-Wei], Chen, Y.S.[Yung-Sheng],
Morphology-based text line extraction,
MVA(19), No. 3, May 2008, pp. 195-207.
Springer DOI 0803
BibRef

van Beusekom, J.[Joost], Shafait, F.[Faisal], Breuel, T.M.[Thomas M.],
Text-line examination for document forgery detection,
IJDAR(16), No. 2, June 2013, pp. 189-207.
Springer DOI 1306

See also Document cleanup using page frame detection. BibRef

Kramer, M.[Martin], Afzal, M.Z.[Muhammad Zeshan], Bukhari, S.S.[Syed Saqib], Shafait, F.[Faisal], Breuel, T.M.[Thomas M.],
Robust stereo correspondence for documents by matching connected components of text-lines with dynamic programming,
ICPR12(734-737).
WWW Link. 1302
BibRef

Afzal, M.Z.[Muhammad Zeshan], Bukhari, S.S.[Syed Saqib], Kramer, M.[Martin], Shafait, F.[Faisal], Breuel, T.M.[Thomas M.],
Robust stereo matching for document images using parameter selection of text-line extraction,
ICPR12(331-334).
WWW Link. 1302
BibRef

Chakraborty, D.[Dibyayan], Pal, U.[Umapada],
Baseline detection of multi-lingual unconstrained handwritten text lines,
PRL(74), No. 1, 2016, pp. 74-81.
Elsevier DOI 1604
Handwriting recognition BibRef

Chaudhuri, B.B.[Bidyut B.], Adak, C.[Chandranath],
An approach for detecting and cleaning of struck-out handwritten text,
PR(61), No. 1, 2017, pp. 282-294.
Elsevier DOI 1705
Crossed-out text BibRef

Vo, Q.N.[Quang Nhat], Kim, S.H.[Soo Hyung], Yang, H.J.[Hyung Jeong], Lee, G.S.[Guee Sang],
Text line segmentation using a fully convolutional network in handwritten document images,
IET-IPR(12), No. 3, March 2018, pp. 438-446.
DOI Link 1802
BibRef

Dinh, T.N.[Toan Nguyen], Park, J.H.[Jong-Hyun], Lee, G.S.[Guee-Sang],
Text localization using image cues and text line information,
ICIP10(2261-2264).
IEEE DOI 1009
BibRef

Pastor, M.[Moisés],
Text baseline detection, a single page trained system,
PR(94), 2019, pp. 149-161.
Elsevier DOI 1906
BibRef

Grüning, T.[Tobias], Leifert, G.[Gundram], Strauß, T.[Tobias], Michael, J.[Johannes], Labahn, R.[Roger],
A two-stage method for text line detection in historical documents,
IJDAR(22), No. 3, September 2019, pp. 285-302.
Springer DOI 1909
BibRef

Leow, C.S.[Chee Siang], Yajima, H.[Hideaki], Kitagawa, T.[Tomoki], Nishizaki, H.[Hiromitsu],
Single-Line Text Detection in Multi-Line Text with Narrow Spacing for Line-Based Character Recognition,
IEICE(E106-D), No. 12, December 2023, pp. 2097-2106.
WWW Link. 2312
BibRef

Li, Z.Y.[Zi-Yan], Jin, L.W.[Lian-Wen], Zhang, C.Q.[Cheng-Quan], Zhang, J.X.[Jia-Xin], Xie, Z.C.[Ze-Cheng], Lyu, P.Y.[Peng-Yuan], Yao, K.[Kun],
Irregular text block recognition via decoupling visual, linguistic, and positional information,
PR(153), 2024, pp. 110516.
Elsevier DOI Code:
WWW Link. 2405
Scene text recognition, Irregular text recognition, Text block recognition, Character spotting, Linkage reasoning BibRef


Li, D.[Deng], Wu, Y.[Yue], Zhou, Y.C.[Yi-Cong],
Linecounter: Learning Handwritten Text Line Segmentation By Counting,
ICIP21(929-933)
IEEE DOI 2201
Deep learning, Training, Image segmentation, Text recognition, Annotations, Semantics, handwritten text line segmentation, document analysis BibRef

Quirós, L.[Lorenzo], Vidal, E.[Enrique],
Learning to Sort Handwritten Text Lines in Reading Order through Estimated Binary Order Relations,
ICPR21(7661-7668)
IEEE DOI 2105
Text analysis, Text recognition, Layout, Probabilistic logic, Indexes, Task analysis, Sorting, document layout analysis, reading order BibRef

Boillet, M.[Mélodie], Kermorvant, C.[Christopher], Paquet, T.[Thierry],
Multiple Document Datasets Pre-training Improves Text Line Detection With Deep Neural Networks,
ICPR21(2134-2141)
IEEE DOI 2105
Training, Measurement, Image segmentation, Analytical models, Text analysis, Layout, Neural networks, Document Layout Analysis, Deep Learning BibRef

Wödlinger, M.[Matthias], Sablatnig, R.[Robert],
Text Baseline Recognition Using a Recurrent Convolutional Neural Network,
ICPR21(4673-4679)
IEEE DOI 2105
Deep learning, Image segmentation, Handwriting recognition, Text analysis, Text recognition, Pipelines, Neural networks, Historical document analysis BibRef

Vo, Q.N., Lee, G.,
Dense prediction for text line segmentation in handwritten document images,
ICIP16(3264-3268)
IEEE DOI 1610
Decision support systems BibRef

de Lima, O., Janakiraman, S., Saber, E., Day, D.C., Bauer, P., Shaw, M., Twede, R., Lea, P.,
Signature line detection in scanned documents,
ICIP16(3254-3258)
IEEE DOI 1610
Decision support systems BibRef

Li, W.[Wei], Breier, M.[Matthias], Merhof, D.[Dorit],
Skew correction and line extraction in binarized printed text images,
ICIP15(472-476)
IEEE DOI 1511
binary image processing BibRef

Moysset, B.[Bastien], Kermorvant, C.[Christopher], Wolf, C.[Christian], Louradour, J.[Jerome],
Paragraph text segmentation into lines with Recurrent Neural Networks,
ICDAR15(456-460)
IEEE DOI 1511
BibRef

Wang, L.[Liuan], Fan, W.[Wei], Sun, J.[Jun], Naoi, S.[Satshi], Hiroshi, T.[Tanaka],
Text line extraction in document images,
ICDAR15(191-195)
IEEE DOI 1511
MSER BibRef

Ha, S.J.[Seong Jong], Jin, B.[Bora], Cho, N.I.[Nam Ik],
Fast text line extraction in document images,
ICIP12(797-800).
IEEE DOI 1302
BibRef

Pastor-Pellicer, J.[Joan], Espana-Boquera, S.[Salvador], Castro-Bleda, M.J., Zamora-Martinez, F.[Francisco],
A combined Convolutional Neural Network and Dynamic Programming approach for text line normalization,
ICDAR15(341-345)
IEEE DOI 1511
BibRef

Kimura, T.[Tomotaka], Premachandra, C.[Chinthaka], Kawanaka, H.[Hiroharu],
Simultaneous Mixed Vertical and Horizontal Handwritten Japanese Character Line Detection,
ICCVG16(564-572).
Springer DOI 1611
BibRef

Premachandra, C.[Chinthaka], Goto, K.[Katsunari], Tsuruoka, S.[Shinji], Kawanaka, H.[Hiroharu], Takase, H.[Haruhiko],
Speedy Character Line Detection Algorithm Using Image Block-Based Histogram Analysis,
ICIAR15(481-488).
Springer DOI 1507
BibRef

Hyun, J.I.[Jung Il], Kim, H.K.[Hae Kwang], Oh, W.G.[Weon Gun],
Fast text line detection by finding linear connected components on Canny edge image,
FCV15(1-4)
IEEE DOI 1506
edge detection BibRef

Javed, M.[Mohammed], Nagabhushan, P., Chaudhuri, B.B.,
Automatic extraction of correlation-entropy features for text document analysis directly in run-length compressed domain,
ICDAR15(1-5)
IEEE DOI 1511
BibRef
And:
A direct approach for word and character segmentation in run-length compressed documents with an application to word spotting,
ICDAR15(216-220)
IEEE DOI 1511
BibRef
Earlier:
Extraction of line-word-character segments directly from run-length compressed printed text-documents,
NCVPRIPG13(1-4)
IEEE DOI 1408
Compressed document feature extraction. document image processing BibRef

Bukhari, S.S., Shafait, F., Breuel, T.M.,
Towards Generic Text-Line Extraction,
ICDAR13(748-752)
IEEE DOI 1312
document image processing BibRef

Itani, Y., Hirano, T., Ishii, J.,
Text Line Extraction Method Using Domain-Based Active Contour Model,
ICDAR13(1230-1234)
IEEE DOI 1312
document image processing BibRef

dos Santos, R.P.[Rodolfo P.], Clemente, G.S.[Gabriela S.], Ren, T.I.[Tsang Ing], Cavalcanti, G.D.C.[George D.C.],
Text Line Segmentation Based on Morphology and Histogram Projection,
ICDAR09(651-655).
IEEE DOI 0907
BibRef

Bai, Z.L.[Zhen-Long], Huo, Q.A.[Qi-Ang],
A goal-oriented verification-based approach for target text line extraction from a document image captured by a pen scanner,
ICPR04(II: 574-577).
IEEE DOI 0409
BibRef

Bai, Z.L.[Zhen-Long], Huo, Q.A.[Qi-Ang],
Underline detection and removal in a document image using multiple strategies,
ICPR04(II: 578-581).
IEEE DOI 0409
BibRef
Earlier:
An approach to extracting the target text line from a document image captured by a pen scanner,
ICDAR03(76-80).
IEEE DOI 0311
BibRef

Nakano, Y.[Yasuaki], Hananoi, T.[Toshihiro], Miyao, H.[Hidetoshi], Maruyama, M.[Minoru], Maruyama, K.I.[Ken-Ichi],
A Document Analysis System Based on Text Line Matching of Multiple OCR Outputs,
DAS04(463-471).
Springer DOI 0505
BibRef

Deforges, O., Barba, D.,
A fast multiresolution text line and non text-line structures extraction and discrimination scheme for document image analysis,
ICIP94(I: 134-138).
IEEE DOI 9411
BibRef

Chapter on OCR, Document Analysis and Character Recognition Systems continues in
Text Detection, Find Text in General Scenes, Scene Text .


Last update:Sep 28, 2024 at 17:47:54