25.2.2.2 Document Layout, Document Segmentation, Page Layout, Structure Analysis

Chapter Contents (Back)
Document Analysis. Segmentation. Document Segmentation. Layout Analysis. Document Layout. Page Segmentation. Application, Document Layout. Application, Page Layout.
See also Page Segmentation, General Evaluations.


See also Block Segmentation and Text Extraction in Mixed Text/Image Documents.

Goldwasser, S.M., Troxel, D.E.,
Page Composition of Continuous Tone Imagery,
CVGIP(26), No. 1, April 1984, pp. 30-44.
Elsevier DOI BibRef 8404

Okawa, Y.[Yoshikuni],
A Structural Analysis of Visual Form on Packaging Graphics and Its Use in an Automated Design System,
CVGIP(43), No. 2, August 1988, pp. 265-278.
Elsevier DOI BibRef 8808
Earlier:
Identification of Packaged-in-a-box Goods for Designing a Part of an Intelligent Cash Register,
ICPR80(150-152). Where to put the graphics to not hide the picture. BibRef

Srihari, S.N., and Govindaraju, V.,
Analysis of Textual Images Using the Hough Transform,
MVA(2), 1989, pp. 141-153. BibRef 8900

Peppers, N.A.[Norman A.], Young, J.R.[James R.], Nishi, H.[Hisami], Ueno, H.[Hiroshi],
Page segmentor,
US_Patent4,817,169, March 28, 1989.
WWW Link. BibRef 8903

O'Gorman, L.,
The Document Spectrum for Page Layout Analysis,
PAMI(15), No. 11, November 1993, pp. 1162-1173.
IEEE DOI Determine the structure of the document for storage and recognition. (For evaluation:
See also Empirical Performance Evaluation Methodology and Its Application to Page Segmentation Algorithms. ) BibRef 9311

Krishnamoorthy, M., Nagy, G., Seth, S., and Viswanathan, M.,
Syntatic Segmentation and Labeling of Digitized Pages from Technical Journals,
PAMI(15), No. 7, July 1993, pp. 737-747.
IEEE DOI A more complete version of the following paper and system. Error correction with backtracking. Computationally complex. Understanding of how documents are put together. BibRef 9307

Viswanathan, M.,
Analysis of Scanned documents: A Syntactic Approach,
SDIA92(xx-yy). BibRef 9200

Hones, F., Lichter, J.,
Layout Extraction of Mixed-Mode Documents,
MVA(7), No. 4, 1994, pp. 237-246. BibRef 9400

Saitoh, T., Yamaai, T., Tachikawa, M.,
Document Image Segmentation and Layout Analysis,
IEICE(Info Sys 77), No. 7, 1994, pp. 778-784. BibRef 9400

Saitoh, T., Pavlidis, T.,
Page segmentation without rectangle assumption,
ICPR92(II:277-280).
IEEE DOI 9208
BibRef

Cullen, J.F.[John F.], Ejiri, K.[Koichi],
Segmentation of text, picture and lines of a document image,
US_Patent5,335,290, August 2, 1994.
WWW Link. BibRef 9408
And: US_Patent5,465,304, Nov 7, 1995
WWW Link. BibRef

Peairs, M.[Mark],
Method of selecting a target document using features of an example page,
US_Patent5,717,940, Feb 10, 1998
WWW Link. BibRef 9802

Peairs, M.[Mark], Hull, J.J.[Jonathan J.], Cullen, J.F.[John F.],
Automatic document classification using text and images,
US_Patent7,039,856, May 2, 2006
WWW Link. BibRef 0605

Kopec, G.E., Chou, P.A.,
Document Image Decoding Using Markov Source Models,
PAMI(16), No. 6, June 1994, pp. 602-617.
IEEE DOI BibRef 9406
Earlier:
Document image decoding,
ICIP94(II: 36-40).
IEEE DOI 9411
BibRef
Earlier:
Automatic Generation of Custom Document Image Decoders,
ICDAR93(xx-yy). BibRef

Kam, A.C.,
Heuristic Document Image Decoding Using Markov Source Models,
MITMasters Thesis, June 1993. BibRef 9306

Kam, A.C., Kopec, G.E.,
Document Image Decoding by Heuristic Search,
PAMI(18), No. 9, September 1996, pp. 945-950.
IEEE DOI Heuristic Search. BibRef 9609
Earlier:
Separable Source Models for Document Image Decoding,
SPIE(2422), February 1995, pp. 84-97. BibRef

Shiau, J.N.[Jeng-Nan],
Automatic image segmentation for color documents,
US_Patent5,341,226, August 23, 1994.
WWW Link. BibRef 9408

Hayashi, N.[Naoki], Saito, K.[Kazuo],
Document layout processing method and device for carrying out the same,
US_Patent5,379,373, January 3, 1995.
WWW Link. BibRef 9501

Ozaki, M.[Masaharu],
Method and apparatus for document segmentation by background analysis,
US_Patent5,555,556, September 10, 1996
WWW Link. BibRef 9609

Kopec, G.E.[Gary E.], Lomelin, M.[Mauricio],
Supervised Template Estimation for Document Image Decoding,
PAMI(19), No. 12, December 1997, pp. 1313-1324.
IEEE DOI 9712
BibRef
Earlier:
Document image Decoding Approach to Character Template Estimation,
ICIP96(II: 213-216).
IEEE DOI BibRef
And:
Document Specific Character Template Estimation,
SPIE(2660), 1996, pp. 14-26. Templates for recognizing characters. BibRef

Kopec, G.E.[Gary E.],
Multilevel Character Templates for Document Image Decoding,
SPIE(3027), 1997, pp. xx-yy. BibRef 9700
Earlier:
Document Image Decoding in the Berkeley Digital Library Project,
SPIE(2660), 1996, pp. 2-13. BibRef
And:
Document Image Decoding in the Berkeley Digital Library,
ICIP96(II: 769-772).
IEEE DOI BibRef

Dengel, A.R., Dubiel, F.,
Computer Understanding of Document Structure,
IJIST(7), No. 4, Winter 1996, pp. 271-278. 9612
BibRef

Niyogi, D., Srihari, S.N.,
Integrated Approach to Document Decomposition and Structural-Analysis,
IJIST(7), No. 4, Winter 1996, pp. 330-342. 9612
BibRef
Earlier:
Knowledge-Based Derivation of Document Logical Structure,
ICDAR95(472-475). Bottom up approach. 3 levels of rules, knowledge, control and strategy. Accuracy varies (48-100%). Has 160 rules. BibRef

Niyogi, D., Srihari, S.N.,
A Rule-Based System for Document Understanding,
AAAI-86(789-793). BibRef 8600

Simon, A., Pret, J.C., Johnson, A.P.,
A Fast Algorithm for Bottom-Up Document Layout Analysis,
PAMI(19), No. 3, March 1997, pp. 273-277.
IEEE DOI 9704
BibRef

Liu, J.M.[Ji-Ming], Tang, Y.Y.[Yuan Y.], Suen, C.Y.[Ching Y.],
Chinese Document Layout Analysis Based on Adaptive Split-and-Merge and Qualitative Spatial Reasoning,
PR(30), No. 8, August 1997, pp. 1265-1278.
Elsevier DOI 9708
BibRef

Tang, Y.Y., Ma, H., Liu, J.M., Li, B.F., Xi, D.H.,
Multiresolution Analysis in Extraction of Reference Lines from Documents with Gray-Level Background,
PAMI(19), No. 8, August 1997, pp. 921-926.
IEEE DOI 9709
Find reference lines to determine the structure of the document. BibRef

Bayer, T.A.[Thomas A.], Kressel, U.[Ulrich], Mogg-Schneider, H.U.[Heike U.], Renz, I.[Ingrid],
Categorizing Paper Documents,
CVIU(70), No. 3, June 1998, pp. 299-306.
DOI Link BibRef 9806

Caelli, T.M.[Terry M.], and Dillon, C.[Craig],
CITE: A Trainable Image Annotation System,
PRL(18), No. 11-13, November 1997, pp. 1247-1252. 9806
BibRef

Dillon, C.[Craig], and Caelli, T.M.[Terry M.],
Learning Image Annotation: The Cite System,
Videre(1), No. 2, Winter 1998, pp. 90-121. Generate automatic annotations. Apply to airports and office scenes. Region and color based analysis.
HTML Version.
PDF File. BibRef 9800

Cooperman, R.S.[Robert S.],
System for document layout analysis,
US_Patent5,784,487, Jul 21, 1998
WWW Link. BibRef 9807

Ancin, H.[Hakan],
Document segmentation system,
US_Patent5,956,468, September 21, 1999.
WWW Link. Text and graphics. BibRef 9909

Chao, H.[Hui], Bloomberg, D.S.[Dan S.],
Method and system for document segmentation,
US_Patent6,904,170, June 7, 2005.
WWW Link. Projection profiles. BibRef 0506

Bloomberg, D.S.[Dan S.],
Method and article of manufacture for determining whether a scanned image is an original image or fax image,
US_Patent5,828,771, Oct 27, 1998
WWW Link. BibRef 9810

Nakayama, T.[Takehiro],
Method and apparatus for document classification from degraded images,
US_Patent5,909,510, Jun 1, 1999
WWW Link. BibRef 9906

Crabtree, R.N.[Ralph N.], Peng, A.[Antai],
Knowledge-based document analysis system,
US_Patent5,937,084, Aug 10, 1999
WWW Link. BibRef 9908

Li, J., Gray, R.M.,
Context-Based Multiscale Classification of Document Images Using Wavelet Coefficient Distributions,
IP(9), No. 9, September 2000, pp. 1604-1616.
IEEE DOI 0008
BibRef

Ageenko, E., Fränti, P.,
Context-based filtering of document images,
PRL(21), No. 6-7, June 2000, pp. 483-491. 0006
BibRef

Lee, K.H.[Kyong-Ho], Choy, Y.C.[Yoon-Chul], Cho, S.B.[Sung-Bae],
Geometric Structure Analysis of Document Images: A Knowledge-Based Approach,
PAMI(22), No. 11, November 2000, pp. 1224-1240.
IEEE DOI 0012
Journal pages. Combine botton-up and top-down approach. Segmentation then identification. BibRef

Shin, C.[Christian], Doermann, D.S.[David S.], Rosenfeld, A.[Azriel],
Classification of document pages using structure-based features,
IJDAR(3), No. 4, 2001, pp. 232-247.
Springer DOI 0106
BibRef

Liang, J.S.[Ji-Sheng], Phillips, I.T.[Ihsin T.], Haralick, R.M.[Robert M.],
An Optimization Methodology for Document Structure Extraction on Latin Character Documents,
PAMI(23), No. 7, July 2001, pp. 719-734.
IEEE DOI 0108
BibRef

Chetverikov, D., Liang, J., Komuves, J., Haralick, R.M.,
Zone Classification Using Texture Features,
ICPR96(III: 676-680).
IEEE DOI 9608
(Hungarian Academy of Sciences, H) BibRef

Liang, J., Ha, J., Haralick, R.M., and Phillips, I.T.,
Document Layout Structure Extraction Using Bounding Boxes of Different Entities,
WACV96(278-283).
IEEE DOI 9609
BibRef

Haralick, R.M.,
Document Image Analysis: Geometric and Logical Layout,
CVPR94(385-390).
IEEE DOI BibRef 9400

Klink, S.[Stefan], Kieninger, T.[Thomas],
Rule-based document structure understanding with a fuzzy combination of layout and textual features,
IJDAR(4), No. 1, 2001, pp. 18-26.
Springer DOI 0111
BibRef

Altamura, O.[Oronzo], Esposito, F.[Floriana], Malerba, D.[Donato],
Transforming paper documents into XML format with WISDOM++,
IJDAR(4), No. 1, 2001, pp. 2-17.
Springer DOI 0111
BibRef

Lee, S.W.[Seong-Whan], Ryu, D.S.[Dae-Seok],
Parameter-Free Geometric Document Layout Analysis,
PAMI(23), No. 11, November 2001, pp. 1240-1256.
IEEE DOI 0112
Segment into maximal homogeneous regions, identify as text, graphics, etc. Periodicity measure for text. BibRef

Ryu, D.S., Kang, S.M., Lee, S.W.,
Parameter-independent Geometric Document Layout Analysis,
ICPR00(Vol IV: 397-400).
IEEE DOI 0009
BibRef

Hull, J.J., Lee, D.S.,
Simultaneous Highlighting of Paper and Electronic Documents,
ICPR00(Vol IV: 401-404).
IEEE DOI 0009
BibRef

Acharyya, M.[Mausumi], Kundu, M.K.[Malay K.],
Document image segmentation using wavelet scale-space features,
CirSysVideo(12), No. 12, December 2002, pp. 1117-1127.
IEEE Top Reference. 0301
BibRef
Earlier:
Multiscale Segmentation of Document Images Using M-Band Wavelets,
CAIP01(510 ff.).
Springer DOI 0210

See also adaptive approach to unsupervised texture segmentation using M-Band wavelet transform, An. BibRef

Lee, J.Y.[Ji-Yeon], Park, J.S.[Jeong-Seon], Byun, H.R.[Hye-Ran], Moon, J.[Jongsub], Lee, S.W.[Seong-Whan],
Automatic generation of structured hyperdocuments from document images,
PR(35), No. 2, February 2002, pp. 485-503.
Elsevier DOI 0201
BibRef

Lee, J.Y., Choi, S.H., Lee, S.W.,
Automatic Generation of Structured Hyperdocuments from Multi-column Document Images,
ICPR00(Vol IV: 422-425).
IEEE DOI 0009
BibRef

Lam, W.[Wai], Han, Y.[Yiqiu],
Automatic textual document categorization based on generalized instance sets and a metamodel,
PAMI(25), No. 5, May 2003, pp. 628-633.
IEEE Abstract. 0304
Generalized instance set. (GIS) BibRef

Bagdanov, A.D.[Andrew D.], Worring, M.[Marcel],
Multiscale Document Description Using Rectangular Granulometries,
IJDAR(6), No. 3, March 2004, pp. 181-191.
Springer DOI 0406
BibRef
Earlier: DAS02(445 ff.).
Springer DOI 0303
BibRef
Earlier:
Fine-grained document genre classification using first order random graphs,
ICDAR01(79-83).
IEEE DOI 0109
BibRef

Chang, F.[Fu], Chu, S.Y.[Shih-Yu], Chen, C.Y.[Chi-Yen],
Chinese document layout analysis using an adaptive regrouping strategy,
PR(38), No. 2, February 2005, pp. 261-271.
Elsevier DOI 0411
BibRef

Wu, C.C.[Chung-Chih], Chou, C.H.[Chien-Hsing], Chang, F.[Fu],
A machine-learning approach for analyzing document layout structures with two reading orders,
PR(41), No. 10, October 2008, pp. 3200-3213.
Elsevier DOI 0808
Binary decision; Document layout analysis; Reading order; Support vector machine; Taboo box; Textline; Text region BibRef

Altamura, O.[Oronzo], Berardi, M.[Margherita], Ceci, M.[Michelangelo], Malerba, D.[Donato], Varlaro, A.[Antonio],
Using colour information to understand censorship cards of film archives,
IJDAR(9), No. 2-4, April 2007, pp. 281-297.
Springer DOI 0704
BibRef
Earlier: A2, A1, A3, A4, Only:
A color-based layout analysis to process censorship cards of film archives,
ICDAR05(II: 1110-1114).
IEEE DOI 0508
BibRef

Natarajan, P.[Prem], Prasad, R.[Rohit], Subramanian, K.[Krishna], Saleem, S.[Shirin], Choi, F.[Fred], Schwartz, R.[Rich],
Finding structure in noisy text: Topic classification and unsupervised clustering,
IJDAR(10), No. 3-4, December 2007, pp. 187-198.
Springer DOI 0712
BibRef

Subramanian, K.[Krishna], Prasad, R.[Rohit], Natarajan, P.[Prem],
Robust named entity detection from optical character recognition output,
IJDAR(14), No. 2, June 2011, pp. 189-200.
WWW Link. 1106
BibRef

Cao, H.[Huaigu], Prasad, R.[Rohit], Saleem, S.[Shirin], Natarajan, P.[Premkumar],
Unsupervised HMM Adaptation Using Page Style Clustering,
ICDAR09(1091-1095).
IEEE DOI 0907
BibRef

Lemaitre, A.[Aurélie], Camillerapp, J.[Jean], Coüasnon, B.[Bertrand],
Multiresolution cooperation makes easier document structure recognition,
IJDAR(11), No. 2, November 2008, pp. xx-yy.
Springer DOI 0810
BibRef
Earlier:
Contribution of Multiresolution Description for Archive Document Structure Recognition,
ICDAR07(247-251).
IEEE DOI 0709
BibRef

Jamieson, M.[Michael], Fazly, A.[Afsaneh], Stevenson, S.[Suzanne], Dickinson, S.J.[Sven J.], Wachsmuth, S.[Sven],
Using Language to Learn Structured Appearance Models for Image Annotation,
PAMI(32), No. 1, January 2010, pp. 148-164.
IEEE DOI 0912
BibRef
Earlier: A1, A2, A4, A3, A5:
Learning Structured Appearance Models from Captioned Images of Cluttered Scenes,
ICCV07(1-8).
IEEE DOI 0710
Given an unstructured collection of captioned images of cluttered scenes featuring a variety of objects. Few image features relate to captions, some caption words do not relate to image features. Learn meaningful feature configurations across multiple images. Graph based model. BibRef

Jamieson, M.[Michael], Eskin, Y.[Yulia], Fazly, A.[Afsaneh], Stevenson, S.[Suzanne], Dickinson, S.J.[Sven J.],
Discovering hierarchical object models from captioned images,
CVIU(116), No. 7, July 2012, pp. 842-853.
Elsevier DOI 1202
BibRef
Earlier:
Discovering Multipart Appearance Models from Captioned Images,
ECCV10(V: 183-196).
Springer DOI 1009
Language-vision integration; Object recognition; Automatic image annotation; Learning hierarchical models BibRef

Moringen, J.[Jan], Wachsmuth, S.[Sven], Dickinson, S.J.[Sven J.], Stevenson, S.[Suzanne],
Learning Visual Compound Models from Parallel Image-Text Datasets,
DAGM08(xx-yy).
Springer DOI 0806
BibRef

Anaya-Sanchez, H.[Henry], Pons-Porrata, A.[Aurora], Berlanga-Llavori, R.[Rafael],
A document clustering algorithm for discovering and describing topics,
PRL(31), No. 6, 15 April 2010, pp. 502-510.
Elsevier DOI 1004
BibRef
Earlier:
A New Document Clustering Algorithm for Topic Discovering and Labeling,
CIARP08(161-168).
Springer DOI 0809
Document clustering; Topic discovery; Topic description BibRef

Lias-Rodríguez, A.[Alexsey], Pons-Porrata, A.[Aurora],
BR: A New Method for Computing All Typical Testors,
CIARP09(433-440).
Springer DOI 0911
BibRef

Fonseca-Bruzón, A.[Adrian], Gil-García, R.[Reynaldo], Pons-Porrata, A.[Aurora],
Using the alpha-beta-Neighborhood for Adaptive Document Filtering,
CIARP08(783-790).
Springer DOI 0809
BibRef

Gil-García, R.[Reynaldo], Pons-Porrata, A.[Aurora],
Improving the Dynamic Hierarchical Compact Clustering Algorithm by Using Feature Selection,
CIARP10(113-120).
Springer DOI 1011
BibRef

Pons-Porrata, A.[Aurora], Gil-García, R.[Reynaldo], Berlanga-Llavori, R.[Rafael],
Using Typical Testors for Feature Selection in Text Categorization,
CIARP07(643-652).
Springer DOI 0711
BibRef

Gil-García, R.[Reynaldo], Pons-Porrata, A.[Aurora],
Dynamic hierarchical algorithms for document clustering,
PRL(31), No. 6, 15 April 2010, pp. 469-477.
Elsevier DOI 1004
BibRef
Earlier:
A Speed-Up Hierarchical Compact Clustering Algorithm for Dynamic Document Collections,
CIARP09(379-386).
Springer DOI 0911
BibRef
Earlier:
Hierarchical Star Clustering Algorithm for Dynamic Document Collections,
CIARP08(187-194).
Springer DOI 0809
BibRef
Earlier:
A New Nearest Neighbor Rule for Text Categorization,
CIARP06(814-823).
Springer DOI 0611
Hierarchical clustering; Dynamic clustering; Overlapped clustering BibRef

Artigas-Fuentes, F.J.[Fernando José], Gil-García, R.[Reynaldo], Badía-Contelles, J.M.[José Manuel], Pons-Porrata, A.[Aurora],
Fast k-NN Classifier for Documents Based on a Graph Structure,
CIARP10(228-235).
Springer DOI 1011
BibRef

Gil-García, R.J.[Reynaldo J.], Badía-Contelles, J.M.[Jose M.], Pons-Porrata, A.[Aurora],
A General Framework for Agglomerative Hierarchical Clustering Algorithms,
ICPR06(II: 569-572).
IEEE DOI 0609
BibRef

Erkilinc, S.[Sezer], Jaber, M.[Mustafa], Saber, E.[Eli], Bauer, P.[Peter], Depalov, D.[Dejan],
Analysis and classification for complex scanned documents,
SPIE(Newsroom), August 8, 2011.
DOI Link 1108
A fast and robust algorithm classifies text, image, strong edges or lines, and background regions in different types of scanned documents from plain fax covers to colorful flyers. BibRef

Rangoni, Y.[Yves], Belaïd, A.[Abdel], Vajda, S.[Szilárd],
Labelling logical structures of document images using a dynamic perceptive neural network,
IJDAR(15), No. 1, March 2012, pp. 45-55.
WWW Link. 1203
BibRef

Rangoni, Y.[Yves], Belaid, A.[Abdel],
Document Logical Structure Analysis Based on Perceptive Cycles,
DAS06(117-128).
Springer DOI 0602
BibRef
Earlier:
Data categorization for a context return applied to logical document structure recognition,
ICDAR05(I: 297-301).
IEEE DOI 0508
BibRef

Zhang, X., Hu, X., Hu, T., Park, E.K., Zhou, X.,
Utilizing Different Link Types to Enhance Document Clustering Based on Markov Random Field Model With Relaxation Labeling,
SMC-A(42), No. 5, September 2012, pp. 1167-1182.
IEEE DOI 1208
BibRef

Lee, I.[Ingyu], On, B.W.[Byung-Won],
An effective web document clustering algorithm based on bisection and merge,
AIR(36), No. 1, June 2011, pp. 69-85.
WWW Link. 1208
BibRef

Nielsen, F.,
Jeffreys Centroids: A Closed-Form Expression for Positive Histograms and a Guaranteed Tight Approximation for Frequency Histograms,
SPLetters(20), No. 7, 2013, pp. 657-660.
IEEE DOI 1307
document handling; document classification BibRef

Sun, M.[Ming], Priebe, C.E.[Carey E.],
Efficiency investigation of manifold matching for text document classification,
PRL(34), No. 11, 1 August 2013, pp. 1263-1269.
Elsevier DOI 1306
Manifold matching; MDS Procrustes; CCA; JOFC; Efficiency; Classification BibRef

Shen, C.C.[Cen-Cheng], Vogelstein, J.T.[Joshua T.], Priebe, C.E.[Carey E.],
Manifold matching using shortest-path distance and joint neighborhood selection,
PRL(92), No. 1, 2017, pp. 41-48.
Elsevier DOI 1705
Nonlinear transformation BibRef

Cote, M.[Melissa], Albu, A.B.[Alexandra Branzan],
Texture sparseness for pixel classification of business document images,
IJDAR(17), No. 3, September 2014, pp. 257-273.
WWW Link. 1408
BibRef
And:
Sparseness-Based Descriptors for Texture Segmentation,
ICPR14(1108-1113)
IEEE DOI 1412
Accuracy BibRef

Delaye, A.[Adrien], Lee, K.[Kibok],
A flexible framework for online document segmentation by pairwise stroke distance learning,
PR(48), No. 4, 2015, pp. 1197-1210.
Elsevier DOI 1502
Document analysis and recognition BibRef

Tran, T.A.[Tuan Anh], Na, I.S.[In Seop], Kim, S.H.[Soo Hyung],
Page segmentation using minimum homogeneity algorithm and adaptive mathematical morphology,
IJDAR(19), No. 3, September 2016, pp. 191-209.
Springer DOI 1609
BibRef

Dai-Ton, H.[Ha], Duc-Dung, N.[Nguyen], Duc-Hieu, L.[Le],
An adaptive over-split and merge algorithm for page segmentation,
PRL(80), No. 1, 2016, pp. 137-143.
Elsevier DOI 1609
Document analysis and recognition BibRef

Dey, S.[Soumyadeep], Mukherjee, J.[Jayanta], Sural, S.[Shamik],
Consensus-based clustering for document image segmentation,
IJDAR(19), No. 4, December 2016, pp. 351-368.
Springer DOI 1611
BibRef

Forczmanski, P.[Pawel], Markiewicz, A.[Andrzej],
Two-stage approach to extracting visual objects from paper documents,
MVA(27), No. 8, November 2016, pp. 1243-1257.
Springer DOI 1612
BibRef
Earlier: A2, A1:
Detection and Classification of Interesting Parts in Scanned Documents by Means of AdaBoost Classification and Low-Level Features Verification,
CAIP15(II:529-540).
Springer DOI 1511
BibRef

Eskenazi, S.[Sébastien], Gomez-Krämer, P.[Petra], Ogier, J.M.[Jean-Marc],
A comprehensive survey of mostly textual document segmentation algorithms since 2008,
PR(64), No. 1, 2017, pp. 1-14.
Elsevier DOI 1701
Document BibRef

Quirós, L.[Lorenzo], Martínez-Hinarejos, C.D.[Carlos D.], Toselli, A.H.[Alejandro H.], Vidal, E.[Enrique],
Interactive Layout Detection,
IbPRIA17(161-168).
Springer DOI 1706
BibRef

Drira, F.[Fadoua], Le Bourgeois, F.[Frank],
Mean-Shift segmentation and PDE-based nonlinear diffusion: Toward a common variational framework for foreground/background document image segmentation,
IJDAR(20), No. 3, September 2017, pp. 201-222.
Springer DOI 1708
BibRef

Drira, F.[Fadoua], Le Bourgeois, F.[Franck],
Denoising Textual Images Using Local/Non-local Smoothing Filters: A Comparative Study,
FHR12(521-526).
IEEE DOI 1302
BibRef

Drira, F.[Fadoua], Le Bourgeois, F.[Frank], Emptoz, H.[Hubert],
A Coupled Mean Shift-Anisotropic Diffusion Approach for Document Image Segmentation and Restoration,
ICDAR07(814-818).
IEEE DOI 0709
BibRef

Zhang, Y., Er, M.J., Zhao, R., Pratama, M.,
Multiview Convolutional Neural Networks for Multidocument Extractive Summarization,
Cyber(47), No. 10, October 2017, pp. 3230-3242.
IEEE DOI 1709
Computational modeling, Data mining, Feature extraction, Machine learning, Neural networks, Semantics, Convolutional neural networks (CNNs), deep learning, multidocument summarization (MDS), multiview learning, word, embedding BibRef

Zhu, A.[Anna], Zhang, C.[Chen], Li, Z.[Zhi], Xiong, S.[Shengwu],
Coarse-to-fine document localization in natural scene image with regional attention and recursive corner refinement,
IJDAR(22), No. 3, September 2019, pp. 351-360.
Springer DOI 1909
BibRef

Binmakhashen, G.M.[Galal M.], Mahmoud, S.A.[Sabri A.],
Document Layout Analysis: A Comprehensive Survey,
Surveys(52), No. 6, October 2019, pp. xx-yy.
DOI Link 2001
Survey, Document Layout. document structure analysis, Document segmentation, layout analysis, document image retrieval, document image understanding BibRef

Lu, T.[Tan], Dooms, A.[Ann],
Probabilistic homogeneity for document image segmentation,
PR(109), 2021, pp. 107591.
Elsevier DOI 2009
Probabilistic local text homogeneity, Random walk-and-check simulation, Bayesian cue integration, Document image segmentation BibRef

Li, Y.J.[Yu-Jie], Zhang, P.F.[Peng-Fei], Xu, X.[Xing], Lai, Y.[Yi], Shen, F.M.[Fu-Min], Chen, L.J.[Li-Jiang], Gao, P.X.[Peng-Xiang],
Few-shot prototype alignment regularization network for document image layout segementation,
PR(115), 2021, pp. 107882.
Elsevier DOI 2104
Meta-learning, Few-shot learning, Metric learning, Semantic segmentation BibRef

Raman, N.[Natraj], Shah, S.[Sameena], Veloso, M.[Manuela],
Synthetic document generator for annotation-free layout recognition,
PR(128), 2022, pp. 108660.
Elsevier DOI 2205
Synthetic image generation, Bayesian network, Layout analysis BibRef

Xu, J.S.[Jian-Shuang], Klein, J.[Johannes], Jochims, J.[Jörn], Weissner, N.[Niklas], Kays, R.[Rüdiger],
A reliable and unobtrusive approach to display area detection for imperceptible display camera communication,
JVCIR(85), 2022, pp. 103510.
Elsevier DOI 2205
Whiteboard scanning. Display area detection, Display camera communication, Rectangle detection, Pattern recognition, Visible light communication BibRef

Pisaneschi, L.[Lorenzo], Gemelli, A.[Andrea], Marinai, S.[Simone],
Automatic generation of scientific papers for data augmentation in document layout analysis,
PRL(167), 2023, pp. 38-44.
Elsevier DOI 2303
Document layout analysis, Transformers, Automatic document generation, Deep learning, Generative models BibRef

Wang, C.J.[Chao-Jie], Chen, B.[Bo], Duan, Z.B.[Zhi-Bin], Chen, W.C.[Wen-Chao], Zhang, H.[Hao], Zhou, M.Y.[Ming-Yuan],
Generative Text Convolutional Neural Network for Hierarchical Document Representation Learning,
PAMI(45), No. 4, April 2023, pp. 4586-4604.
IEEE DOI 2303
Probabilistic logic, Computational modeling, Semantics, Analytical models, Task analysis, Vocabulary, Data models, variational autoencoder BibRef

Mishra, P.[Prerna],
Domain adaptive learning for document layout analysis and object detection using classifier alignment mechanism,
SP:IC(116), 2023, pp. 116986.
Elsevier DOI 2307
Domain adaptation, Document object detection, Layout analysis, Classifier alignment, Deep neural network BibRef

Fei, Y.F.[Yue-Fan], Xu, X.L.[Xiao-Long],
GFMRC: A machine reading comprehension model for named entity recognition,
PRL(172), 2023, pp. 97-105.
Elsevier DOI 2309
Named entity recognition, Machine reading comprehension, Feature extraction, Context BibRef

Guo, P.C.[Peng-Cheng], Song, Y.H.[Yong-Hong], Deng, Y.[Yongbiao], Xie, K.K.[Kang-Kang], Xu, M.J.[Ming-Jie], Liu, J.H.[Jia-Hao], Ren, H.J.[Hai-Jun],
DCMAI: A Dynamical Cross-Modal Alignment Interaction Framework for Document Key Information Extraction,
CirSysVideo(34), No. 1, January 2024, pp. 504-517.
IEEE DOI 2401
BibRef


Cheng, H.[Hiuyi], Zhang, P.[Peirong], Wu, S.[Sihang], Zhang, J.X.[Jia-Xin], Zhu, Q.Y.[Qi-Yuan], Xie, Z.C.[Ze-Cheng], Li, J.[Jing], Ding, K.[Kai], Jin, L.W.[Lian-Wen],
M6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis,
CVPR23(15138-15147)
IEEE DOI 2309
BibRef

Tang, Z.N.[Zi-Neng], Yang, Z.[Ziyi], Wang, G.X.[Guo-Xin], Fang, Y.W.[Yu-Wei], Liu, Y.[Yang], Zhu, C.G.[Chen-Guang], Zeng, M.[Michael], Zhang, C.[Cha], Bansal, M.[Mohit],
Unifying Vision, Text, and Layout for Universal Document Processing,
CVPR23(19254-19264)
IEEE DOI 2309
BibRef

Shi, Y.Z.[Yu-Zhi], Kim, M.[Mijung], Chae, Y.[Yeongnam],
Multi-scale Cell-based Layout Representation for Document Understanding,
WACV23(3659-3668)
IEEE DOI 2302
Deep learning, Adaptation models, Codes, Computational modeling, Layout, Graphics processing units BibRef

de Nardin, A.[Axel], Zottin, S.[Silvia], Paier, M.[Matteo], Foresti, G.L.[Gian Luca], Colombi, E.[Emanuela], Piciarelli, C.[Claudio],
Efficient few-shot learning for pixel-precise handwritten document layout analysis,
WACV23(3669-3677)
IEEE DOI 2302
Training, Measurement, Text analysis, Semantic segmentation, Layout, Supervised learning, Optical character recognition, and un-supervised learning BibRef

Mathur, P.[Puneet], Jain, R.[Rajiv], Mehra, A.[Ashutosh], Gu, J.X.[Jiu-Xiang], Dernoncourt, F.[Franck], Anandhavelu, N., Tran, Q.[Quan], Kaynig-Fittkau, V.[Verena], Nenkova, A.[Ani], Manocha, D.[Dinesh], Morariu, V.I.[Vlad I.],
LayerDoc: Layer-wise Extraction of Spatial Hierarchical Structure in Visually-Rich Documents,
WACV23(3599-3609)
IEEE DOI 2302
Visualization, Automation, Layout, Semantics, Feature extraction, Mobile applications, Algorithms: Vision + language and/or other modalities BibRef

Yang, H.[Huichen], Hsu, W.[William],
Transformer-Based Approach for Document Layout Understanding,
ICIP22(4043-4047)
IEEE DOI 2211
Visualization, Layout, Pipelines, Neural networks, Object detection, Benchmark testing, Document Layout Understanding, Document Structure Extraction BibRef

Zhou, E.[Ejian], Wu, X.J.[Xing-Jiao], Xiao, L.[Luwei], Du, X.C.[Xiang-Cheng], Ma, T.L.[Tian-Long], He, L.[Liang],
Document Layout Analysis Via Positional Encoding,
ICIP22(1156-1160)
IEEE DOI 2211
Analytical models, Text analysis, Image coding, Layout, Predictive models, Maintenance engineering, deep learning BibRef

Gu, Z.X.[Zhang-Xuan], Meng, C.[Changhua], Wang, K.[Ke], Lan, J.[Jun], Wang, W.Q.[Wei-Qiang], Gu, M.[Ming], Zhang, L.Q.[Li-Qing],
XYLayoutLM: Towards Layout-Aware Multimodal Networks For Visually-Rich Document Understanding,
CVPR22(4573-4582)
IEEE DOI 2210
Visualization, Layout, Optical character recognition, Transformers, Encoding, Noise measurement, Document analysis and understanding, Vision + language BibRef

Shekhar, S.[Sumit], Guda, B.P.R.[Bhanu Prakash Reddy], Chaubey, A.[Ashutosh], Jindal, I.[Ishan], Jain, A.[Avneet],
OPAD: An Optimized Policy-based Active Learning Framework for Document Content Analysis,
FaDE-TCV22(2825-2835)
IEEE DOI 2210
Deep learning, Measurement, Learning systems, Text analysis, Annotations, Layout, Natural languages BibRef

Minouei, M.[Mohammad], Soheili, M.R.[Mohammad Reza], Stricker, D.[Didier],
Document Layout Analysis with an Enhanced Object Detector,
IPRIA21(1-5)
IEEE DOI 2201
Text analysis, Layout, Detectors, Optical computing, Optical detectors, Optical imaging, deep learning BibRef

Yang, H.C.[Hui-Chen], Hsu, W.H.[William H.],
Vision-Based Layout Detection from Scientific Literature using Recurrent Convolutional Neural Networks,
ICPR21(6455-6462)
IEEE DOI 2105
Training, Adaptation models, Text analysis, Layout, Transfer learning, Training data, Object detection, Document Layout Analysis BibRef

Davoudi, H.[Homa], Fiorucci, M.[Marco], Traviglia, A.[Arianna],
Ancient Document Layout Analysis: Autoencoders meet Sparse Coding,
ICPR21(5936-5942)
IEEE DOI 2105
Training, Image segmentation, Text analysis, Image analysis, Layout, Neural networks, Training data BibRef

Liebl, B.[Bernhard], Burghardt, M.[Manuel],
An Evaluation of DNN Architectures for Page Segmentation of Historical Newspapers,
ICPR21(5153-5160)
IEEE DOI 2105
Training, Systematics, Particle separators, Layout, Transfer learning, Training data, Computer architecture, optical character recognition software BibRef

Sarkar, M.[Mausoom], Aggarwal, M.[Milan], Jain, A.[Arneh], Gupta, H.[Hiresh], Krishnamurthy, B.[Balaji],
Document Structure Extraction Using Prior Based High Resolution Hierarchical Semantic Segmentation,
ECCV20(XXVIII:649-666).
Springer DOI 2011
BibRef

Li, K., Wigington, C., Tensmeyer, C., Zhao, H., Barmpalios, N., Morariu, V.I., Manjunatha, V., Sun, T., Fu, Y.,
Cross-Domain Document Object Detection: Benchmark Suite and Method,
CVPR20(12912-12921)
IEEE DOI 2008
Portable document format, US Department of Defense, Object detection, Benchmark testing, Layout, Detectors BibRef

Patil, A.G., Ben-Eliezer, O., Perel, O., Averbuch-Elor, H.,
READ: Recursive Autoencoders for Document Layout Generation,
WTDDL20(2316-2325)
IEEE DOI 2008
Layout, Training, Task analysis, Training data, Decoding, Neural networks, Semantics BibRef

Bakkali, S., Ming, Z., Coustaty, M., Rusiñol, M.,
Cross-Modal Deep Networks For Document Image Classification,
ICIP20(2556-2560)
IEEE DOI 2011
BibRef
And:
Visual and Textual Deep Feature Fusion for Document Image Classification,
WTDDL20(2394-2403)
IEEE DOI 2008
Visualization, Feature extraction, Optical character recognition software, Bit error rate, deep CNNs. Task analysis, Semantics, Neural networks BibRef

Rastogi, M., Ali, S.A., Rawat, M., Vig, L., Agarwal, P., Shroff, G., Srinivasan, A.,
Information Extraction from Document Images via FCA based Template Detection and Knowledge Graph Rule Induction,
WTDDL20(2377-2385)
IEEE DOI 2008
Semantics, Visualization, Information retrieval, Data mining, Noise measurement, Machine learning, Lattices BibRef

Singh, P.[Pranaydeep], Varadarajan, S.[Srikrishna], Singh, A.N.[Ankit Narayan], Srivastava, M.M.[Muktabh Mayank],
Multi-domain Document Layout Understanding Using Few-shot Object Detection,
ICIAR20(II:89-99).
Springer DOI 2007
BibRef

Haurilet, M., Al-Halah, Z., Stiefelhagen, R.,
SPaSe - Multi-Label Page Segmentation for Presentation Slides,
WACV19(726-734)
IEEE DOI 1904
image segmentation, learning (artificial intelligence), multilabel page segmentation, presentation slides, Semantics BibRef

Carraggi, A.[Angelo], Cornia, M.[Marcella], Baraldi, L.[Lorenzo], Cucchiara, R.[Rita],
Visual-Semantic Alignment Across Domains Using a Semi-Supervised Approach,
MultLearnApp18(VI:625-640).
Springer DOI 1905
BibRef

Baraldi, L.[Lorenzo], Cornia, M.[Marcella], Grana, C., Cucchiara, R.[Rita],
Aligning Text and Document Illustrations: Towards Visually Explainable Digital Humanities,
ICPR18(1097-1102)
IEEE DOI 1812
Visualization, Task analysis, Training, Semantics, Kernel, Pattern recognition, Art BibRef

Li, X., Yin, F., Liu, C.,
Page Object Detection from PDF Document Images by Deep Structured Prediction and Supervised Clustering,
ICPR18(3627-3632)
IEEE DOI 1812
Image segmentation, Object detection, Proposals, Portable document format, Convolutional neural networks, structured prediction BibRef

Viana, M.P., Oliveira, D.A.B.,
Fast CNN-Based Document Layout Analysis,
CEFR-LCV17(1173-1180)
IEEE DOI 1802
Computer architecture, Databases, Image segmentation, Layout, Text analysis, Training, BibRef

Yang, X., Yumer, E., Asente, P., Kraley, M., Kifer, D., Giles, C.L.,
Learning to Extract Semantic Structure from Documents Using Multimodal Fully Convolutional Neural Networks,
CVPR17(4342-4351)
IEEE DOI 1711
Decoding, Image reconstruction, Image segmentation, Semantics, Training, Visualization BibRef

Nair, R.R., Kota, B.U., Nwogu, I., Govindaraju, V.,
Segmentation of highly unstructured handwritten documents using a neural network technique,
ICPR16(1291-1296)
IEEE DOI 1705
Convolution, Image segmentation, Layout, Libraries, Neural networks, Text analysis, Writing BibRef

Fan, J.,
Detection of quadrilateral document regions from digital photographs,
WACV16(1-9)
IEEE DOI 1606
Cameras BibRef

Amorim, G.F.[Glauco F.], dos Santos, J.A.F.[Joel A. F.], Muchaluat-Saade, D.C.[Débora C.],
XTemplate 4.0: Providing Adaptive Layouts and Nested Templates for Hypermedia Documents,
MMMod16(I: 642-653).
Springer DOI 1601
BibRef

Minaee, S.[Shervin], Wang, Y.[Yao],
Screen content image segmentation using sparse decomposition and total variation minimization,
ICIP16(3882-3886)
IEEE DOI 1610
BibRef
Earlier:
Screen content image segmentation using least absolute deviation fitting,
ICIP15(3295-3299)
IEEE DOI 1511
Algorithm design and analysis BibRef

Eskenazi, S.[Sebastien], Gomez-Kramer, P.[Petra], Ogier, J.M.[Jean-Marc],
Let's be done with thresholds!,
ICDAR15(851-855)
IEEE DOI 1511
document authentication. The document and the copy should give the same result. BibRef

Chazalon, J.[Joseph], Rusinol, M.[Marcal], Ogier, J.M.[Jean-Marc], Llados, J.[Josep],
A semi-automatic groundtruthing tool for mobile-captured document segmentation,
ICDAR15(621-625)
IEEE DOI 1511
BibRef

Henter, D.[Dominik], Stahl, A.[Armin], Ebbecke, M.[Markus], Gillmann, M.[Michael],
Classifier self-assessment: active learning and active noise correction for document classification,
ICDAR15(276-280)
IEEE DOI 1511
BibRef

Dejean, H.[Herve],
Extracting structured data from unstructured document with incomplete resources,
ICDAR15(271-275)
IEEE DOI 1511
Document Layout Analysis; data extraction BibRef

Wang, S.M.[Si-Meng], Gao, L.C.[Liang-Cai], Wang, Y.H.[Yue-Han],
Classification of forms with similar layouts based on Mixed Gaussian Weighted Mask,
ICDAR15(111-115)
IEEE DOI 1511
Mixed Gaussian Weighted Mask; distance measurement; form classification BibRef

Kepa, M.[Marcin], Szymanski, J.[Julian],
Two Stage SVM and kNN Text Documents Classifier,
PReMI15(279-289).
Springer DOI 1511
BibRef

Rodríguez-Osoria, V.[Victor], Nuño-Maganda, M.A.[Marco Aurelio], Hernández-Mier, Y.[Yahir], Torres-Huitzil, C.[Cesar],
Embedded Image Processing System for Automatic Page Segmentation of Open Book Images,
ISVC14(II: 531-540).
Springer DOI 1501
BibRef

Gao, H.X.[Hong-Xing], Rusinol, M.[Marcal], Karatzas, D.[Dimosthenis], Llados, J.[Josep],
Embedding Document Structure to Bag-of-Words through Pair-wise Stable Key-Regions,
ICPR14(2903-2908)
IEEE DOI 1412
Algorithm design and analysis BibRef

Daher, H.[Hani], Bouguelia, M.R.[Mohamed-Rafik], Belaid, A.[Abdel], d'Andecy, V.P.[Vincent Poulain],
Multipage Administrative Document Stream Segmentation,
ICPR14(966-971)
IEEE DOI 1412
Accuracy BibRef

Javed, M., Nagabhushan, P., Chaudhuri, B.B.,
Extraction of Projection Profile, Run-Histogram and Entropy Features Straight from Run-Length Compressed Text-Documents,
ACPR13(813-817)
IEEE DOI 1408
data compression BibRef

Bhardwaj, K., Chaudhury, S., Roy, S.D.,
Augmented paper system: A framework for User's Personalized Workspace,
NCVPRIPG13(1-4)
IEEE DOI 1408
rendering (computer graphics) BibRef

Bouguelia, M.R.[Mohamed-Rafik], Belaid, Y.[Yolande], Belaid, A.[Abdel],
Document image and zone classification through incremental learning,
ICIP13(4230-4234)
IEEE DOI 1402
Document Classification BibRef

Clausner, C., Pletschacher, S., Antonacopoulos, A.,
The Significance of Reading Order in Document Recognition and Its Evaluation,
ICDAR13(688-692)
IEEE DOI 1312
document image processing BibRef

Kang, L.[Le], Kumar, J.[Jayant], Ye, P.[Peng], Li, Y.[Yi], Doermann, D.S.[David S.],
Convolutional Neural Networks for Document Image Classification,
ICPR14(3168-3172)
IEEE DOI 1412
Accuracy BibRef

Kumar, J., Doermann, D.S.,
Unsupervised Classification of Structurally Similar Document Images,
ICDAR13(1225-1229)
IEEE DOI 1312
decision trees BibRef

Kumar, J.[Jayant], Pillai, J.[Jaishanker], Doermann, D.S.[David S.],
Document Image Classification and Labeling Using Multiple Instance Learning,
ICDAR11(1059-1063).
IEEE DOI 1111
BibRef

Diamantatos, P., Verras, V., Kavallieratou, E.,
Detecting Main Body Size in Document Images,
ICDAR13(1160-1164)
IEEE DOI 1312
document image processing BibRef

Chen, K.[Kai], Yin, F.[Fei], Liu, C.L.[Cheng-Lin],
Hybrid Page Segmentation with Efficient Whitespace Rectangles Extraction and Grouping,
ICDAR13(958-962)
IEEE DOI 1312
document image processing BibRef

Cruz, F.[Francisco], Terrades, O.R.[Oriol Ramos],
EM-Based Layout Analysis Method for Structured Documents,
ICPR14(315-320)
IEEE DOI 1412
Computational modeling BibRef

Álvaro, F.[Francisco], Cruz, F.[Francisco], Sánchez, J.A.[Joan-Andreu], Terrades, O.R.[Oriol Ramos],
Page Segmentation of Structured Documents Using 2D Stochastic Context-Free Grammars,
IbPRIA13(133-140).
Springer DOI 1307
BibRef

Fernandez, F.C.[Francisco Cruz], Terrades, O.R.[Oriol Ramos],
Document segmentation using Relative Location Features,
ICPR12(1562-1565).
WWW Link. 1302
BibRef

Zirari, F.[Fattah], Ennaji, A.[Abdellatif], Nicolas, S.[Stephane], Mammass, D.[Driss],
A Document Image Segmentation System Using Analysis of Connected Components,
ICDAR13(753-757)
IEEE DOI 1312
BibRef
Earlier: A1, A4, A2, A3:
A Graph Based Approach for Heterogeneous Document Segmentation,
ICISP12(424-431).
Springer DOI 1208
BibRef

Kapoor, A., Pandey, P., Biswas, K.K.,
Fuzzy Rule Based Document Image Segmentation for Component Labeling,
NCVPRIPG11(11-14).
IEEE DOI 1205
BibRef

Bastos dos Santos, J.E.[Jose Eduardo],
Automatic Content Extraction on Semi-structured Documents,
ICDAR11(1235-1239).
IEEE DOI 1111
BibRef

Kuster, M.W.[Marc Wilhelm],
The Four and a Half Challenges of Humanities Data,
ICDAR11(1017-1023).
IEEE DOI 1111
Text with various characteristics: unusual characters, unusual layout, unusual semantics an dsegmentations. BibRef

Louradour, J.[Jerome], Kermorvant, C.[Christopher],
Sample-Dependent Feature Selection for Faster Document Image Categorization,
ICDAR11(309-313).
IEEE DOI 1111
BibRef

Diem, M.[Markus], Kleber, F.[Florian], Sablatnig, R.[Robert],
Text Classification and Document Layout Analysis of Paper Fragments,
ICDAR11(854-858).
IEEE DOI 1111
BibRef

Winder, A.[Amy], Andersen, T.[Tim], Barney Smith, E.H.[Elisa H.],
Extending Page Segmentation Algorithms for Mixed-Layout Document Processing,
ICDAR11(1245-1249).
IEEE DOI 1111
BibRef

Clausner, C., Pletschacher, S., Antonacopoulos, A.,
Scenario Driven In-depth Performance Evaluation of Document Layout Analysis Methods,
ICDAR11(1404-1408).
IEEE DOI 1111
BibRef

Baechler, M.[Micheal], Ingold, R.[Rolf],
Multi Resolution Layout Analysis of Medieval Manuscripts Using Dynamic MLP,
ICDAR11(1185-1189).
IEEE DOI 1111
BibRef

Hadjar, K.[Karim], Ingold, R.[Rolf],
Minimizing User Annotations in the Generation of Layout Ground-Truthed Data,
ICDAR11(703-707).
IEEE DOI 1111
BibRef

Santosh Kumar, S.A., Shreyamsha Kumar, B.K.,
Edge envelope based reconstruction of torn document,
ICCVGIP10(391-397).
DOI Link 1111
BibRef

Baechler, M.[Micheal], Bloechle, J.L.[Jean-Luc], Ingold, R.[Rolf],
Semi-automatic Annotation Tool for Medieval Manuscripts,
FHR10(182-187).
IEEE DOI 1011
BibRef

Chanda, S.[Sukalpa], Franke, K.[Katrin], Pal, U.[Umapada],
Document-Zone Classification in Torn Documents,
FHR10(25-30).
IEEE DOI 1011
BibRef

Usilin, S.[Sergey], Nikolaev, D.[Dmitry], Postnikov, V.[Vassili], Schaefer, G.[Gerald],
Visual appearance based document image classification,
ICIP10(2133-2136).
IEEE DOI 1009
BibRef

Sankarasubramaniam, Y.[Yogesh], Munnangi, K.[Krusheel], Banerjee, S.[Serene], Kuchibhotla, A.[Anjaneyulu],
Paper widgets: Visually aesthetic 'smarts' for document images,
ICIP10(2137-2140).
IEEE DOI 1009
BibRef

Gordo, A.[Albert], Perronnin, F.[Florent],
A Bag-of-Pages Approach to Unordered Multi-page Document Classification,
ICPR10(1920-1923).
IEEE DOI 1008
BibRef

An, C.[Chang], Yin, D.W.[Da-Wei], Baird, H.S.[Henry S.],
Document Segmentation Using Pixel-Accurate Ground Truth,
ICPR10(245-248).
IEEE DOI 1008
BibRef

Pletschacher, S.[Stefan], Antonacopoulos, A.[Apostolos],
The PAGE (Page Analysis and Ground-Truth Elements) Format Framework,
ICPR10(257-260).
IEEE DOI 1008
BibRef

Idarrou, A.[Ali], Mammass, D.[Driss], Dupuy, C.S.[Chantal Soulé], Valles-Parlangeau, N.[Nathalie],
Classification of Multi-structured Documents: A Comparison Based on Media Image,
ICISP10(428-438).
Springer DOI 1006
BibRef

Chaudhury, S.[Santanu], Jindal, M.[Megha], Roy, S.D.[Sumantra Dutta],
Model-Guided Segmentation and Layout Labelling of Document Images Using a Hierarchical Conditional Random Field,
PReMI09(375-380).
Springer DOI 0912
BibRef

Wang, S.Y.[Sui-Yu], Baird, H.S.[Henry S.], An, C.[Chang],
Document Content Extraction Using Automatically Discovered Features,
ICDAR09(1076-1080).
IEEE DOI 0907
BibRef

Lecerf, L.[Loic], Chidlovskii, B.[Boris],
Scalable Feature Extraction from Noisy Documents,
ICDAR09(361-365).
IEEE DOI 0907
Determine frequent patterns for use in layout recognition. BibRef

Ferilli, S.[Stefano], Basile, T.M.A.[Teresa M.A.], Esposito, F.[Floriana], Biba, M.[Marenglen],
A Contour-Based Progressive Technique for Shape Recognition,
ICDAR11(723-727).
IEEE DOI 1111
BibRef
Earlier: A1, A4, A3, A2:
A Distance-Based Technique for Non-Manhattan Layout Analysis,
ICDAR09(231-235).
IEEE DOI 0907
BibRef

Smith, R.W.[Raymond W.],
Hybrid Page Layout Analysis via Tab-Stop Detection,
ICDAR09(241-245).
IEEE DOI 0907
BibRef

Antonacopoulos, A.[Apostolos], Bridson, D.[David], Papadopoulos, C.[Christos], Pletschacher, S.[Stefan],
A Realistic Dataset for Performance Evaluation of Document Layout Analysis,
ICDAR09(296-300).
IEEE DOI 0907
BibRef

Tatsumi, I.[Itaru], Habe, H.[Hitoshi], Kidode, M.[Masatsugu],
Context-oriented Layout Optimization of Large-Print Textbooks,
ICDAR09(1016-1020).
IEEE DOI 0907
BibRef

Malleron, V.[Vincent], Eglin, V.[Véronique], Emptoz, H.[Hubert], Dord-Crouslé, S.[Stéphanie], Régnier, P.[Philippe],
Hierarchical Decomposition of Handwritten Manuscripts Layouts,
CAIP09(221-228).
Springer DOI 0909
BibRef

Gordo, A.[Albert], Valveny, E.[Ernest],
A Rotation Invariant Page Layout Descriptor for Document Classification and Retrieval,
ICDAR09(481-485).
IEEE DOI 0907
BibRef
And:
The Diagonal Split: A Pre-segmentation Step for Page Layout Analysis and Classification,
IbPRIA09(290-297).
Springer DOI 0906
BibRef

Grim, J.[Jiri], Novovicova, J.[Jana], Somol, P.[Petr],
Structural poisson mixtures for classification of documents,
ICPR08(1-4).
IEEE DOI 0812
BibRef

Agrawal, M.[Mudit], Doermann, D.S.[David S.],
Voronoi++: A Dynamic Page Segmentation Approach Based on Voronoi and Docstrum Features,
ICDAR09(1011-1015).
IEEE DOI 0907
BibRef

Seo, W.[Wontaek], Agrawal, M.[Mudit], Doermann, D.S.[David S.],
Performance Evaluation Tools for Zone Segmentation and Classification (PETS),
ICPR10(503-506).
IEEE DOI 1008
Document zone segmentation BibRef

Abd-Almageed, W.[Wael], Agrawal, M.[Mudit], Seo, W.[Wontaek], Doermann, D.S.[David S.],
Document-zone classification using partial least squares and hybrid classifiers,
ICPR08(1-4).
IEEE DOI 0812
BibRef

Gaceb, D.[Djamel], Eglin, V.[Véronique], Le Bourgeois, F.[Frank], Emptoz, H.[Hubert],
Graph b-Coloring for Automatic Recognition of Documents,
ICDAR09(261-265).
IEEE DOI 0907
BibRef
Earlier:
Application of graph coloring in physical layout segmentation,
ICPR08(1-4).
IEEE DOI 0812

See also Improvement of postal mail sorting system. BibRef

Ceci, M.[Michelangelo], Berardi, M.[Margherita], Porcelli, G., Malerba, D.[Donato],
A Data Mining Approach to Reading Order Detection,
ICDAR07(924-928).
IEEE DOI 0709
BibRef

Gupta, M.D.[M. Das], Sarkar, P.,
A Shared Parts Model for Document Image Recognition,
ICDAR07(1163-1172).
IEEE DOI 0709
BibRef

Dasigi, P.[Praveen], Jain, R.[Raman], Jawahar, C.V.,
Document Image Segmentation as a Spectral Partitioning Problem,
ICCVGIP08(305-312).
IEEE DOI 0812
BibRef

Kumar, K.S.S., Kumar, S., Jawahar, C.V.,
On Segmentation of Documents in Complex Scripts,
ICDAR07(1243-1247).
IEEE DOI 0709
BibRef

Xia, Y., Xiao, B.H., Wang, C.H., Dai, R.W.,
Integrated Segmentation and Recognition of Mixed Chinese/English Document,
ICDAR07(704-708).
IEEE DOI 0709
BibRef

Baird, H.S., Moll, M.A.,
Document Content Inventory and Retrieval,
ICDAR07(93-97).
IEEE DOI 0709
BibRef

Gu, G., Han, W.,
Adaptive Window Based Uneven Lighting Document Segmentation,
ICDAR07(223-226).
IEEE DOI 0709
BibRef

Cao, H.[Huaigu], Prasad, R.[Rohit], Natarajan, P.[Prem], MacRostie, E.[Ehry],
Robust Page Segmentation Based on Smearing and Error Correction Unifying Top-down and Bottom-up Approaches,
ICDAR07(392-396).
IEEE DOI 0709
BibRef

Gao, D., Wang, Y., Hindi, H., Do, M.,
Decompose Document Image Using Integer Linear Programming,
ICDAR07(397-401).
IEEE DOI 0709
BibRef

Nicolas, S., Dardenne, J., Paquet, T., Heutte, L.,
Document Image Segmentation Using a 2D Conditional Random Field Model,
ICDAR07(407-411).
IEEE DOI 0709
BibRef

Gao, D.S.[Da-Shan], Wang, Y.Z.[Yi-Zhou],
Decomposing Document Images by Heuristic Search,
EMMCVPR07(97-111).
Springer DOI 0708
BibRef

Kumar, K.S.S.[K.S. Sesh], Namboodiri, A.M.[Anoop M.], Jawahar, C.V.,
Learning Segmentation of Documents with Complex Scripts,
ICCVGIP06(749-760).
Springer DOI 0612
BibRef

Hernández-Reyes, E.[Edith], Martínez-Trinidad, J.F., Carrasco-Ochoa, J.A., García-Hernández, R.A.[René A.],
Document Representation Based on Maximal Frequent Sequence Sets,
CIARP06(854-863).
Springer DOI 0611
BibRef

Lakhani, G.[Gopal],
Improving Image Decomposition Method of the 3-MRC Coding of Scanned Compound Document Images,
ICCVGIP08(289-296).
IEEE DOI 0812
BibRef

Lakhani, G., Subedi, R.,
Optimal Filling of FG/BG Layers of Compound Document Images,
ICIP06(2273-2276).
IEEE DOI 0610
BibRef

Baird, H.S.[Henry S.], Casey, M.R.[Matthew R.],
Towards Versatile Document Analysis Systems,
DAS06(280-290).
Springer DOI 0602
BibRef

Bloechle, J.L.[Jean-Luc], Lalanne, D.[Denis], Ingold, R.[Rolf],
OCD: An Optimized and Canonical Document Format,
ICDAR09(236-240).
IEEE DOI 0907
BibRef

Bloechle, J.L.[Jean-Luc], Rigamonti, M.[Maurizio], Hadjar, K.[Karim], Lalanne, D.[Denis], Ingold, R.[Rolf],
XCDF: A Canonical and Structured Document Format,
DAS06(141-152).
Springer DOI 0602
BibRef

Sternby, J., Ericsson, A.,
Core points: A framework for structural parameterization,
ICDAR05(I: 217-221).
IEEE DOI 0508
BibRef

Lin, X.,
Active document layout synthesis,
ICDAR05(I: 86-90).
IEEE DOI 0508
BibRef

Sun, H.M.[Hung-Ming],
Page segmentation for Manhattan and non-Manhattan layout documents via selective CRLA,
ICDAR05(I: 116-120).
IEEE DOI 0508
BibRef

Shi, Z.X.[Zhi-Xin], Govindaraju, V.,
Multi-scale techniques for document page segmentation,
ICDAR05(II: 1020-1024).
IEEE DOI 0508
BibRef

Berardi, M.[Margherita], Lapi, M.[Michele], Malerba, D.[Donato],
An Integrated Approach for Automatic Semantic Structure Extraction in Document Images,
DAS04(179-190).
Springer DOI 0505
BibRef

Ceci, M., Berardi, M.[Margherita], Malerba, D.[Donato],
Relational learning techniques for document image understanding: comparing statistical and logical approaches,
ICDAR05(I: 473-482).
IEEE DOI 0508
BibRef

Malerba, D., Esposito, F., Altamura, O., Ceci, M., Berardi, M.,
Correcting the document layout: a machine learning approach,
ICDAR03(97-102).
IEEE DOI 0311
BibRef

Malerba, D.[Donato], Esposito, F.[Floriana], Lisi, F.A., Altamura, O.[Oronzo],
Automated discovery of dependencies between logical components in document image understanding,
ICDAR01(174-178).
IEEE DOI 0109
BibRef

Huang, M., DeMenthon, D.F., Doermann, D.S., Golebiowski, L., Hamilton, B.A.,
Document ranking by layout relevance,
ICDAR05(I: 362-366).
IEEE DOI 0508
BibRef

Waked, B., Suen, C.Y.[Ching Y.], Bergler, S.,
Segmenting document images using diagonal white runs and vertical edges,
ICDAR01(194-199).
IEEE DOI 0109
BibRef

Yingsaeree, C., Kawtrakul, A.,
Rule-based middle-level character detection for simplifying Thai document layout analysis,
ICDAR05(II: 888-892).
IEEE DOI 0508
BibRef

Adam, S.[Sébastien], Rigamonti, M.[Maurizio], Clavier, E.[Eric], Trupin, E.[Eric], Ogier, J.M.[Jean-Marc], Tombre, K.[Karl], Gardes, J.[Joël],
DocMining: A Document Analysis System Builder,
DAS04(472-483).
Springer DOI 0505
BibRef

Carmagnac, F.[Fabien], Héroux, P.[Pierre], Trupin, É.[Éric],
Multi-view HAC for Semi-supervised Document Image Classification,
DAS04(191-200).
Springer DOI 0505
BibRef

Antonacopoulos, A.[Apostolos], Karatzas, D.[Dimosthenis],
Semantics-based content extraction in typewritten historical documents,
ICDAR05(I: 48-53).
IEEE DOI 0508
BibRef
Earlier:
A Complete Approach to the Conversion of Typewritten Historical Documents for Digital Archives,
DAS04(90).
Springer DOI 0505
BibRef
And:
Document image analysis for World War II personal records,
DIAL04(336-341).
IEEE DOI 0404

See also Colour text segmentation in web images based on human perception. BibRef

Mao, S., Kim, J.W., Thoma, G.R.,
A dynamic feature generation system for automated metadata extraction in preservation of digital materials,
DIAL04(225-232).
IEEE DOI 0404
BibRef

Gattani, A., Mukerji, M., Gur, H.,
A fast multifunctional approach for document image analysis,
ICDAR03(1178-1182).
IEEE DOI 0311
BibRef

Hoque, S., Selim, H., Howells, W.G.J., Fairhurst, M.C., Deravi, F.,
SAGENT: a novel technique for document modeling for secure access and distribution,
ICDAR03(1257-1261).
IEEE DOI 0311
BibRef

Howells, W.G.J., Selim, H., Hoque, S., Fairhurst, M.C., Deravi, F.,
The autonomous document object (ADO) model,
ICDAR01(977-981).
IEEE DOI 0109
BibRef

Klein, B., Agne, S., Bagdanov, A.D.,
Understanding document analysis and understanding (through modeling),
ICDAR03(1218-1222).
IEEE DOI 0311
BibRef

Breuel, T.M.[Thomas M.],
An algorithm for finding maximal whitespace rectangles at arbitrary orientations for document layout analysis,
ICDAR03(66-70).
IEEE DOI 0311
BibRef
Earlier:
Two Geometric Algorithms for Layout Analysis,
DAS02(188 ff.).
Springer DOI 0303
BibRef

Lee, K.H.[Kyong-Ho], Choy, Y.C.[Yoon-Chul], Cho, S.B.[Sung-Bae], Tang, X.[Xiao], McCrary, V.[Victor],
Document Reverse Engineering: From Paper to XML,
DAS02(503 ff.).
Springer DOI 0303
BibRef

Liang, J.[Jian], Doermann, D.S.[David S.],
Logical Labeling of Document Images Using Layout Graph Matching with Adaptive Learning,
DAS02(224 ff.).
Springer DOI 0303
BibRef

Bagdanov, A.D., Worring, M.,
Granulometric analysis of document images,
ICPR02(I: 468-471).
IEEE DOI 0211
BibRef

Tam, V., Santoso, A., Setiono, R.,
A comparative study of centroid-based, neighborhood-based and statistical approaches for effective document categorization,
ICPR02(IV: 235-238).
IEEE DOI 0211
BibRef

Popat, K., Greene, D.H., Poo, T.L.[Tze-Lei],
Adaptive stack algorithm in document image decoding,
ICPR02(IV: 231-234).
IEEE DOI 0211
BibRef

Liang, J.[Jian], Doermann, D.S., Ma, M.[Matthew], Guo, J.K.,
Page classification through logical labelling,
ICPR02(III: 477-480).
IEEE DOI 0211
BibRef

Valveny, E., Marti, E.,
Learning of structural descriptions of graphic symbols using deformable template matching,
ICDAR01(455-459).
IEEE DOI 0109
BibRef

Valveny, E., Lamiroy, B.,
Sean-to-XML: automatic generation of browsable technical documents,
ICPR02(III: 188-191).
IEEE DOI 0211
BibRef

Duong, J., Emptoz, H., Cote, M.,
Features for printed document image analysis,
ICPR02(III: 245-248).
IEEE DOI 0211
BibRef

da Silva, J.M.M.[João Marcelo Monte], Lins, R.D.[Rafael Dueire],
Color Document Synthesis as a Compression Strategy,
ICDAR07(466-470).
IEEE DOI 0709
BibRef

Lins, R.D.[Rafael Dueire], da Silva, J.M.M.[João Marcelo Monte],
Generating Color Documents from Segmented and Synthetic Elements,
ICIAR07(1217-1228).
Springer DOI 0708
BibRef

Pappas, T., Tseng, S., Kosiba, D.,
A Robust and Efficient Algorithm for Bilevel Document Block Classification,
ICIP01(I: 1122-1125).
IEEE DOI 0108
BibRef

Sylwester, D., Seth, S.,
Adaptive segmentation of document images,
ICDAR01(827-831).
IEEE DOI 0109
BibRef

Nagy, G., Kanai, J., Krishnamoorthy, M., Thomas, M., Viswanathan, M.,
Two Complementary Techniques for Digitized Document Analysis,
ACM DPS88(169-176), December 1988. 0101
top-down/bottom-up. Publication specific pages. BibRef

Gatos, B., Papamarkos, N.,
Applying fast segmentation techniques at a binary image represented by a set of non-overlapping blocks,
ICDAR01(1147-1151).
IEEE DOI 0109
BibRef

Nattee, C., Numao, M.,
Geometric method for document understanding and classification using online machine learning,
ICDAR01(602-606).
IEEE DOI 0109
BibRef

Eglin, W., Gagneux, A.,
Visual exploration and functional document labeling,
ICDAR01(816-820).
IEEE DOI 0109
BibRef

Kise, K., Miki, Y., Matsumoto, K.,
Backgrounds as Information Carriers for Printed Documents,
ICPR00(Vol IV: 380-384).
IEEE DOI 0009
BibRef

Okun, O., Pietikäinen, M.,
Automatic Ground-truth Generation for Skew-tolerance Evaluation of Document Layout Analysis Methods,
ICPR00(Vol IV: 376-379).
IEEE DOI 0009
BibRef

Maderlechner, G.[Gerd], Panyr, J.[Jiri], Suda, P.[Peter],
Finding Captions in PDF-Documents for Semantic Annotations of Images,
SSPR06(422-430).
Springer DOI 0608
BibRef

Maderlechner, G., Schreyer, A., Suda, P.,
Extraction of Relevant Information from Document Images Using Measures of Visual Attention,
ICPR00(Vol IV: 385-388).
IEEE DOI 0009
BibRef

Watanabe, T., Sobue, T.,
Layout Analysis of Complex Documents,
ICPR00(Vol IV: 447-450).
IEEE DOI 0009
BibRef

Aiyer, A.[Anuradha], Gray, R.M.[Robert M.],
A Fast, Table-Lookup Algorithm for Classifying Document Images,
ICIP99(I:590-594).
IEEE DOI BibRef 9900

Stevens, J., Gee, A., Dance, C.,
Automatic Processing of Document Annotations,
BMVC98(xx-yy). BibRef 9800

Takasu, A.[Atsuhiro],
Document filtering for fast approximate string matching of erroneous text,
ICDAR01(916-920).
IEEE DOI 0109
BibRef

Takasu, A.[Atsuhiro],
Probabilistic Interpage Analysis for Article Extraction from Document Images,
ICPR98(Vol I: 932-935).
IEEE DOI 9808
BibRef

Leung, M.[Maylor], Twan, T.[Ting],
Linear Layout Processing,
ICPR98(Vol I: 403-405).
IEEE DOI 9808
BibRef

Robert, L., Likforman-Sulem, L., Lecolinet, E.,
Image and Text Coupling for Creating Electronic Books from Manuscripts,
ICDAR97(823-826).
IEEE DOI 9708
BibRef

Hong, T., Srihari, S.N.,
Representing OCRed Documents in HTML,
ICDAR97(831-834).
IEEE DOI 9708
BibRef

Rus, D.[Daniela], de Santis, P.[Peter],
The Self-Organizing Desk,
IJCAI97(758-763). extracting and organizing document information given a camera viewing a physical desktop. BibRef 9700

Menier, G., Lorette, G.,
Lexical Analyzer Based on a Self-Organizing Feature Map,
ICDAR97(1067-1071).
IEEE DOI 9708
BibRef

Brugger, R., Zramdini, A.[Abdelwahab], Ingold, R.[Rolf],
Modeling Documents for Structure Recognition Using Generalized N-Grams,
ICDAR97(56-60).
IEEE DOI 9708
BibRef

Baird, H.S., Gilbert, D., Ittner, D.J.,
A family of European page readers,
ICPR94(B:540-543).
IEEE DOI 9410
BibRef

Baird, H.S., Ittner, D.,
Language-Free Layout Analysis,
ICDAR93(336-340). BibRef 9300

Kornai, A., Connell, S.D.,
Statistical Zone Finding,
ICPR96(III: 818-822).
IEEE DOI 9608
(IBM Almaden Res. Center, USA) BibRef

Liu, J.M.[Ji-Ming], Tang, Y.Y.[Yuan Y.], He, Q.C.[Qi-Chao], Suen, C.Y.[Ching Y.],
Adaptive document segmentation and geometric relation labeling: algorithms and experimental results,
ICPR96(III: 763-767).
IEEE DOI 9608
(Hong Kong Baptist Univ., HK) BibRef

Ramel, J.Y., Vincent, N., Emptoz, H.,
Combining global and local vision for technical document understanding,
ICPR96(III: 773-777).
IEEE DOI 9608
(Laboratoire de Reconnaissance, F) BibRef

Sainz, G., Izquierdo, J., Dimitriadis, Y., Lopez Coronado, J.,
A New Neuro-Fuzzy System for Logical Labeling of Documents,
ICPR96(IV: 431-435).
IEEE DOI 9608
(Univ. of Valladolid, E) BibRef

Esposito, F., Malbera, D., Semeraro, G.,
A Knowledge-Based Approach to the Layout Analysis,
ICDAR95(466-471). BibRef 9500
Earlier:
Automated Acquisition of Rules for Document Understanding,
ICDAR93(650-654). Hybrid approach. Independent of document type. For simple layout such as letters. BibRef

Esposito, F., Malbera, D., Semeraro, G., Annese, E., and Scafuro, G.,
An Experimental Page Layout Recognition System for Office Document Automatic Classification: An Integrated Approach for Inductive Generalization,
ICPR90(I: 557-562).
IEEE DOI BibRef 9000

Antonacopoulos, A., Ritchings, R.T.,
Flexible page segmentation using the background,
ICPR94(B:339-344).
IEEE DOI 9410
BibRef

Bussi, S.[Silvia], Mangili, F.[Fulvia],
A semi-automatic method for form layout description,
CIAP95(539-544).
Springer DOI 9509
BibRef

Tateisi, Y., Itoh, N.,
Using stochastic syntactic analysis for extracting a logical structure from a document image,
ICPR94(B:391-394).
IEEE DOI 9410
BibRef

Ciardello, G., Scafuro, G., de Grandi, M.T., Spada, M.R., Roccotelli, M.P.,
An Experimental System for Office Document Handling and Text Recognition,
ICPR88(739-743).
IEEE Top Reference. BibRef 8800

Meynieux, E., Seisen, S., Tombre, K.,
Bilevel Information Recognition and Coding in Office Paper Documents,
ICPR86(442-445). BibRef 8600

Kida, H., Iwaki, O., Kawada, K.,
Document Recognition System for Office Automation,
ICPR86(446-448). BibRef 8600

Hase, M., Suzuki, G., Itoh, H.,
A Method for Extracting Marked Regions from Document Images,
ICPR86(780-782). BibRef 8600

Derrien-Peden, D.,
Frame-Based System for Macro-Typographical Structure Analysis in Scientific Papers,
ICDAR91(311-319). Gets text in reading order. BibRef 9100

Ingold, R., Armangil, D.,
A Top-down Document Analysis Method for Logical Structure Recognition,
ICDAR91(41-49). BibRef 9100

Zen, H., Ozawa, S.,
Extraction of the Fair Document from Mixed Mode Manuscript,
CVPR85(544-549). BibRef 8500

Chapter on OCR, Document Analysis and Character Recognition Systems continues in
Page Segmentation, General Evaluations .


Last update:Mar 16, 2024 at 20:36:19