Filters for graph-based keyword spotting in historical handwritten documents

dc.contributor.authorStauffer, Michael
dc.contributor.authorFischer, Andreas
dc.contributor.authorRiesen, Kaspar
dc.date.accessioned2018-04-25T09:39:29Z
dc.date.issued2020-06
dc.description.abstractThe accessibility to handwritten historical documents is often constrained by the limited feasibility of automatic full transcriptions. Keyword Spotting (KWS), that allows to retrieve arbitrary query words from documents, has been proposed as alternative. In the present paper, we make use of graphs for representing word images. The actual keyword spotting is thus based on matching a query graph with all documents graphs. However, even with relative fast approximation algorithms the shear amount of matchings might limit the practical application of this approach. For this reason we present two novel filters with linear time complexity that allow to substantially reduce the number of graph matchings actually required. In particular, these filters estimate a graph dissimilarity between a query graph and all document graphs based on their node and edge distribution in a polar coordinate system. Eventually, all graphs from the document with distributions that differ to heavily from the query’s node/edge distribution are eliminated. In an experimental evaluation on four different historical documents, we show that about 90% of the matchings can be omitted, while the KWS accuracy is not negatively affected.en_ZA
dc.description.departmentInformaticsen_ZA
dc.description.embargo2019-03-30
dc.description.librarianhj2018en_ZA
dc.description.sponsorshipThe Hasler Foundation Switzerland.en_ZA
dc.description.urihttp://www.elsevier.com/locate/patrecen_ZA
dc.identifier.citationStauffer, M., Fischer, A. & Riesen, K. 2020, 'Filters for graph-based keyword spotting in historical handwritten documents', Pattern Recognition Letters, vol. 134, pp. 125-134.en_ZA
dc.identifier.issn0167-8655 (print)
dc.identifier.issn1872-7344 (online)
dc.identifier.other10.1016/j.patrec.2018.03.030
dc.identifier.urihttp://hdl.handle.net/2263/64717
dc.language.isoenen_ZA
dc.publisherElsevieren_ZA
dc.rights© 2018 Elsevier B.V. All rights reserved. Notice : this is the author’s version of a work that was accepted for publication in Pattern Recognition Letters. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. A definitive version was subsequently published in Pattern Recognition Letters, vol. 134, pp. 125-134, 2020. doi : 10.1016/j.patrec.2018.03.030.en_ZA
dc.subjectGraph theoryen_ZA
dc.subjectKeyword spottingen_ZA
dc.subjectGraph representationen_ZA
dc.subjectFilter methoden_ZA
dc.subjectFast rejectionen_ZA
dc.subjectPattern matchingen_ZA
dc.subjectHistoryen_ZA
dc.subjectGraphic methodsen_ZA
dc.subjectBandpass filtersen_ZA
dc.subjectApproximation algorithmsen_ZA
dc.subjectHandwritten keyword spottingen_ZA
dc.subjectBipartite graph matchingen_ZA
dc.titleFilters for graph-based keyword spotting in historical handwritten documentsen_ZA
dc.typePostprint Articleen_ZA

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Stauffer_Filters_2020.pdf
Size:
6.08 MB
Format:
Adobe Portable Document Format
Description:
Postprint Article

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.75 KB
Format:
Item-specific license agreed upon to submission
Description: