Please use this identifier to cite or link to this item: http://hdl.handle.net/1893/28051
Full metadata record
DC FieldValueLanguage
dc.contributor.authorBöschen, Falken_UK
dc.contributor.authorScherp, Ansgaren_UK
dc.contributor.editorGörg, Sen_UK
dc.contributor.editorBergmann, Ren_UK
dc.contributor.editorMüller, Gen_UK
dc.date.accessioned2018-11-06T14:29:50Z-
dc.date.available2018-11-06T14:29:50Z-
dc.date.issued2015-12-31en_UK
dc.identifier.urihttp://hdl.handle.net/1893/28051-
dc.description.abstractWe propose a pipeline for text extraction from infographics that makes use of a novel combination of data mining and computer vision techniques. The pipeline defines a sequence of steps to identify characters, cluster them into text lines, determine their rotation angle, and apply state-of-the-art OCR to recognise the text. In this paper, we formally define the pipeline and present its current implementation. In addition, we have conducted preliminary evaluations over a data corpus of 121 manually annotated infographics from a broad range of illustration types such as bar charts, pie charts, and line charts, maps, and others. We assess the results of our text extraction pipeline by comparing it with two baselines. Finally, we sketch an outline for future work and possibilities for improving the pipeline.en_UK
dc.publisherCEUR Workshop Proceedingsen_UK
dc.relationBöschen F & Scherp A (2015) Formalization and preliminary evaluation of a pipeline for text extraction from infographics. In: Görg S, Bergmann R & Müller G (eds.) Proceedings of the LWA 2015 Workshops: KDML, FGWM, IR, and FGDB, volume 1458. CEUR Workshop Proceedings, 1458. LWA 2015 Workshops: KDML, FGWM, IR, FGD, Trier, Germany, 07.10.2015-09.10.2015. Aachen, Germany: CEUR Workshop Proceedings, pp. 20-31. http://ceur-ws.org/Vol-1458/D03_CRC13_Boeschen.pdfen_UK
dc.relation.ispartofseriesCEUR Workshop Proceedings, 1458en_UK
dc.rightsThe copyright is owned by default by the authors. Copying is permitted only for private and academic purposes. The permission for academic use implies an attribution obligation, i.e., you must properly cite the items that you use in your own published work. Modification is not permitted unless a suitable license is granted by its copyright owners. Copying or use for commercial purposes is forbidden unless an explicit permission is acquired from the copyright owners.en_UK
dc.subjectInfographicsen_UK
dc.subjectOCRen_UK
dc.subjectmulti-oriented text extractionen_UK
dc.subjectformalizationen_UK
dc.titleFormalization and preliminary evaluation of a pipeline for text extraction from infographicsen_UK
dc.typeConference Paperen_UK
dc.citation.jtitleCEUR Workshop Proceedingsen_UK
dc.citation.issn1613-0073en_UK
dc.citation.volume1458en_UK
dc.citation.spage20en_UK
dc.citation.epage31en_UK
dc.citation.publicationstatusPublisheden_UK
dc.type.statusVoR - Version of Recorden_UK
dc.identifier.urlhttp://ceur-ws.org/Vol-1458/D03_CRC13_Boeschen.pdfen_UK
dc.citation.btitleProceedings of the LWA 2015 Workshops: KDML, FGWM, IR, and FGDBen_UK
dc.citation.conferencedates2015-10-07 - 2015-10-09en_UK
dc.citation.conferencelocationTrier, Germanyen_UK
dc.citation.conferencenameLWA 2015 Workshops: KDML, FGWM, IR, FGDen_UK
dc.citation.isbnN/Aen_UK
dc.publisher.addressAachen, Germanyen_UK
dc.contributor.affiliationUniversity of Kielen_UK
dc.contributor.affiliationLeibniz Information Centre for Economics - ZBWen_UK
dc.identifier.scopusid2-s2.0-84944322158en_UK
dc.identifier.wtid1007296en_UK
dc.contributor.orcid0000-0002-2653-9245en_UK
dc.date.accepted2015-08-17en_UK
dcterms.dateAccepted2015-08-17en_UK
dc.date.filedepositdate2018-10-22en_UK
rioxxterms.apcnot requireden_UK
rioxxterms.typeConference Paper/Proceeding/Abstracten_UK
rioxxterms.versionVoRen_UK
local.rioxx.authorBöschen, Falk|en_UK
local.rioxx.authorScherp, Ansgar|0000-0002-2653-9245en_UK
local.rioxx.projectInternal Project|University of Stirling|https://isni.org/isni/0000000122484331en_UK
local.rioxx.contributorGörg, S|en_UK
local.rioxx.contributorBergmann, R|en_UK
local.rioxx.contributorMüller, G|en_UK
local.rioxx.freetoreaddate2018-10-22en_UK
local.rioxx.licencehttp://www.rioxx.net/licenses/all-rights-reserved|2018-10-22|en_UK
local.rioxx.filenameBöschen-Scherp-2015.pdfen_UK
local.rioxx.filecount1en_UK
local.rioxx.sourceN/Aen_UK
Appears in Collections:Computing Science and Mathematics Conference Papers and Proceedings

Files in This Item:
File Description SizeFormat 
Böschen-Scherp-2015.pdfFulltext - Published Version433.5 kBAdobe PDFView/Open


This item is protected by original copyright



Items in the Repository are protected by copyright, with all rights reserved, unless otherwise indicated.

The metadata of the records in the Repository are available under the CC0 public domain dedication: No Rights Reserved https://creativecommons.org/publicdomain/zero/1.0/

If you believe that any material held in STORRE infringes copyright, please contact library@stir.ac.uk providing details and we will remove the Work from public display in STORRE and investigate your claim.