Please use this identifier to cite or link to this item: http://hdl.handle.net/1893/27854
Full metadata record
DC FieldValueLanguage
dc.contributor.authorBeck, Tilmanen_UK
dc.contributor.authorBöschen, Falken_UK
dc.contributor.authorScherp, Ansgaren_UK
dc.contributor.editorElloumi, Men_UK
dc.contributor.editorGranitzer, Men_UK
dc.contributor.editorHameurlain, Aen_UK
dc.contributor.editorSeifert, Cen_UK
dc.contributor.editorStein, Ben_UK
dc.contributor.editorTjoa, AMen_UK
dc.contributor.editorWagner, Ren_UK
dc.date.accessioned2018-09-27T13:27:13Z-
dc.date.available2018-09-27T13:27:13Z-
dc.date.issued2018-12-31en_UK
dc.identifier.urihttp://hdl.handle.net/1893/27854-
dc.description.abstractThe vast amount of scientific literature poses a challenge when one is trying to understand a previously unknown topic. Selecting a representative subset of documents that covers most of the desired content can solve this challenge by presenting the user a small subset of documents. We build on existing research on representative subset extraction and apply it in an information retrieval setting. Our document selection process consists of three steps: computation of the document representations, clustering, and selection of documents. We implement and compare two different document representations, two different clustering algorithms, and three different selection methods using a coverage and a redundancy metric. We execute our 36 experiments on two datasets, with 10 sample queries each, from different domains. The results show that there is no clear favorite and that we need to ask the question whether coverage and redundancy are sufficient for evaluating representative subsets.en_UK
dc.language.isoenen_UK
dc.publisherSpringer International Publishingen_UK
dc.relationBeck T, Böschen F & Scherp A (2018) What to Read Next? Challenges and Preliminary Results in Selecting Representative Documents. In: Elloumi M, Granitzer M, Hameurlain A, Seifert C, Stein B, Tjoa A & Wagner R (eds.) Database and Expert Systems Applications. DEXA 2018. Communications in Computer and Information Science, 903. 29th International Conference on Database and Expert Systems Applications, DEXA 2018, Regensburg, Germany, 03.09.2018-06.09.2018. Cham, Switzerland: Springer International Publishing, pp. 230-242. https://doi.org/10.1007/978-3-319-99133-7_19en_UK
dc.relation.ispartofseriesCommunications in Computer and Information Science, 903en_UK
dc.rightsThis is a post-peer-review, pre-copyedit version of a paper published in Elloumi M. et al. (eds) Database and Expert Systems Applications. DEXA 2018. Communications in Computer and Information Science, vol 903. The final authenticated version is available online at: https://doi.org/10.1007/978-3-319-99133-7_19en_UK
dc.subjectRepresentative document selectionen_UK
dc.subjectDocument clusteringen_UK
dc.titleWhat to Read Next? Challenges and Preliminary Results in Selecting Representative Documentsen_UK
dc.typeConference Paperen_UK
dc.identifier.doi10.1007/978-3-319-99133-7_19en_UK
dc.citation.issn1865-0937en_UK
dc.citation.issn1865-0929en_UK
dc.citation.spage230en_UK
dc.citation.epage242en_UK
dc.citation.publicationstatusPublisheden_UK
dc.type.statusAM - Accepted Manuscripten_UK
dc.contributor.funderEuropean Commissionen_UK
dc.citation.btitleDatabase and Expert Systems Applications. DEXA 2018en_UK
dc.citation.conferencedates2018-09-03 - 2018-09-06en_UK
dc.citation.conferencelocationRegensburg, Germanyen_UK
dc.citation.conferencename29th International Conference on Database and Expert Systems Applications, DEXA 2018en_UK
dc.citation.date07/08/2018en_UK
dc.citation.isbn9783319991320en_UK
dc.citation.isbn9783319991337en_UK
dc.publisher.addressCham, Switzerlanden_UK
dc.contributor.affiliationUniversity of Kielen_UK
dc.contributor.affiliationUniversity of Kielen_UK
dc.contributor.affiliationComputing Scienceen_UK
dc.identifier.isiWOS:000460552400019en_UK
dc.identifier.scopusid2-s2.0-85051950111en_UK
dc.identifier.wtid972876en_UK
dc.contributor.orcid0000-0003-4223-5353en_UK
dc.contributor.orcid0000-0002-2653-9245en_UK
dc.date.accepted2018-05-18en_UK
dcterms.dateAccepted2018-05-18en_UK
dc.date.filedepositdate2018-09-27en_UK
rioxxterms.apcnot requireden_UK
rioxxterms.typeConference Paper/Proceeding/Abstracten_UK
rioxxterms.versionAMen_UK
local.rioxx.authorBeck, Tilman|en_UK
local.rioxx.authorBöschen, Falk|0000-0003-4223-5353en_UK
local.rioxx.authorScherp, Ansgar|0000-0002-2653-9245en_UK
local.rioxx.projectProject ID unknown|European Commission (Horizon 2020)|en_UK
local.rioxx.contributorElloumi, M|en_UK
local.rioxx.contributorGranitzer, M|en_UK
local.rioxx.contributorHameurlain, A|en_UK
local.rioxx.contributorSeifert, C|en_UK
local.rioxx.contributorStein, B|en_UK
local.rioxx.contributorTjoa, AM|en_UK
local.rioxx.contributorWagner, R|en_UK
local.rioxx.freetoreaddate2018-09-27en_UK
local.rioxx.licencehttp://www.rioxx.net/licenses/all-rights-reserved|2018-09-27|en_UK
local.rioxx.filenameW43-BeckEtAl-Challenges and Preliminary Results in Selecting Representative Documents.pdfen_UK
local.rioxx.filecount1en_UK
local.rioxx.source9783319991337en_UK
Appears in Collections:Computing Science and Mathematics Conference Papers and Proceedings

Files in This Item:
File Description SizeFormat 
W43-BeckEtAl-Challenges and Preliminary Results in Selecting Representative Documents.pdfFulltext - Accepted Version725.74 kBAdobe PDFView/Open


This item is protected by original copyright



Items in the Repository are protected by copyright, with all rights reserved, unless otherwise indicated.

The metadata of the records in the Repository are available under the CC0 public domain dedication: No Rights Reserved https://creativecommons.org/publicdomain/zero/1.0/

If you believe that any material held in STORRE infringes copyright, please contact library@stir.ac.uk providing details and we will remove the Work from public display in STORRE and investigate your claim.