Please use this identifier to cite or link to this item: http://hdl.handle.net/1893/29995
Full metadata record
DC FieldValueLanguage
dc.contributor.authorConnor, Richarden_UK
dc.contributor.authorDearle, Alanen_UK
dc.contributor.authorVadicamo, Luciaen_UK
dc.contributor.editorAmato, Gen_UK
dc.contributor.editorMecella, Men_UK
dc.contributor.editorGennaro, Cen_UK
dc.date.accessioned2019-08-20T00:04:26Z-
dc.date.available2019-08-20T00:04:26Z-
dc.date.issued2019en_UK
dc.identifier.urihttp://hdl.handle.net/1893/29995-
dc.description.abstractSearching for similar strings is an important and frequent database task both in terms of human interactions and in absolute worldwide CPU utilisation. A wealth of metric functions for string comparison exist. However, with respect to the wide range of classification and other techniques known within vector spaces, such metrics allow only a very restricted range of techniques. To counter this restriction, various strategies have been used for mapping string spaces into vector spaces, approximating the string distances within the mapped space and therefore allowing vector space techniques to be used. In previous work we have developed a novel technique for mapping metric spaces into vector spaces, which can therefore be applied for this purpose. In this paper we evaluate this technique in the context of string spaces, and compare it to other published techniques for mapping strings to vectors. We use a publicly available English lexicon as our experimental data set, and test two different string metrics over it for each vector mapping. We find that our novel technique considerably outperforms previously used technique in preserving the actual distance.en_UK
dc.language.isoenen_UK
dc.publisherCEUR-WSen_UK
dc.relationConnor R, Dearle A & Vadicamo L (2019) Modelling string structure in vector spaces. In: Amato G, Mecella M & Gennaro C (eds.) 27th Italian Symposium on Advanced Database Systems. CEUR Workshop Proceedings, 2400. SEBD 2019: Italian Symposium on Advanced Database Systems, Castiglione della Pescaia (Grosseto), Italy, 16.06.2019-19.06.2019. Aachen: CEUR-WS. http://ceur-ws.org/Vol-2400/paper-45.pdfen_UK
dc.relation.ispartofseriesCEUR Workshop Proceedings, 2400en_UK
dc.rightsCopyright 2019 for the individual papers by the papers authors. Copying permitted for private and academic purposes. This volume is published and copyrighted by its editors. SEBD 2019, June 16-19, 2019, Castiglione della Pescaia, Italy.en_UK
dc.rights.urihttps://storre.stir.ac.uk/STORREEndUserLicence.pdfen_UK
dc.subjectMetric Mappingen_UK
dc.subjectn-Simplex projectionen_UK
dc.subjectPivoted embeddingen_UK
dc.subjectStringen_UK
dc.subjectJensen-Shannon distanceen_UK
dc.subjectLevenshtein distanceen_UK
dc.titleModelling string structure in vector spacesen_UK
dc.typeConference Paperen_UK
dc.citation.issn1613-0073en_UK
dc.citation.publicationstatusPublisheden_UK
dc.type.statusVoR - Version of Recorden_UK
dc.identifier.urlhttp://ceur-ws.org/Vol-2400/paper-45.pdfen_UK
dc.citation.btitle27th Italian Symposium on Advanced Database Systemsen_UK
dc.citation.conferencedates2019-06-16 - 2019-06-19en_UK
dc.citation.conferencelocationCastiglione della Pescaia (Grosseto), Italyen_UK
dc.citation.conferencenameSEBD 2019: Italian Symposium on Advanced Database Systemsen_UK
dc.citation.date09/07/2019en_UK
dc.publisher.addressAachenen_UK
dc.contributor.affiliationComputing Scienceen_UK
dc.contributor.affiliationUniversity of St Andrewsen_UK
dc.contributor.affiliationVisual Computing Group, CNR-ISTIen_UK
dc.identifier.scopusid2-s2.0-85069491938en_UK
dc.identifier.wtid1429786en_UK
dc.contributor.orcid0000-0003-4734-8103en_UK
dc.date.accepted2019-04-20en_UK
dcterms.dateAccepted2019-04-20en_UK
dc.date.filedepositdate2019-08-19en_UK
rioxxterms.apcnot requireden_UK
rioxxterms.typeConference Paper/Proceeding/Abstracten_UK
rioxxterms.versionVoRen_UK
local.rioxx.authorConnor, Richard|0000-0003-4734-8103en_UK
local.rioxx.authorDearle, Alan|en_UK
local.rioxx.authorVadicamo, Lucia|en_UK
local.rioxx.projectInternal Project|University of Stirling|https://isni.org/isni/0000000122484331en_UK
local.rioxx.contributorAmato, G|en_UK
local.rioxx.contributorMecella, M|en_UK
local.rioxx.contributorGennaro, C|en_UK
local.rioxx.freetoreaddate2019-08-19en_UK
local.rioxx.licencehttps://storre.stir.ac.uk/STORREEndUserLicence.pdf|2019-08-19|en_UK
local.rioxx.filenamepaper-45.pdfen_UK
local.rioxx.filecount1en_UK
local.rioxx.source1613-0073en_UK
Appears in Collections:Computing Science and Mathematics Conference Papers and Proceedings

Files in This Item:
File Description SizeFormat 
paper-45.pdfFulltext - Published Version2.98 MBAdobe PDFView/Open


This item is protected by original copyright



Items in the Repository are protected by copyright, with all rights reserved, unless otherwise indicated.

The metadata of the records in the Repository are available under the CC0 public domain dedication: No Rights Reserved https://creativecommons.org/publicdomain/zero/1.0/

If you believe that any material held in STORRE infringes copyright, please contact library@stir.ac.uk providing details and we will remove the Work from public display in STORRE and investigate your claim.