Please use this identifier to cite or link to this item:
http://hdl.handle.net/1893/27713
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Connor, Richard | en_UK |
dc.contributor.author | Moss, Robert | en_UK |
dc.contributor.editor | Navarro, Gonzalo | en_UK |
dc.contributor.editor | Pestov, Vladimir | en_UK |
dc.date.accessioned | 2018-09-05T13:55:46Z | - |
dc.date.available | 2018-09-05T13:55:46Z | - |
dc.date.issued | 2012-12-31 | en_UK |
dc.identifier.uri | http://hdl.handle.net/1893/27713 | - |
dc.description.abstract | We investigate a distance metric, previously defined for the measurement of structured data, in the more general context of vector spaces. The metric has a basis in information theory and assesses the distance between two vectors in terms of their relative information content. The resulting metric gives an outcome based on the dimensional correlation, rather than magnitude, of the input vectors, in a manner similar to Cosine Distance. In this paper the metric is defined, and assessed, in comparison with Cosine Distance, for its major properties: semantics, properties for use within similarity search, and evaluation efficiency. We find that it is fairly well correlated with Cosine Distance in dense spaces, but its semantics are in some cases preferable. In a sparse space, it significantly outperforms Cosine Distance over TREC data and queries, the only large collection for which we have a human-ratified ground truth. This result is backed up by another experiment over movielens data. In dense Cartesian spaces it has better properties for use with similarity indices than either Cosine or Euclidean Distance. In its definitional form it is very expensive to evaluate for high-dimensional sparse vectors; to counter this, we show an algebraic rewrite which allows its evaluation to be performed more efficiently. Overall, when a multivariate correlation metric is required over positive vectors, SED seems to be a better choice than Cosine Distance in many circumstances. | en_UK |
dc.language.iso | en | en_UK |
dc.publisher | Springer Verlag | en_UK |
dc.relation | Connor R & Moss R (2012) A multivariate correlation distance for vector spaces. In: Navarro G & Pestov V (eds.) Similarity Search and Applications: 5th International Conference, SISAP 2012, Toronto, ON, Canada, August 9-10, 2012. Proceedings. Lecture Notes in Computer Science, 7404. Similarity Search and Applications: 5th International Conference, SISAP 2012, Toronto, 09.08.2012-10.08.2012. Berlin, Heidelberg: Springer Verlag, pp. 209-225. https://doi.org/10.1007/978-3-642-32153-5_15 | en_UK |
dc.relation.ispartofseries | Lecture Notes in Computer Science, 7404 | en_UK |
dc.rights | The publisher does not allow this work to be made publicly available in this Repository. Please use the Request a Copy feature at the foot of the Repository record to request a copy directly from the author. You can only request a copy if you wish to use this work for your own research or private study. | en_UK |
dc.rights.uri | http://www.rioxx.net/licenses/under-embargo-all-rights-reserved | en_UK |
dc.subject | Distance metric | en_UK |
dc.subject | multivariate correlation | en_UK |
dc.subject | vector space | en_UK |
dc.subject | cosine distance | en_UK |
dc.subject | similarity search | en_UK |
dc.title | A multivariate correlation distance for vector spaces | en_UK |
dc.type | Conference Paper | en_UK |
dc.rights.embargodate | 2999-12-31 | en_UK |
dc.rights.embargoreason | [Connor Moss 2012.pdf] The publisher does not allow this work to be made publicly available in this Repository therefore there is an embargo on the full text of the work. | en_UK |
dc.identifier.doi | 10.1007/978-3-642-32153-5_15 | en_UK |
dc.citation.jtitle | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | en_UK |
dc.citation.issn | 0302-9743 | en_UK |
dc.citation.spage | 209 | en_UK |
dc.citation.epage | 225 | en_UK |
dc.citation.publicationstatus | Published | en_UK |
dc.type.status | VoR - Version of Record | en_UK |
dc.author.email | richard.connor@stir.ac.uk | en_UK |
dc.citation.btitle | Similarity Search and Applications: 5th International Conference, SISAP 2012, Toronto, ON, Canada, August 9-10, 2012. Proceedings | en_UK |
dc.citation.conferencedates | 2012-08-09 - 2012-08-10 | en_UK |
dc.citation.conferencelocation | Toronto | en_UK |
dc.citation.conferencename | Similarity Search and Applications: 5th International Conference, SISAP 2012 | en_UK |
dc.citation.isbn | 978-3-642-32152-8 | en_UK |
dc.publisher.address | Berlin, Heidelberg | en_UK |
dc.contributor.affiliation | University of Strathclyde | en_UK |
dc.contributor.affiliation | University of Strathclyde | en_UK |
dc.identifier.scopusid | 2-s2.0-84865484370 | en_UK |
dc.identifier.wtid | 956103 | en_UK |
dc.contributor.orcid | 0000-0003-4734-8103 | en_UK |
dcterms.dateAccepted | 2012-12-31 | en_UK |
dc.date.filedepositdate | 2018-08-16 | en_UK |
rioxxterms.apc | not required | en_UK |
rioxxterms.type | Conference Paper/Proceeding/Abstract | en_UK |
rioxxterms.version | VoR | en_UK |
local.rioxx.author | Connor, Richard|0000-0003-4734-8103 | en_UK |
local.rioxx.author | Moss, Robert| | en_UK |
local.rioxx.project | Internal Project|University of Stirling|https://isni.org/isni/0000000122484331 | en_UK |
local.rioxx.contributor | Navarro, Gonzalo| | en_UK |
local.rioxx.contributor | Pestov, Vladimir| | en_UK |
local.rioxx.freetoreaddate | 2262-12-01 | en_UK |
local.rioxx.licence | http://www.rioxx.net/licenses/under-embargo-all-rights-reserved|| | en_UK |
local.rioxx.filename | Connor Moss 2012.pdf | en_UK |
local.rioxx.filecount | 1 | en_UK |
local.rioxx.source | 978-3-642-32152-8 | en_UK |
Appears in Collections: | Computing Science and Mathematics Conference Papers and Proceedings |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Connor Moss 2012.pdf | Fulltext - Published Version | 873.04 kB | Adobe PDF | Under Permanent Embargo Request a copy |
This item is protected by original copyright |
Items in the Repository are protected by copyright, with all rights reserved, unless otherwise indicated.
The metadata of the records in the Repository are available under the CC0 public domain dedication: No Rights Reserved https://creativecommons.org/publicdomain/zero/1.0/
If you believe that any material held in STORRE infringes copyright, please contact library@stir.ac.uk providing details and we will remove the Work from public display in STORRE and investigate your claim.