Please use this identifier to cite or link to this item: http://hdl.handle.net/1893/28199
Appears in Collections:Computing Science and Mathematics Conference Papers and Proceedings
Author(s): Amato, Giuseppe
Chávez, Edgar
Connor, Richard
Falchi, Fabrizio
Gennaro, Claudio
Vadicamo, Lucia
Title: Re-ranking Permutation-Based Candidate Sets with the n-Simplex Projection
Editor(s): Marchand-Maillet, S.
Silva, Y. N.
Chavez, E.
Citation: Amato G, Chávez E, Connor R, Falchi F, Gennaro C & Vadicamo L (2018) Re-ranking Permutation-Based Candidate Sets with the n-Simplex Projection. In: Marchand-Maillet S, Silva YN & Chavez E (eds.) 11th International Conference on Similarity Search and Applications, SISAP 2018. Lecture Notes in Computer Science, 11223. SISAP 2018: International Conference on Similarity Search and Applications, Lima, Peru, 07.10.2018-09.10.2018. Cham, Switzerland: Springer Verlag, pp. 3-17. https://doi.org/10.1007/978-3-030-02224-2_1
Issue Date: 31-Dec-2018
Date Deposited: 9-Nov-2018
Series/Report no.: Lecture Notes in Computer Science, 11223
Conference Name: SISAP 2018: International Conference on Similarity Search and Applications
Conference Dates: 2018-10-07 - 2018-10-09
Conference Location: Lima, Peru
Abstract: In the realm of metric search, the permutation-based approaches have shown very good performance in indexing and supporting approximate search on large databases. These methods embed the metric objects into a permutation space where candidate results to a given query can be efficiently identified. Typically, to achieve high effectiveness, the permutation-based result set is refined by directly comparing each candidate object to the query one. Therefore, one drawback of these approaches is that the original dataset needs to be stored and then accessed during the refining step. We propose a refining approach based on a metric embedding, called n-Simplex projection, that can be used on metric spaces meeting the n-point property. The n-Simplex projection provides upper- and lower-bounds of the actual distance, derived using the distances between the data objects and a finite set of pivots. We propose to reuse the distances computed for building the data permutations to derive these bounds and we show how to use them to improve the permutation-based results. Our approach is particularly advantageous for all the cases in which the traditional refining step is too costly, e.g. very large dataset or very expensive metric function.
Status: AM - Accepted Manuscript
Rights: This is a post-peer-review, pre-copyedit version of a paper published in Marchand-Maillet S, Silva YN & Chavez E (eds.) 11th International Conference on Similarity Search and Applications, SISAP 2018. Lecture Notes in Computer Science, 11223. The final authenticated version is available online at: https://doi.org/10.1007/978-3-030-02224-2_1

Files in This Item:
File Description SizeFormat 
SISAP_2018_reranking.pdfFulltext - Accepted Version1.41 MBAdobe PDFView/Open



This item is protected by original copyright



Items in the Repository are protected by copyright, with all rights reserved, unless otherwise indicated.

The metadata of the records in the Repository are available under the CC0 public domain dedication: No Rights Reserved https://creativecommons.org/publicdomain/zero/1.0/

If you believe that any material held in STORRE infringes copyright, please contact library@stir.ac.uk providing details and we will remove the Work from public display in STORRE and investigate your claim.