Please use this identifier to cite or link to this item:
http://hdl.handle.net/1893/26956
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Christie, Lee A | en_UK |
dc.contributor.author | Brownlee, Alexander | en_UK |
dc.contributor.author | Woodward, John R | en_UK |
dc.date.accessioned | 2018-04-06T22:48:49Z | - |
dc.date.available | 2018-04-06T22:48:49Z | - |
dc.date.issued | 2018-04-30 | en_UK |
dc.identifier.uri | http://hdl.handle.net/1893/26956 | - |
dc.description.abstract | Benchmarks are important to demonstrate the utility of optimisation algorithms, but there is controversy about the practice of benchmarking; we could select instances that present our algorithm favourably, and dismiss those on which our algorithm under-performs. Several papers highlight the pitfalls concerned with benchmarking, some of which concern the context of the automated design of algorithms, where we use a set of problem instances (benchmarks) to train our algorithm. As with machine learning, if the training set does not reflect the test set, the algorithm will not generalize. This raises some open questions concerning the use of test instances to automatically design algorithms. We use differential evolution, and sweep the parameter settings to investigate the practice of benchmarking using the BBOB benchmarks. We make three key findings. Firstly, several benchmark functions are highly correlated. This may lead to the false conclusion that an algorithm performs well in general, when it performs poorly on a few key instances, possibly introducing unwanted bias to a resulting automatically designed algorithm. Secondly, the number of evaluations can have a large effect on the conclusion. Finally, a systematic sweep of the parameters shows how performance varies with time across the space of algorithm configurations. The data sets, including all computed features, the evolved policies, and their performances, and the visualisations for all feature sets, are available from http://hdl.handle.net/11667/109. | en_UK |
dc.language.iso | en | en_UK |
dc.publisher | University of Stirling | en_UK |
dc.relation | Christie LA, Brownlee A & Woodward JR (2018) Investigating Benchmark Correlations when Comparing Algorithms with Parameter Tuning (Detailed Experiments and Results). Not applicable. Stirling: University of Stirling. | en_UK |
dc.relation.uri | http://hdl.handle.net/11667/109 | en_UK |
dc.rights | Authors retains copyright. | en_UK |
dc.subject | benchmarks | en_UK |
dc.subject | BBOB | en_UK |
dc.subject | ranking | en_UK |
dc.subject | differential evolution | en_UK |
dc.subject | continuous optimisation | en_UK |
dc.subject | parameter tuning | en_UK |
dc.subject | automated design of algorithms | en_UK |
dc.title | Investigating Benchmark Correlations when Comparing Algorithms with Parameter Tuning (Detailed Experiments and Results) | en_UK |
dc.type | Technical Report | en_UK |
dc.contributor.sponsor | Not applicable | en_UK |
dc.citation.publicationstatus | Published | en_UK |
dc.citation.peerreviewed | Unrefereed | en_UK |
dc.type.status | AM - Accepted Manuscript | en_UK |
dc.contributor.funder | Engineering and Physical Sciences Research Council | en_UK |
dc.contributor.funder | Engineering and Physical Sciences Research Council | en_UK |
dc.author.email | alexander.brownlee@stir.ac.uk | en_UK |
dc.citation.date | 07/04/2018 | en_UK |
dc.publisher.address | Stirling | en_UK |
dc.description.notes | Work funded by UK EPSRC [grants EP/N002849/1, EP/J017515/1]. Results obtained using the EPSRC funded ARCHIE-WeSt HPC [EPSRC grant EP/K000586/1]. | en_UK |
dc.contributor.affiliation | Computing Science | en_UK |
dc.contributor.affiliation | Computing Science | en_UK |
dc.contributor.affiliation | Queen Mary, University of London | en_UK |
dc.identifier.wtid | 493456 | en_UK |
dc.contributor.orcid | 0000-0001-8878-0344 | en_UK |
dc.contributor.orcid | 0000-0003-2892-5059 | en_UK |
dcterms.dateAccepted | 2018-04-07 | en_UK |
dc.date.filedepositdate | 2018-04-11 | en_UK |
dc.relation.funderproject | FAIME: A Feature based Framework to Automatically Integrate and Improve Metaheuristics via Examples. | en_UK |
dc.relation.funderproject | DAASE: Dynamic Adaptive Automated Software Engineering | en_UK |
dc.relation.funderref | EP/N002849/1 | en_UK |
dc.relation.funderref | EP/J017515/1 | en_UK |
rioxxterms.apc | not required | en_UK |
rioxxterms.type | Technical Report | en_UK |
rioxxterms.version | AM | en_UK |
local.rioxx.author | Christie, Lee A|0000-0001-8878-0344 | en_UK |
local.rioxx.author | Brownlee, Alexander|0000-0003-2892-5059 | en_UK |
local.rioxx.author | Woodward, John R| | en_UK |
local.rioxx.project | EP/N002849/1|Engineering and Physical Sciences Research Council|http://dx.doi.org/10.13039/501100000266 | en_UK |
local.rioxx.project | EP/J017515/1|Engineering and Physical Sciences Research Council|http://dx.doi.org/10.13039/501100000266 | en_UK |
local.rioxx.freetoreaddate | 2018-04-11 | en_UK |
local.rioxx.licence | http://www.rioxx.net/licenses/all-rights-reserved|2018-04-11| | en_UK |
local.rioxx.filename | investigating-benchmark-correlations-techreport.pdf | en_UK |
local.rioxx.filecount | 1 | en_UK |
Appears in Collections: | Computing Science and Mathematics Technical Reports |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
investigating-benchmark-correlations-techreport.pdf | Fulltext - Accepted Version | 986.56 kB | Adobe PDF | View/Open |
This item is protected by original copyright |
Items in the Repository are protected by copyright, with all rights reserved, unless otherwise indicated.
The metadata of the records in the Repository are available under the CC0 public domain dedication: No Rights Reserved https://creativecommons.org/publicdomain/zero/1.0/
If you believe that any material held in STORRE infringes copyright, please contact library@stir.ac.uk providing details and we will remove the Work from public display in STORRE and investigate your claim.