Investigating Benchmark Correlations when Comparing Algorithms with Parameter Tuning  (Detailed Experiments and Results)

Christie, Lee A; Brownlee, Alexander; Woodward, John R

Please use this identifier to cite or link to this item: http://hdl.handle.net/1893/26956

Full metadata record

DC Field	Value	Language
dc.contributor.author	Christie, Lee A	en_UK
dc.contributor.author	Brownlee, Alexander	en_UK
dc.contributor.author	Woodward, John R	en_UK
dc.date.accessioned	2018-04-06T22:48:49Z	-
dc.date.available	2018-04-06T22:48:49Z	-
dc.date.issued	2018-04-30	en_UK
dc.identifier.uri	http://hdl.handle.net/1893/26956	-
dc.description.abstract	Benchmarks are important to demonstrate the utility of optimisation algorithms, but there is controversy about the practice of benchmarking; we could select instances that present our algorithm favourably, and dismiss those on which our algorithm under-performs. Several papers highlight the pitfalls concerned with benchmarking, some of which concern the context of the automated design of algorithms, where we use a set of problem instances (benchmarks) to train our algorithm. As with machine learning, if the training set does not reflect the test set, the algorithm will not generalize. This raises some open questions concerning the use of test instances to automatically design algorithms. We use differential evolution, and sweep the parameter settings to investigate the practice of benchmarking using the BBOB benchmarks. We make three key findings. Firstly, several benchmark functions are highly correlated. This may lead to the false conclusion that an algorithm performs well in general, when it performs poorly on a few key instances, possibly introducing unwanted bias to a resulting automatically designed algorithm. Secondly, the number of evaluations can have a large effect on the conclusion. Finally, a systematic sweep of the parameters shows how performance varies with time across the space of algorithm configurations. The data sets, including all computed features, the evolved policies, and their performances, and the visualisations for all feature sets, are available from http://hdl.handle.net/11667/109.	en_UK
dc.language.iso	en	en_UK
dc.publisher	University of Stirling	en_UK
dc.relation	Christie LA, Brownlee A & Woodward JR (2018) Investigating Benchmark Correlations when Comparing Algorithms with Parameter Tuning (Detailed Experiments and Results). Not applicable. Stirling: University of Stirling.	en_UK
dc.relation.uri	http://hdl.handle.net/11667/109	en_UK
dc.rights	Authors retains copyright.	en_UK
dc.subject	benchmarks	en_UK
dc.subject	BBOB	en_UK
dc.subject	ranking	en_UK
dc.subject	differential evolution	en_UK
dc.subject	continuous optimisation	en_UK
dc.subject	parameter tuning	en_UK
dc.subject	automated design of algorithms	en_UK
dc.title	Investigating Benchmark Correlations when Comparing Algorithms with Parameter Tuning (Detailed Experiments and Results)	en_UK
dc.type	Technical Report	en_UK
dc.contributor.sponsor	Not applicable	en_UK
dc.citation.publicationstatus	Published	en_UK
dc.citation.peerreviewed	Unrefereed	en_UK
dc.type.status	AM - Accepted Manuscript	en_UK
dc.contributor.funder	Engineering and Physical Sciences Research Council	en_UK
dc.contributor.funder	Engineering and Physical Sciences Research Council	en_UK
dc.author.email	alexander.brownlee@stir.ac.uk	en_UK
dc.citation.date	07/04/2018	en_UK
dc.publisher.address	Stirling	en_UK
dc.description.notes	Work funded by UK EPSRC [grants EP/N002849/1, EP/J017515/1]. Results obtained using the EPSRC funded ARCHIE-WeSt HPC [EPSRC grant EP/K000586/1].	en_UK
dc.contributor.affiliation	Computing Science	en_UK
dc.contributor.affiliation	Computing Science	en_UK
dc.contributor.affiliation	Queen Mary, University of London	en_UK
dc.identifier.wtid	493456	en_UK
dc.contributor.orcid	0000-0001-8878-0344	en_UK
dc.contributor.orcid	0000-0003-2892-5059	en_UK
dcterms.dateAccepted	2018-04-07	en_UK
dc.date.filedepositdate	2018-04-11	en_UK
dc.relation.funderproject	FAIME: A Feature based Framework to Automatically Integrate and Improve Metaheuristics via Examples.	en_UK
dc.relation.funderproject	DAASE: Dynamic Adaptive Automated Software Engineering	en_UK
dc.relation.funderref	EP/N002849/1	en_UK
dc.relation.funderref	EP/J017515/1	en_UK
rioxxterms.apc	not required	en_UK
rioxxterms.type	Technical Report	en_UK
rioxxterms.version	AM	en_UK
local.rioxx.author	Christie, Lee A\|0000-0001-8878-0344	en_UK
local.rioxx.author	Brownlee, Alexander\|0000-0003-2892-5059	en_UK
local.rioxx.author	Woodward, John R\|	en_UK
local.rioxx.project	EP/N002849/1\|Engineering and Physical Sciences Research Council\|http://dx.doi.org/10.13039/501100000266	en_UK
local.rioxx.project	EP/J017515/1\|Engineering and Physical Sciences Research Council\|http://dx.doi.org/10.13039/501100000266	en_UK
local.rioxx.freetoreaddate	2018-04-11	en_UK
local.rioxx.licence	http://www.rioxx.net/licenses/all-rights-reserved\|2018-04-11\|	en_UK
local.rioxx.filename	investigating-benchmark-correlations-techreport.pdf	en_UK
local.rioxx.filecount	1	en_UK
Appears in Collections:	Computing Science and Mathematics Technical Reports

Files in This Item:

File	Description	Size	Format
investigating-benchmark-correlations-techreport.pdf	Fulltext - Accepted Version	986.56 kB	Adobe PDF	View/Open

This item is protected by original copyright

View License

Show simple item record

Items in the Repository are protected by copyright, with all rights reserved, unless otherwise indicated.

The metadata of the records in the Repository are available under the CC0 public domain dedication: No Rights Reserved https://creativecommons.org/publicdomain/zero/1.0/

If you believe that any material held in STORRE infringes copyright, please contact library@stir.ac.uk providing details and we will remove the Work from public display in STORRE and investigate your claim.

STORRE

STORRE: Stirling Online Research Repository