Mixed Order Hyper-Networks for Function Approximation and Optimisation

Swingler, Kevin

Please use this identifier to cite or link to this item: http://hdl.handle.net/1893/25349

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	Smith, Leslie	-
dc.contributor.advisor	Hussain, Amir	-
dc.contributor.author	Swingler, Kevin	-
dc.date.accessioned	2017-05-17T12:25:12Z	-
dc.date.available	2017-05-17T12:25:12Z	-
dc.date.issued	2016-05	-
dc.identifier.uri	http://hdl.handle.net/1893/25349	-
dc.description.abstract	Many systems take inputs, which can be measured and sometimes controlled, and outputs, which can also be measured and which depend on the inputs. Taking numerous measurements from such systems produces data, which may be used to either model the system with the goal of predicting the output associated with a given input (function approximation, or regression) or of finding the input settings required to produce a desired output (optimisation, or search). Approximating or optimising a function is central to the field of computational intelligence. There are many existing methods for performing regression and optimisation based on samples of data but they all have limitations. Multi layer perceptrons (MLPs) are universal approximators, but they suffer from the black box problem, which means their structure and the function they implement is opaque to the user. They also suffer from a propensity to become trapped in local minima or large plateaux in the error function during learning. A regression method with a structure that allows models to be compared, human knowledge to be extracted, optimisation searches to be guided and model complexity to be controlled is desirable. This thesis presents such as method. This thesis presents a single framework for both regression and optimisation: the mixed order hyper network (MOHN). A MOHN implements a function f:{-1,1}^n ->R to arbitrary precision. The structure of a MOHN makes the ways in which input variables interact to determine the function output explicit, which allows human insights and complexity control that are very difficult in neural networks with hidden units. The explicit structure representation also allows efficient algorithms for searching for an input pattern that leads to a desired output. A number of learning rules for estimating the weights based on a sample of data are presented along with a heuristic method for choosing which connections to include in a model. Several methods for searching a MOHN for inputs that lead to a desired output are compared. Experiments compare a MOHN to an MLP on regression tasks. The MOHN is found to achieve a comparable level of accuracy to an MLP but suffers less from local minima in the error function and shows less variance across multiple training trials. It is also easier to interpret and combine from an ensemble. The trade-off between the fit of a model to its training data and that to an independent set of test data is shown to be easier to control in a MOHN than an MLP. A MOHN is also compared to a number of existing optimisation methods including those using estimation of distribution algorithms, genetic algorithms and simulated annealing. The MOHN is able to find optimal solutions in far fewer function evaluations than these methods on tasks selected from the literature.	en_GB
dc.language.iso	en	en_GB
dc.publisher	University of Stirling	en_GB
dc.subject	Neural networks	en_GB
dc.subject	Optimisation	en_GB
dc.subject	Machine Learning	en_GB
dc.subject	Estimation of Distribution Algorithms	en_GB
dc.subject.lcsh	Computers Data processing	en_GB
dc.subject.lcsh	Computer algorithms	en_GB
dc.subject.lcsh	Machine learning	en_GB
dc.subject.lcsh	Neural networks	en_GB
dc.subject.lcsh	Combinatorial analysis Data processing	en_GB
dc.title	Mixed Order Hyper-Networks for Function Approximation and Optimisation	en_GB
dc.type	Thesis or Dissertation	en_GB
dc.type.qualificationlevel	Doctoral	en_GB
dc.type.qualificationname	Doctor of Philosophy	en_GB
dc.author.email	kms@cs.stir.ac.uk	en_GB
Appears in Collections:	Computing Science and Mathematics eTheses

Files in This Item:

File	Description	Size	Format
thesisSwingler.pdf	Thesis	1.6 MB	Adobe PDF	View/Open

This item is protected by original copyright

View License

Show simple item record

Items in the Repository are protected by copyright, with all rights reserved, unless otherwise indicated.

The metadata of the records in the Repository are available under the CC0 public domain dedication: No Rights Reserved https://creativecommons.org/publicdomain/zero/1.0/

If you believe that any material held in STORRE infringes copyright, please contact library@stir.ac.uk providing details and we will remove the Work from public display in STORRE and investigate your claim.

STORRE

STORRE: Stirling Online Research Repository