Please use this identifier to cite or link to this item:
http://hdl.handle.net/1893/21796
Appears in Collections: | Computing Science and Mathematics Journal Articles |
Peer Review Status: | Refereed |
Title: | A neurally-inspired musical instrument classification system based upon the sound onset |
Author(s): | Newton, Michael Smith, Leslie |
Contact Email: | l.s.smith@stir.ac.uk |
Issue Date: | Jun-2012 |
Date Deposited: | 21-May-2015 |
Citation: | Newton M & Smith L (2012) A neurally-inspired musical instrument classification system based upon the sound onset. Journal of the Acoustical Society of America, 131 (6), pp. 4785-4798. https://doi.org/10.1121/1.4707535 |
Abstract: | Physiological evidence suggests that sound onset detection in the auditory system may be performed by specialized neurons as early as the cochlear nucleus. Psychoacoustic evidence shows that the sound onset can be important for the recognition of musical sounds. Here the sound onset is used in isolation to form tone descriptors for a musical instrument classification task. The task involves 2085 isolated musical tones from the McGill dataset across five instrument categories. A neurally inspired tone descriptor is created using a model of the auditory system's response to sound onset. A gammatone filterbank and spiking onset detectors, built from dynamic synapses and leaky integrate-and-fire neurons, create parallel spike trains that emphasize the sound onset. These are coded as a descriptor called the onset fingerprint. Classification uses a time-domain neural network, the echo state network. Reference strategies, based upon mel-frequency cepstral coefficients, evaluated either over the whole tone or only during the sound onset, provide context to the method. Classification success rates for the neurally-inspired method are around 75%. The cepstral methods perform between 73% and 76%. Further testing with tones from the Iowa MIS collection shows that the neurally inspired method is considerably more robust when tested with data from an unrelated dataset. |
DOI Link: | 10.1121/1.4707535 |
Rights: | Publisher policy allows this work to be made available in this repository. Published in Journal of the Acoustical Society of America by Acoustical Society of America. The original publication is available at: http://scitation.aip.org/content/asa/journal/jasa/131/6/10.1121/1.4707535 |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
FINAL_JAS004785.pdf | Fulltext - Published Version | 1.31 MB | Adobe PDF | View/Open |
This item is protected by original copyright |
Items in the Repository are protected by copyright, with all rights reserved, unless otherwise indicated.
The metadata of the records in the Repository are available under the CC0 public domain dedication: No Rights Reserved https://creativecommons.org/publicdomain/zero/1.0/
If you believe that any material held in STORRE infringes copyright, please contact library@stir.ac.uk providing details and we will remove the Work from public display in STORRE and investigate your claim.