The utility of image descriptions in the initial stages of vision: a case study of printed text

Watt, Roger; Dakin, Steven C

doi:10.1348/000712608X379070

Please use this identifier to cite or link to this item: http://hdl.handle.net/1893/954

Appears in Collections:	Psychology Journal Articles
Peer Review Status:	Refereed
Title:	The utility of image descriptions in the initial stages of vision: a case study of printed text
Author(s):	Watt, Roger Dakin, Steven C
Contact Email:	r.j.watt@stir.ac.uk
Keywords:	Vision Research Computational neuroscience Vision physiology
Issue Date:	Feb-2010
Date Deposited:	17-Mar-2009
Citation:	Watt R & Dakin SC (2010) The utility of image descriptions in the initial stages of vision: a case study of printed text. British Journal of Psychology, 101 (1), pp. 1-26. http://onlinelibrary.wiley.com/doi/10.1348/000712608X379070/abstract; https://doi.org/10.1348/000712608X379070
Abstract:	Vision research has made very substantial progress towards understanding how we see. It is one area of psychology where the three-way thrust of behavioural measurements (psychophysics), brain imaging, and computational studies have been combined quite routinely for some years. The purpose of this paper is to demonstrate a relatively unusual form of computational modelling which we characterise as involving image descriptions. Image descriptions are statements about structures in images and relationships between structures. Most modelling in vision is either conceived in fairly abstract terms, or is done at the level of images. Neither is entirely satisfactory, and image descriptions are a simple formulation of age-old ideas about a vocabulary of image features that are detected and parameterized from actual digital images. For our example, we use the domain of the visual perception of printed text. This is an area that has been characterized by thorough, robust psychophysical experiments. The fundamental requirements of visual processing in this domain are: grouping of some parts if the image into words; at the same time segmenting words from each other. We show how these are readily understood in terms of our model of image descriptions, and show quantitatively that typographical practice, refined over centuries, is about optimum for the visual system at least as represented by our model. In addition, we show that the same notion of image descriptions could, in principle, support word recognition in certain circumstances.
URL:	http://onlinelibrary.wiley.com/doi/10.1348/000712608X379070/abstract
DOI Link:	10.1348/000712608X379070
Rights:	Published in British Journal of Psychology by British Psychological Society.

Files in This Item:

File	Description	Size	Format
Watt3.pdf	Fulltext - Accepted Version	1.21 MB	Adobe PDF	View/Open

This item is protected by original copyright

View License

Show full item record

Items in the Repository are protected by copyright, with all rights reserved, unless otherwise indicated.

The metadata of the records in the Repository are available under the CC0 public domain dedication: No Rights Reserved https://creativecommons.org/publicdomain/zero/1.0/

If you believe that any material held in STORRE infringes copyright, please contact library@stir.ac.uk providing details and we will remove the Work from public display in STORRE and investigate your claim.

STORRE

STORRE: Stirling Online Research Repository