Image Processing With Neural Networks-A Review: M. Egmont-Petersen, D. de Ridder, H. Handels
Image Processing With Neural Networks-A Review: M. Egmont-Petersen, D. de Ridder, H. Handels
Image Processing With Neural Networks-A Review: M. Egmont-Petersen, D. de Ridder, H. Handels
www.elsevier.com/locate/patcog
of Information and Computing Sciences, Utrecht University, P.O.B. 80 089, 3508 TB Utrecht, Netherlands Recognition Group, Department of Applied Physics, Delft University of Technology, Delft, Netherlands c Department of Medical Informatics, Lubeck Medical University, Lubeck, Germany Received 12 May 2001; accepted 21 August 2001
Abstract We review more than 200 applications of neural networks in image processing and discuss the present and possible future role of neural networks, especially feed-forward neural networks, Kohonen feature maps and Hopeld neural networks. The various applications are categorised into a novel two-dimensional taxonomy for image processing algorithms. One dimension species the type of task performed by the algorithm: preprocessing, data reduction=feature extraction, segmentation, object recognition, image understanding and optimisation. The other dimension captures the abstraction level of the input data processed by the algorithm: pixel-level, local feature-level, structure-level, object-level, object-set-level and scene characterisation. Each of the six types of tasks poses specic constraints to a neural-based approach. These specic conditions are discussed in detail. A synthesis is made of unresolved problems related to the application of pattern recognition techniques in image processing and specically to the application of neural networks. Finally, we present an outlook into the future application of neural networks and relate them to novel developments. ? 2002 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved.
Keywords: Neural networks; Digital image processing; Invariant pattern recognition; Preprocessing; Feature extraction; Image compression; Segmentation; Object recognition; Image understanding; Optimization
1. Introduction Techniques from statistical pattern recognition have, since the revival of neural networks, obtained a widespread use in digital image processing. Initially, pattern recognition problems were often solved by linear and quadratic discriminants [1] or the (non-parametric) k-nearest neighbour classier and the Parzen density estimator [2,3]. In the mid-eighties, the PDP group [4] together with others, introduced the back-propagation learning algorithm for neural networks. This algorithm for the rst time made it feasible to train a non-linear
Corresponding author. Tel.: +31-30-253-4129; fax: +31-30-251-3791 E-mail address: michael@cs.uu.nl (M. Egmont-Petersen). URL: http:==www.cs.uu.nl=people=michael=nn-review.html.
neural network equipped with layers of the so-called hidden nodes. Since then, neural networks with one or more hidden layers can, in theory, be trained to perform virtually any regression or discrimination task. Moreover, no assumptions are made as with respect to the type of underlying (parametric) distribution of the input variables, which may be nominal, ordinal, real or any combination hereof. In their 1993 review article on image segmentation, Pal and Pal predicted that neural networks would become widely applied in image processing [5]. This prediction turned out to be right. In this review article, we survey applications of neural networks developed to solve different problems in image processing (for a review of neural networks used for 1D signal processing, see Ref. [6]). There are two central questions which we will try to answer in this review article:
0031-3203/02/$22.00 ? 2002 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved. PII: S 0 0 3 1 - 3 2 0 3 ( 0 1 ) 0 0 1 7 8 - 9
2280
1. What are major applications of neural networks in image processing now and in the nearby future? 2. Which are the major strengths and weaknesses of neural networks for solving image processing tasks? To facilitate a systematic review of neural networks in image processing, we propose a two-dimensional taxonomy for image processing techniques in Section 2. This taxonomy establishes a framework in which the advantages and unresolved problems can be structured in relation to the application of neural networks in image processing (Section 3). Section 4 discusses some real-world applications of neural networks in image processing. In Section 5, identied problems are considered and Section 6 presents an overview of future research issues which need to be resolved or investigated further as to expedite the application of neural networks in image processing. A number of future trends are also brie y sketched. In the paper, we will not consider the basic theory of neural networks. The reader is referred to standard text books, e.g., Ref. [7]. 2. Taxonomy for image processing algorithms Traditional techniques from statistical pattern recognition like the Bayesian discriminant and the Parzen windows were popular until the beginning of the 1990s. Since then, neural networks (ANNs) have increasingly been used as an alternative to classic pattern classiers and clustering techniques. Non-parametric feed-forward ANNs quickly turned out to be attractive trainable machines for feature-based segmentation and object recognition. When no gold standard is available, the self-organising feature map (SOM) is an interesting alternative to supervised techniques. It may learn to discriminate, e.g., di erent textures when provided with powerful features. The current use of ANNs in image processing exceeds the aforementioned traditional applications. The role of feed-forward ANNs and SOMs has been extended to encompass also low-level image processing tasks such as noise suppression and image enhancement. Hopeld ANNs were introduced as a tool for nding satisfactory solutions to complex (NP-complete) optimisation problems. This makes them an interesting alternative to traditional optimisation algorithms for image processing tasks that can be formulated as optimisation problems. The di erent problems addressed in the eld of digital image processing can be organised into what we have chosen to call the image processing chain. We make the following distinction between steps in the image processing chain (see Fig. 1): 1. Preprocessing=ltering. Operations that give as a result a modied image with the same dimensions as
2.
3.
4.
5. 6.
the original image (e.g., contrast enhancement and noise reduction). Data reduction=feature extraction. Any operation that extracts signicant components from an image (window). The number of extracted features is generally smaller than the number of pixels in the input window. Segmentation. Any operation that partitions an image into regions that are coherent with respect to some criterion. One example is the segregation of di erent textures. Object detection and recognition. Determining the position and, possibly, also the orientation and scale of specic objects in an image, and classifying these objects. Image understanding. Obtaining high level (semantic) knowledge of what an image shows. Optimisation. Minimisation of a criterion function which may be used for, e.g., graph matching or object delineation.
Optimisation techniques are not seen as a separate step in the image processing chain but as a set of auxiliary techniques, which support the other steps. Besides the actual task performed by an algorithm, its processing capabilities are partly determined by the abstraction level of the input data. We distinguish between the following abstraction levels: A. Pixel level. The intensities of individual pixels are provided as input to the algorithm. B. Local feature level. A set of derived, pixel-based features constitutes the input. C. Structure (edge) level. The relative location of one or more perceptual features (e.g., edges, corners, junctions, surfaces, etc.). D. Object level. Properties of individual objects. E. Object set level. The mutual order and relative location of detected objects. F. Scene characterisation. A complete description of the scene possibly including lighting conditions, context, etc. Table 1 contains the taxonomy of image processing algorithms that results from combining the steps of the image processing chain with the abstraction level of the input data. 3. Neural networks in image processing In this section, we will review neural networks trained to perform one of the six tasks in the image processing chain (3.13.6). 3.1. Preprocessing The rst step in the image processing chain consists of preprocessing. Loosely dened, by preprocessing we
2281
Scene analysis Object arrangement
Preprocessing
Data reduction
Segmentation
Object recognition
Image understanding
Optimisation
Fig. 1. The image processing chain containing the ve di erent tasks: preprocessing, data reduction, segmentation, object recognition and image understanding. Optimisation techniques are used as a set of auxiliary tools that are available in all steps of the image processing chain. Table 1 The image processing tasks categorised into a two-dimensional taxonomya Preprocessing Pixel Feature Structure Object Object set Scene 26 4 Compression= feature extract 25 2 Segmentation 39 19 2 Recognition 51 38 6 2 Image understanding 3 2 2 Optimisation 5 3 5 1
a Each cell contains the number of applications in our survey where neural networks accomplish a specic task in the image processing chain.
mean any operation of which the input consists of sensor data, and of which the output is a full image. Preprocessing operations generally fall into one of three categories: image reconstruction (to reconstruct an image from a number of sensor measurements), image restoration (to remove any aberrations introduced by the sensor, including noise) and image enhancement (accentuation of certain desired features, which may facilitate later processing steps such as segmentation or object recognition). Applications of ANNs in these three preprocessing categories will be discussed separately below. The majority of the ANNs were applied directly to pixel data (level A); only four networks were applied to more high-level data (features, level B). 3.1.1. Image reconstruction Image reconstruction problems often require quite complex computations and a unique approach is needed for each application. In Ref. [8], an ADALINE network is trained to perform an electrical impedance tomography (EIT) reconstruction, i.e., a reconstruction of a 2D image
based on 1D measurements on the circumference of the image. Srinivasan et al. [9] trained a modied Hopeld network to perform the inverse Radon transform (e.g., for reconstruction of computerised tomography images). The Hopeld network contained summation layers to avoid having to interconnect all units. Meyer and Heindl [10] used regression feed-forward networks (that learn the mapping E(y|x), with x the vector of input variables and y the desired output vector) to reconstruct images from electron holograms. Wang and Wahl trained a Hopeld ANN for reconstruction of 2D images from pixel data obtained from projections [11]. 3.1.2. Image restoration The majority of applications of ANNs in preprocessing can be found in image restoration [1231]. In general, one wants to restore an image that is distorted by the (physical) measurement system. The system might introduce noise, motion blur, out-of-focus blur, distortion caused by low resolution, etc. Restoration can employ all information about the nature of the distortions introduced
2282
by the system, e.g., the point spread function. The restoration problem is ill-posed because con icting criteria need to be fullled: resolution versus smoothness. The neural-network applications we reviewed had various designs ranging from relatively straightforward to highly complex, modular approaches. In the most basic image restoration approach, noise is removed from an image by simple ltering. Greenhil and Davies [18] used a regression feed-forward network in a convolution-like way to suppress noise (with a 5 5 pixel window as input and one output node). De Ridder et al. built a modular feed-forward ANN approach that mimics the behaviour of the Kuwahara lter, an edge-preserving smoothing lter [16]. Their experiments showed that the mean squared error used in ANN training may not be representative of the problem at hand. Furthermore, unconstrained feed-forward networks often ended up in a linear approximation to the Kuwahara lter. Chua and Yang [14,15] used cellular neural networks (CNNs) for image processing. A CNN is a system in which nodes are locally connected [23]. Each node contains a feedback template and a control template, which to a large extent determine the functionality of the network. For noise suppression, the templates implement an averaging function; for edge detection, a Laplacian operator. The system operates locally, but multiple iterations allow it to distribute global information throughout the nodes. Although quite fast in application, a disadvantage is that the parameters in uencing the network behaviour (the feedback and control templates) have to be set by hand. Others have proposed methods for training CNNs, e.g., using gradient descent or genetic algorithms (grey-value images, Zamparelli [30]). CNNs were also applied for restoration of colour images by Lee and Degyvez [21]. Another interesting ANN architecture is the generalised adaptive neural lter (GANF) [20,31] which has been used for noise suppression. A GANF consists of a set of neural operators, based on stack lters [12] that uses binary decompositions of grey-value data. Finally, fuzzy ANNs [27,28] and the neurochips described in Ref. [22] have been applied to image restoration as well. Traditional methods for more complex restoration problems such as deblurring and diminishing out-offocus defects, are maximum a posteriori estimation (MAP) and regularisation. Applying these techniques entails solving high-dimensional convex optimisation tasks. The objective functions of MAP estimation or the regularisation problem can both be mapped onto the energy function of the Hopeld network [13,17,24,29]. Often, mapping the problem turned out to be di cult, so in some cases the network architecture had to be modied as well. Other types of networks have also been applied to image restoration. Qian et al. [26] developed a hybrid system consisting of order statistic lters for noise removal and a Hopeld network for deblurring (by optimising a
criterion function). The modulation transfer function had to be measured in advance. Guan et al. [19] developed a so-called network-of-networks for image restoration. Their system consists of loosely coupled modules, where each module is a separate ANN. Phoha and Oldham [25] proposed a layered, competitive network to reconstruct a distorted image. 3.1.3. Image enhancement The goal of image enhancement is to amplify specic (perceptual) features. Among the applications where ANNs have been developed for image enhancement [32 42], one would expect most applications to be based on regression ANNs [37,38,40,42]. However, several enhancement approaches rely on a classier, typically resulting in a binary output image [32,35,36,39]. The most well-known enhancement problem is edge detection. A straightforward application of regression feed-forward ANNs, trained to behave like edge detectors, was reported by Pugmire et al. [38]. Chandresakaran et al. [32] used a novel feed-forward architecture to classify an input window as containing an edge or not. The weights of the network were set manually instead of being obtained from training. A number of more complex, modular systems were also proposed [37,40]. Formulating edge detection as an optimisation problem made it possible for Tsai et al. to train a Hopeld network for enhancement of endocardiac borders [41]. Some enhancement approaches utilise other types of ANNs. Shih et al. [39] applied an ART network for binary image enhancement. Moh and Shih [36] describe a general approach for implementation of morphological image operations by a modied feed-forward ANN using shunting mechanisms, i.e., neurons acting as switches. Waxman et al. [42] consider the application of a centre-surround shunting feed-forward ANN (proposed by Grossberg) for contrast enhancement and colour night vision. 3.1.4. Applicability of neural networks in preprocessing There seem to be three types of problems in preprocessing (unrelated to the three possible operation types) to which ANNs can be applied:
optimisation of an objective function dened by a approximation of a mathematical transformation used mapping by an ANN trained to perform a certain task,
To solve the rst type of problems, traditional methods for optimisation of some objective function may be replaced by a Hopeld network. For a further
2283
discussion of the suitability of Hopeld networks for solving optimisation problems, see Section 3.6. For the approximation task, regression (feed-forward) ANNs could be applied. Although some applications such ANNs were indeed successful, it would seem that these applications call for more traditional mathematical techniques, because a guaranteed (worst-case) performance is crucial in preprocessing. In several other applications, regression or classication (mapping) networks were trained to perform image restoration or enhancement directly from pixel data. A remarkable nding was that non-adaptive ANNs (e.g., CNNs) were often used for preprocessing. Secondly, when networks were adaptive, their architectures usually di ered much from those of the standard ANNs: prior knowledge about the problem was used to design the networks that were applied for image restoration or enhancement (e.g., by using shunting mechanisms to force a feed-forward ANN to make binary decisions). The interest in non-adaptive ANNs indicates that the fast, parallel operation and the ease with which ANNs can be embedded in hardware may be important criteria when choosing for a neural implementation of a specic preprocessing operation. However, the ability to learn from data is apparently of less importance in preprocessing. While it is relatively easy to construct a linear lter with a certain, desired behaviour, e.g., by specifying its frequency prole, it is much harder to obtain a large enough data set to learn the optimal function as a high-dimensional regression problem. This holds especially when the desired network behaviour is only critical for a small subset of all possible input patterns (e.g., in edge detection). Moreover, it is not at all trivial to choose a suitable error measure for supervised training, as simply minimising the mean squared error might give undesirable results in an image processing setting. An important caveat is that the network parameters are likely to become tuned to one type of image (e.g., a specic sensor, scene setting, scale, etc.), which limits the applicability of the trained ANN. When the underlying conditional probability distributions, p(x|!j ) or p(y|x), change, the classication or regression networklike all statistical modelsneeds to be retrained. 3.2. Data reduction and feature extraction Two of the most important applications of data reduction are image compression and feature extraction. In general, an image compression algorithm, used for storing and transmitting images, contains two steps: encoding and decoding. For both these steps, ANNs have been used. Feature extraction is used for subsequent segmentation or object recognition. The kind of features one wants to extract often correspond to particular geometric
or perceptual characteristics in an image (edges, corners and junctions), or application dependent ones, e.g., facial features. 3.2.1. Image compression applications Two di erent types of image compression approaches can be identied: direct pixel-based encoding=decoding by one ANN [4351] and pixel-based encoding=decoding based on a modular approach [5258]. Di erent types of ANNs have been trained to perform image compression: feed-forward networks [44,49 54,56 58], SOMs [43,46 48], adaptive fuzzy leader clustering (a fuzzy ART-like approach) [55], a learning vector quantier [49,58] and a radial basis function network [50]. For a more extensive overview, see [45]. Auto-associator networks have been applied to image compression where the input signal was obtained from a convolution window [50,56,58]. These networks contain at least one hidden layer, with less units than the input and output layers. The network is then trained to recreate the input data. Its bottle-neck architecture forces the network to project the original data onto a lower dimensional (possibly non-linear) manifold from which the original data should be predicted. Other approaches rely on a SOM, which after training acts as a code book [43,46]. The most advanced approaches are based on specialised compression modules. These approaches either combine di erent ANNs to obtain the best possible image compression rate or they combine more traditional statistical methods with one or more ANNs. Dony and Haykin have developed an approach based on di erent, specialised modules [53]. In their approach, a supervisor ANN can choose which processing module is best suited for the compression task at hand. Wang et al. also present a modular coding approach based on specialised ANNs [57]. ANN approaches have to compete with well-established compression techniques such as JPEG, which should serve as a reference. The major advantage of ANNs is that their parameters are adaptable, which may give better compression rates when trained on specic image material. However, such a specialisation becomes a drawback when novel types of images have to be compressed. For a discussion of how to evaluate image compression algorithms see, e.g., Ref. [52]. 3.2.2. Feature extraction applications Feature extraction can be seen as a special kind of data reduction of which the goal is to nd a subset of informative variables based on image data. Since image data are by nature very high dimensional, feature extraction is often a necessary step for segmentation or object recognition to be successful. Besides lowering the computational cost, feature extraction is also a means for
2284
controlling the so-called curse of dimensionality. 1 When used as input for a subsequent segmentation algorithm, one wants to extract those features that preserve the class separability well [2,3]. There is a wide class of ANNs that can be trained to perform mappings to a lower-dimensional space, for an extensive overview see Ref. [60]. A well-known feature-extraction ANN is Ojas neural implementation of a one-dimensional principal component analysis (PCA) [61], later extended to multiple dimensions [62]. In Ref. [63], Baldi and Hornik proved that training three-layer auto-associator networks corresponds to applying PCA to the input data. Later [64,65], auto-associator networks with ve layers were shown to be able to perform non-linear dimensionality reduction (i.e., nding principal surfaces [66]). It is also possible to use a mixture of linear subspaces to approximate a non-linear subspace (see, e.g., Ref. [67]). Another approach to feature extraction is rst to cluster the high-dimensional data, e.g., by a SOM, and then use the cluster centres as prototypes for the entire cluster. Among the ANNs that have been trained to perform feature extraction [6877], feed-forward ANNs have been used in most of the reviewed applications [70,74,75,77]. SOMs [7173] and Hopeld ANNs [76] have also been trained to perform feature extraction. Most of the ANNs trained for feature extraction obtain pixel data as input. Neural-network feature extraction was performed for
subsequent automatic target recognition in remote
Some feature extraction approaches were designed to cope explicitly with (changes in) orientation of objects. It is important to make a distinction between application of supervised and unsupervised ANNs for feature extraction. For a supervised auto-associator ANN, the information loss implied by the data reduction can be measured directly on the predicted output variables, which is not the case for unsupervised feature extraction by the SOM. Both supervised and unsupervised ANN feature extraction methods have advantages compared to traditional techniques such as PCA. Feed-forward ANNs with several hidden layers can be trained to perform non-linear feature extraction, but lack a formal, statistical basis (see Section 5.3). 3.3. Image segmentation Segmentation is the partitioning of an image into parts that are coherent according to some criterion. When considered as a classication task, the purpose of segmentation is to assign labels to individual pixels or voxels. Some neural-based approaches perform segmentation directly on the pixel data, obtained either from a convolution window (occasionally from more bands as present in, e.g., remote sensing and MR images), or the information is provided to a neural classier in the form of local features. 3.3.1. Image segmentation based on pixel data Many ANN approaches have been presented that segment images directly from pixel or voxel data [78113]. Several di erent types of ANNs have been trained to perform pixel-based segmentation: feed-forward ANNs [90,94,102,105,106], SOMs [78,82,84,87,91,92,94,98, 102,109], Hopeld networks [83,85,96,103,110], probabilistic ANNs [94,112], radial basis function networks [94], CNNs [108], constraint satisfaction ANNs [79] and RAM-networks [104]. A self-organising architecture with fuzziness measures was used in Ref. [86]. Also, biologically inspired neural-network approaches have been proposed: the perception model developed by Grossberg [88,89], which is able to segment images from surfaces and their shading, and the brain-like networks proposed by Opara and Worgotter [99]. Hierarchical segmentation approaches have been designed to combine ANNs on di erent abstraction levels [105,110]. The guiding principles behind hierarchical approaches are specialisation and bottomup processing: one or more ANNs are dedicated to low level feature extraction=segmentation, and their results are combined at a higher abstraction level where another (neural) classier performs the nal image segmentation. Reddick et al. developed a pixel-based two-stage approach where a SOM is trained to segment multispectral MR images [102]. The segments are subsequently classied into white matter, grey matter, etc., by a feed-forward
sensing (accounting for orientation) [72] and character recognition [75,76]; subsequent segmentation of food images [74] and of magnetic resonance (MR) images [71]; nding the orientation of objects (coping with rotation) [49,70]; nding control points of deformable models [77]; clustering low-level features found by the Gabor lters in face recognition and wood defect detection [73]; subsequent stereo matching [69]; clustering the local content of an image before it is encoded [68].
In most applications, the extracted features were used for segmentation, image matching or object recognition. For (anisotropic) objects occurring at the same scale, rotation causes the largest amount of intra-class variation.
1 The curse of dimensionality is a property of a classication or regression problem. It expresses that a higher dimensionality of the feature space leads to an increased number of parameters, which need to be estimated. The risk of overtting the model will increase with the number of parameters, which will often lead to peaking (i.e., the best generalisation performance is obtained when using a subset of the available features) [59].
2285
ANN. Non-hierarchical, modular approaches have also been developed [78,105,107]. In general, pixel-based (often supervised) ANNs have been trained to classify the image content based on
texture [78,82,87,94,100,101,104,107,113]; a combination of texture and local shape [81,90,95,
A segmentation task that is most frequently performed by feature-based ANNs is texture segregation, which is typically based on
co-occurrence matrices [118,119,128]; wavelet features [123]; multiresolution features extracted from the Gabor spatial derivatives computed in the linear scale-space
105,112].
ANNs have also been developed for pre- and postprocessing steps in relation to segmentation, e.g., for delineation of contours [80,108]; connecting edge pixels [103]; identication of surfaces [88,89]; deciding whether a pixel occurs inside or outside a segment [110]; defuzzifying the segmented image [86];
and for
clustering of pixels [98,109]; motion segmentation [97].
In most applications, ANNs were trained as supervised classiers to perform the desired segmentation. One feature that most pixel-based segmentation approaches lack is a structured way of coping with variations in rotation and scale. This shortcoming may deteriorate the segmentation result. 3.3.2. Image segmentation based on features Several feature-based approaches apply ANNs for segmentation of images [32,71,92,114 129]. Di erent types of ANNs have been trained to perform feature-based image segmentation: feed-forward ANNs [71,114,118, 119,125], recursive networks [127], SOMs [71,92,119 121,129], variants of radial basis function networks [117] and CNNs [116], Hopeld ANNs [126], principal component networks [129] and a dynamic ANN [32]. Hierarchical network architectures have been developed for optical character recognition [122] and for segmentation of range images [92]. Feature-based ANNs have been trained to segment images based on the di erences in
texture [119,122124,126 128]; a combination of texture and local shape [118,121,125].
The Gabor and wavelet-based features, and features extracted from the linear scale-space provide information at several scales to the classier, which, however, needs to cope explicitly with variations in scale. As with respect to orientation, the Gabor and wavelet-based approaches are, in general, sensitive to horizontal, vertical and diagonal features. These three directions can be combined into a local orientation measure such that rotation invariance is obtained. The scale-space features can be reduced to a few invariants that are indeed rotation invariant [130]. The generalised co-occurrence matrices cope with variations in orientation by averaging over four orthogonal orientations. Scale can also be taken into account by varying the distance parameter used to compute the co-occurrence matrix. 3.3.3. Open issues in segmentation by ANNs Three central problems in image segmentation by ANNs are: how to incorporate context information, the inclusion of (global) prior knowledge, and the evaluation of segmentation approaches. In the approaches we reviewed, context information was obtained from, e.g., multiscale wavelet features or from features derived from the linear scale space (computed at a coarse scale). How context information can best be incorporated, is an interesting issue for further research. The general problem of how to include a priori knowledge in a segmentation approach is considered in Section 5.2. A caveat is how to obtain a gold standard for the (in most cases supervised) segmentation algorithms. In general, the true class membership of the pixels=voxels in the training set is known with varying degrees of condence. In Ref. [119], this problem is addressed by letting an expert demarcate the inner parts of areas with a similar (coherent) texture but leaving the transition areas unclassied. Certainly, intra- and inter-observer variability needs to be assessed thoroughly (e.g., by the kappa statistic [131]) before suitable training and test images can be compiled. Even when a reliable gold standard is available, objective performance assessment entails more than simply computing error rates on novel test images. There is not
2286
yet a single measure capable of unequivocally quantifying segmentation quality. Besides statistical performance aspects such as coverage, bias and dispersion [131], desirable properties such as within-region homogeneity and between-region heterogeneity [132] are also important (for an overview of segmentation quality measures see Ref. [133]). 3.4. Object recognition Object recognition consists of locating the positions and possibly orientations and scales of instances of objects in an image. The purpose may also be to assign a class label to a detected object. Our survey of the literature on object recognition using ANNs indicates that in most applications, ANNs have been trained to locate individual objects based directly on pixel data. Another less frequently used approach is to map the contents of a window onto a feature space that is provided as input to a neural classier. 3.4.1. Object recognition based on pixel data Among the ANN approaches developed for pixel-based object recognition [39,42,67,70,72,134 179], several types of ANNs can be distinguished: feed-forward-like ANNs [70,147149,152,164,165,171,172], variants using weight sharing [144,159,160], recurrent networks [179], the ART networks introduced by Grossberg [39,139], mixtures-of-experts [173], (evolutionary) fuzzy ANNs [155], bi-directional auto-associative memories [157], the Neocognitron introduced by Fukushima [150,162] and variants hereof [137,161], piecewise-linear neural classiers [168], higher-order ANNs [169,170] and Hopeld ANNs [135,175,176]. Besides, interesting hardware ANNs have been built for object recognition: the RAM network [143,145] and optical implementations [154,167]. Finally, SOMs are occasionally used for feature extraction from pixel data [171,177]; the output of the map is then propagated to a (neural) classier. Several novel network architectures have been developed specically to cope with concomitant object variations in position, (in-plane or out-of-plane) rotation and scale (in one case, an approach has been developed that is invariant to changes in illumination [167]). It is clear that a distinction needs to be made between invariant recognition in 2D (projection or perspective) images and in 3D volume images. An interesting approach that performs object recognition, which is invariant to 2D translations, in-plane rotation and scale, is the neurally inspired what-and-where lter [139]. It combines a multiscale oriented lter bank (what) with an invariant matching module (where). Other approaches rely on learning the variations explicitly by training [141,147,148,164]. Egmont-Petersen and Arts built a statistical intensity model of the object that should be detected [147,148]. The convolution ANN
was trained using synthetic images of the (modelled) object with randomly chosen orientations. Penedo et al. developed a two-stage ANN approach for recognition of nodules in chest radiographs [164]. These ANNs were trained partly with synthetic subimages of nodules. Others have developed approaches that are invariant to both translation and 2D rotation [144,178], or systems that through their architectures perform processing in a translation-invariant way and=or at di erent scales (e.g., the Neocognitron [150] and the shared weight networks [158,159]). Fukumi et al. developed a hierarchical approach for rotation-invariant object recognition [70]. This approach, like its predecessor [149], maps the image to a polar space in which rotation-invariant recognition takes place. Clearly, when object recognition is performed by teaching a classier to recognise the whole object from a spatial pattern of pixel intensities, the complexity of the classier grows exponentially with the size of the object and with the number of dimensions (2D versus 3D). An interesting approach that circumvents this problem is iterative search through the image for the object centre [143]. The output of the ANN is the estimated displacement vector to the object centre. Depending on the contents of the scene, even context information may be required before the objects of interest can be recognised with condence. The incorporation of context information may again lead to a large number of extra parameters and thereby a more complex classier. To cope with this problem the so-called multiresolution approaches have been developed [171,175,176], which combine the intensities from pixels located on di erent levels of a pyramid [180] but centred around the same location. This provides the classier with context information, but a combinatorial explosion in the number of parameters is circumvented. Still, variations in scale have to be learned explicitly by the classier. A disadvantage of ANN pyramid approaches is that they sample the scale space coarsely as the resolution is reduced with a factor two at each level in the pyramid (in, e.g., the linear scale space [181], scale is a continuous variable). A special type of ANN that incorporates the scale information directly in a pyramidal form is the so-called higher-order ANN [169,170]. This network builds up an internal scale-space-like representation by what is called coarse coding. However, higher-order ANNs need to learn variations in scale explicitly too. They should be used with caution because the coarse coding scheme may lead to aliasing, as the high-resolution images are not blurred before computing the coarser image at the next level. Rare conditions such as object occlusion or the occurrence of multiple objects within the (sub)image that is processed by the classier have hardly been considered explicitly. An experimental architecture developed by McQuiod is capable of recognising multiple objects
2287
simultaneously within an image [161] (see also Section 3.6). Recurrent ANNs (with feed-back loops [182]) can be used to develop special approaches for object recognition [179]. The added value of a recurrent network architecture lies in its memory: the current state contains information about the past, which may constitute valuable context information. The recurrent network developed by Ziemke [179] performs a convolution with an image in order to detect oil spills. The recurrence principle introduces averaging, which can give a more robust performance. Several of the approaches for object detection and classication operate on binary images [137,139,143145]. Although binarisation simplies the recognition problem considerably, it generally decreases the recognition performance of an ANN. 3.4.2. Object recognition based on features Several neural-network approaches have been developed for feature-based object recognition [152,164,171, 177,183209] including: feed-forward ANNs [152,171, 177,184 187,190,193197,200,205,207209], Hopeld ANNs [201], a fuzzy ANN [186] and RAM ANNs [192,202]. SOMs are occasionally used to perform feature extraction prior to object recognition [177,197], although SOMs have also been trained to perform object classication [206]. The smaller variety of neural architectures developed for feature-based object recognition compared to the pixel-based approaches discussed in the previous section, re ects the fact that most e ort is focused on developing and choosing the best features for the recognition task. Common for many feature-based approaches is that variations in rotation and scale are coped with by the features, e.g., statistical moments. A certain amount of noise will in uence the computed features and deteriorate the recognition performance [203]. So the major task of the subsequent classier is to lter out noise and distortions propagated by the features. Moreover, when the object to be detected is large and needs to be sampled densely, feature extraction is inevitable. Otherwise, a neural classier will contain so many parameters that a good generalisation will be impeded. In general, the types of features that are used for object recognition di er from the features used by the neural-based segmentation approaches already reviewed. For object recognition, the features typically capture local geometric properties:
points with a high curvature on the detected object (Gabor) lter banks [201,207] including wavelets dedicated features: stellate features [194] and OCR
projection of the (sub)image onto the x- and y-axis principal components obtained from the image (distances to) feature space trajectories [210], which
[184];
describe objects in all rotations, translations or scales [187]; The Fourier descriptors derived from the image [209]; The Zernike moments [195] and the moments of Hu [203]. The Fourier descriptors, the Zernike moments and the moments of Hu are invariant to changes in object position, orientation and scale [203,211]. For a discussion of moments and invariance to grey-level transformations, see Ref. [211]. Multiresolution approaches have also been developed for object recognition based on features from
the linear scale-space [185,193]; the Gauss pyramid [171]; the Laplace pyramid [205].
Also, the positions of detected edges (input level C) may serve as features for a classier [171]. Finally, a set of features has been developed that is invariant to changes in colour [192]. Which set of features is best suited for a particular recognition task, depends on the variations among the objects (and of the background) with respect to position, (in-plane) orientation and scale. Knowledge of the degrees of freedom the approach has to cope with is needed for choosing a suited set of features (feature selection is discussed in Section 5.1). 3.4.3. Using pixels or features as input? Most ANNs that have been trained to perform image segmentation or object recognition obtain as input either pixel=voxel data (input level A) or a vector consisting of local, derived features (input level B). For pixel- and voxel-based approaches, all information (within a window) is provided directly to the classier. The perfect (minimal error-rate) classier should, when based directly on pixel data, be able to produce the best result if the size of the window is comparable to that of the texture elements (texels) or the window encompasses the object and the (discriminative) surrounding background. When, on the other hand, the input to the classier consists of a feature vector, the image content is always compressed. Whether su cient discriminative information is retained in the feature vector, can only be resolved experimentally. Two-dimensional image modalities such as radiography, 2D ultrasound and remote sensing often exhibit concomitant variations in rotation and scale. If such invariances are not built into a pixel-based ANN, careful
features [190];
2288
calibration (estimation of the physical size of a pixel) and subsequent rescaling of the image to a standard resolution are required steps to ensure a condent result. When only rotations occur, features obtained from a polar mapping of the window may ensure a good segmentation or detection result [70,149]. In many applications, however, calibration is unfeasible and 2D=3D rotation and scale invariance needs to be incorporated into the ANN. For pixel-based approaches, invariance can be either built directly into the neural classier (e.g., using weight sharing [159] or by taking symmetries into account [212]), or the classier has to be trained explicitly to cope with the variation by including training images in all relevant orientations and scales. A major disadvantage of these approaches is that object variations in rotation and scale have to be learned explicitly by the classier (translation can usually be coped with by convolution). This again calls for a very large, complete training set and a classier that can generalise well. Model-based approaches have been presented that can generate such a complete training set [147,148,164,185], see the discussion above. How to design robust pixel-based algorithms for segmentation and object recognition that can cope with the three basic a ne transforms, is a challenging subject for future research. In situations where many concomitant degrees of freedom occur (2D or 3D rotation, scale, a ne greylevel transformations, changes in colour, etc.), only feature-based approaches may guarantee that the required invariance is fully obtained. It is clear that when variations in orientation and scale occur and reliable calibration is unfeasible, an ANN based on invariant features should be preferred above a pixel-based approach. Another advantage of feature-based approaches is that variations in rotation and scale may remain unnoticed by the user, who may then end up with a poor result. When there is no limited set of images on which an algorithm has to work (e.g., image database retrieval), the more exible pixel-based methods can prove useful. The recommendation to prefer feature-based over pixel=voxel-based image processing (when signicant variations in rotation and scale actually occur in the image material), puts emphasis on the art of developing and choosing features which, in concert, contain much discriminative power in relation to the particular image processing task. Prior knowledge regarding the image processing task (e.g., invariance) should guide the development and selection of discriminative features. Feature-based classiers will, in general, be easier to train when the chosen features cope adequately with the degrees of freedom intrinsic to the image material at hand. The removal of super uous features is often necessary to avoid the peaking phenomenon [59] and guarantee a good generalisation ability of the classier. This issue, which is a general problem in statistical pattern recognition, is discussed in Section 5.1.
3.5. Image understanding Image understanding is a complicated area in image processing. It couples techniques from segmentation or object recognition with knowledge of the expected image content. In two applications, ANNs were used in combination with background knowledge to classify objects such as chromosomes from extracted structures (input level C) [213] and to classify ships, which were recognised from pixel data (input level A) by an advanced modular approach [214]. In another application, ANNs were used to analyse camera images for robot control from local features (input level B) [215]. Neural (decision) trees [216], semantic models based on extracted structures (input level C) [217] or neural belief networks [218] can be used to represent knowledge about the expected image content. This knowledge is then used to restrict the number of possible interpretations of single objects as well as to recognise di erent congurations of image objects. Especially, the approaches by Reinus et al. [217] and Stassopoulou et al. [218] perform genuine image interpretation. Reinus trains an ANN to diagnose bone tumours. The recognition approach of Stassopoulou et al. predicts the degree of desertication of an area from a set of detected objects=segments, such as rocks, eroded areas, etc., in remote sensing images (input level E). A major problem when applying ANNs for high level image understanding is their black-box character. It is virtually impossible to explain why a particular image interpretation is the most likely one. As a remedy, Stassopoulou et al. mapped the trained ANN onto a Bayesian belief network after training had been performed. An alternative approach to coping with the black-box problem is to use the generic explanation facility developed for ANNs [219] or to use rule extraction [220]. Another problem in image understanding relates to the level of the input data. When, e.g., seldom occurring features (input level C) or object positions (input level E) are provided as input to a neural classier, a large number of images are required to establish statistically representative training and test sets. We feel that image understanding is the most dubious application of ANNs in the image processing chain. 3.6. Optimisation Some image processing (sub)tasks such as graphand stereo-matching can best be formulated as optimisation problems, which may be solved by Hopeld ANNs [11,76,103,221230]. In some applications, the Hopeld network obtained pixel-based input (input level A) [11,76,103,226,230], in other applications the input consisted of local features (input level B) [224,228] or detected structures (typically edges, input level C) [222,223,225,227,229].
2289
list a number of application areas in which ANN-based systems are often encountered:
industrial inspection: quality and process control, e.g.,
by connecting edge pixels [103,226] (input level A); thresholding images by relaxation [230] (input level A); two-dimensional [76,227,229] and three-dimensional object recognition by (partial) graph matching [222,228] (input level C); establishing correspondence between stereo images based on features (landmarks) [224] and stereo correspondence between line cameras from detected edges [225]; approximation of a polygon from detected edge points [223]; controlling Voronoi pyramids [221].
the detection of defect objects in the production of steel, textiles, fruit, vegetables, plants or other food products; document processing: computerised reading of machine-generated and hand-written text used for, e.g., automatic processing of forms and mail sorting; identication and authentication: e.g., license plate recognition, ngerprint analysis and face detection=verication [236]; medical diagnosis: e.g., screening for cervical cancer [237] or breast tumours [238,239]; defence: various navigation and guidance systems, target recognition systems, etc. [240,241].
Hopeld ANNs have mainly been applied to segmentation and recognition tasks that are too di cult to realise with conventional neural classiers because the solutions entail partial graph matching or recognition of three-dimensional objects. Matching and recognition are both solved by letting the network converge to a stable state while minimising the energy function. It was also shown that iterating the Hopeld network can be interpreted as a form of probabilistic relaxation [231]. In most of the applications reviewed, casting the actual problem to the architecture of the Hopeld network turned out to be di cult. Occasionally, the original problem had to be modied before it could be solved by the Hopeld architecture. Also, convergence to a global optimum cannot be guaranteed. Finally, for Hopeld networks training and use both require complex computation, but this also holds for other more traditional algorithms for non-linear programming [232]. It should be kept in mind that some (constrained) non-linear programming problems can be solved optimally by traditional algorithmic approaches. The Hopeld network is really only an interesting approach for problems that lie beyond this subclass of solvable optimisation problems. 4. Real-world applications of neural networks This review has concentrated on applications of ANNs to image processing problems, which were reported in the scientic literature. However, as the eld matured, ANNs have gradually found their way into a large range of (commercial) applications. Unfortunately, commercial and other considerations often impede publication of scientic and technical aspects of such systems. In some research programmes, an overview of commercial applications of ANNs has been given, e.g., the SIENA project (ESPRIT project 9811) [233], the NeuroNet project [234] and the British NCTT project [235]. The project web sites
More information on the aforementioned applications can be found via the internet [233235]. 5. Discussion Two major advantages of ANNs is that they are applicable to a wide variety of problems and are relatively easy to use. There are, however, still caveats and fundamental problems that need to be investigated in the future. Some of these issues are general in the sense that they are not resolved by other, competing techniques from the pattern recognition eld. Other problems are caused by the strive to solve an image processing problem by means of a statistical, data-oriented technique. Finally, some problems are fundamental to the way ANNs approach pattern recognition problems. 5.1. Issues in pattern recognition When trying to solve a recognition problem, one may be faced with several problems that are fundamental to applied statistical pattern recognition: avoiding the curse of dimensionality, selecting the best features and achieving a good transferability. The rst problem, the curse of dimensionality, occurs when too many input variables are provided to a classier or regression function. The risk of ending up with a classier or regressor that generalises poorly on novel data, increases with the number of dimensions of the input space. The problem is caused by the inability of existing classiers to cope adequately with a large number of (possibly irrelevant) parameters, a deciency that makes feature extraction and=or feature selection necessary steps in classier development. Feature extraction has been discussed in detail in Section 3.2.2. Feature selection is by virtue of its dependence on a trained classier, an ill-posed problem [242244]. Besides o ering a way to control the curse of dimensionality, feature
2290
selection also provides insight into the properties of a classier and the underlying classication problem [242]. A problem that is especially important in applications such as medical image processing, is how to ensure the transferability of a classier. When trained to classify patterns obtained from one setting with a specic class distribution, P(!j ), a classier will have a poorer and possibly unacceptably low performance when transferred to a novel setting with another class distribution P (!j ). How to cope with varying prior class distributions, is a subject for future research. Another problem related to transferability is how to account for changing underlying feature distributions, p(x|!j ) or p(y|x). In general, the parameters of the classier or regression function need to be reestimated from a data set that is representative for the novel distribution. This problem is intrinsic to all statistical models as they are based on inductive inference. Note that for a classier that has been trained, e.g., to recognise objects appearing at a certain scale directly from pixel data, recognition of similar objects at a di erent scale is equivalent to classifying patterns from a novel distribution p (x|!j ). Classiers or regression models that have not been retrained, should catch patterns occurring outside the space spanned by the training cases and leave these patterns unprocessed, thereby avoiding the assignment of wild-guess class labels (see, e.g., Ref. [245]) or unreliable prediction of the conditional mean (in regression). Moreover, the question of how to incorporate costs of di erent misclassications (again, an important topic in, e.g., medical image processing) or the computational costs of features [246], is not yet fully answered. 5.2. Obstacles for pattern recognition in image processing Besides fundamental problems within the eld of pattern recognition, other problems arise because statistical techniques are used on image data. First, most pixel-based techniques consider each pixel as a separate random variable. A related problem is how one should incorporate prior knowledge into pattern recognition techniques. Also, the evaluation of image processing approaches is not always straightforward. A challenging problem in the application of pattern recognition techniques on images is how to incorporate context information and prior knowledge about the expected image content. This can be knowledge about the typical shape of objects one wants to detect, knowledge of the spatial arrangement of textures or objects, or prior knowledge of a good approximate solution to an optimisation problem. According to Perlovsky [247], the key to restraining the highly exible learning algorithms for ANNs, lies in the very combination with prior (geometric) knowledge. However, most pattern recognition methods do not even use the prior information that neighbouring pixel=voxel values are highly correlated.
This problem can be circumvented by extracting features from images rst, by using distance or error measures on pixel data which do take spatial coherency into account (e.g., Refs. [67,248]), or by designing an ANN with spatial coherency (e.g., Ref. [159]) or contextual relations between objects (e.g., Ref. [249]) in mind. Context information can also be obtained from the pyramid and scale space approaches discussed in Section 3.4.1. In the reviewed applications, prior knowledge was mainly used to identify local features (input level B) that were used as input to neural classiers. Fuzzy ANNs may play a special role because they can be initialised with (fuzzy) rules elicited from domain experts. Using prior knowledge to constrain the highly parameterised (neural) classiers, is a scientic challenge. There is a clear need for a thorough validation of the developed image processing algorithms. In the reviewed literature, validation on a large set of test images had only occasionally been performed. Validation and comparison of di erent algorithms are only possible when a reliable gold standard exists and meaningful (objective) quality measures are available. For, e.g., object recognition, a gold standard is in most cases easy to obtain. In other applications, di erent (human) observers may not fully agree about the gold standard (e.g., segmentation of medical images). Even with a reliable gold standard being available, it is clear that performance assessment entails much more than simply computing error rates on novel test images. Finally, in image processing, classication and regression problems quickly involve a very large number of input dimensions, especially when the algorithms are applied directly to pixel data. This is problematic, due to the curse of dimensionality already discussed. However, the most interesting future applications promise to deliver even more input. Whereas, in almost all reviewed articles, ANNs were applied to two-dimensional images, e.g., (confocal) microscopy and CT=MR (medical) imaging are three-dimensional modalities. One way to cope with this increased dimensionality is by feature-based pattern recognition, another way would be to develop an architecture that inherently downsamples the original image. As already mentioned, the search for the optimal set of features that in concert gives the best class separability is a never-ending quest. To avoid such a quest for all kinds of features that capture certain specic aspects in a (sub)image, a general mapping (invariant to changes in position, rotation and scale) of a (sub)image to a manifold subspace should be developed. This will change the focus from selection of individual features to optimisation of the sampling density in the invariant space. 5.3. Neural network issues A number of unresolved problems exist in the eld of ANNs. We will in turn consider the lack of
2291
a profound theoretical basis for ANNs, the problem of choosing the best architecture and the black-box problem. Several theoretical results regarding the approximation capabilities of ANNs have been proven. Although feed-forward ANNs with two hidden layers can approximate any (even discontinuous) function to an arbitrary precision, theoretical results on, e.g., the rate of convergence are lacking. For other (non)parametric classiers, the relation between the size of the training set and the expected error rate has been studied theoretically. One obstacle in developing a more profound statistical foundation for trained ANNs is that convergence to the global minimum of the risk function (squared error) cannot be guaranteed. Furthermore, there is always a danger of overtraining an ANN as minimising the error measure on a training set does not imply nding a well-generalising ANN. Nevertheless, the large body of work on application of ANNs presented in the last decade provides (novice) users with many rules of thumb on how to set the various parameters. Also, methods such as regularisation, early stopping and ensemble training=bagging can help in avoiding the problem of overtraining. Another problem is how to choose the best ANN architecture. Although there is some work on model selection [250], no general guidelines exist that guarantee the best trade-o between bias and variance of the classier for a particular size of the training set. Training unconstrained networks using standard performance measures such as the mean squared error might even give very unsatisfying results. This, we assume, is the reason why in a number of applications, networks were not adaptive at all (e.g., CNNs) or heavily constrained by their architecture (e.g., the Neocognitron and shared weight networks). Note that this does not automatically imply that unconstrained ANNs should not be applied to image processing. It does indicate that as much prior knowledge as possible should be used in both ANN design and training. ANNs su er from what is known as the black-box problem: given any input a corresponding output is produced, but it cannot be elucidated why this decision was reached, how reliable it is, etc. In image understanding, this is certainly problematic, so the use of ANNs in such applications will remain limited. Some fuzzy neural architectures facilitate extraction of fuzzy rules after training. We expect that fuzzy ANNs will be more applicable in image understanding. In some applications, e.g., process monitoring, electronic surveillance, biometrics, etc., a condence measure is highly necessary to prevent costly false alarms. In such areas, it might even be preferable to use other, less well-performing methods that do give statistically profound condence intervals.
6. Conclusion and future perspectives We have structured our survey according to the six steps in the image processing chain. ANNs have been trained to perform these six tasks with various degrees of success:
Image preprocessing is a popular application area.
Several (regression) ANNs were developed for image reconstruction, image restoration and image enhancement. Often, these networks were not (or only partially) adaptive. A general conclusion is that neural solutions are truly interesting when existing algorithms fail or when ANNs may reduce the amount of computation considerably. The largest risk in preprocessing is that training results in ANNs being tuned to specic image material. Image compression is an interesting application of ANNs. A caveat again is tuning to particular images. As there is no unique way of evaluating compression algorithms, approaches should be compared with competing compression algorithms on novel test images. Feature extraction is a useful application of, especially, the SOM. Also, the possibility of non-linear feature extraction by feed-forward ANNs with several hidden layers o ers additional functionality. Image segmentation and object detection have largely been performed by pixel-based or feature-based (low level) approaches. Pixel-based approaches provide the classier with all relevant information, but usually result in high-dimensional input spaces. A feature-based approach, however, essentially compresses the information obtained from a local neighbourhood into a vector of salient features. On the one hand, it cannot be guaranteed that the chosen features comprise most of the discriminative information. On the other hand, a feature-based approach may be the only way to guarantee rotation and scale invariance. A possible remedy is to develop novel pixel-based classication approaches in which neighbouring pixels are no longer regarded as completely separate variables. For object recognition, problems like object occlusion and multiple occurrences of objects remain unresolved. Image understanding is a dubious application of ANNs because of their black-box character and the need for a large number of images as training and test sets. As long as there is no accepted facility for explaining why a particular class label has been assigned to a pattern, black-box classiers will not be widely applied in image understanding. Neural-fuzzy architectures [251] and probabilistic networks [218] may lend themselves better for image understanding because of their transparent character and the possibility of initialisation by a priori rules or distributions. Optimisation problems have in most cases been approached by solutions based on Hopeld ANNs.
2292
Table 2 The di erent types of neural networks applied to the various tasks in the image processing chaina Segmentation, feature-based (3.3.2) Object recognition, feature-based (3.4.2) Image understanding (3.5)
Image reconstruction
Neural network type Feed-forward, regression Feed-forward, auto-association Feed-forward, classication Feed-forward, shared weights Feed-forward, recursive Perceptron Radial basis function network (RBF) Self-organising feature map (SOM) (Fuzzy) Learning vector quantization (LVQ) Hopeld Cellular (CNN) Generalized adaptive neural lters (GANF) Adaptive resonance theory (ART) Associative memories (and RAM) ADALINE Neocognitron Probabilistic Neural decision tree Neural belief network Higher order network Counterpropagation network Fuzzy neural= Neuro-fuzzy system Other
a The
numbers in the top row refer to the sections where the image processing task is being reviewed.
Nevertheless, several issues remain problematic such as casting the problem at hand to the Hopeld architecture and bypassing the high dependency of the initial conguration. Hopeld networks become an
interesting alternative to conventional optimisation techniques when the latter fail in solving the problem, either because of its non-linear character or because of the computational complexity.
Optimisation (3.6)
(3.1.1)
2293
An overview of ANN architectures used for di erent image processing tasks is given in Table 2. It shows that feed-forward ANNs, SOMs and Hopeld ANNs are the most frequently applied architectures, although many, more exotic designs have been applied to image processing problems as well. 6.1. Outlook This article has to a large extent been an overview of what can now perhaps be called the neural network hype in image processing: the approximately 15-year period following the exciting publications of Kohonen [252], Hopeld [253] and Rumelhart et al. [4]. Their work led many researchers to develop and apply various methods, which were originally inspired by the structure of the human brain. In some cases, the emphasis was on biological plausibility. Other applications focused on the possibility of parallel implementation. In most applications, however, the adaptive capabilities of feed-forward ANNs were used to build a classier. We believe that the last few years have seen a change in attitude towards ANNs, so that now ANNs are not anymore automatically seen as the best solution to any classication or regression problem. The eld of ANNs has to a large extent been reincorporated in the various disciplines that inspired it: pattern recognition, psychology and neurophysiology. ANNs are interesting as tools when there is a real need for an adaptive approach or a fast, parallel solution, but one should remain open to new interesting developments, such as the recently proposed support vector machines [254]. So what are the challenges left for ANNs in image processing? As we have discussed before, the main problems in many image processing applications still are the abundance of features and the di culty of coping with concomitant variations in position, orientation and scale. This clearly indicates the need for more intelligent, invariant feature extraction and feature selection mechanisms. Prior knowledge, e.g., about the aforementioned invariances or the expected image content, should play a large role in this, but could also be incorporated into the network architecture itself. A true challenge is to use ANNs as building blocks in large, adaptive systems consisting of collaborating modules. Such an adaptive system should be able to control each module and propagate feedback from the highest level (e.g., object detection) to the lowest level (e.g., preprocessing). Another interesting possibility for ANNs is what might be called on-the-job training, which makes possible the use of ANNs in changing environments. In many application areas, this would be a valuable improvement over current systems and facilitate transferability between di erent sites. The conclusion must be that ANNs can play a role in image processing, although it might be a role as a sup-
porting tool rather than a major one. ANNs are useful in image processing as either non-parametric classiers, non-linear regression functions, or for (un)supervised feature extraction. If, or when, the problems of ANN application outlined in this paper are gradually solved, this role may become increasingly larger. Acknowledgements This work was partly supported by the Dutch Cancer Foundation (KWF) on grant RUL-97-1509, the Foundation for Computer Science Research in the Netherlands (SION) and the Dutch Organisation for Scientic Research (NWO). We are highly grateful to R.P.W. Duin, J. Kittler and L.J. van Vliet for commenting on an earlier, draft version of this manuscript. We also thank an anonymous referee for useful remarks and suggestions. References
[1] J. Corneld, Statistical classication methods, Proceedings of the Second Conference on the Diagnostic Process, Computer Diagnosis and Diagnostic Methods, Chicago, 1972, pp. 108130. [2] P.A. Devijver, J. Kittler, Pattern Recognition: A Statistical Approach, Englewood Cli s, London, 1982. [3] K. Fukunaga, Introduction to Statistical Pattern Recognition, 2nd Edition, Academic Press, New York, 1990. [4] D.E. Rumelhart, G.E. Hinton, R.J. Williams, Learning internal representations by error propagation, in: D.E. Rumelhart, J.L. McClelland (Eds.), Parallel Distributed Processing: Explorations in the microstructure of Cognition, Vol. I, MIT Press, Cambridge, 1986, pp. 319362. [5] N.R. Pal, S.K. Pal, A review on image segmentation techniques, Pattern Recognition 26 (9) (1993) 1277 1294. [6] W.R.M. Dassen, M. Egmont-Petersen, R.G.A. Mulleneers, Articial neural networks in cardiology; a review, in: P.E. Vardas (Ed.), Cardiac Arrhythmias, Pacing & Electrophysiology, Kluwer Academic Publishers, London, 1998, pp. 205211. [7] C.M. Bishop, Neural Networks for Pattern Recognition, Oxford University Press, Oxford, 1995. [8] A. Adler, R. Guardo, A neural network image reconstruction technique for electrical impedance tomography, IEEE Trans. Med. Imaging 13 (4) (1994) 594600. [9] V. Srinivasan, Y.K. Han, S.H. Ong, Image reconstruction by a Hopeld neural network, Image Vision Comput. 11 (5) (1993) 278282. [10] R.R. Meyer, E. Heindl, Reconstruction of o -axis electron holograms using a neural net, J. Microsc. 191 (1) (1998) 5259. [11] Y.M. Wang, F.M. Wahl, Vector-entropy optimizationbased neural-network approach to image reconstruction
2294
M. Egmont-Petersen et al. / Pattern Recognition 35 (2002) 22792301 from projections, IEEE Trans. Neural Networks 8 (5) (1997) 10081014. N. Ansari, Z.Z. Zhang, Generalised adaptive neural lters, IEE Electron. Lett. 29 (4) (1993) 342343. L. Bedini, A. Tonazzini, Image restoration preserving discontinuities: the Bayesian approach and neural networks, Image Vision Comput. 10 (2) (1992) 108118. W. Chua, L. Yang, Cellular networks: theory, IEEE Trans. Circuits Systems 35 (10) (1988) 12571272. W. Chua, L. Yang, Cellular networks: applications, IEEE Trans. Circuits Systems 35 (10) (1988) 12731290. D. de Ridder, R.P.W. Duin, P.W. Verbeek et al., The applicability of neural networks to non-linear image processing, Pattern Anal. Appl. 2 (2) (1999) 111128. M.A.T. Figueiredo, J.M.N. Leitao, Sequential and parallel image restoration: neural network implementations, IEEE Trans. Image Process. 3 (6) (1994) 789801. D. Greenhil, E.R. Davies, Relative e ectiveness of neural networks for image noise suppression, Proceedings of the Pattern Recognition in Practice IV, Vlieland, 1994, pp. 367378. L. Guan, J.A. Anderson, J.P. Sutton, A network of networks processing model for image regularization, IEEE Trans. Neural Networks 8 (1) (1997) 169174. H. Hanek, N. Ansari, Speeding up the generalized adaptive neural lters, IEEE Trans. Image Process. 5 (5) (1996) 705712. C.C. Lee, J.P. Degyvez, Color image processing in a cellular neural-network environment, IEEE Trans. Neural Networks 7 (5) (1996) 10861098. T. Matsumoto, H. Kobayashi, Y. Togawa, Spatial versus temporal stability issues in image processing neuro chips, IEEE Trans. Neural Networks 3 (4) (1992) 540569. J.A. Nossek, T. Roska, Special issue on Cellular Neural Networks, IEEE Trans. Circuits Systems 40 (3) (1993). J.K. Paik, A.K. Katsaggelos, Image restoration using a modied Hopeld network, IEEE Trans. Image Process. 1 (1) (1992) 4963. V.V. Phoha, W.J.B. Oldham, Image recovery and segmentation using competitive learning in a layered network, IEEE Trans. on Neural Networks 7 (4) (1996) 843856. W. Qian, M. Kallergi, L.P. Clarke, Order statistic-neural network hybrid lters for gamma-camera-bremsstrahlung image restoration, IEEE Trans. Med. Imaging 12 (1) (1993) 5864. F. Russo, Hybrid neuro-fuzzy lter for impulse noise removal, Pattern Recognition 32 (11) (1999) 18431855. F. Russo, Image ltering using evolutionary neural fuzzy systems, in: S.K. Pal, A. Ghosh, M.K. Kundu (Eds.), Soft computing for image processing, Physica-Verlag, Heidelberg, 2000, pp. 2343. Y.L. Sun, S. Yu, Improvement on performance of modied Hopeld neural network for image restoration, IEEE Trans. Image Process. 4 (5) (1995) 683692. M. Zamparelli, Genetically trained cellular neural networks, Neural Networks 10 (6) (1997) 11431151. Z.Z. Zhang, N. Ansari, Structure and properties of generalized adaptive neural lters for signal enhancement, IEEE Trans. Neural Networks 7 (4) (1996) 857868. [32] V. Chandrasekaran, M. Palaniswami, T.M. Caelli, Range image segmentation by dynamic neural network architecture, Pattern Recognition 29 (2) (1996) 315329. [33] J. Chey, S. Grossberg, E. Mingolla, Neural dynamics of motion groupingfrom aperture ambiguity to object speed and direction [review], J. Opt. Soc. Am. A-Optics Image Sci. 14 (10) (1997) 25702594. [34] S.M. Courtney, L.H. Finkel, G. Buchsbaum, A multistage neural network for color constancy and color induction, IEEE Trans. Neural Networks 6 (4) (1995) 972985. [35] S. Lu, A. Szeto, Hierarchical articial neural networks for edge enhancement, Pattern Recognition 26 (8) (1993) 11491163. [36] J. Moh, F.Y. Shih, A general purpose model for image operations based on multilayer perceptrons, Pattern Recognition 28 (7) (1995) 10831090. [37] D.T. Pham, E.J. Bayro-Corrochano, Neural computing for noise ltering, edge detection and signature extraction, J. Systems Eng. 2 (2) (1992) 111222. [38] R.H. Pugmire, R.M. Hodgson, R.I. Chaplin, The properties and training of a neural network based universal window lter developed for image processing tasks, in: S. Amari, N. Kasabov (Eds.), Brain-like computing and intelligent information systems, Springer-Verlag, Singapore, 1998, pp. 4977. [39] F.Y. Shih, J. Moh, F.-C. Chang, A new ART-based neural architecture for pattern classication and image enhancement without prior knowledge, Pattern Recognition 25 (5) (1992) 533542. [40] V. Srinivasan, P. Bhatia, S.H. Ong, Edge detection using a neural network, Pattern Recognition 27 (12) (1994) 16531662. [41] C.-T. Tsai, Y.-N. Sun, P.-C. Chung et al., Endocardial boundary detection using a neural network, Pattern Recognition 26 (7) (1993) 10571068. [42] A.M. Waxman, M.C. Seibert, A. Gove et al., Neural processing of targets in visible multispectral IR and SAR imagery, Neural Networks 8 (78) (1995) 10291051. [43] C. Amerijckx, M. Verleysen, P. Thissen et al., Image compression by self-organized Kohonen map, IEEE Trans. Neural Networks 9 (3) (1998) 503507. [44] R.W. Brause, M. Rippl, Noise suppressing sensor encoding and neural signal orthonormalization, IEEE Trans. Neural Networks 9 (4) (1998) 613628. [45] R.D. Dony, S. Haykin, Neural network approaches to image compression, Proc. IEEE 83 (2) (1995) 288303. [46] W.-C. Fang, B.J. Sheu, O.T.-C. Chen et al., A VLSI neural processor for image data compression using self-organization networks, IEEE Trans. Neural Networks 3 (3) (1992) 506518. [47] G. Hauske, A self organizing map approach to image quality, Biosystems 40 (12) (1997) 93102. [48] J. Heikkonen, A computer vision approach to air ow analysis, Pattern Recognition Lett. 17 (4) (1996) 369385. [49] R.J.T. Morris, L.D. Rubin, H. Tirri, Neural network techniques for object orientation detection: solution by optimal feedforward network and learning vector quantization approaches, IEEE Trans. Pattern Anal. Mach. Intell. 12 (11) (1990) 11071115.
[18]
[26]
[27] [28]
M. Egmont-Petersen et al. / Pattern Recognition 35 (2002) 22792301 [50] S.A. Rizvi, L.C. Wang, N.M. Nasrabadi, Nonlinear vector prediction using feed-forward neural networks, IEEE Trans. Image Process. 6 (10) (1997) 14311436. [51] W. Skarbek, A. Cichocki, Robust image association by recurrent neural subnetworks, Neural Process. Lett. 3 (1996) 131138. [52] J.G. Daugman, Complete discrete 2-D Gabor transforms by neural networks for image analysis and compression, IEEE Trans. Acoustics, Speech Signal Process. 36 (7) (1988) 11691179. [53] R.D. Dony, S. Haykin, Optimally adaptive transform coding, IEEE Trans. Image Process. 4 (10) (1995) 1358 1370. [54] M.D. Garris, C.L. Wilson, J.L. Blue, Neural networkbased systems for handprint OCR applications, IEEE Trans. Image Process. 7 (8) (1998) 10971112. [55] S. Mitra, S.Y. Yang, High delity adaptive vector quantization at very low bit rates for progressive transmission of radiographic images, J. Electron. Imaging 8 (1) (1999) 2335. [56] D. Tzovaras, M.G. Strintzis, Use of nonlinear principal component analysis and vector quantization for image coding, IEEE Trans. Image Process. 7 (8) (1998) 1218 1223. [57] L.C. Wang, S.A. Rizvi, N.M. Nasrabadi, A modular neural network vector predictor for predictive image coding, IEEE Trans. Image Process. 7 (8) (1998) 1198 1217. [58] A. Weingessel, H. Bischof, K. Hornik et al., Adaptive combination of PCA and VQ networks, IEEE Trans. Neural Networks 8 (5) (1997) 12081211. [59] W.G. Waller, A.K. Jain, On the monotonicity of the performance of a Bayesian classier, IEEE Trans. Information Theory 24 (3) (1978) 392394. [60] J. Lampinen, E. Oja, Pattern recognition, in: C.T. Leondes (Ed.), Neural Network Systems, Techniques and Applications, Vol. 5, Image Processing and Pattern Recognition, Academic Press, New York, 1998, pp. 159. [61] E. Oja, A simplied neuron model as a principal component analyzer, J. Math. Biol. 15 (3) (1982) 267273. [62] E. Oja, Neural networks, principal components, and subspaces, Int. J. Neural Systems 1 (1) (1989) 6168. [63] P. Baldi, J. Hornik, Neural networks and principal component analysis: learning from examples without local minima, Neural Networks 2 (1) (1989) 5358. [64] M. Kramer, Nonlinear principal component analysis using autoassociative neural networks, Am. Ins. Chem. Eng. J. 37 (2) (1991) 223243. [65] S. Usui, S. Nakauchi, M. Nakano, Internal color representation acquired by a ve-layer neural network, Proceedings of the International Conference on Articial Neural Networks, Helsinki, Finland, 1991, pp. 867872. [66] T. Hastie, W. Stuetzle, Principal curves, J. Am. Statist. Assoc. 84 (406) (1989) 502516. [67] G.E. Hinton, P. Dayan, M. Revow, Modelling the manifolds of images of handwritten digits, IEEE Trans. Neural Networks 8 (1) (1997) 6574. [68] H.M. Abbas, M.M. Fahmy, Neural networks for maximum likelihood clustering, Signal Processing 36 (1) (1994) 111126.
2295
[69] J.M. Cruz, G. Pajares, J. Aranda et al., Stereo matching technique based on the perceptron criterion function, Pattern Recognition Lett. 16 (9) (1995) 933944. [70] M. Fukumi, S. Omatu, Y. Nishikawa, Rotation-invariant neural pattern recognition system estimating a rotation angle, IEEE Trans. Neural Networks 8 (3) (1997) 568581. [71] J.O. Glass, W.E. Reddick, Hybrid articial neural network segmentation and classication of dynamic contrast-enhanced MR imaging (DEMRI) of osteosarcoma, Magn. Resonance 2 Imaging 16 (9) (1998) 10751083. [72] V.Z. Kepuska, S.O. Mason, A hierarchical neural network system for signalized point recognition in aerial photographs, Photogrammetric Eng. Remote Sensing 61 (7) (1995) 917925. [73] J. Lampinen, E. Oja, Distortion tolerant pattern recognition based on self-organizing feature extraction, IEEE Trans. Neural Networks 6 (3) (1995) 539547. [74] D. Patel, E.R. Davies, I. Hannah, The use of convolution operators for detecting contaminants in food images, Pattern Recognition 29 (6) (1996) 10191029. [75] A. Shustorovich, A subspace projection approach to feature extractionthe 2-D Gabor transform for character recognition, Neural Networks 7 (8) (1994) 12951301. [76] P.N. Suganthan, H. Yan, Recognition of handprinted Chinese characters by constrained graph matching, Image Vision Comput. 16 (3) (1998) 191201. [77] C.K.I. Williams, M. Revow, G.E. Hinton, Instantiating deformable models with a neural net, Comput. Vision Image Understand 68 (1) (1997) 120126. [78] M.N. Ahmed, A.A. Farag, Two-stage neural network for volume segmentation of medical images, Pattern Recognition Lett. 18 (1113) (1997) 11431151. [79] C.T. Chen, E.C. Tsao, W.C. Lin, Medical image segmentation by a constraint satisfaction neural network, IEEE Trans. Nucl. Sci. 38 (2) (1991) 678686. [80] G.I. Chiou, J.N. Hwang, A neural network based stochastic active contour model (NNS-SNAKE) for contour nding of distinct features, IEEE Trans. Image Process. 4 (10) (1995) 14071416. [81] C. Chong, J. Jia, Assessments of neural network classier output codings using variability of Hamming distance, Pattern Recognition Lett. 17 (8) (1996) 811818. [82] M. Franzke, H. Handels, Topologische Merkmalskarten zur automatischen Mustererkennung in medizinischen Bilddaten, in: S. Fuchs, R. Ho mann (Eds.), Informatik Aktuell, Mustererkennung 1992, 14. DAGM-Symposium, Springer, Heidelberg, 1992, pp. 329334. [83] A. Ghosh, N.R. Pal, S.K. Pal, Image segmentation using a neural network, Biol. Cybernet. 66 (2) (1991) 151158. [84] A. Ghosh, S.K. Pal, Neural network, self-organization and object extraction, Pattern Recognition Lett. 13 (5) (1992) 387397. [85] A. Ghosh, N.R. Pal, S.K. Pal, Object background classication using Hopeld type neural network, Int. J. Pattern Recognition Artic. Intell. 6 (5) (1992) 9891008.
2296
M. Egmont-Petersen et al. / Pattern Recognition 35 (2002) 22792301 networks, IEEE Trans. Med. Imaging 16 (6) (1997) 911 918. S. Rout, S.P. Srivastava, J. Majumdar, Multi-modal image segmentation using a modied Hopeld neural network, Pattern Recognition 31 (6) (1998) 743750. A.J. Schoeld, P.A. Mehta, T.J. Stonham, A system for counting people in video images using neural networks to identify the background scene, Pattern Recognition 29 (8) (1996) 14211428. S.B. Serpico, L. Bruzzone, F. Roli, An experimental comparison of neural and statistical non-parametric algorithms for supervised classication of remote-sensing images, Pattern Recognition Lett. 17 (13) (1996) 1331 1341. R.H. Silverman, Segmentation of ultrasonic images with neural networks, Int. J. Pattern Recognition Artic. Intell. 5 (1991) 619628. M.M. van Hulle, T. Tollenaere, A modular articial neural network for texture processing, Neural Networks 6 (1) (1993) 732. D.L. Vilarino, V.M. Brea, D. Cabello et al., Discrete-time CNN for image segmentation by active contours, Pattern Recognition Lett. 19 (8) (1998) 721734. J. Waldemark, An automated procedure for cluster analysis of multivariate satellite data, Int. J. Neural Systems 8 (1) (1997) 315. T. Wang, X. Zhuang, X. Xing, Robust segmentation of noisy images using a neural network model, Image Vision Comput. 10 (4) (1992) 233240. D.L. Wang, D. Terman, Image segmentation based on oscillatory correlation, Neural Computation 9 (4) (1997) 805836. Y. Wang, T. Adali, S.Y. Kung et al., Quantication and segmentation of brain tissues from MR images a probabilistic neural network approach, IEEE Trans. Image Process. 7 (8) (1998) 11651181. A.J. Worth, D.N. Kennedy, Segmentation of magnetic resonance brain images using analogue constraint satisfaction neural networks, Image Vision Comput. 12 (6) (1994) 345354. N. Babaguchi, K. Yamada, K. Kise et al., Connectionist model binarization, Int. J. Pattern Recognition Artic. Intell. 5 (4) (1991) 629644. G.P. Babu, M.N. Murty, Optimal thresholding using multi state stochastic connectionist approach, Pattern Recognition Lett. 16 (1) (1995) 1118. J. Basak, B. Chanda, D.D. Majumder, On edge and line linking in graylevel images with connectionist models, IEEE Trans. Systems, Man Cybernet. 24 (3) (1994) 413428. A.G. Bors, I. Pitas, Optical ow estimation and moving object segmentation based on median radial basis function network, IEEE Trans. Image Process. 7 (5) (1998) 693702. D. DeKruger, B.R. Hunt, Image processing and neural networks for recognition of cartographic area features, Pattern Recognition 27 (4) (1994) 461483. M. Egmont-Petersen, E. Pelikan, Detection of bone tumours in radiographic images using neural networks, Pattern Anal. Appl. 2 (2) (1999) 172183.
[86] A. Ghosh, Use of fuzziness measures in layered networks for object extraction: a generalization, Fuzzy Sets Systems 72 (3) (1995) 331348. [87] M. Gro , F. Seibert, Visualization of multidimensional data sets using a neural network, Visual Comput. 10 (3) (1993) 145159. [88] S. Grossberg, E. Mingolla, Neural dynamics of surface perception: boundary webs, illuminants, and shape from shading, Comput. Vision, Graphics, Image Process. 37 (1) (1987) 116165. [89] S. Grossberg, N.P. Mcloughlin, Cortical dynamics of three-dimensional surface perceptionbinocular and half-occluded scenic images, Neural Networks 10 (9) (1997) 15831605. [90] L.O. Hall, A.M. Bensaid, L.P. Clarke et al., A comparison of neural network and fuzzy clustering techniques in segmenting magnetic resonance images of the brain, IEEE Trans. Neural Networks 3 (5) (1992) 672682. [91] H. Handels, C. Busch, J. Encarnacao et al., KAMEDIN: A telemedicine system for computer supported cooperative work and remote image analysis in radiology, Comput. Methods Programs Biomed. 52 (3) (1997) 175183. [92] J. Koh, M.S. Suk, S.M. Bhandarkar, A multilayer self organizing feature map for range image segmentation, Neural Networks 8 (1) (1995) 6786. [93] C. Kotropoulos, X. Magnisalis, I. Pitas et al., Nonlinear ultrasonic image processing based on signal-adaptive lters and self-organizing neural networks, IEEE Trans. Image Process. 3 (1) (1994) 6577. [94] D.X. Le, G.R. Thoma, H. Wechsler, Classication of binary document images into textual or nontextual data blocks using neural network models, Mach. Vision Appl. 8 (5) (1995) 289304. [95] W.-C. Lin, E.C.-K. Tsao, C.-T. Chen, Constraint satisfaction neural networks for image segmentation, Pattern Recognition 25 (7) (1992) 679693. [96] B.S. Manjunath, T. Simchony, R. Chellappa, Stochastic and deterministic networks for texture segmentation, IEEE Trans. Acoustics, Speech Signal Process. 38 (6) (1990) 10391049. [97] J.A. Marshall, Self-organizing neural networks for perception of visual motion, Neural Networks 3 (1) (1990) 4574. [98] S.C. Ngan, X. Hu, Analysis of functional magnetic resonance imaging data using self-organizing mapping with spatial connectivity, Magn. Resonance Med. 41 (5) (1999) 939946. [99] R. Opara, F. Worgotter, Using visual latencies to improve image segmentation, Neural Computation 8 (7) (1996) 14931520. [100] M. Ozkan, B.M. Dawant, R.J. Maciunas, Neuralnetwork-based segmentation of multi-modal medical imagesa comparative and prospective study, IEEE Trans. Med. Imaging 12 (3) (1993) 534544. [101] T.N. Pappas, An adaptive clustering algorithm for image segmentation, IEEE Trans. Signal Process. 40 (4) (1992) 901914. [102] W.E. Reddick, J.O. Glass, E.N. Cook et al., Automated segmentation and classication of multispectral magnetic resonance images of brain using articial neural
[103] [104]
[105]
[113]
[117]
[118] [119]
M. Egmont-Petersen et al. / Pattern Recognition 35 (2002) 22792301 [120] S. Ghosal, R. Mehrotra, Range surface characterization and segmentation using neural networks, Pattern Recognition 28 (5) (1995) 711727. [121] S. Haring, M.A. Viergever, J.N. Kok, Kohonen networks for multiscale image segmentation, Image Vision Comput. 12 (6) (1994) 339344. [122] S.Y. Kung, J.S. Taur, Decision based neural networks with signal image classication applications, IEEE Trans. Neural Networks 6 (1) (1995) 170181. [123] A. Laine, J. Fan, Texture classication by wavelet packet signatures, IEEE Trans. Pattern Anal. Mach. Intell. 15 (11) (1993) 11861191. [124] R.J. Machado, V.C. Barbosa, P.A. Neves, Learning in the combinatorial neural model, IEEE Trans. Neural Networks 9 (5) (1998) 831847. [125] M.F. McNittgray, H.K. Huang, J.W. Sayre, Feature selection in the pattern classication problem of digital chest radiograph segmentation, IEEE Trans. Med. Imaging 14 (3) (1995) 537547. [126] P.P. Raghu, R. Poongodi, B. Yegnanarayana, Unsupervised texture classication using vector quantization and deterministic relaxation neural network, IEEE Trans. Image Process. 6 (10) (1997) 13761387. [127] L. Sukissian, S. Kollias, Y. Boutalis, Adaptive classication of textured images using linear prediction and neural networks, Signal Processing 36 (2) (1994) 209232. [128] D.L. Toulson, J.F. Boyce, Segmentation of MR images using neural nets, Image Vision Comput. 10 (5) (1992) 324328. [129] E.C.-K. Tsao, W.-C. Lin, C.-T. Chen, Constraint satisfaction neural networks for image recognition, Pattern Recognition 26 (4) (1993) 553567. [130] L.M.J. Florack, The syntactical structure of scalar images, thesis, Image Sciences Institute, Utrecht University, Utrecht, 1993. [131] M. Egmont-Petersen, J.L. Talmon, J. Brender et al., On the quality of neural net classiers, Artic. Intell. Med. 6 (5) (1994) 359381. [132] J.M.H. Du Buf, M. Kardan, M. Spann, Texture feature performance for image segmentation, Pattern Recognition 23 (3 4) (1990) 291309. [133] Y.J. Zhang, A survey on evaluation methods for image segmentation, Pattern Recognition 29 (8) (1996) 1335 1346. [134] C. Alippi, Real-time analysis of ships in radar images with neural networks, Pattern Recognition 28 (12) (1995) 18991913. [135] M. Antonucci, B. Tirozzi, N.D. Yarunin et al., Numerical simulation of neural networks with translation and rotation invatiant pattern recognition, Int. J. Modern Phys. B 8 (1112) (1994) 15291541. [136] J.J. Atick, P.A. Gri n, A.N. Redlich, Statistical approach to shape from shadingreconstruction of 3-D face surfaces from single 2-D images, Neural Comput. 8 (6) (1996) 13211340. [137] J. Basak, S.K. Pal, PsyCOPA psychologically motivated connectionist system for object perception, IEEE Trans. Neural Networks 6 (6) (1995) 13371354. [138] J. Buhmann, J. Lange, C.v.d. Malsburg et al., Object recognition with Gabor functions in the dynamic link architectureparallel implementation on a transputer
2297
[139]
[140]
[141]
[142]
[143]
[144]
[148]
[149]
[153] [154]
network, in: B. Kosko (Ed.), Neural Networks for Signal Processing, 1992, Prentice-Hall, Englewood Cli s, NJ, pp. 121160. G.A. Carpenter, S. Grossberg, G.W. Lesher, The what-and-where ltera spatial mapping neural network for object recognition and image understanding, Comput. Vision Image Understand 69 (1) (1998) 122. C.A. Carson, J.M. Keller, K.K. McAdoo et al., Escherichia coli O157:H7 restriction pattern recognition by articial neural network, J. Clin. Microbiol. 33 (11) (1995) 28942898. H.-P. Chan, S.-C.B. Lo, B. Sahiner et al., Computer-aided detection of mammographic microcalcications: pattern recognition with an articial neural network, Med. Phys. 22 (10) (1995) 15551567. C.H. Chen, On the relationships between statistical pattern recognition and articial neural networks, Int. J Pattern Recognition Artic. Intell. 5 (4) (1991) 655661. S.S. Christensen, A.W. Andersen, T.M. JHrgensen et al., Visual guidance of a pig evisceration robot using neural networks, Pattern Recognition Lett. 17 (4) (1996) 345355. A. Delopoulos, A. Tirakis, S. Kollias, Invariant image classication using triple-correlation-based neural networks, IEEE Trans. Neural Networks 5 (3) (1994) 392408. E. do Valle Sim es, L.F. Uebel, D.A.C. Barone, o Hardware implementation of RAM neural networks, Pattern Recognition Lett. 17 (4) (1996) 421429. I.E. Dror, M. Zagaeski, C.F. Moss, 3-D target recognition via sonara neural network model, Neural Networks 8 (1) (1995) 149160. M. Egmont-Petersen, T. Arts, Recognition of radiopaque markers in X-ray images using a neural network as nonlinear lter, Pattern Recognition Lett. 20 (5) (1999) 521533. M. Egmont-Petersen, U. Schreiner, S.C. Tromp et al., Detection of leukocytes in contact with the vessel wall from in vivo microscope recordings using a neural network, IEEE Trans. Biomed. Eng. 47 (7) (2000) 941951. M. Fukumi, S. Omatu, F. Takeda et al., Rotationinvariant neural pattern-recognition system with application to coin recognition, IEEE Trans. Neural Networks 3 (2) (1992) 272279. K. Fukushima, Neocognitron: a hierarchical neural network capable of visual pattern recognition, Neural Networks 1 (2) (1988) 119130. H.P. Graf, C.R. Nohl, J. Ben, Image recognition with an analog neural net chip, Mach. Vision Appl. 8 (2) (1995) 131140. R.E. Hurst, R.B. Bonner, K. Ashenayi et al., Neural net-based identication of cells expressing the p300 tumor-related antigen using uorescence image analysis, Cytometry 27 (1) (1997) 3642. K. Itoh, ID number recognition of X-ray lms by a neural network, Comput. Methods Programs Biomed. 43 (12) (1994) 1518. B. Javidi, Q. Tang, Optical implementation of neural networks by the use of nonlinear joint transform correlators, Appl. Opt. 34 (20) (1995) 39503962.
2298
M. Egmont-Petersen et al. / Pattern Recognition 35 (2002) 22792301 [173] L. Wang, S. Der, N. Nasrabadi, Composite classiers for automatic target recognition, Opt. Eng. 37 (3) (1998) 858868. [174] C. Wohler, A time delay neural network algorithm for estimating image-pattern shape and motion, Image Vision Comput. 17 (3 4) (1999) 281294. [175] S.S. Young, P.D. Scott, C. Bandera, Foveal automatic target recognition using a multiresolution neural network, IEEE Trans. Image Process. 7 (8) (1998) 11221135. [176] S.S. Young, P.D. Scott, N.M. Nasrabadi, Object recognition using multilayer Hopeld neural network, IEEE Trans. Image Process. 6 (3) (1997) 357372. [177] Y. Zheng, J.F. Greenleaf, J.J. Gisvold, Reduction of breast biopsies with a modied self-organizing map, IEEE Trans. Neural Networks 8 (6) (1997) 13861396. [178] Z.G. Zhu, S.Q. Yang, G.Y. Xu et al., Fast road classication and orientation estimation using omni-view images and neural networks, IEEE Trans. Image Process. 7 (8) (1998) 11821197. [179] T. Ziemke, Radar image segmentation using recurrent articial neural networks, Pattern Recognition Lett. 17 (4) (1996) 319334. [180] B. Jahne, Digital image processing. Concepts, Algorithms and Scientic Applications, Springer, Berlin, 1995. [181] L.M.J. Florack, B.M. ter Haar Romeny, J.J. Koenderink et al., Scale and the di erential structure of images, Image Vision Comput. 10 (6) (1992) 376388. [182] Y. Bengio, P. Simard, P. Frasconi, Learning long-term dependencies with gradient descent is di cult, IEEE Trans. Neural Networks 5 (2) (1994) 157166. [183] R. Anand, K. Mehrotra, C.K. Mohan et al., Analyzing images containing multiple sparse patterns with neural networks, Pattern Recognition 26 (11) (1993) 1717 1724. [184] R. Bajaj, S. Chaudhury, Signature verication using multiple neural classiers, Pattern Recognition 30 (1) (1997) 17. [185] G.M.T. Brake, N. Karssemeijer, Single and multiscale detection of masses in digital mammograms, IEEE Trans. Med. Imag. 18 (7) (1999) 628639. [186] S.H. Chang, G.H. Han, J.M. Valverde et al., Cork quality classication system using a unied image processing and fuzzy-neural network methodology, IEEE Trans. Neural Networks 8 (4) (1997) 964974. [187] S. Desch nes, Y. Sheng, P.C. Chevrette, Threee dimensional object recognition from two-dimensional images using wavelet transforms and neural networks, Opt. Eng. 37 (3) (1998) 763770. [188] A.J. Einstein, J. Barba, P.D. Unger et al., Nuclear di useness as a measure of texture: denition and application to the computer-assisted diagnosis of parathyroid adenoma and carcinoma, J. Micros. 176 (2) (1994) 158166. [189] J. Heikkonen, M. Mantynen, A computer vision approach to digit recognition on pulp bales, Pattern Recognition Lett. 17 (4) (1996) 413419. [190] K. Huang, H. Yan, O -line signature verication based on geometric feature extraction and neural network classication, Pattern Recognition 30 (1) (1997) 917. [191] K.M. Iftekharuddin, T.D. Schechinger, K. Jemili et al., Feature-based neural wavelet optical character
[155] N.K. Kasabov, S.I. Israel, B.J. Woodford, Adaptive, evolving, hybrid connectionist systems for image pattern recognition, in: S.K. Pal, A. Ghosh, M.K. Kundu (Eds.), Soft computing for image processing, Physica-Verlag, Heidelberg, 2000. [156] J.H. Kim, H.S. Cho, Neural network-based inspection of solder joints using a circular illumination, Image Vision Comput. 13 (6) (1995) 479490. [157] B. Kosko, Adaptive bidirectional associative memories, Appl. Opt. 26 (23) (1987) 49474960. [158] Y. LeCun, B. Boser, J.S. Denker et al., Backpropagation applied to handwritten zip code recognition, Neural Comput. 1 (4) (1989) 541551. [159] Y. LeCun, L.D. Jackel, B. Boser et al., Handwritten digit recognitionapplications of neural network chips and automatic learning, IEEE Commun. Mag. 27 (11) (1989) 4146. [160] J.S. Lin, S.C.B. Lo, A. Hasegawa et al., Reduction of false positives in lung nodule detection using a two-level neural classication, IEEE Trans. Med. Imag. 15 (2) (1996) 206217. [161] M.R.J. McQuoid, Neural ensembles: simultaneous recognition of multiple 2-D visual objects, Neural Networks 6 (7) (1993) 917970. [162] C. Neubauer, Evaluation of convolutional neural networks for visual recognition, IEEE Trans. Neural Networks 9 (4) (1998) 685696. [163] E. Paquet, M. Rioux, H.H. Arsenault, Invariant pattern recognition for range images using the phase Fourier transform and a neural network, Opt. Eng. 34 (4) (1994) 11781183. [164] M.G. Penedo, M.J. Carreira, A. Mosquera et al., Computer-aided diagnosis: a neural-network-based approach to lung nodule detection, IEEE Trans. Med. Imag. 17 (6) (1998) 872880. [165] L.I. Perlovsky, W.H. Schoendorf, B.J. Burdick et al., Model-based neural network for target detection in SAR images, IEEE Trans. Image Process. 6 (1) (1997) 203216. [166] J.C. Principe, M. Kim, J.W. Fisher, Target discrimination in synthetic aperture radar using articial neural networks, IEEE Trans. Image Process. 7 (8) (1998) 11361149. [167] J.-Y. Shen, Y.-X. Zhang, G.-G. Mu, Optical pattern recognition system based on a winner-take-all model of a neural network, Opt. Eng. 32 (5) (1992) 10531056. [168] J. Sklansky, M. Vriesenga, Genetic selection and neural modelling of piecewise-linear classiers, Int. J. Pattern Recognition Artic. Intell. 10 (5) (1996) 587612. [169] L. Spirkovska, M.B. Reid, Coarse-coded higher-order neural networks for PRSI object recognition, IEEE Trans. Neural Networks 4 (2) (1993) 276283. [170] L. Spirkovska, M.B. Reid, Higher-order neural networks applied to 2D and 3D object recognition, Mach. Learning 15 (2) (1994) 169199. [171] M. Turner, J. Austin, N.M. Allinson et al., Chromosome location and feature extraction using neural networks, Image Vision Comput. 11 (4) (1993) 235239. [172] L.C. Wang, S.Z. Der, N.M. Nasrabadi, Automatic target recognition using a feature-decomposition and data-decomposition modular neural network, IEEE Trans. Image Process. 7 (8) (1998) 11131121.
M. Egmont-Petersen et al. / Pattern Recognition 35 (2002) 22792301 recognition system, Opt. Eng. 34 (11) (1995) 3193 3199. T.M. JHrgensen, S.S. Christensen, A.W. Andersen, Detecting danger labels with RAM-based neural networks, Pattern Recognition Lett. 17 (4) (1996) 399 412. H. Kai, H. Yan, O -line signature verication based on geometric feature extraction and neural network classication, Pattern Recognition 30 (1) (1997) 917. N. Karssemeijer, G.M.T. Brake, Detection of stellate distortions in mammograms, IEEE Trans. Med. Imag. 15 (5) (1996) 611619. A. Khotanzad, J.H. Lu, Classication of invariant image representations using a neural network, IEEE Trans. Acoustics, Speech Signal Process. 38 (6) (1990) 1028 1038. H.J. Kim, H.S. Yang, A neural network capable of learning and inference for visual pattern recognition, Pattern Recognition 27 (10) (1994) 12911302. S. Lawrence, C.L. Giles, A.C. Tsoi et al., Face recognitiona convolutional neural-network approach, IEEE Trans. Neural Networks 8 (1) (1997) 98113. D.A. Mitzias, B.G. Mertzios, Shape recognition with a neural network classier based on a fast polygon approximation technique, Pattern Recognition 27 (5) (1994) 627636. Y. Pan, A note on e cient parallel algorithms for the computation of 2D image moments, Pattern Recognition 24 (9) (1991) 917. Y. Park, A comparison of neural net classiers and linear tree classiers: their similarities and di erences, Pattern Recognition 27 (11) (1994) 14931503. P.P. Raghu, B. Yegnanarayana, Multispectral image classication using gabor lters and stochastic relaxation neural network, Neural Networks 10 (3) (1997) 561572. S. Ramanan, R.S. Petersen, T.G. Clarkson et al., pRAM nets for detection of small targets in sequences of infra-red images, Neural Networks 8 (78) (1995) 12271237. A. Ravichandran, B. Yegnanarayana, Studies on object recognition from degraded images using neural networks, Neural Networks 8 (3) (1995) 481488. F. Ros, S. Guillaume, G. Rabatel et al., Recognition of overlapping particles in granular product images using statistics and neural networks, Food Control 6 (1) (1995) 3743. P. Sajda, C.D. Spence, S. Hsu et al., Integrating neural networks with image pyramids to learn target context, Neural Networks 8 (78) (1995) 11431152. C. Strouthopoulos, N. Papamarkos, Text identication for document image analysis using a neural network, Image Vision Comput. 16 (1213) (1998) 879896. B. Tak c, L. Sadovnik, Three-dimensional target a recognition and tracking using neural networks trained on optimal views, Opt. Eng. 37 (3) (1998) 819828. D.-M. Tsai, R.-Y. Tsai, Use neural networks to determine matching order for recognizing overlapping objects, Pattern Recognition Lett. 17 (10) (1996) 10771088. J.Y. Wang, F.S. Cohen, 3-D Object recognition and shape estimation from image contours using b-splines, shape
2299
[192]
[202]
[220]
[203] [204]
[227]
invariant matching, and neural network.2, IEEE Trans. Pattern Anal. Mach. Intell. 16 (1) (1994) 1323. D. Casasent, L.M. Neiberg, M.A. Sipe, Feature space trajectory distorted object representation for classication and pose estimation, Opt. Eng. 37 (3) (1998) 914920. M. Sonka, V. Hlavac, R. Boyle, Image processing, Analysis, and Machine Vision, 2nd Edition, PWS Publishing, Pacic Grove, 1999. T. Sams, J.L. Hansen, Implications of physical symmetries in adaptive image classiers, Neural Networks 13 (6) (2000) 565570. B. Lerner, Toward a completely automatic neuralnetwork-based human chromosome analysis, IEEE Trans. System, Man, Cybernet. Part B: Cybernetics 28 (4) (1998) 544552. G. Pasquariello, G. Satalino, V.l. Forgia et al., Automatic target recognition for naval tra c control using neural networks, Image Vision Comput. 16 (2) (1998) 6773. G. Wells, C. Venaille, C. Torras, Promising research: vision-based robot positioning using neural networks, Image Vision Comput. 14 (10) (1996) 715732. G.L. Foresti, G. Pieroni, Exploiting neural trees in range image understanding, Pattern Recognition Lett. 19 (9) (1998) 869878. W.R. Reinus, A.J. Wilson, B. Kalman et al., Diagnosis of focal bone lesions using neural networks, Invest. Radiol. 29 (6) (1994) 606611. A. Stassopoulou, M. Petrou, J. Kittler, Bayesian and neural networks for geographic information processing, Pattern Recognition Lett. 17 (13) (1996) 13251330. M. Egmont-Petersen, W.R.M. Dassen, C.J.H.J. Kirchhof et al., An explanation facility for a neural network trained to predict arterial brillation directly after cardiac surgery, Proceedings of Computers in Cardiology 1998, Cleveland, 1998, pp. 489 492. A.B. Tickle, R. Andrews, M. Golea et al., The truth will come to light: directions and challenges in extracting the knowledge embedded within trained articial neural networks, IEEE Trans. Neural Networks 9 (6) (1998) 10571068. E. Bertin, H. Bischof, P. Bertolino, Voronoi pyramids controlled by Hopeld neural networks, Comput. Vision Image Understand. 63 (3) (1996) 462475. T.W. Chen, W.C. Lin, A neural network approach to CSG-based 3-D object recognition, IEEE Trans. Pattern Anal. Mach. Intell. 16 (7) (1994) 719726. P.C. Chung, C.T. Tsai, E.L. Chen et al., Polygonal approximation using a competitive Hopeld neural network, Pattern Recognition 27 (11) (1994) 15051512. N.M. Nasrabadi, C.Y. Choo, Hopeld network for stereo correspondence, IEEE Trans. Neural Networks 3 (1) (1992) 513. Y. Ruichek, J.-G. Postaire, A neural matching algorithm for 3-D reconstruction from stereo pairs of linear images, Pattern Recognition Lett. 17 (4) (1996) 387398. D. Shen, H.H.S. Ip, A Hopeld neural network for adaptive image segmentation: an active surface paradigm, Pattern Recognition Lett. 18 (1) (1997) 3748. P.N. Suganthan, E.K. Teoh, D.P. Mital, Pattern recognition by homomorphic graph matching using
2300
M. Egmont-Petersen et al. / Pattern Recognition 35 (2002) 22792301 Hopeld neural networks, Image Vision Comput. 13 (1) (1995) 4560. P.N. Suganthan, E.K. Teoh, D.P. Mital, Pattern recognition by graph matching using the Potts MFT neural networks, Pattern Recognition 28 (7) (1995) 9971009. P.N. Suganthan, E.K. Teoh, D.P. Mital, Optimal mapping of graph homomorphism onto self organising hopeld network, Image Vision Comput. 15 (9) (1997) 679694. S.-S. Yu, W.-H. Tsai, Relaxation by the Hopeld neural network, Pattern Recognition 25 (2) (1992) 197210. E.R. Hancock, J. Kittler, A Bayesian interpretation for the Hopeld network, Proceedings of the IEEE Conference on Neural Networks, San Francisco, CA, 1993, pp. 341346. F.S. Hiller, G.J. Lieberman, Introduction to operations research, 6th Edition, McGraw-Hill, New York, 1995. H.J. Kappen, W. Wiegerinck, T. Morgan, et al., Stimulation initiative for european neural applications (SIENA), in: B. Kappen, S. Gielen (Eds.), Neural networks: Best practice in Europe, Vol. 8, World Scientic, Singapore, 1997, pp. 18. (http:==www.mbfys.kun.nl= snn=Research=siena=index.html). http:==www.kcl.ac.uk=neuronet. http:==www.brainstorm.co.uk=nctt. D. Valentin, H. Abdi, A.J. OToole et al., Connectionist models of face processinga survey, Pattern Recognition 27 (9) (1994) 12091230. M. Cenci, C. Nagar, A. Vecchione, PAPNET-assisted primary screening of conventional cervical smears, Anticancer Res. 20 (5C) (2000) 38873889. G.M.T. Brake, N. Karssemeijer, J.H.C.L. Hendriks, An automatic method to discriminate malignant masses from normal tissue in digital mammograms, Phys. Med. Biol. 45 (10) (2000) 28432857. S.K. Lee, C.S. Lo, C.M. Wang et al., A computer-aided design mammography screening system for detection and classication of microcalcications, Int. J. Med. Inform. 60 (1) (2000) 2957. Armed Forces Communications and Electronics Association, DARPA neural network study, AFCEA, Fairfax, 1988. [241] M.W. Roth, Survey of neural network technology for automatic target recognition, IEEE Trans. Neural Networks 1 (1) (1990) 2843. [242] M. Egmont-Petersen, J.L. Talmon, A. Hasman et al., Assessing the importance of features for multi-layer perceptrons, Neural Networks 11 (4) (1998) 623635. [243] A.K. Jain, D. Zongker, Feature selection: Evaluation, application, and small sample performance, IEEE Trans. Pattern Anal. Mach. Intell. 19 (2) (1997) 153158. [244] P. Pudil, J. Novovicov , J. Kittler, Floating search a methods in feature selection, Pattern Recognition Lett. 15 (11) (1994) 11191125. [245] D.M.J. Tax, R.P.W. Duin, Support vector domain description, Pattern Recognition Lett. 20 (1113) (1999) 11911199. [246] M. Egmont-Petersen, W.R.M. Dassen, J.H.C. Reiber, Sequential selection of discrete features for neural networksa Bayesian approach to building a cascade, Pattern Recognition Lett. 20 (1113) (1999) 14391448. [247] L.I. Perlovsky, Conundrum of combinatorial complexity, IEEE Trans. Pattern Anal. Mach. Intell. 20 (6) (1998) 666670. [248] P. Simard, Y. LeCun, J. Denker, E cient pattern recognition using a new transformation distance, Proc. Adv. Neural Inform. Process. Systems, 1994, pp. 50 58. [249] W.J. Christmas, J. Kittler, M. Petrou, Analytical approaches to the neural net architecture design, Proceedings of Pattern Recognition in Practice IV, Vlieland, 1994, pp. 325 335. [250] N. Murata, S. Yoshizawa, S. Amari, Network information criteriondetermining the number of hidden units for an articial neural network model, IEEE Trans. Neural Networks 5 (6) (1994) 865872. [251] S.K. Pal, A. Ghosh, Neuro-fuzzy computing for image processing and pattern recognition, Int. J. Systems Sci. 27 (12) (1996) 11791193. [252] T. Kohonen, Self-organized formation of topologically correct feature maps, Biol. Cybernet. 43 (1) (1982) 5969. [253] J.J. Hopeld, Neural networks and physical systems with emergent collective computational abilities, Proc. Nat. Acad. Sci. U.S.A. 81 (1982) 30883092. [254] V.N. Vapnik, Statistical Learning Theory, Wiley, New York, 1998.
[228]
[229]
[230] [231]
[232] [233]
[239]
[240]
About the AuthorM. EGMONT-PETERSEN was born in Copenhagen, Denmark, in 1967. He received the combined B.S. and combined M.S. degrees in Computer Science=Business Administration from Copenhagen Business School in 1988 and 1990, respectively. He received the Ph.D. degree in Medical Informatics from Maastricht University, The Netherlands, in 1996. He has worked from 1997 to 2000 for the Division of Image Processing, Department of Radiology, Leiden University Medical Centre, as postdoctoral researcher. He is currently associated with the Department of Computer Science at the University of Utrecht, The Netherlands. Dr. Egmont-Petersen is currently developing novel learning algorithms for Bayesian Belief Networks. His main research interests include belief networks, neural networks, support vector machines, statistical classiers, feature selection, image understanding and invariant theory. He has published more than 35 papers in journals and conference proceedings. About the AuthorD. DE RIDDER received his M.Sc. degree in 1996 from the Department of Computer Science of the Delft University of Technology, The Netherlands, where he is currently a Ph.D. student in the Pattern Recognition Group at the Department of Applied Physics. His research interests include statistical pattern recognition, image processing and in particular the application of neural network techniques in the eld of non-linear image processing. Currently he is working on developing and extending tools for non-linear data analysis. He has written over 20 papers in journals and conference proceedings.
2301
About the AuthorRER. NAT. HABIL. HEINZ HANDELS was born in Wurselen, Germany, in 1960. After his study of Informatics, Mathematics and Physics he received his Ph.D. degree in Computer Science at the RWTH Aachen, Germany, in 1992. Since 1992 he is the head of the Medical Image Processing and Pattern Recognition Group of the Institute for Medical Informatics at the Medical University of Lubeck. In 1999, he nished his Habilitation for Medical Informatics at the Medical University of Lubeck. His current research interest is focussed on the development of methods and optimised systems for the analysis and recognition of pathological image structures like tumours in radiological and dermatological images. This includes the integration of image analysis algorithms and feature selection methods with neural pattern recognition techniques for medical decision support. He has published more than 70 papers in journals and conference proceedings.