University of Trento: Department of Information and Communication Technology
University of Trento: Department of Information and Communication Technology
University of Trento: Department of Information and Communication Technology
DEPARTMENT OF INFORMATION AND COMMUNICATION TECHNOLOGY 38050 Povo Trento (Italy), Via Sommarive 14 http://www.dit.unitn.it
LOCATION-AWARE COMPUTING: A NEURAL NETWORK MODEL FOR DETERMINING LOCATION IN WIRELESS LANS Roberto Battiti, Thang Le Nhat and Alessandro Villani
Location-aware computing: a neural network model for determining location in wireless LANs
Roberto Battiti, Thang Le Nhat, and Alessandro Villani
Universit` a di Trento, Dipartimento di Informatica e Telecomunicazioni via Sommarive 14, I-38050 Pant` e di Povo (TN), Italy battiti|thang|avillani@science.unitn.it
Abstract. The strengths of the RF signals arriving from more access points in a wireless LANs are related to the position of the mobile terminal and can be used to derive the location of the user. In a heterogeneous environment, e.g. inside a building or in a variegated urban geometry, the received power is a very complex function of the distance, the geometry, the materials. The complexity of the inverse problem (to derive the position from the signals) and the lack of complete information, motivate to consider exible models based on a network of functions (neural networks). Specifying the value of the free parameters of the model requires a supervised learning strategy that starts from a set of labeled examples to construct a model that will then generalize in an appropriate manner when confronted with new data, not present in the training set. The advantage of the method is that it does not require ad-hoc infrastructure in addition to the wireless LAN, while the exible modeling and learning capabilities of neural networks achieve lower errors in determining the position, are amenable to incremental improvements, and do not require the detailed knowledge of the access point locations and of the building characteristics. A user needs only a map of the working space and a small number of identied locations to train a system, as evidenced by the experimental results presented.
Keywords: location- and context-aware computing, wireless LANs, IEEE802.11b, neural networks, machine learning.
1 Introduction
Sentient computers, that take the current context (e.g. location, time, activity, previous history) into account when interacting with the user, hold signicant promises for a seamless use of tomorrows wireless networks in which mobile computing and Internet connectivity will be provided for professional and recreational activities through PDAs, smart phones, laptops, and other mobile appliances. Knowledge of the location and suitable models are important in order to reduce the cognitive burden on the users in context- and location-aware systems [8, 1, 15]. Location awareness is considered for example in the infostation-based hoarding work of [14], and
in the websign system of [17]. Some techniques for determining the location in indoor and urban context (where GPS assisted localization encounters problems) are based on pattern recognition: from the signature of the signal received by multiple antennas one derives the position of the mobile device [3]. A complication is caused by the fact that signal propagation is inuenced by environmental factors like, for example, the number of people located in the working area [2], the position and material of walls and, in general, the infrastructure contained in a building. The research in this paper proposes a method based on neural networks for reducing the errors in the determination of the current location of the user. One executes measurements of the strength of signals coming from the different antennas at a series of points distributed in the environment. These data are a training set that can be used by a learning algorithm (e.g. a neural net) to develop an association between signal strengths and location. We propose to use neural networks and a training algorithm based on secondorder information in order to develop exible models of the relationship between the raw signal measurements and the location data. The following part of this paper is organized as follows. Section 2 summarizes previous approaches to the problem, Section 3 describes the methodology for modeling the input-output relationship through multi-layer perceptron neural networks, Section 4 describes the system and the collection of data points for the experiments, Section 5 discusses the experiments dealing with the selection the neural architecture and the length of the training phase. Finally, Section 6 describes the obtained measurement error test results as a function of the number of training examples.
2 Previous approaches
Advances in indoor, short-range wireless communication technology and the increasing trend toward portable, hand-held personal computers equipped with high-speed radio access have made wireless LANs popular. Currently, there are several alternative wireless LAN technologies such as IEEE 802.11 a, b, HIPERLAN and Bluetooth. Among them, the IEEE 802.11 standard is gaining a growing support as a solution for transmitting/receiving data with high-speed rate in indoor networks with a bandwidth of up to 54Mbps [16]. Many different systems and technologies to determine the location of users for mobile computing applications have been proposed. Global Positioning System (GPS)is a satellite-based navigation aid originally developed by the US military. GPS systems receive signals from multiple satellites and use a triangulation process to determine physical locations with approximately 10 meters accuracy. GPS is very successful in open areas but ineffective for indoor use or in urban areas with tall buildings that shield the satellite signals. The Active Badge system [23], [9] is one of the earliest indoor systems for determining the location, based on diffused infrared technology. A badge emits a unique IR signal periodically or on demand. Infrared sensors placed in the building pick up these periodic signals and transfer them to a master station for processing. Although the Active Badge system provides accurate location information, it also subject to some
restrictions such as line-of-sight limitations, poor performance with uorescent lighting or direct sunlight. The Active Bat location system [24, 10] developed by AT&T researchers uses a combination of RF and ultrasound time-of-ight to estimate the distance. When a controller connected to the PC sends a radio request message, an Active Bat tag attached to the object reacts by emitting an ultrasonic pulse directed to a matrix of receiving elements mounted on the room ceiling. At the same time, the controlling PC sends a reset signal to the receivers over the serial network, so that they can measure the time interval and calculate the distances from the tag to the receivers. The use of ultrasound time-of-ight requires a large xed-sensor infrastructure throughout the ceiling and the accuracy, that can reach about 9 cm, is rather sensitive to the precise placement of the sensors. PinPoint Corp. develops a product named 3D-iD local positioning system [25] for determining the 3D locations of items inside buildings. In this architecture, 3D-iD readers emit codes that are received by the tags attached to mobile devices. Then the tags simply change the signals frequency and transmit back to the reader with tag ID information phase-modulated onto it. The reader extracts the tag ID from this returned signal and also determines the tags distance from the antenna by measuring the round trip time of ight. The PinPoint system is composed of cells within a building and uses spread-spectrum radio signals and multiple antennas (up to 16) at the cell controller to process the signal from a tag. It can detect reliably items from about a 30-meter distance with 1 to 3-meter accuracy. The disadvantages are that each antenna has a narrow cone of inuence, so that ubiquitous deployment becomes prohibitively expensive. Additional difculties arise when interoperating with the IEEE 802.11 wireless networking infrastructure because of radio spectrum collision [11]. Microsoft Research RADAR location system used the IEEE 802.11b wireless LAN technology [3]. In the RADAR system, the RF signal strength is used as a measure of distance between Access Point (AP) and mobile terminal, and then this information is used to compute the 2D position by triangulation, with both an empirical method and a signal propagation modeling method. The results show that the empirical method is superior in terms of accuracy with median resolution in the range of about 3 meters, while the signal propagation modeling method has 4.3 meters accuracy (median), but it makes deployment easier. Similar to Active Bat system, Cricket, a location-support system for in-building, mobile, location-dependent applications, uses a combination of RF and ultrasound hardware to enable a small device attached to mobile user (the listener) to estimate the distance to the nearest beacon[18]. The listener performs the timing and computation functions. On each transmission, a beacon, a small device attached to some locations within the geographic space, sends both space information and an ultrasonic pulse. When the listener hears the RF signal, it uses the rst few bits as training information and turns on it ultrasonic receiver to listen to the ultrasonic pulse, which arrive in short time later. Based on the time interval between the rst bit of RF information and the ultrasonic signal, the listener can determine the distance to the beacon. Crickets main features are user privacy, decentralized administration, network heterogeneity, low cost and a portion-of-a-room granularity of 4x4 feet.
SpotON, a new tagging technology for three-dimensional location sensing based on radio signal strength analysis was introduced in [12]. The system is built by using RFIDeas badge and AIRID base station - the product of Illinois Company and Hydra microwebserver that has both an Ethernet and serial port for the AIRID internetworking task. In general, the SpotON system is similar to Microsoft Research wireless LAN and the PinPoint system in developing a ne grained tagging technology based on RF signal strength. However, following the authors laboratory experiments, the SpotON can archive better resolution and accuracy than the Microsoft Research system with a much lower cost than the product from PinPoint.
!" $ #
(1)
is the transmitted power, where is the received power, , are transmitter # and receiver antenna gain respectively, is the &('0) separation in meters, is distance 2 # 4 1 3 the system loss factor not related to the propagation and is the wavelength in meters. More detailed radio propagation models for indoor environments are considered 65 for example in [3]. If one knows distances from the mobile terminal to at least three different APs, one can calculate the position of the mobile terminal in the system. However, in a variegated and heterogeneous environment, e.g. inside a building or in a complex urban geometry, the received power is a very complex function of the distance, the geometry of walls, the infrastructures contained in the building. Even if a detailed model of the building is available, solving the direct problem of deriving the signal strength given the location requires a lengthy simulation. The inverse problem, of deriving the location from the signal strengths is more complicated and very difcult to solve in realistic situations. Furthermore, in order to facilitate the deployment of the system, it is unrealistic to require a detailed exhaustive specication of the building geometry, materials, infrastructures. The two reasons, complexity of the problem and
%
lack of complete information, motivate to consider exible models based on networks of functions. These models are termed non-parametric models in statistics, and neural networks in other contexts. The non-linear transformation of each unit and a sufciently large number of free parameters guarantee that a neural network is capable of representing the relationship between inputs (signal strengths) and outputs (position). Let us note that the distance from the access points, and therefore the detailed knowledge of their position, is not required by the system: a user may train and use the system without requiring this information. Specifying the value of the free parameters of the model (also called weights of the network) requires a learning strategy that starts from a set of labeled examples to construct a model that will then generalize in an appropriate manner when confronted with new data, not present in the training set. 3.1 The One-Step Secant method for training neural networks Efcient optimization algorithms are crucial in the learning phase of models like neural networks and have been studied for example in [5], [6]. Let us briey dene the notation. We consider the standard multi-layer perceptron (MLP) architecture, with weights connecting only nearby layers and the sum-of-squared-differences energy function dened as: C C
78@9A
where E and P E are the target and the current output values for pattern S , respectively, 9 as a function of the parameters of the networks (weights ). The architecture of the multi-layer perceptron is organized as follows: the signals ow sequentially through the different layers from the input to the output layer. For each layer, each unit (neuron) rst calculates a scalar product between a vector of parameters (weights) and the vector given by the outputs of the previous layer. A transfer function is then applied to the result to produce the input for the function for the hidden layers UV W next 3a`4The 3YX layer. bdcVe transfer is the sigmoidal function: T , while for the output layer it is the identity function, so that the output signal is not bounded. It has been demonstrated that a network with a single hidden layer is sufcient to approximate any continuous function to a desired accuracy, provided that the number of hidden neurons is sufciently large [13]. In this work we consider a single-hidden-layer MLP and a training technique that uses second-derivatives information: the one-stepsecant method with fast line searches OSS, see [5], [4]. The standard back propagation technique uses only rst-order information (the gradient). In particular, the stochastic on-line back-propagation update is given by:
3 D 7 E EGFH
(2)
7 E where the pattern S is chosen randomly from the training set at each iteration, q is the gradient, and p is a xed learning rate. Faster training can be obtained by using also second derivatives, but computing all u2v uwY second derivatives (the Hessian) requires order t operations [7] and order t
(3)
memory to store the Hessian components. In addition, the solution xuyY of equation to nd operations, at least the step (or search direction) in Newtons method requires t when using traditional linear algebra routines. Fortunately, some second-order information can be calculated by starting from the last gradients, and therefore reducing the u computation and memory requirements to nd the search direction to t . Historically the one-step-secant method OSS is a variation of what is called one-step (memory-less) Broyden-Fletcher-Goldfarb-Shanno method, see [20]. The OSS method is described in detail and is used for multilayer perceptrons in [4] and [5]. Note that BFGS stores the whole approximated Hessian, while the one-step method fih H requires only vectors computed from gradients. In fact, the new search direction S is obtained as:
fih H fih fvf ` ff ' H ` (4) f f where the two scalars and f are following combination of scalar products of fih the H and f (last step, gradient and difference of the previously dened vectors , S f v f fGh H f fih H f ' 3r` f 6 ` f f f f f f
;
gradients):
fh vf i H f f
The search direction is the negative gradient at the beginning of learning and it is fih H u u restarted to ' every steps ( being the number of weights the network). fih in H is crucial to obtain The fast one-dimensional minimization along the direction S an efcient algorithm. The one-dimensional search is based on the backtracking strat3ef3 ) and the rst tentative egy. The last successful learning rate is increased ( 0d 7 step is executed. If the new value is not below the upper-limiting curve, then a new tentative step is tried by using successive quadratic interpolations the requirement #agihi after eachuntil is met. Note that the learning rate is decreased by unsuccessful trial. Quadratic interpolation is not wasting computation, in fact, after the rst trial one 7j 70j k has exactly the information that is needed to t a parabola: the value of and at 7%l mUV the initial point and the value of at the trial point. The parabola is:
mUVn7 j ` 7 j k U `po
and the minimizer ts
7 l
'
7 j
5fu s
'q
7 jk U r
(5)
7 jk 3 l v (6) B 3 Bmvxwty c w{ w z | ~ lvz } c ' gihi 5fu gihi If the gradient-multiplier is 0.5, the s that minimizes the parabola is less than , see [5] for the complete details. 5fu '
is:
the positions of the three Access Points (APs) are shown in the Fig. 1. The oor has and includes more than dimensions of 25.5 x 24.5 , for a total area of 624.75 eleven rooms (ofces and classrooms).
Fig. 1. The oor layout of the experiment, with access points locations
The origin of the coordinate system (0,0) is placed at the left bottom corner of the @Ud x e 3vde , map. The of access 3 x ecoordinates B the 3Ypoints are as follows: AP1 d e 3 e AP3 . AP2 The Access Points are AVAYA WP-II E model by Lucent Technologies Netherland B.V., two with external antennas. The wireless stations are Pentium-based Laptop computers running Linux version 7.2. Each Laptop is equipped with the ORiNOCO PC card - a wireless network interface card by Lucent Technologies. The network operates in the 2.4 GHz license-free ISM band and supports data rates of 1, 2, 5.5, and 11Mbps. The 2.4 GHz ISM band is divided into 13 channels (IEEE & ETSI Wireless LAN Standard). In our system we use three channels: channel 1 at 2412 MHz; channel 7 at 2442 MHz and channel 13 at 2472 MHz (European Channel Selectionnonoverlapping). Additional details of the system specications are collected in Table 1. 4.1 Collection of example patterns In order to facilitate the collection of labeled example patterns, the map of the area is stored on a laptop and a user interface has been designed based on a single click on the displayed map.
2.4 GHz ISM Bandwidth Direct Sequence Spread Spectrum CCK at 11 Mbps and 5.5 Mbps, DQPSK at 2 Mbps, DBPSK at 1 Mbps Media Access Protocol CSMA/CA (Collision Avoidance) with Acknowledgment (ACK) Bit Error Rate (BER) Better than Nominal Output Power 15 dBm External Antenna Gain 2.5dBi Transmission Speed Auto select 1, 2, 5.5, and 11Mbps Spreading 11chip Barker Sequence Encryption 128 bit - (RC4)- Gold, also supports 64-bit Wired Equivalent Privacy WEP (RC4)- Silver Receiver Sensitivity -83,-87,-91,-94 dBm at 11, 5.5, 2, 1Mbps Number of APs 3 Number of Measurement Points 56 Floor Dimensions 25.5 m x 24.5 m OS platform Linux 7.2 Table 1. The system specications
When the user is at an identiable position in the experimental area (e.g., at the entrance of a room, close to a corner, close to a column, etc.) he clicks on the displayed map in a point corresponding to the current position. Immediately after the click, the three received radio signal strengths from the APs are automatically measured and they are saved together with the points coordinates in a le, to prepare the examples for training and testing the neural network. A total of 56 measurement points are identied on the map and collected, during different periods of the day.
3 DC IU E @ U 9A ` @I E 9A 'QP E 'QP E (7) EGFH6 U 9A E @9A U where P E are the and coordinates obtained by the network and IU E I E and P and the correct target values. is the number of test or training patterns, 7
depending on the specic plot. Both the training error and the generalization error are shown in each gure. As expected, the training error decreases during training, while the generalization error rst decreases, then reaches a plateau value and nally tends to increase (over-training effect). The over-training effect is particularly strong for the architecture with 16 hidden units. The best generalization values (of about 1.52 meters) are reached after about 4,000 iterations for both the 8 and 16 hidden units architectures. The robustness of the MLP model for different architectures and for different lengths of the training phase is to be noted. When the architecture changes from 4 to 8 to 16 hidden units, the optimal generalization value changes only by less than 5% (from 1.6 meters to about 1.52 meters). When the number of iterations increases from 2,000 to 20,000 the generalization error worsens only by a few percent points, in particular for the more compact architectures (4 and 8 hidden units). After this series of tests, the architecture 3 8 2 apparently achieves close to optimal generalization values (of about 1.53 meters) and is less subject to overtraining that the more redundant 3 16 2 architecture. The structure of the neural network used in the subsequent tests consists of three layers as shown in Fig. 3: 3 input units, 8 hidden layer units and 2 outputs. The network structure is feed-forward and fully connected. The CPU time for a single training session (2,000 iterations with 300 examples) on the architecture 3 8 2 is of about 13.2 seconds on a 1400 MHz Pentium IV.
The purpose of the experiments in this section is to determine the architecture, in our case the number of hidden units, and the length of the training phase leading to the best generalization results. Fig. 2 describes a signicant summary of the results obtained in the experiments. The three architectures considered are given by 4, 8, and 16 hidden units. A set of labeled examples (signal strengths and correct location) has been collected as described in Sec. 4.1. Among all examples collected, 200 are extracted randomly and are used for the training phase, the remaining ones are used to test the generalization, at different steps during the training process. The plotted value is the average absolute distance 7 error over all patterns:
The rst experiment considers 56 examples 7 collected at a specic period during the as a function of the number of examples day. Fig. 4 shows the average distance error present in the training set, the remaining examples being used to test the trained neural net. The examples used for the training are selected randomly for each trial. We made 100 repetitions for the selection of the training sets. For convenience, the average over all trials is also plotted. It can be observed that about ve random examples are sufcient to produce a test distance error of less than 3 meters, already sufcient to localize a mobile terminal within a single room, in most cases. This is an indication that, once a map of the environment is available (without knowing the position of the three APs), a user may quickly train the system to recognize the position by visiting about ve different places and determining their positions on the given map. When the number for examples increases, the accuracy improves, to reach a value of about 1.9 meters for a number of examples equal to 45. After a careful examination of the data we discovered that in a fraction of the test points, only two of the three signals are present (when this event occurs one of the signals is set to the lowest possible value). This is also caused by the fact that only two APs are equipped with an external antenna. In this case, the distance error tends to be larger. A second experiment considers also the variability of the signal strengths during the day, a variability caused for example by the different number of people in the rooms affecting the signal propagation characteristics. A total of 8 collections of signal strengths has been executed at different times of the day, ranging from 8:30am to 6:30pm, for a total of 448 examples. Fig. 5 shows the (test) distance error obtained as a function of the number of training examples. For each trial, the specied number of examples is extracted randomly from the complete series, while the remaining examples are used for testing. For this experiment we repeat 100 times the random selection of the training sets. It can be observed that the distance error decreases rapidly as a function of the number for training examples, to reach a limiting value of about 1.5 meters for approximately 300 examples. Let us note that the second experiment is more difcult because now the environmental characteristics must be also taken into account by the neural network model. percentile The detailed histogram of the test error is shown in Fig. 6. The (median) is 1.69 meters. percentile) for For a comparison, the results presented in [3] are of 8.16 meters ( the strongest base station method (location of terminal guessed to be the same as the base station with the strongest signal), 2.94 meters for the empirical method proposed in the cited paper, and 2.75 meters by averaging over 5 nearest neighbors.
7 Conclusions
We considered a neural network (the multi-layer perceptron) for building a exible mapping between the raw signal measurements and the position of the mobile terminal. The average accuracy reached when the environmental changes during the day are also taken into account is of approximately 2.3 meters, therefore improving the previous
state of the art results [3]. The positive features of the system are its reliance on a standard wireless LAN infrastructure, its respect for privacy (the position computation is executed at the mobile station, the system is informed only if the user desires), its simplicity and speed. The training phase does not require the knowledge of the positions of the base stations and training can be done incrementally, by identifying points on a map and running the OSS algorithm. The collection of about ve known points is sufcient to determine the position within about 3 meters of accuracy. We plan to extend the present work to consider different neural network and machine learning methods, in particular by using the structured risk minimization principle presented in [21].
Acknowledgments
We would like to thank Pietro De Vigili, Dr. Kiss Kallo Csaba, and Dr. Davide Aprea for their help in collecting the examples and conducting the experiments, Dr. P. Bahl and Prof. A. Wolisz for their help in obtaining the software for getting the signal strengths data in the IEEE802.11b system.
References
1. J. Anhalt, A. Smailagic, D. P. Siewiorek, F. Gemperle, D. Salber, S. Weber, J. Beck, and J. Jennings. Toward context-aware computing: Experiences and lessons. IEE Intelligent System, 3(16):3846, May-June 2001. 2. P. Bahl, V. N. Padmanabhan, and A. Balachandran. A software system for locating mobile users: Design, evaluation, and lessons. Technical report, Microsoft Research, MSR-TR2000-12, April 2000. 3. Paramvir Bahl and Venkata N. Padmanabhan. RADAR: An in-building RF-based user location and tracking system. In IEEE INFOCOM 2000, pages 775784, March 2000. 4. R. Battiti. Accelerated back-propagation learning: Two optimization methods. Complex Systems, 3(4):331342, 1989. 5. R. Battiti. First-and second-order methods for learning: Between steepest descent and newtons method. Neural Computation, 4:141166, 1992. 6. R. Battiti and G. Tecchiolli. Training neural nets with the reactive tabu search. IEEE Transactions on Neural Networks, 6(5):11851200, 1995. 7. C. Bishop. Exact calculation of the hessian matrix for the multilayer perceptron. Neural Computation, 4:949501, 1992. 8. G.M. Djuknic and R.E. Richton. Geolocation and assisted GPS. IEEE Computer, pages 123125, February 2001. 9. Andy Harter and Andy Hopper. A distributed location system for the active ofce. IEEE Network, 6(1):6270, jan-feb 1994. 10. Andy Harter, Andy Hopper, Pete Steggles, Andy Ward, and Paul Webster. The anatomy of a context-aware application. In MOBICOM 1999, pages 5968, August 1999. 11. Jeffrey Hightower and Gaetano Borriello. Location system for ubiquitous computing. IEEE Computer, 34(8):5766, August 2001. 12. Jeffrey Hightower, Gaetano Borriello, and Roy Want. SpotON: An Indoor 3D Location Sensing Technology Based on RF Signal Strength. The University of Washington, Technical Report: UW-CSE 2000-02-02, February 2000.
13. Kurt Hornik. Approximation capabilities of multilayer feedforward networks. Neural Networks, 4:251257, 1991. 14. U. Kubach and K. Rothermel. Exploiting location information for infostation-based hoarding. In Seventh Annual International Conference on Mobile Compuiting and Networking, Rome, pages 1527, 2001. 15. A. Misra, S. Das, A. McAuley, and S. K. Das. Autoconguration, registration and mobility management for pervasive computing. IEEE Personal Communications (Special Issue on Pervasive Computing), 8(4):2431, Aug 2001. 16. Luis Munoz, Marta Garcia, Johnny Choque, Ramon Aguero, and Petri Mahonen. Optimizing internet ows over IEEE 802.11b wireless local area networks: A performance-enhancing proxy based on forward error correction. IEEE Communications Magazine, 39(12):6067, December 2001. 17. S. Pradhan, C. Brignone, J.-H. Cui, A. McReynolds, and M.T. Smith. Websigns: hyperlinking physical locations to the web. IEEE Computer, 34(8):4248, August 2001. 18. Nissanka B. Priyantha, Anit Chakraborty, and Hari Balakrishnan. The cricket locationsupport system. In MOBICOM 2000, pages 3243, August 2000. 19. T. S. Rapport. Wireless Communications - Principles and Practice. IEEE Press, 1996. 20. D. F. Shanno. Conjugate gradient methods with inexact searches. Mathematics of Operations Research, 3(3):244256, 1978. 21. V. N. Vapnik. The Nature of Statistical Learning Theory. Springer Verlag, 1995. 22. V.N. Vapnik and A. Ja. Chervonenkis. On the uniform convergence of relative frequencies of events to their probabilities. Theory Probab. Apl., 16:264280, 1971. 23. Roy Want, Andy Hopper, Veronica Falcao, and Jonathan Gibbons. The active badge location system. ACM Transaction on Information Systems, 10(1):91102, January 1992. 24. Andy Ward, Alan Jones, and Andy Hopper. A new location technique for the active ofce. IEEE Personal Communications, 4(5):4247, October 1997. 25. Jay Werb and Colin Lanzl. Designing a positioning system for nding things and people indoors. IEEE Spectrum, 35(9):7178, September 1998.
1.7 1.65 Average absolute error (meters) 1.6 1.55 1.5 1.45 1.4 1.35 1.3 1.25 1.2 0 1.7 1.65 Average absolute error (meters) 1.6 1.55 1.5 1.45 1.4 1.35 1.3 1.25 1.2 0 1.7 1.65 Average absolute error (meters) 1.6 1.55 1.5 1.45 1.4 1.35 1.3 1.25 1.2 0 2000 4000 6000 2000 4000 6000 2000 4000 6000
8000 10000 12000 14000 16000 18000 20000 Number of iterations test error training error
8000 10000 12000 14000 16000 18000 20000 Number of iterations test error training error
2 (top), 3
2 (middle), 3
16
Output Layer
Hidden Layer
Input Layer
AP1s Signals
average
2.5
1.5
10
15
20 25 30 Number of examples
35
40
45
50
Fig. 4. Reduction of average distance error (test) as a function of the number of training examples (patterns at a single time of the day)
"average"
2.5
1.5
50
100
350
400
450
Fig. 5. Reduction of average distance error (test) as a function of number of training examples (patterns at different times of the day).
0.25
0.15
0.1
0.05