Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

A Practical Implementation of The Box Co PDF

Download as pdf or txt
Download as pdf or txt
You are on page 1of 6

Computers & Geosciences Vol. 24, No. 1, pp.

95±100, 1998
# 1998 Elsevier Science Ltd. All rights reserved
Printed in Great Britain
PII: S0098-3004(97)00137-4 0098-3004/98 $19.00 + 0.00

SHORT NOTE
A PRACTICAL IMPLEMENTATION OF THE BOX
COUNTING ALGORITHM
GUIDO GONZATO
Dipartimento di Fisica, Settore di Geo®sica, UniversitaÁ di Bologna, Viale Berti Pichat 8, 40100
Bologna, Italy
(e-mail: guido@ibogfs.df.unibo.it)

(Received 22 November 1996; revised 1 August 1997)

INTRODUCTION: FRACTALS AND ALGORITHMS of the object. This step is repeated M times, using
squares of increasingly shorter side. The relation
Fractal geometry and fractal analysis have had a
between Zi and D is
huge impact on geosciences as a whole (see Scholz
and Mandelbrot, 1989; Turcotte, 1992, and refer- log…N…Zi †† ˆ a ÿ D log…Zi †: …2†
ences therein), though most published works deal
with fractals from a theoretical point of view. One then calculates D by ®tting the regression
Branches of geosciences like geomorphology or ge- line between the independent variable log(Zi) and
ography, which could bene®t from fractal analysis the dependent variable log(N(Zi)), where i = 1, . . . ,
M. D is given by the absolute value of the line
carried out on actual morphologic features, have in
slope.
practice gained less than expected from the fractal
To illustrate practically the use of the method, let
approach.
us imagine one wants to measure the fractal dimen-
This is perhaps a paradox, considering that the
sion of the fault lines distribution in an area: the
®rst instance of fractal analysis dates back to the
®rst step is to draw a tectonic map. The following
works of a geographer, Richardson (1961), in his
algorithm in Table 1 is then applied.
attempts to relate the length of a coastline and the
It must be stressed that in order to obtain reliable
length of the ``ruler'' used to measure it. The
results and a realistic estimate of D, two conditions
method he followed consisted in measuring the
are to be met: (1) a large range of square side
length of the coastline with a map divider, using
lengths must be used and (2) for each step, a large
increasingly shorter steps. Richardson found that
number of boxes must be ``full'' (that is, containing
L…Zi †AZi…1ÿD† …1† part of the object). Besides, particular care must be
taken with the subsequent regression analysis, see
where L(Zi) is the length of the coastline as a func- Davis (1973), Draper and Smith (1981) and Koch
tion of Zi, Zi is the i-th spread of the map divider, (1987) for examples of application of this technique.
and D is a constant. Mandelbrot (1967) interpreted Manual implementation of the method is tedious
D from a geometrical point of view, noting that and time consuming. As a result, the computer-shy
Equation (1) coincided with the mathematical de®- scholar is bound either to give up or to employ
nition of Hausdor€±Besicovich dimension. In this only few square lengths, thus breaking condition 1.
respect, D is a dimension value; Mandelbrot called However, even a PC-class computer can generate Zi,
it ``fractal dimension''. An introduction to fractals N(Zi) and D in a matter of few seconds. The follow-
and the techniques employed for estimating D can ing program has been written with ease of use and
be found in Roach and Fowler (1993). eciency in mind.
The procedure followed by Richardson is known
as the ``divider method'' or ``walker's ruler''
(Mandelbrot, 1982) (see Fig. 1), and in practice it is
THE PROGRAM vsbc
never used because of its main limitation: the object
being measured must be connected. Another algor- vsbc (virtual screen box counting) is a simple
ithm is the box counting (cf. Grassberger, 1993) yet powerful implementation of the classical box-
(see Fig. 2), which has no restrictions on the shape counting algorithm, aimed at fractal analysis of
of the object. To calculate D, one covers the object digitised maps. As its name suggests, it uses a ``vir-
with a grid of squares initially of side Z1, then tual screen'' that is necessary to improve scope and
counts the number N1 of squares that include part performance.
95
96 Short Note

Figure 1. Divider method algorithm.

Table 1. Box counting algorithm


VAR LogEta, LogNBox: ARRAY [1. . .30] of Real;
ScreenSize, Side, x, y,
NumSteps, NumBox: Integer;
D: Real;
BEGIN
ScreenSize:= ``screen width''; { e.g. 1000 pixels }
Side:= ScreenSize; { initial box side }
NumSteps:= 0;
REPEAT
NumBox:= 0;
y:= 0;
WHILE (y < =ScreenSize) DO BEGIN
WHILE (x < =ScreenSize) DO BEGIN
IF "the box (x,y,x + side,y + side) contains at least one pixel turned
on" THEN
NumBox:= NumBox + 1;
x:= x + Side
END; { WHILE x }
y:= y + Side
END; { WHILE y }
NumSteps:= NumSteps + 1;
LogEta(NumSteps):= log(Side);
LogNBox(NumSteps):= log(NumBox);
Side:= Side/sqrt(2)
UNTIL (Side < =1);
"calculate regression line between vectors LogEta and LogNBox, NumSteps
data each";
D:= abs("line slope")
END. { of box counting algorithm }

The box-counting algorithm requires that the


range of the box side length be as large as possible.
As a rule of thumb, the range should span at least
three orders of magnitude, which requires a
1000  1000-pixel screen. However, a standard PC
computer screen has a resolution of 800  600 or
1024  768 pixels. Since the boxes must be square,
this provides a usable 600  600 or 768  768 screen,
which is hardly enough for an accurate analysis.
To extend the screen resolution further there are
three ways. The ®rst is to buy expensive hardware
(namely, a big monitor and a memory-®lled video
card). The second is to employ the ``virtual screen''
facilities provided by the scrolling windows in some
environments, such as MS Windows or X-Window
System. The third way, which is the most e€ective Figure 3. 2  16 array of bytes acting as 16  16-pixel
and easiest to implement, is to drop the graphical virtual screen.
Short Note 97

Figure 2. Box counting algorithm.


98 Short Note

Figure 4. (A) straight line (D = 1). (B) Koch curve (D = 1.2618). (C) Sierpinsky carpet (D=1.5849).
Short Note 99

Table 2. Application of vsbc: theoretical and calculated D


Picture Theoretical D Calculated D Std. Dev. (D) Di€erence (%)

Straight line 1.0000 0.992 0.007 ÿ0.8


Koch curve 1.2618 1.279 0.010 +1.36
Sierpinsky carpet 1.5849 1.5647 0.016 ÿ1.27

approach completely and simulate a large screen one changes the directive #define NPIXELS
using conventional memory. 2048 in vscrn.c.
An S*S-pixel wide monochromatic screen can be The critical step in the box counting algorithm is
implemented using an S/8*S array of bytes, where when a box is explored to check whether it contains
each bit acts as a ``virtual pixel'' that is either on or data. In vsbc the search is simple, with the boxes
o€ (Fig. 3). This implementation is simple, memory systematically searched, pixel by pixel, until either
ecient, and very fast. In fact, accessing the single at least a full pixel is encountered or the last empty
``pixels'' in the virtual screen does not require any pixel is reached. Fast search techniques are beyond
BIOS or graphic library function calls and their in- the scope of this work; see Grassberger (1993) or
herent overhead. Furthermore, the screen resolution Wirth (1976) for further insight.
can be increased at will to increase precision, with vsbc has been compiled and tested using the C
the only limit being available memory. compiler gcc v. 2.7.0 under Linux 1.2.13, as well
Other practical details that can in¯uence the pre- as cc under Digital Unix 4.0, Djgpp v. 2.01 and
cision of the measured D are the scanner resolution, Turbo C++ 3.0 under MS-DOS. The program is
the displacement and the angle of the image on the available by anonymous FTP on ibogeo.df.unibo.it
virtual screen. According to our preliminary tests, (137.204.48.112), in directory /pub/vsbc, or from
the former point does not seem to a€ect D signi®- FTP.IAMG.ORG.
cantly, whereas the latter is likely to be the main
cause of error, and will be the subject of a sub-
sequent article. RESULTS

In order to test vsbc, a few di€erent images


have been generated, saved as PCX ®les, and sub-
IMPLEMENTATION mitted to the program (Fig. 4a±c). These pictures
represent a straight line, the Koch curve, and the
vsbc is the translation into ANSI C of the al-
Sierpinsky carpet; their theoretical dimension values
gorithm in Table 1. The program consists of two
are 1, 1.2618 and 1.5849, respectively.
modules: the ®rst, vscrn.c, contains the rou-
For each picture, vsbc outputs a ®le containing
tines for the virtual screen management, the second
pairs of values log(box side)±log(number of boxes),
is the main program.
on which one calculates the regression line, and
The module vscrn.c implements all the basic
®nally D. The results are shown in Table 2. Note
functions needed for graphics. In particular, the
how the calculated D is close to the theoretical
functions putpixel(), vgetpixel() and
value.
vline() are used to plot pixels, to read the pix-
els status (on or o€), and to draw lines; the func-
tions vpcxsave() and vpcxload() are CONCLUSIONS
used to save or load a virtual screen as a mono-
chromatic PCX ®le. This graphic format is particu- In our experience, vsbc has proven to be a
larly suitable for black and white pictures, as it is practical and useful program. We hope that its use
simple to implement and produces very compact may help the computer-shy researcher develop
®les that are amenable to further compression, sav- further insight into the application of fractal geome-
ing disk space. Its only drawback is that it is try on real-world objects.
becoming obsolete and being replaced by other for-
mats, like GIF or TIFF; as a result, not all graphic AcknowledgmentsÐThis work was performed with contri-
packages still support it. If necessary, butions from CNR-Gruppo Nazionale per la Vulcanologia
and Gruppo Nazionale Difesa dai Terremot.
vpcxsave() and vpcxload() can be
replaced by other functions implementing, say, the
TIFF format. REFERENCES
As it stands, vsbc uses a 2048  2048 screen,
therefore using only 2048*2048/8 = 524,288 bytes Davis, J. C. (1973) Statistics and Data Analysis in Geology.
John Wiley & Sons, New York, 646 pp.
for its array. This small amount of memory con- Draper, N. and Smith, H. (1981) Applied Regression
sumption makes it possible to use the program even Analysis, 2nd edn. John Wiley & Sons, New York, 709
on PC-class machines; to increase the screen width, pp.
100 Short Note

Grassberger, P. (1993) On ecient box counting algor- Richardson, L. F. (1961) The problem of contiguity: an
ithms. International Journal of Modern Physics C 4(3), appendix of statistics of deadly quarrels. General
515±523. Systems Yearbook 6, 139±187.
Koch, K. R. (1987) Parameter Estimation and Hypothesis Roach, D. E. and Fowler, A. D. (1993) Dimensionality
Testing in Linear Models. Springer Verlag, Berlin, 378 analysis of patterns: fractal measurements. Computers
pp. & Geosciences 19(6), 849±869.
Scholz, H. and Mandelbrot, B. B. (eds.) (1989) Fractals in
Mandelbrot, B. B. (1967) How long is the coast of Geophysics. BirkhaÈuser Verlag, Basel, 313 pp.
Britain? Statistical self-similarity and fractional dimen- Turcotte, D. L. (1992) Fractals and Chaos in Geology and
sion Science 156, 636±638. Geophysics. Cambridge University Press, Cambridge,
Mandelbrot, B. B. (1982) The Fractal Geometry of Nature. 221 pp.
W. H. Freeman and Co., San Francisco, California, Wirth, N. (1976) Algorithms + Data Structures = Programs.
468 pp. Prentice-Hall, Englewood Cli€s, New Jersey, 228 pp.

You might also like