been published in Oceanography,
Volume 17, Number 2, a quarterly journal of The Oceanography Society. Copyright 2003 by The Oceanography Society. All rights reserved. Reproduction of any portion of this artiJune 2004
32This article hasOceanography
cle by photocopy machine, reposting, or other means without prior authorization of The Oceanography Society is strictly prohibited. Send all correspondence to: info@tos.org or 5912 LeMay Road, Rockville, MD 20851-2326, USA.
From Meters
to Kilometers
A LO OK AT O CEANCOLOR SCALE S OF VARIABILITY,
SPATIAL COHERENCE, AND THE NEED FOR FINESCALE
REMOTE SENSING IN COASTAL O CEAN OPTICS
B Y W. PA U L B I S S E T T, R O B E R T A . A R N O N E , C U R T I S S O . D A V I S , T O M M Y D . D I C K E Y,
D A N I E L D Y E , D AV I D D . R . K O H L E R , A N D R I C H A R D W. G O U L D , J R .
Oceanography
June 2004
33
INTRODUCTION
explicit terms of cost-benefit analysis, such
The physical, biological, chemical, and opti-
discussions should be integral parts of the
(ONR) sponsored the Hyperspectral Coastal
cal processes of the ocean operate on a wide
scientific design of instruments, platforms,
Ocean Dynamics Experiment (HyCODE)
variety of spatial and temporal scales, from
and experiments aimed at resolving oceanic
(Dickey et al., this issue), which presented
seconds to decades and from micrometers to
processes.
the opportunity to study the question of
In 2001, the Office of Naval Research
thousands of kilometers (Dickey et al., this
The practical examples of this problem in
issue; Dickey, 1991). These processes drive
remote sensing include: “What is the optimal
Hyperspectral airborne sensors were de-
the accumulation and loss of living and non-
repeat coverage frequency?” and “What is the
ployed on several platforms at various al-
living mass constituents in the water column
optimal Ground Sample Distance (GSD) or
titudes. This coverage was supplemented
(e.g., nutrients, phytoplankton, detritus, sed-
pixel size of the data?” For the optical ocean-
by numerous space-borne, remote-sensing
iments). These mass constituents frequently
ographer, there is also the issue of optimal
satellites. The airborne instruments included
have unique optical characteristics that alter
spectral coverage needed to resolve the opti-
two versions of the Portable Hyperspectral
the clarity and color of the water column
cal constituents of interest (Chang et al., this
Imager for Low-Light Spectroscopy (PHILLS
(e.g., Preisendorfer, 1976). This alteration
issue). The sum of these considerations feed
1 and PHILLS 2) (Davis et al., 2002) op-
of the ocean color, or more specifically the
into the sensor, deployment platform, and
erating at an altitude of less than 10,000
change in the spectral “water-leaving radi-
deployment schedule decisions. For polar-
feet and 30,000 feet, respectively, as well as
ance,” Lw(λ), has led to the development of
orbiting and geo-stationary satellites that
the NASA Airborne Visible/Infrared Imag-
optical techniques to sample and study the
cost hundreds of millions of dollars, as well
ing Spectrometer (AVIRIS) sensor operat-
change in biological and chemical constitu-
as airborne sensors that have smaller upfront
ing at 60,000 feet. These sensors provided
ents (Schofield et al., this issue). Thus, these
costs but higher deployment costs, the deci-
hyperspectral data at 2 m, 9 m, and 20 m
optical techniques provide a mechanism to
sion of sampling frequency directly impacts
GSDs, respectively. The satellite data col-
study the effects of underlying biogeochemi-
the scientific use of the data stream, and
lected included the multi-spectral images
cal processes. In addition, because time- and
what processes may be addressed with data
from Sea-viewing Wide Field-of-view Sensor
space-dependent changes in Lw(λ) may be
streams collected by these sensors. These
(SeaWiFS), Moderate Resolution Imaging
measured remotely, optical oceanography
scientific cost-benefit analyses extend be-
Spectroradiometer (MODIS), Fengyun 1
provides a way to sample ecological interac-
yond the cost in dollars because the typical
C (FY1-C), Oceansat as well as the multi-
tions over a wide range of spatial and tem-
lifetime and replacement cycle of these sen-
spectral polarimeter Multiangle Imaging
poral scales.
sors is on the order of years to decades, and
SpectroRadiometer (MISR) sensor and sea
a poorly designed sensor package is very dif-
surface temperature (SST) sensor Advanced
ficult to replace.
Very High Resolution Radiometer (AVHRR).
The question often posed by scientists
trying to resolve problems involving the
scales of variability in remote-sensing data.
These collections provided a wealth of re-
temporal and spatial variation of oceanic
properties is: “What is the optimal time/
W. Paul Bissett (pbissett@flenvironmental.org)
mote-sensing and field data during a spa-
space sampling frequency?” The obvious an-
is Research Scientist, Florida Environmental Re-
tially and temporally intense oceanographic
swer is that the sampling frequency should
search Institute, Tampa, FL. Robert A. Arnone
field campaign, and they offered the ability
be one half the frequency of the variation
is Head, Ocean Sciences Branch, Naval Research
to begin to address the issue of optimal sam-
(i.e., Nyquist frequency) of the property of
Laboratory, Stennis Space Center, MS. Curtiss
pling scales for the coastal ocean.
interest. However, therein lies the rub for
O. Davis is at Remote Sensing Division, Naval
the oceanographer: the range of the relevant
Research Laboratory, Washington, DC. Tommy
data streams requires the calibration, vali-
scales is large, and the range of available
D. Dickey is Professor, Ocean Physics Laboratory,
dation, and atmospheric correction of the
resources and/or actual engineering capa-
University of California, Santa Barbara, Goleta,
sensor signals to retrieve estimates of Lw(λ),
bilities to sample all relevant scales is often
CA. Daniel Dye is at Florida Environmental Re-
or “remote sensing reflectance,” Rrs(λ), a
small. Hence, the decisions affecting re-
search Institute, Tampa, FL. David D.R. Kohler is
normalized measure of the Lw(λ). Our goals
source allocation become critical in order to
Senior Scientist, Florida Environmental Research
in this paper are to illuminate some of the
maximize the total data information in both
Institute, Tampa, FL. Richard W. Gould, Jr.
issues of remote sensing spatial scaling in
quantity and quality. While these scientific
is Head, Ocean Optics Center, Naval Research
the nearshore environment and attempt to
resource decisions are rarely discussed in
Laboratory, Stennis Space Center, MS.
derive some understanding of appropriate
34
Oceanography
June 2004
The use of these multiple remote-sensing
sampling scales in the nearshore environ-
cube. The calibration of the sensor (Kohler
data density is to reduce spectral resolution.
ment. We will focus on the data collected by
et al., 2002a) and the specific corrections for
However, reducing spectral resolution also
a single sensor (PHILLS 2) to reduce uncer-
the window as well as the atmospheric cor-
reduces the biogeochemical information that
tainties in the analysis that may result from
rection of these data are described elsewhere
may be derived from optical data. To look at
the different data processing techniques ap-
(Kohler et al., 2002b). The data values are
the impacts of spectral resolution reduction
plied to each of the individual sensors’ data.
given in Rrs(λ), units of 1/sr (Mobley, 1994).
on the ability to discern spatial variability
This single 124-band data cube from July 31,
in the spectral Rrs(λ) data, the hyperspectral
2001 represents 15 GB of raw data. This data
data were reduced in spectral resolution to
was calibrated, atmospherically corrected,
approximate the SeaWiFS bands. This was
and geo-rectified for the analyses presented
accomplished by multiplying the Rrs(λ) by
here.
the SeaWiFS wavelength response function
METHODS
The PHILLS 2 was deployed seven times
st
rd
th
st
st
(July 21 , 23 , 27 , 31 a.m., 31 p.m., and
st
nd
August 1 and 2 ) during the 2001 HyCODE
LEO-15 field program, and each mission
The engineering issues surrounding the
(Figure 2). This created an 8-band image,
generated nearly 4,000 square kilometers
collection, storage, and transmission of
with band centers located at 412, 443, 490,
of spectral data at 9 m resolution (Figure
higher spatial and spectral resolution sys-
510, 555, 670, 765, and 865 nm. These data
1). For this discussion on spatial scaling, we
tems are fairly cost intensive. If this image
are used to illuminate the different multi-
have chosen to focus on a single PHILLS 2
was collected from space, it would require
spectral and hyperspectral data streams to
image from July 31 , as the coverage pro-
over three hours to transmit the data to a
resolve information variability in the near-
vided by these data approximates the total
ground station over an X-band downlink
shore environment.
spatial extent of a satellite sensor, at a much
(for reference, a polar-orbiting satellite has
The autocorrelation function has previ-
higher spatial resolution, which allows us to
approximately an 11-minute transmission
ously been used in time-series studies to de-
explore scaling issues within a single image
window). One of the easiest ways to reduce
termine the optimal time frequency of sam-
st
Figure 1. A false color
composite of the 9 m Portable Hyperspectral Imager for Low-Light Spectroscopy (PHILLS 2) data
collected at an altitude of 30,000 feet on July 31, 2001 at the HyCODE LEO-15 study
site offshore of New Jersey. The inshore yellow dot represents the location of the LEO-15 profiling bio-optical
node. The offshore blue dot is the location of the UCSB OPL (University of California, Santa Barbara, Ocean Physics Laboratory)
bio-optical mooring. The inshore small red box and the offshore small green box represent regions of interest (ROIs) where the
variance of the SeaWiFS Band 5 Rrs, PC1 (SW) and PC1 (Hyp) were approximately the same, even though the mean was significantly different (see text and Table 1). The size of these boxes represents a mean ground sampling distance (GSD) of 441 m (49 pixels
on a side for a total of 2401 pixels equal to approximately 0.2 km2). The white line represents the transect data used in the variable
GSD study. Its selection was driven by the desire to use a single flight line of data for the variance calculation (see text).
Oceanography
June 2004
35
Figure 2. To evaluate the effect of reduced spectral resolution on spatial variability, a reduc-
Figure 3. The Simulated SeaWiFS Band 5 Rrs values (sr-1*10,000) along the sampling line
tion in the spectral resolution of the hyperspectral data was performed so as to approxi-
transect as shown in Figure 1. The vertical green and red lines denote the respective
mate that of SeaWiFS bands. Shown are the SeaWiFS wavelength response functions used
locations of offshore and inshore regions of interest from which the variance threshold
to transform the hyperspectral PHILLS 2 data into a simulated SeaWiFS-type data product.
for the GSD analysis was determined.
pling (e.g., Abbott and Letelier, 1998; Chang
to be different over cross-shore distances
would like the scene itself to describe the op-
et al., 2002; Dickey et al., 2001), which led
compared to along-shore distances. There
timal GSD based on the ability of the sensor
us to attempt a spatial autocorrelation
are other statistical methods for estimating
and hyperspectral data to resolve distinct,
to examine spatial variability of Rrs(555)
spatial variability, including those that de-
homogeneous waters. The more rigorous
along the transect shown in Figure 1 (Fig-
termine the anisotropy in the directionality
application of 2-D variance analyses is the
ure 3). However, results indicated a trend in
of the variance calculation (i.e., the variance
subject of a follow-on study.
this data record, with higher intensities of
is different in different directions) (Curran,
Rrs(555) nearshore. Autocorrelation stud-
1988; Dale et al., 2002). Variance ellipses
od of estimating spatial variability, one that
ies require the mean of any subsample of a
have been used to describe the variance in
focuses on the ability to separate the linearly
record to approximate the mean of the total
altimeter-derived velocities in the near-
additive noise of the image from the “real”
record. Any attempt to calculate a decorrela-
shore environment (e.g., Strub and James,
geophysical detail of the scene. Linearly ad-
tion scale from this transect would result in
2000). Of particular interest may be the use
ditive noise refers to the interference derived
a value for the decorrelation scale that had a
of semivariogram or variograms developed
from the noise of the sensor as well as any
direct proportional relationship to the total
in the soil research community to describe
noise generated from the atmosphere or
length of the transect. The statistical reason
“roughness” in the topology of spatial mea-
processing algorithms. If the noise is stable
for this is that data from the transect does
surements (Curran, 1988). These have been
and linear, then the true signal of interest
not represent a stationary function (i.e.,
used with satellite ocean remote-sensing
may be retrieved from a sample of a popula-
the mean changes as the transect length in-
data to describe larger scales of interest in
tion, provided the sample size is sufficiently
creases and, therefore, the sample variance
chlorophyll distributions (e.g., Yoder et al.,
large. In this case, we would expect that the
will increase with domain size) (Chilès and
1987). However, many of these methods
standard deviation of the Rrs(λ) signal to be
Delfiner, 1999) (see Statistics Review box).
require interpretations that are difficult to
a proxy for the total noise, and that over any
This suggests that the measure (decorrela-
definitively relate to geophysical parameters,
homogenous region of the scene it should be
tion scale) may be an improper statistic to
i.e., “sills” and “nuggets” in variograms. Here,
constant, regardless of the magnitude of the
use to describe the optimal spatial sampling
we are interested in determining an optimal
signal. Thus, any pixel in a homogenous re-
frequency for coastal-ocean data sets.
sampling size that is more easily discussed in
gion of interest would be equal to the mean
terms of this scene of interest and in terms
value of the region ± some random compo-
of the sensor capabilities. In other words, we
nent.
While not statistically explored here, the
change in mean (and thus, variance) appears
36
Oceanography
June 2004
For this study, we derived another meth-
background noise-generated standard devia-
data to separate water masses into distinct
tiplicative noise in our data. Multiplicative
tion must contain regions of distinctly dif-
optical regions. One approach to using the
noise occurs when the noise (i.e., standard
ferent optical constituents.
full spectrum simultaneously is to first lin-
This analysis would fail if there were mul-
deviation) is a direct function of the inten-
Most studies of spatial variation in radi-
early transform the n-dimensional spectral
sity (mean) of the signal. By assuming (and
ance fields focus on the variance within a
data (where n is the number of wavelengths)
confirming) that the noise of the scene is
single channel, or perhaps a combination
into a variance minimizing coordinate sys-
truly random and linear, any increase in the
of channels. In this study, we wish to assess
tem. When the “proper” or root vectors (ei-
standard deviation would thus be gener-
if there is any additional information to be
genvectors) of this new coordinate system
ated by a change in real geophysical proper-
retrieved from the continuous spectrum of
are orthogonal to each other, this type of
ties within the region of interest. Therefore,
reflectance data, as opposed to using only
transformation is called a Principal Com-
an increase in a region’s standard deviation
one or two bands individually. The question
ponent Analysis (PCA). A PCA allows the
above a background random noise-gener-
of how to use the entire hyperspectral data
user to focus on the vectors that describe the
ated standard deviation would suggest a
simultaneously to identify homogeneous
most variance (information) using the entire
nonhomogenous region of interest, i.e., one
regions of optical properties is an active area
spectral and image space, rather than focus-
with real differences in the region’s geophysi-
of research; as a first step, we would like to
ing only on the variance in the image at a
cal properties. Put simply, a region of inter-
be able to determine if the full spectrum of-
single wavelength A PCA is a powerful way
est with a standard deviation greater than a
fers any ability over single or multichannel
to look for patterns.
S TAT I S T I C S R E V I E W
When analyzing any data set, a good place to start is
data sets, one would expect 68 percent of the popula-
Other measures, such as sample variograms and
by calculating the data set’s mean (a measure of the
tion to fall with 1 standard deviation (1σ) around the
Pair Quadrat Variance (PQV), focus only on the change
central tendency) and variance (a measure of the dis-
population’s mean. The probabilities that any member
with lag distance. For a transect of n contiguous or
persion or variability). The mean is given by the fol-
of the population would fall within 2 and 3σ are ap-
equally spaced intervals (quadrats), a sample variogram
lowing equation:
proximately 95 and 99 percent, respectively.
for a given distance d is given by (Dale et al., 2002):
n
∑X
µ=
n−d
In spatial data analysis, one is frequently interested
i
in how a sample at one spot co-varies or correlates
i =1
N
γˆ ( d ) =
with the same measure of a sample in another location.
∑( X
i
− Xi + d )
2
i =1
N −d
where the capital Greek letter sigma (Σ) means sum-
Autocovariance and autocorrelation are simply mea-
mation over all values, Xi, in the population, divided by
sures of the covariance and correlation of the values of
sample variogram is constant with respect to direc-
the total number of values, N, in the population. The
a single variable for all pairs of points separated by a
tion, it is referred to as isotropic. If the variogram
variance is given by:
given spatial lag (Dale et al., 2002). An estimate of the
changes with respect to the direction with which it
autocovariance for samples at a distance d is given by:
was calculated, then it is referred to as anisotropic. It
n
σ2 =
∑( X
i
− µ)
2
n−d
∑( X
i =1
N
It is easily seen that the greater the separation of
the individual values, Xi, are from the mean, , the
Cov =
i
− µ ) ( Xi + d − µ )
i =1
N −d
The autocorrelation is given by dividing Cov by σ2.
Note that this equation is omnidirectional. If the
is clear from Figures 1 and 6 that there appears to be a
directionality component to the along shore and cross
shore variance, and thus this image would be consider
anisotropic.
larger the variability of the population represented in
The value of these statistics in describing the data set of
the data set grows. The square root of the variance is
interest depends on the validity of the underlying as-
REFERENCE S
called the standard deviation. When the population is
sumptions. A trend in the spatial data (similar to Figure
Dale, M.R.T., P. Dixon, M.-J. Fortin, P. Legendre, D.E. Myers,
normally distributed around its mean, the standard de-
3) violates the assumption of stationarity, i.e. the esti-
and M.S. Rosenberg, 2002: Conceptual and mathemati-
viation provides a measure that is easily conceptualized
mate of the mean and the autocorrelation are constant
cal relationships among methods for spatial analysis.
as a distance away from the mean. The standard devia-
with respect to distance along the record, and negates
Ecography, 25, 558-577.
tion may also be used to produce a confidence interval
the effectiveness of the autocovariance and autocorre-
in populations that are normally distributed. In such
lation in the overall analysis.
Oceanography
June 2004
37
It should be noted that great care must
the user to recognize that it is a hyperspec-
eigenvalues of the first eigenvector created
be used in analyzing a PCA transformation
tral vector itself, which could not have been
from a PCA (referred to as PCA Hyp) of the
of a hyperspectral image. There will be an
generated without the full spectral data set.
hyperspectral image. We used the Environ-
equal number of eigenvectors as there are
The easiest way to see the impacts of all of
ment for Visualizing Images (ENVI) soft-
spectral channels, but frequently only the
the wavelengths on the eigenvector is to
ware package from Research Systems, Inc., to
first 10 eigenvectors are necessary to de-
square the PCA eigenvectors to calculate
accomplish a PCA of the hyperspectral data.
scribe >99 percent of the total variance in
each channel’s percentage contribution to
The first eigenvector (PC1) of both the hy-
the scene. However, the number of eigenvec-
the description of scene’s spectral variation.
perspectral and two band images described
tors needed to describe the total variance in
We seek to use the hyperspectral data to
>95 percent of the variance of the images;
the scene is completely image dependent. If
separate homogenous water masses, and we
PCA Hyp PC1 = 95.6 percent and PCA SW
there is a large amount of spectral variation
believe that there is additional information
PC1 = 99.0 percent. The second and third ei-
in the scene, then more eigenvectors will
in the full spectrum of the radiance field,
genvector of the PCA Hyp accounted for 2.9
be needed to describe the majority of the
rather than in any single channel or combi-
percent, and 0.7 percent of the image’s spec-
scene variance. If there is a small amount of
nation of channels. In order to test this be-
tral variance, respectively. The total variance
spectral variation, then a smaller number of
lief, we will compare the spatial variability of
described by the remaining eigenvectors for
eigenvectors will be required. As an example,
three images created from the same hyper-
PCA Hyp is 1.43 percent. There are only two
many open-ocean images have been found
spectral data set. The first image is a single
eigenvectors for the PCA SW, and the second
to only need the first three eigenvectors to
simulated SeaWiFS band (Band 5). The sec-
accounts for 1 percent of the variance.
The first three eigenvectors from the PCA
describe 98 percent of the scene dependent
ond is an image of the eigenvalues of the first
variance (e.g., Mueller, 1976). For these
eigenvector created from a PCA (referred
Hyp as well as the percentage contribution
ocean images, a common error in PCA is to
to as PCA SW) of a simulated two band
from each spectral channel to each eigen-
assume that only three spectral channels are
SeaWiFS image (Bands 3 and 5). These two
vector, is shown in Figure 4. It is clear that
needed to describe the scene dependent vari-
bands were selected for this multispectral
while there are some dominant channels in
ance. It must be understood that the eigen-
test, since they are used in many common
the first eigenvector (i.e., approximately 560
vector is a measure of the variance across all
SeaWiFS chlorophyll algorithms (O’Reilly et
nm in PC1), it peaks at only approximately
bands simultaneously, and therefore requires
al., 1998). The third image is an image of the
6 percent, which means that the other wave-
A
B
Figure 4. So as to evaluate the entire spectral data set, Principal Component Analysis (PCA) was used to reduce the dimensionality of the image. PCA is a method of maintaining nearly all
of the characteristics of the original data set while reducing the number of parameters needed to describe the data. This is accomplished by reprojecting those data along orthogonal axes
that are positioned to best describe the variance of the data (eigenvalues and eigenvectors). The first three eigenvector principal components are displayed in (A). The influence that each
spectral band had on the first three principal components is displayed in (B).
38
Oceanography
June 2004
lengths contribute 94 percent of the influence on the variance described by this vector
Image Type
Region of
Interest, ROI
Mean of ROI
Standard
Deviation
of ROI
Inshore
52.77
1.54
Offshore
24.55
1.85
Inshore
13.38
1.79
Offshore
-21.68
1.98
Inshore
111.08
6.44
Offshore
-12.11
7.46
(Figure 4A). Therefore, it would not be accurate to say a single channel would describe
Simulated SeaWiFS Band 5
95.6 percent of the variance in this image. A
more accurate statement would be that the
spectral shape that describes the most variance in this image is demonstrated in the
PC1 of Simulated SeaWiFS
Bands 3 and 5
PC1 of Hyperspectral Cube
first eigenvector.
Two regions of interest (ROIs) in the
visually homogenous areas of the imagery
Standard
Deviation Used
in Analysis, σt
2.0
2.2
8.0
Table 1. The mean and standard deviation from the simulated single-band image (SeaWiFS Band 5) as well as the first Principal
Component Analysis eigenvalue images from simulated dual band (SeaWiFS Band 3 and 5), and hyperspectral data. Also, included is the test standard deviation, σt, used for the optimal GSD calculation.
were selected to confirm the hypothesis of
linear noise (which should be applicable to
the PCA because it is a linear transformation of the hyperspectral data) and to gen-
rections, while remaining centered on pixel
differences, probably resulting from the
erate a test standard deviation value. These
i, and the mean and standard deviations
sediments suspended during the passage of
two ROIs were approximately 441 m on a
were recalculated. This procedure continued
the weather front. This variability is repre-
side (49 pixels), (approximately 0.2 km ,
until σi was greater than σt, at which point
sented in Figure 6, as a false color composite
2401 pixels); one region was inshore while
the size of the previous non-failing ROIi was
of the PCA Hyp PC1 eigenvalues rendered
the other was offshore (Figure 1). Table 1
recorded. The size of the ROIi should then
in density slices. Clearly, there is a tremen-
gives the mean and standard deviations for
equate to the maximum size of a region with
dous amount of spatial variability inshore,
the two ROIs from the simulated SeaWiFS
homogenous optical properties.
which decreases as we move offshore. As we
2
move offshore past 20 km, the optimal GSD
Band 5 as well as the PC1 for PCA SW and
PCA Hyp. Note this standard deviation is
RE SULTS AND DISCUSSION
increases for each test. However, beyond
not normalized by the mean (e.g., Mahade-
The results of this approach in describing
this point there are significant differences
van and Campbell, 2002; Mahadevan and
the spatial variability of this coastal environ-
between the SW Band 5 and PCA SW and
Campbell, in press) because we are trying
ment may be found in Figure 5. Here, the
the PCA Hyp. The optimal average GSD
to separate random noise of the sensor and
largest GSD of the ROIi that has a standard
and median GSD grow to approximately 2
processing from the real geophysical changes
deviation greater than or equal to σt is plot-
km and approximately 1.5 km, respectively,
in the image. Theoretically, any homogenous
ted as a function of the position along the
for SW Band 5 as the water masses become
region of the same size should have a similar
transect for three images: single band (Fig-
more homogeneous with respect to this
standard deviation; otherwise, some real fea-
ure 5A), dual band PC1 (Figure 5B), and
wavelength. The average and median GSD
ture of interest has been included within the
hyperspectral PC1 (Figure 5C). It can be
for the PCA SW and PCA Hyp are less, as
study region. As the ROIs were selected with
seen that the size of the GSD increases when
the additional bands of information provide
an eye to a perfectly homogenous region, we
moving from onshore to offshore. The opti-
improved ability to delineate water-mass
allow for some error in our selection criteria.
mal GSD for each data set increases rapidly
types. There is some additional geophysical
Table 1 also provides the test standard devia-
out of the surf zone to an average of approx-
structure between 28 and 40 km that reduces
tion, σt, for each of the GSD calculations.
imately 100 m within 200 m of the shore. By
the optimal GSD back to the levels seen
about 10 km, the optimal GSD grows to >1
nearshore for all three tests. Once offshore
(Figure 1), a new ROIi was created with a
km. The average and median optimal GSD
more than 40 km, the optimal GSD grows to
minimum size of 3 X 3 pixels, or 27 X 27 m
for all vary between 150 and 200 m out to
> 6 km for the Band 5 test, and > 4 km for
(729 m ). The mean and standard deviation,
5 km, with the average GSD growing to ap-
the PCA SW. These larger GSDs approach
σ, of each region was calculated, and σi was
proximately 1 km beyond approximately 12
the scale of chlorophyll distributions de-
compared against σt. If σi was less than σt,
km from the shoreline.
scribed by others in the coastal environment
Next, at every pixel along the transect line
2
then ROIi increased in size by two pixels in
The variability inshore for each GSD
each of the along-track and cross-track di-
calculation is driven primarily by intensity
using multispectral data (e.g., Yoder et al.,
1987). However, the PCA Hyp drops back to
Oceanography
June 2004
39
A
B
Figure 5. (A) To determine the optimal GSD for the SW Band 5 Rrs, the real geophysical variation
along the flight line transect needed to be resolved. The data values show that nearshore (<10 km)
C
an optimal GSD would be less than 100 m to 200 m. These optimal GSDs grow to 1 km farther offshore. Note, however, that there are discontinuities in the progression of larger and larger GSDs as one
moves offshore. This may suggest the crossing of a frontal boundary, which would require a smaller
GSD to resolve. The blue and red lines are the mean and median, respectively, of the GSDs from a
particular point along the transect to the most inshore point. The vertical green and red lines denote
the respective locations of the inshore and offshore regions of interest (ROIs) from which the variance
threshold for the GSD analysis was determined. The horizontal grey line indicates the size of the region of interest from which the threshold was determined. (B) Determining the optimal GSD for the
simulated SeaWiFS PC1 image was accomplished in the same manner as Figure 5A. Similar to Figure
5A, this figure illustrates the same basic trend: smaller GSDs are required inshore while larger GSDs
are sufficient off shore. The description of the lines in the image are the same as in Figure 5A. (C) The
optimal GSD for the hyperspectral PC1 image was determined in the same manner as Figure 5A. In
shore, this analysis is in agreement with the results from the other two GSD studies. However, offshore
the variance found within the PC 1 (Hyp) was significantly greater than what was witnessed in the
other two studies resulting in smaller GSDs required to resolve what were thought to be regions of
homogeneous ocean color. The description of the lines in the image are the same as in Figure 5A.
Figure 6.
A false color composite of the inshore
variability of the GSD is
shown. The image displays
the eigenvalues associated with
the first eigenvector of the PCA Hyp
(PC1) image generated from the hyperspectral data. The eigenvalues are mapped
into linear density slices and colored using
a linear blue to red color table. A land mask
was applied prior to the PCA being applied to the
data set. Results suggest that the variability of the
40
Oceanography
June 2004
water color is greater as one approaches the shore.
levels seen nearshore, suggesting the hyper-
boundary). In addition, the input of fresh
signals may be a function of the total num-
spectral data offers additional information
water has its greatest impact on baroclinic-
ber of bands sampled.
with which to separate features in otherwise
ity in the nearshore environment. The use of
homogeneous-appearing waters.
optical tracers for salinity (Coble et al., this
data set of interest by rotating the coordi-
issue) may actually improve the understand-
nate system into one that minimizes the
confirm what many coastal oceanographers
ing and prediction of coastal circulation, a
variance across the entire data space. In
intuitively understand. The closer to shore
requirement for any study on the sources
the hyperspectral image cube of Figure 1, a
one approaches, the more variable the color
and fate of biogeochemically relevant mate-
single eigenvector (Figure 4) accounts for
of the water. In the optically deep waters off
rials. These results suggest that color studies
most of the variance in this image. While
the coast of New Jersey, the optical features
at the LEO-15 site may require GSDs ap-
this eigenvector (PC1) may appear similar to
are driven by the wind and tidal mixing of
proximately 100 m to resolve biogeochemi-
a water-leaving radiance vector in high-scat-
sediments as well as allochthonous inputs
cal processes from ocean-color data.
tering green waters, great care must be used
In many respects, these statistical results
A PCA reduces the dimensionality of a
of sediments, nutrients, and organic mate-
The optimal GSD is also a function of
in ascribing real geophysical properties to
rial from rivers and estuaries. In addition,
the information content of the data set. The
eigenvectors. Figure 4B does show why the
far field dynamics drive coastal jets that also
single-band data set shows less variability in
three techniques were so similar, particularly
bring in allochthonous material into this lo-
its standard deviation than the PC1 of the
in the nearshore region, as the wavelengths
cation (Chant et al., in press), which are tid-
dual-band data set, which in turn shows less
around 560 nm influenced the variance of
ally mixed with nearshore waters. This drives
variability in the standard deviation than the
the PC1 the most. PCA is a tool to describe
the variability of the optical signal to a very
PC1 of the hyperspectral data set. This effect
scene-dependent variance, and in this pa-
high level over small spatial distances. As we
results in lower mean and median optimal
per, we focus on the information content
move offshore into the deeper waters of the
GSDs as the number of bands used in the
of spectral data over small homogeneous
shelf, the impacts of tidal oscillations are less
analysis increases. This result suggests that
regions of water color, and intensity across
important. The change in water-mass optical
additional bands add information that may
a large scene of interest. The spatial scale
characteristics is driven by the interactions
be used to discriminate optically different
of homogeneous regions often depends on
of larger-scale physical features (i.e., mean
water masses, and perhaps retrieve estimates
the total number of bands used to describe
currents) and weather patterns. These larger-
of different optically active constituents.
that homogeneous region, particularly when
scale processes tend to homogenize water
In particular, beyond 40 km, there is a real
moving away from the shallow water regions
masses over kilometer scales, and as a result,
divergence between the GSDs of the PCA
impacted by high-energy mixing. It does
the GSD needed to adequately resolve the
Hyp and those derived from the simulated
not, however, necessarily suggest that eco-
real horizontal geophysical boundaries with-
SeaWiFS data set. Unsurprisingly, it also sug-
logical parameters of interest vary over these
in these homogenous waters grows in size.
gests that the optimal GSD for delineating
same scales. The determination of variance
the spectral variances in upwelling radiance
of ecological-relevant material would
In the nearshore environment (less than
10 km from the shore in this example), each
of these tests yield approximately the same
result (Figure 7). It would appear from this
result that to adequately describe the geo-
Figure 7. The inshore GSD for SW
Band 5, PCA SW, and PCA Hyp. The
physical features in the nearshore, the GSD
similarity within this region of the
must be < 200 m. Many important bio-
GSD trends is striking, and suggests
chemical processes occur within 10 km of
that variance within the inshore region may be driven by the concen-
the shore. River discharges of nutrients and
trations of suspended matter. The
organic matter have their greatest influences
vertical red line denotes the loca-
in this nearshore region, and the cycling of
tion of the inshore ROI from which
the variance threshold for the
these materials within the nearshore envi-
analysis was determined. The hori-
ronment may have large impacts on esti-
zontal grey line indicates the size of
mates of the fate of biogeochemical elements
the region of interest from which
the threshold was determined.
(e.g., carbon, at the terrestrial or ocean
Oceanography
June 2004
41
depend greatly on the algorithms that in-
the ROIs shown in Figure 1, the Band 3:5
1999; Lee and Carder, 2004; Louchard et al.,
vert color and intensity into optically active
ratio showed a significant difference in stan-
2003; Mobley et al., 2002). In addition, hy-
mass constituents (i.e., chlorophyll, colored
dard deviation, versus the similar standard
perspectral approaches may also yield infor-
dissolved organic matter [CDOM], sedi-
deviations calculated for each band in each
mation on phytoplankton speciation, which
ments).
ROI. This violates our primary assumption,
might allow for the remote identification of
Ocean-color research and applications are
and suggests that studies using ratio analyses
harmful algal blooms (HABs) (Roesler et
frequently more concerned with the prod-
should attempt to delineate the differences
al., in press). These hyperspectral imagery
ucts derived from Rrs(λ) estimates (e.g., total
in the variances of biogeochemical estimates
analysis techniques offer the potential to
absorption, total scattering, diver visibility,
that result from variance of the data versus
dramatically increase our ability to retrieve
total chlorophyll concentration), rather than
the mathematical variance created by the ap-
coastal zone information from ocean color
the Rrs data itself. It may be that the appro-
plication of the algorithm.
data streams and specifically address critical
priate spatial sampling frequency for these
This work suggests that future studies on
issues in coastal-zone management. As we
products is different than the sampling fre-
the optimal sampling frequency in the spa-
move from multispectral to hyperspectral
quency determined from the spatial varia-
tial domain of remote-sensing data begin
data products, we may also find a need for
tions in the radiance fields. However, using
with the Rrs data itself. Furthermore, the op-
higher spatial resolution data to better de-
the products produced from these same
timal sampling frequency may be a function
scribe the changes in the nearshore coastal
radiance fields to determine spatial sam-
of the total number of wavebands available
environment.
pling frequencies may produce vary different
for analysis. This work does not definitively
scaling results, strictly due to the method of
suggest that variations of optically-active
SUMMARY AND CONCLUSIONS
product generation. In this paper, we show
constituents may be retrieved at the same
There are probably many methods of de-
how the noise of the sensor and atmospheric
spatial resolution as the variation in the total
termining the optimal spatial sampling fre-
processing approximates a linear transform
Rrs vector. However, it does suggest that spa-
quency in the coastal zone. However, when
function, and we expect that the variance in
tial variations in ocean color depend on the
a statistical approach is used, care must be
radiance data above the background linear
number of channels used to described differ-
taken to use a method that is applicable to
noise represents true difference in ocean col-
ences between homogenous regions. If these
the sample, and to ensure the rigorous as-
or. However, many remote-sensing product
additional channels can be used to discrimi-
sumptions of stationary functions are not
calculations are nonlinear transforms of the
nate additional biological, chemical, and
violated. Otherwise, unclear results are ob-
Rrs data (e.g., Lee and Carder, 2002; O’Reilly
physical information, then the hyperspectral
tained that may lead to an inefficient scien-
et al., 1998). A nonlinear transform of the
ocean color signal will yield a greater ability
tific design of a remote sensing sensor or ex-
Rrs data will alter the mean and variance sta-
to identify, study, and predict important eco-
periment, for example, for the data discussed
tistics in ways that may alter scaling results
logical processes in the coastal environment.
here, the assumptions inherent to using the
New algorithms are being developed that
autocorrelation function were violated, in-
focus on relatively continuous spectral data
dicating that autocorrelation analysis is the
sis using a ratio of a simulated Band 3 to
rather than on the ratio of multispectral
wrong tool for this coastal data set.
Band 5 because our primary assumption is
channels. These new algorithms use a variety
The results described here suggest that the
that the sum of environmental and sensor
of techniques to take advantage of the great-
spatial resolution required for offshore stud-
noise is a linearly additive component of the
er degrees of freedom that the hyperspec-
ies may be dependent on the spectral resolu-
total signal, and the standard deviation of
tral data stream offers to the ocean-color
tion of the data stream. At LEO-15 between
this noise component of the signal would
scientist; algorithms are in development to
1 and 10 km, a 50- to 200-m GSD appears
be constant across varying levels of inten-
retrieve standard oceanographic products,
sufficient for single-band, dual-band, and
sity. If this assumption is true (and it would
such as total chlorophyll and CDOM, as well
hyperspectral-band data. Within 1 km of
appear so from this analysis), then a linear
as new products such as bathymetry, bottom
the shore, an even higher resolution sensor
addition of noise to a downward trending
type, and water-column Inherent Optical
might be needed to resolve the wind and tid-
numerator would yield an increasing vari-
Properties (IOPs) (e.g., Dierssen et al., 2003;
ally impacted features. In the optically deep
ance estimate of a ratio product. In fact, for
Hoge et al., 2003; Lee et al., 1998; Lee et al.,
offshore waters of LEO-15, bottom effects do
shown here.
We specifically did not include an analy-
42
Oceanography
June 2004
not impact Rrs. However, in optically shallow
ACKNOWLED GEMENTS
areas, the spatial heterogeneity of the bottom
This work was supported by the Office of
may further reduce the GSD required to re-
Naval Research. We would like to thank Sha-
solve the optical constituents near the coast
ron DeBra and Mubin Kadiwala for their
(see Philpot et al., this issue). Offshore of 10
help in editing and preparing this paper.
km, there is a significant difference in the
Lastly, the review and comments of Dr. Ri-
ability to discriminate optical boundaries
cardo Letelier were greatly appreciated.
using the single or dual band data compared
to the hyperspectral data. This suggests that
REFERENCE S
hyperspectral data may be better able to de-
Abbott, M.R., and R.M. Letelier, 1998: Decorrelation scales
of chlorophyll as observed from bio-optical drifters in
the California Current. Deep Sea Research II, 45, 1,6391,667.
Chang, G.C., T.D. Dickey, O.M. Schofield, A.D. Weidemann,
E. Boss, W.S. Pegau, M.A. Moline, and S.M. Glenn, 2002:
Nearshore physical processes and bio-optical properties
in the New York Bight. Journal of Geophysical Research,
107(C9), 3,133.
Chant, R.J., S. Glenn, and J. Kohut, in press: Flow reversals
during upwelling conditions on the New Jersey inner
shelf. Journal of Geophysical Research.
Chilès, J.-P., and P. Delfiner, 1999: Geostatistics: Modeling
Spatial Uncertainty. John Wiley and Sons, Inc., New York,
720 pp.
Curran, P.J., 1988: The semivariogram in remote sensing:
An introduction. Remote Sensing of Environment, 24(3),
493-507.
Dale, M.R.T., P. Dixon, M.-J. Fortin, P. Legendre, D.E. Myers,
and M.S. Rosenberg, 2002: Conceptual and mathematical relationships among methods for spatial analysis.
Ecography, 25, 558-577.
Davis, C.O., J. Bowles, R.A. Leathers, D. Korwan, T.V.
Downes, W.A. Snyder, W.J. Rhea, W. Chen, J. Fisher, W.P.
Bissett, and R.A. Reisse, 2002: Ocean PHILLS hyperspectral imager: Design, characterization, and calibration.
Optics Express, 10(4), 210-221.
Dickey, T., S. Zedler, X. Yu, S.C. Doney, D. Frye, H. Jannasch,
D. Manov, D. Sigurdson, J.D. McNeil, L. Dobeck, T. Gilboy, C. Bravo, D.A. Siegel, and N. Nelson, 2001: Physical
and biogeochemical variability from hours to years at the
Bermuda Testbed Mooring site: June 1994 March 1998.
Deep Sea Research II, 48, 2,105-2,140.
Dickey, T.D., 1991: The emergence of concurrent high-resolution physical and bio-optical measurements in the
upper ocean and their applications. Reviews of Geophysics, 29(3), 383-413.
Dierssen, H., R. Zimmerman, R. Leathers, T. Downes, and C.
Davis, 2003: Ocean-color remote sensing of seagrass and
bathymetry in the Bahamas Banks by high-resolution
airborne imagery. Limnology and Oceanography, 48(1,
part 2), 444-455.
Hoge, F.E., P.E. Lyon, C.D. Mobley, and L.K. Sundman, 2003:
Radiative transfer equation inversion: Theory and shape
factor models for retrieval of oceanic inherent optical
properties. Journal of Geophysical Research, 108(C12),
3,386.
Kohler, D.D.R., W.P. Bissett, C.O. Davis, J. Bowles, D. Dye, J.
Britt, J. Bailey, R. Steward, O.M. Schofield, M. Moline, S.
Glenn, and C. Orrico, 2002a: Characterization and calibration of a hyperspectral coastal ocean remote sensing
instrument. In: Proceedings of the AGU/ASLO Ocean
Sciences 2002 Meeting, Honolulu, HI, February 11-15.
American Geophysical Union, Washington, D.C.
lineate optically distinct regions in offshore
coastal waters and that scaling studies may
be dependent on the total number of spectral channels used in the analysis.
Nonlinear transforms of Rrs, typical of
algorithms for products such as chlorophyll
concentration, may alter variance calculations; optimal scaling results for these derived products may depend in part on the
transform of the data, or the algorithm used.
Care must be used when deriving statistics
on nonlinear transforms to avoid missing
real differences in ocean color and biogeophysical boundaries.
The real goal in optical oceanography is
to use optics to identify interesting oceanic
features as well as describe their time-dependent change. These features range from
individual, HAB-forming phytoplankton
species to the delineation of terrestrial outflow plumes from background coastal waters. The features may also include bottom
characteristics, such as seagrass beds or coral
formations. The future of hyperspectral imagery will depend on the ability to retrieve
these optically distinct constituents from the
Rrs(λ) data streams. One of the first steps in
retrieving biogeochemical information from
imagery data is to resolve the spatial variations distinguishable in these hyperspectral
scenes. The next step will be to focus on the
development of new algorithms that use the
entire water-leaving spectrum to assess and
monitor the coastal zone environment.
Kohler, D.D., W.P. Bissett, C.O. Davis, J. Bowles, D. Dye,
R.G. Steward, J. Britt, M. Montes, O. Schofield, and M.
Moline, 2002b: High resolution hyperspectral remote
sensing over oceanographic scales at the LEO 15 field
site. In: Ocean Optics XVI, conference held in Santa Fe,
NM, November 18-22. Office of Naval Research, Washington, D.C.
Lee, Z., K.L. Carder, C.D. Mobley, R.G. Steward, and J.S.
Patch, 1998: Hyperspectral remote sensing for shallow
waters: 1. A semianalytical model. Applied Optics, 37(27),
6,329-6,338.
Lee, Z., K.L. Carder, C.D. Mobley, R.G. Steward, and J.S.
Patch, 1999: Hyperspectral remote sensing for shallow
waters: 2. Deriving bottom depths and water properties
by optimization. Applied Optics, 38(18), 3,831-3,843.
Lee, Z.P., and K.L. Carder, 2002: Effects of spectral band
numbers on the retrieval of water column and bottom
properties from ocean color data. Applied Optics, 41(12),
2,191-2,201.
Lee, Z.P., and K.L. Carder, 2004: Absorption spectrum of
phytoplankton pigments derived from hyperspectral
remote-sensing reflectance. Remote Sensing of Environment, 89, 361-368.
Louchard, E., R. Reid, F. Stephens, C. Davis, R. Leathers,
and T. Downes, 2003: Optical remote sensing of benthic
habitats and bathymetry in coastal environments at Lee
Stocking Island, Bahamas: A comparative spectral classification approach. Limnol. Oceanogr., 48(1, part 2),
511-521.
Mahadevan, A., and J.W. Campbell, 2002: Biogeochemical
patchiness at the sea surface. Geophysical Research Letters,
29(19), 1,926.
Mahadevan, A., and J.W. Campbell, eds., in press: Biogeochemical variability at the sea surface: How it is linked to
process response times. Pp. 215-226 in Handbook of Scaling Methods in Aquatic Ecology: Measurement, Analysis,
Simulation. CRC Press, Boca Raton, FL.
Mobley, C.D., 1994: Light and Water. Academic Press, San
Diego, CA, 592 pp.
Mobley, C.D., L. Sundman, C.O. Davis, M. Montes, and W.P.
Bissett, 2002: A look-up-table approach to inverting
remotely sensed ocean color data. In: Ocean Optics XVI,
conference held in Santa Fe, NM, November 18-22. Office of Naval Research, Washington, D.C.
Mueller, J.L., 1976: Ocean color spectra measured off the
Oregon coast: Characteristic vectors. Applied Optics,
15(2), 394-402.
O’Reilly, J.E., S. Maritorena, B.G. Mitchell, D.A. Siegel, K.L.
Carder, S.A. Garver, M. Kahru, and C. McClain, 1998:
Ocean color chlorophyll algorithms for SeaWiFS. Journal
of Geophysical Research, 103, 24,937-24,953.
Preisendorfer, R.W., 1976. Hydrologic Optics, Volumes 1-6,
National Oceanic and Atmospheric Administration, Honolulu, Hawaii. NTIS PB-259 793. National Technology
Information Service, Springfield, VA, 1,727 pp.
Roesler, C.S., S.M. Etheridge, and G.C. Pitcher, in press: Application of an Ocean Color Algal Taxa Detection Model
to Red Tides in the Southern Benguela. In: Proceedings
of the Tenth International Conference for Harmful Algal
Blooms.
Strub, P.T., and C. James, 2000: Altimeter-derived variability
of surface velocities in the California Current System: 2.
Seasonal circulation and eddy statistics. Deep Sea Research II, 47, 831-870.
Yoder, J.A., C.R. McClain, J.O. Blanton, and L.Y. Oey, 1987:
Spatial scales in CZCS-chlorophyll imagery of the southeastern U.S. continental shelf. Limnol. Oceanogr., 32(4),
929-941.
Oceanography
June 2004
43