8.1 Local Inference Yields Better Data
We present the results of the multi-week experiment described in Section
7.2, showing that the
Local Inference mode yields better data than
Sense-and-Send and
Basic Thresholding, providing more interesting images, fewer uninteresting images, and processing more images overall.
Local Inference sends more interesting images: Figure
7(a) shows the number of interesting images—images containing people—that Camaroptera captures in the
Local Inference mode, compared to the
Sense-and-Send and
Basic Thresholding modes. Across all three light levels, Camaroptera with
Local Inference captures and sends a larger number of interesting images. This difference is greatest at low input power (15klx), with the
Local Inference mode enabling Camaroptera to send around
\(12\times\) more interesting images than the commonly used
Sense-and-Send mode, and around
\(3\times\) more than the
Basic Thresholding mode. The
Local Inference mode outperforms the other two modes as it uses energy judiciously, avoiding the costly transmission of uninteresting images.
The data show that in
Local Inference mode, Camaroptera transmits as many as
\(50\%\) of all interesting images, which is
12x more interesting images than
Sense-and-Send transmits. There are two main reasons that Camaroptera does not capture
\(100\%\) of interesting images in
Local Inference mode. First, Camaroptera spends time processing each image, creating a risk of not capturing an interesting image while processing an uninteresting one. Section
9 discusses strategies to further reduce this risk and capture more interesting events. Second, even in
Local Inference mode, Camaroptera spends time recharging energy spent transmitting interesting events, which blocks capturing new data.
Local Inference sends fewer uninteresting images: Camaroptera’s local inference avoids transmitting uninteresting images more effectively than other modes. Figure
7(b) shows how the number of uninteresting images sent varies with input power across the three operating modes.
Local Inference mode sends up to
\(6.5\times\) fewer uninteresting images than
Sense-and-Send mode. Additionally, while
Local Inference mode transmits a roughly constant number of uninteresting images across input power levels,
Sense-and-Send and
Basic Thresholding send more uninteresting images as input power increases.
Local Inference avoids the problem of eager transmission faced by
Sense-and-Send and
Basic Thresholding: As power increases, recharging becomes faster, enabling sending
more images. However, without the ability to discriminate interesting from uninteresting,
Sense-and-Send and
Basic Thresholding more quickly send more uninteresting images.
Local Inference captures more total images: Operating in the
Local Inference mode allows Camaroptera to avoid costly transmission of uninteresting images and use that energy to capture and process newer images. Figure
7(c) shows the total number of images that Camaroptera processes in all the operating modes across the whole trial.
Local Inference mode enables Camaroptera to capture upto
\(14.7\times\) more total images than the
Sense-and-Send mode, and upto
\(2.8\times\) more than the
Basic Thresholding mode. Processing more raw images by using local inference decreases the chance that Camaroptera misses a critical, interesting event.
Transmitting images is energy-expensive, and collecting that energy takes significant time. Avoiding unnecessary image transmission allows the
Local Inference mode to significantly reduce the energy collection time, resulting in less time spent on each image than the other two modes. Figures
7(d) to
7(f) show the distribution of total time spent on an image frame, across three light levels for each mode. The total time includes the time to capture, process and, if applicable, transmit the frame, as well as the time to collect the energy required for these tasks. As radio transmissions are energy-expensive, the frames that are transmitted incur a large latency, dominated by energy collection. This can be observed in Figure
7(d), which shows the
Sense-and-Send mode transmitting every image, and thus incurring large frame latencies (
\(60 s\) to
\(90 s\) at 15 klx) on all the frames it captures. Further, the frame latencies are higher when input power is low (
\(\ge \!\!60 s\) at 15 klx), due to slower energy collection, and lower at higher input power (
\(30 s\) to
\(40 s\) at 45 klx), where energy collection is faster. The
Basic Thresholding mode (Figure
7(e)) avoids transmitting unchanged images, which is reflected in the large number of frames with latency
\(\le \!\!10 s\), as these frames avoid large energy collection delays. However, the uninteresting images that the
Basic Thresholding mode transmits still incurs this large delay, as represented by the frames having large total times (between
\(50 s\) and
\(90 s\) at 15 klx). In contrast, the
Local Inference mode (Figure
7(f)) minimizes the total per-frame latency by using the high-energy radio only for transmitting the images it classifies as interesting for the application. This results in very few frames incurring the high energy collection cost of radio transmissions (frames with latency between
\(60 s\) to
\(90 s\) at 15 klx). Majority of the frames in the
Local Inference mode either take
\(\le \!\!10 s\) when eliminated by Image Differencing or take
\(10 s\) to
\(20 s\) when eliminated by the Inference module. Faster frame latencies allow the
Local Inference mode to capture more total images, increasing the effectiveness of Camaroptera in detecting interesting events.
8.2 Effect of Varying Event Composition
We studied Camaroptera’s sensitivity to varying the
True Positive (TP) rate of images encountered across Figures
8(a) and
8(d). We restrict this study to Camaroptera’s
Local Inference mode with a DNN having a ratio of False Positives to False Negatives of 10:10. The trials for this sensitivity study had 100 image events, and the results were averaged across three repetitions of each trial. We varied the TP rate from 20% (few people) to 80% (crowded area), representing different amounts of interesting event traffic. The data shows that higher interesting event traffic (TP rate) degrades Camaroptera’s ability to detect interesting events
only when input power is low. We expected that a higher TP rate would be detrimental in the
Local Inference mode, since it would require more energy-expensive image transmissions. This expectation holds true for total images captured, as seen in Figure
8(a), where the most images are captured for the lowest TP rate (
\(20\%\)). Figure
8(d) shows that as the TP rate increases, Camaroptera spends more time on image transmission and less on image capturing, resulting in fewer total images.
Figures
8(b) and
8(c) provide additional insights into how Camaroptera operates at different TP rates. As we expect, Camaroptera reports more total interesting events when there is a higher amount of interesting event traffic; however, the
fraction of interesting events it reports varies. At 10 klx, energy efficiency matters most, and a low TP rate helps Camaroptera avoid activating the expensive radio. Camaroptera reports the largest fraction of interesting events for the
\(20\%\) TP rate at 10 klx. At higher input power, a TP rate of
\(50\%\) presents a higher fraction of interesting events. Camaroptera reporting a larger fraction of interesting events at
\(50\%\) TP than
\(80\%\) TP is expected; more interesting events use the radio more frequently, missing the capture of newer interesting events. When comparing against a
\(20\%\) TP rate, Camaroptera misses consecutive interesting events due to its long processing latency. While Camaroptera misses consecutive event for all TP rates, this degrades performance the most at
\(20\%\) TP rate (since there are few interesting events to begin with). Figure
8 shows that Camaroptera’s ability to report interesting events is robust across a reasonable amount of event traffic (especially at
\(50\%\)) and can be deployed across a variety of environments.
8.3 Effect of Varying DNN Parameters
We measured Camaroptera’s sensitivity to variations in the False Positive and False Negative rate of its DNN classifier, evaluating its effectiveness with different learned inference models (Figures
9(a) to
9(c)). We evaluted three classifiers, representing differently tuned versions of the DNN in Section
6 with the same memory footprint. We identify a classifier by the ratio of its False Positives to its False Negatives, e.g., FP:FN = 10:10. Similar to Section
8.2, this sensitivity study was also conducted with trials comprising 100 image events, averaged across three repetitions.
Figures
9(a) and
9(b) show that a 10:10 classifier captures the most total images. However, the classifier that reports the largest fraction of interesting events depends on the input power. At low input power (
\(10-20 klx\)), a low FP rate (
\(10\%,20\%\)) leads to fewer uninteresting images transmitted (Figure
9(c)) than a
\(40\%\) FP rate, preserving the limited available energy. Camaroptera uses this energy to capture and report a higher fraction of interesting events. At higher input power (
\(\ge \!\!30 klx\)), energy is more abundant, and lowering the FN rate takes precedence; a
\(40\!\!:\!\!1\) classifier reports the largest fraction of interesting events, even when its high FP rate leads to transmitting the most uninteresting events. Figure
9 shows that designers using Camaroptera must tune the classifier to match their application requirements (e.g., suffering higher FP for achieving lower FN), and also the deployment environment (e.g., prioritizing a lower FP when input power is low).
8.4 Device Characterization
We characterized and compared the lifetime of our energy-harvesting, batteryless Camaroptera system with different battery-powered Camaroptera systems, as well as the Permamote [
34] system, in Table
2. Non-rechargeable batteries are a poor choice for a long-range, visual-sensing system like Camaroptera given their limited lifetimes of a few weeks. Rechargeable batteries allow Camaroptera to achieve longer lifetimes, but still require expensive battery replacements every 4–5 years. Permamote combines a rechargeable LTO battery with a non-rechargeable backup CR2477 battery for powering operation during periods of no input power (e.g., night time). While this does extend the operation to night time, we argue that this fits our Camaroptera system poorly for two reasons. First, the 5-year lifetime even with a rechargeable LTO battery requires expensive battery replacements, failing our requirement for a multi-decade lifetime. Second, a visual-sensing-based system like Camaroptera can capture useful images only when the scene is well-lit, rendering night-time operation unnecessary, especially in remote areas where artificial lighting will be rare.
We also characterized Camaroptera’s latency and energy for its main operations, with data in Figure
10. The data show that while inference takes the most time to complete, transmitting using LoRa consumes the most energy, because the radio system has much higher power consumption. With low input power, the high energy cost of transmitting requires a long recharging latency (Figure
2) despite the low transmission latency. With high input power, recharge times drop and the time spent computing exceeds the time spent transmitting
and recharging.