2.2 Existing Works
Most of the existing solutions fail to meet a significant fraction of the features described previously. A comprehensive summary is provided in the following. We refer to the three types of solutions—
camera-based,
sensor-based, and
RF-based, as briefed in Section
1.
Cost-effectiveness. There have been immense advancements in the design and development of lightweight object-detection strategies, such as YOLOv7 [
51] and YOLACT++ [
4], which are being used for efficient application of computer vision in vehicle detection and classification. These works [
4,
5,
13,
24,
36,
51] use
costly image capturing equipment for recording the vehicles. Some recent works [
6,
7,
27,
54] also exploit sophisticated stereo cameras for capturing 3D vision to achieve superior performance. Moreover, at minimum, they need a GPU system for analysis of the image-data on-the-fly or a costly high-end cloud service. Sensor-assisted solutions also mostly need to depend on the installation of
ccostly infrastructure [
26] for the appropriate functioning of the sensors. In principle, RF-based solutions ideally do not need any such costly equipment [
2,
18,
23]. However, the way RF has been employed so far involves a high cost per unit for the necessary setup to achieve controlled measurement of the PHY parameters (e.g.,
RSS) [
44,
49,
52,
53]. The use of ML/AI-based on-the-fly analysis of the collected data, either by cloud or high-end computing facility, also adds to the overall cost. This results in high-cost algorithms for their analysis. In contrast, in our current work, we use low-power RF communication between off-the-shelf low-cost IoT-devices and in-device recording, as well as lightweight analysis of the data, which by default minimizes the cost per unit.
Flexibility/Infrastructure. Unlike a friendly indoor environment, outdoor space is considered to be quite hostile and dynamic. Thus, flexibility and easy re-installation support for a solution are very important. Camera-based solutions need special installation of the costly cameras and their proper maintenance too. Sensor-assisted solutions also need specialized installation of the sensors. For instance, intrusive sensors such as the acoustic sensor [
40], light sensor [
46], magnetometer [
47,
48], loop detector [
20], piezoelectric sensor [
35], and vibration [
37,
45] first need to be appropriately installed on the pavement surfaces. High installation costs, traffic disruption during installation, maintenance, and so forth are some of the disadvantages of this approach. In contrast, in our current work, we use off-the-shelf, battery-operated, small-size IoT devices to serve the purpose. The IoT device can be installed at any suitable location beside the road, such as roadside trees and lampposts. It does not need any specialized installation support and hence is highly portable.
Energy Requirement/Power Supply. In general, all the existing works assume the availability of an adequate and uninterrupted source of power (e.g., wall power) for carrying out the job [
5,
13,
24]. In contrast, our design exploits low-power IoT-systems and hence largely relaxes this requirement. In particular, our compact and efficient communication protocol enables the solution to sustain itself for a substantial amount of time solely based on battery power.
Moreover, the system can be appropriately tuned to run over battery power for a much longer duration.
Cloud Assistance/Computation Cost. The camera-based solutions [
4,
5,
13,
24,
33,
36,
51] need to run complex algorithms in the back-end for processing the image and video data for which constant cloud connectivity is inevitable. Frequent communication with the cloud with large amounts of data creates a serious bottleneck in the mass application of such solutions over a large area. To address this common issue, the recent design trend is to minimize the interaction between the cloud and the edge [
55]. The IoT-assisted approach proposed in this work uses very simple strategies and ensures that the low-power small-sized IoT devices can carry out all the required analysis tasks without explicit support from the cloud.
Wide-Area Synchronized Measurement. Existing works deal with the detection and classification of the vehicles at a single place [
2,
52]. However, in the context of smart city/intelligent transportation, the solution needs to be replicated at multiple locations, and the measurements collected from these locations need to be time-correlated for a system-wide behavioral study. None of the existing camera-based, sensor-based, or even RF-based solutions talks about this issue. Our proposed IoT-based framework, apart from being cost-effective and easy to use, takes care of these vital issues, which are largely missing in the existing works. Intrinsically, our proposed framework does not depend on the conventional 3G/4G/LTE network. Rather, it is designed as an ST-based stand-alone IoT system that connects all its base units using low-power communication technology and supports seamless time correlation among them.
System-Wide Analysis. The primary goal of wide-area detection and classification is to enable on-the-fly behavioral study, anomaly detection, and quick decision-making in traffic management and control. Most of the existing solutions depend on the computation and analysis of the raw data with the help of a cloud server. However, conveying data to the cloud at every epoch from every
Measurement Unit (MU) installed over a large area naturally consumes a huge bandwidth, causing network congestion as well as performance degradation [
2,
49,
52]. The proposed IoT-based solution in our current work, in contrast, by the virtue of the existing rich IoT-protocol base [
15,
21], can carry out in-network data processing to support on-the-fly end-to-end collaborative decision-making in real-time without any active help from the cloud.
Privacy. With the increasing use of technology, the privacy of users has become one of the prime concerns. Camera-based solutions are prone to breaches of privacy and hence are not suitable for all applications [
19]. BLE/WiFi-based solutions can also work in a privacy-preserving fashion but need a prior coupling between vehicles and the detection/classification system—for example, the transmission of any special signal from the vehicles for their detection and classification [
30]. Similarly, sensor-based solutions are also mostly not prone to any such breach of privacy [
28,
45]. However, the requirement for special infrastructure and dependence on the cloud for necessary analysis make their ubiquitous and widespread use difficult. In our current work, we fundamentally use a low-power RF transceiver as a special type of sensor that directly categorizes a vehicle under a class without the requirement of any further detailing of the actual vehicle and hence maintains privacy.
Discussion. RF-based solutions tend to meet many of the expected features which are quite hard by the other two possible approaches. However, the existing works exploit RF in a mostly brute-force way. The role of an efficient protocol for supervising the whole process is extremely important to balance all the goals together, which is missing in the existing works. The approach taken in the work of Sliwa et al. [
44] is the closest to that of ours. Swila et al. [44] use an AT-based protocol similar to token ring [
14] where six active devices exchange RF packets across the road at a very high rate. The protocol measures the RSSI values at the receiving devices and checks for any attenuation due to the passage of the vehicles.
Support Vector Machine (SVM) is used for on-the-fly detection and classification. However, the use of uncontrolled communication among the active devices makes this strategy highly energy-hungry. Moreover, the solution acts blindly—that is, it does not adapt based on the traffic and hence constantly drains the energy from the low-power devices at a very high rate. In addition, the work is quite ignorant about the issue of efficient wide-area coverage. In contrast, our current work, LiVeR, demonstrates an adaptive mechanism that achieves the goal of spending approximately one-third of the RF packets compared to the work of Swila et al. [
44] without compromising accuracy. Section
8 provides a detailed comparison.
Table
2 summarizes the existing works w.r.t. the expected features (in the columns). The advantages of a solution are highlighted with a gray background.