Examining spatiotemporal crowdsensing and caching for population-dynamic OTT content delivery

Kim, Hee Soo; Jang, Yumi; Choi, Yun Jae; Kim, Hong Ki; Kim, Seongcheol; Lee, Sang Hyun

doi:10.1038/s41598-024-64589-1

Download PDF

Article
Open access
Published: 14 June 2024

Examining spatiotemporal crowdsensing and caching for population-dynamic OTT content delivery

Hee Soo Kim¹^Â na1,
Yumi Jang²^Â na1,
Yun Jae Choi¹,
Hong Ki Kim¹,
Seongcheol Kim³ &
â¦
Sang Hyun Lee¹Â

Scientific Reports volumeÂ 14, ArticleÂ number:Â 13783 (2024) Cite this article

523 Accesses
Metrics details

Subjects

Abstract

This study proposes a novel spatiotemporal crowdsensing and caching (SCAC) framework to address the surging demands of urban wireless network traffic. In the context of rampant urbanization and ubiquitous digitization in cities, effective data traffic management is crucial for maintaining a dynamic urban ecosystem. Leveraging user mobility patterns and content preferences, this study formulates an offloading policy to alleviate congestion across urban areas. Our approach uses an AI-based method at the cell level, providing a practical and scalable solution that can be readily adapted to bustling metropolitan areas. The implementation of our model demonstrated its effectiveness in reflecting real-world urban dynamics, resulting in significant reductions in peak-hour traffic and robust performance across diverse urban settings. The deployment strategy initiates from densely populated transportation hubs, gradually expanding to broader urban areas. This systematic expansion adheres to a policy framework that emphasizes data privacy and sustainable urban development, ensuring alignment with societal needs and regulatory frameworks. By addressing technological efficacy and societal impact, this study enhances the understanding of urban wireless traffic management. It offers mobile network operators, policymakers, and urban planners a comprehensive strategy to harness the potential of spatiotemporal technology, thereby ensuring that cities remain dynamic, efficient, and well-prepared for the future of digital connectivity.

Equality of access and resilience in urban population-facility networks

Article Open access 31 March 2022

Addressing the âminimum parkingâ problem for on-demand mobility

Article Open access 08 October 2020

A universal framework for inclusive 15-minute cities

Article 16 September 2024

Introduction

The internetâs widespread availability has fundamentally transformed media consumption, fostering user expectations for instant access to online content¹. In recent years, mobile content streaming through over-the-top (OTT) platforms has gained significant traction². OTT subscribers now value the flexibility to switch seamlessly between devices, enjoying their favorite content indoors and outdoors on mobile devices. Historically, faster mobile networks and the diversification of OTT platforms have fulfilled these user expectations^1,3. However, current strategies employed by mobile network operators (MNOs) for access point deployment and network radio resource expansion are anticipated to incur higher infrastructure expenses and bandwidth limitations². Despite the active deployment of 5G networks by MNOs, the anticipated low-latency communication performance remains unrealized⁴.

Typically, OTT content consumption exhibits two distinctive patternsârepetitive requests from a sizable user base and spikes in demand at specific times. For these reasons, users often encounter difficulties in accessing high-quality video content via OTT services, especially in crowded outdoor areas or while moving on public transportation⁵. Additionally, since OTT platforms leverage the MNO-established infrastructure, such as broadband networks, the surge in OTT video streaming directly correlates with increased mobile network traffic. According to the Ericsson report⁶, there was a 36% increase in mobile network data traffic between the first quarter of 2022 and the first quarter of 2023, with video consumption identified as a primary driver behind this substantial increase.

Balancing traffic demand and supply presents a formidable challenge, even in regions with robust mobile network infrastructure⁷. Throughout the day, user traffic fluctuates, leading to pronounced differences between peak and off-peak hours⁸. The primary reason for the traffic imbalance between peak and off-peak hours is the demand for high-quality services^8,9. However, enhancing the speed and performance of mobile networks poses challenges due to physical and economic constraints². Traditional methods, such as the addition of base stations and the expansion of bandwidth, have become insufficient to keep up with demand due to rising infrastructure costs and limited resources¹⁰. Consequently, during peak periods, users often encounter difficulties in accessing mobile network services such as OTT due to congestion in cells served by a single base station, resulting from a higher volume of users¹¹.

It is serious enough that many users are currently facing challenges in ensuring the quality of service (QoS) for OTT over mobile networks, primarily due to limitations within the existing mobile infrastructures¹², but the deteriorating performance of mobile networks also poses a hindrance to economic growth and development¹³. For instance, a previous study highlighted that inadequate broadband access could impede the effectiveness and profitability of infrastructure development and investment initiatives, thereby limiting the emergence of new businesses¹⁴. This underscores the necessity of developing mobile network-related strategies considering both spatial and temporal aspects to effectively mitigate traffic imbalances¹¹. Given that user mobility shapes content consumption patterns, which in turn influence user content preference^11,15, user mobility patterns are the significant factor in establishing content placement strategy^15,16.

Wireless mobile caching is regarded as an attractive alternative telecommunication technology⁷. Wireless mobile caching can alleviate peak-hour traffic demands by locally storing frequently requested content on usersâ devices during off-peak periods and facilitating content sharing between proximate users, thereby significantly reducing duplicate content requests^17,18. The extent of traffic reduction depends on the storage allocated by users for this purpose¹⁹. As storage costs continue to decline and smartphones with capacities up to 1Â TB emerge, usersâ concerns about running out of storage are expected to diminish, enhancing the feasibility of this approach²⁰.

In the context of wireless mobile caching, storage and device-to-device (D2D) communication technologies assume pivotal roles²¹. Firstly, the quantity of stored contents directly corresponds to local storage capacity²². The ongoing decline in storage cost and rise in mobile device capacity enhance the cost-effectiveness of caching technology²⁰. Secondly, D2D enables direct content sharing between closely located user devices without the need for routing through a base station²³, effectively reducing radio access link loads. Particularly in densely populated areas, D2D links offer significant advantages for peer-to-peer content sharing^21,23. The frequent utilization of D2D sharing ultimately alleviates the burden on the radio access link, enhancing the accuracy of predicting usersâ content consumption patterns¹⁸.

To better align the stable capabilities of fixed mobile network infrastructure with the dynamic nature of user mobility and content preferences, it is imperative to embrace cooperative caching policies. These policies, which account for both temporal and spatial dimensions, serve to enhance performance²⁴. Hence, this study proposes the adoption of spatiotemporal crowdsensing and caching (SCAC) as a desirable cooperative wireless mobile caching policy. SCAC involves transferring frequently accessed content from congested cells via D2D communication, thereby lessening reliance solely on radio access links. Furthermore, SCAC underscores the importance of monitoring user population dynamics across extensive networks.

Previous research on mobile caching highlights the critical importance of achieving a high content-sharing hit-ratio, which is influenced by both user mobility and content preferences^16,18,21,22. However, pre-storing content to accommodate diverse user preferences encounters challenges due to the complexity of predicting individual behaviors and mobility patterns^25,26,27. Since users engage with mobile networks while on the move, user mobility significantly impacts content sharing, highlighting the need to anticipate user mobility patterns²⁵. Additionally, successful content placement depends not only on predicting densely populated user congregations or remote user locations but also on considering the frequency of interactions between users²⁸. The frequency of user interactions directly influences content-sharing attempts¹⁶. Therefore, optimizing the performance of wireless mobile caching networks requires strategically placing content to anticipate user movement toward densely populated areas or increased distances. These strategies must consider constraints such as data transmission speed and user distribution²⁹.

In wireless mobile caching for mobile OTT services, gathering comprehensive data on user location and content preference is crucial³⁰. This extensive data collection significantly enhances the quality of mobile OTT experiences and ensures uninterrupted content delivery, especially during peak hours³¹. Crowdsensing, which leverages the widespread availability of smartphones and devices with built-in sensors, automatically and effectively collects such data^32,33. Extracting mobility and OTT viewing data from mobile device sensors through crowdsensing enables adaptive media placement and boosts wireless mobile caching network performance, thus reducing reliance on fixed network infrastructure for smoother media consumption³⁴. However, while crowdsensing offers significant benefits for data collection and system optimization, it raises security and privacy concerns³⁵. Exposing user content preferences and mobility data to unauthorized parties poses a potential risk of personal information infringement. Therefore, crowdsensing strategies in media caching require robust security protocols and transparent privacy policies to mitigate risks.

Additionally, analyzing wireless mobile caching is complex because of the dual challenge of spaceâtime dependency in data traffic, involving high-speed links and massive connections. To ensure efficient network operation across diverse cell environments, a multi-cell strategy is needed that addresses both spatial and temporal dimensions of radio resource management³⁶. Given that existing techniques focus on enhancing user interactions and content management within single-cell environments, they fail to leverage user dynamics across multiple cells. This limitation hinders effective congestion management and resource allocation for caching. In this study, we advocate the SCAC principle, which leverages spatial and temporal user information to enhance data collection and caching strategies. This principle is designed to align mobile network infrastructure capabilities with the dynamic nature of user mobility and content preferences. SCAC reduces congestion and potential traffic in cells by facilitating collaborative caching in adjacent cells. Specifically, BSs in less congested cells cache content for users heading toward more congested areas, thereby mitigating traffic spikes. As users transition between cells, this proactive approach promotes a more uniform distribution of content, effectively preventing bottlenecks. Therefore, analyzing local user mobility patterns is essential for this strategy because it provides valuable insights into optimal content placement and its impacts on system-level traffic demands.

FigureÂ 1 illustrates the SCAC mechanism across the three cells. Each cell uses crowdsensing to capture unique OTT viewing patterns and user mobility, allowing for predicting future demand during periods of anticipated overload. When a cell predicts an overload, it preemptively caches content likely to be needed by incoming users, thereby directly addressing anticipated demand through targeted content delivery. This proactive approach facilitates a spatiotemporal distribution of traffic, efficiently preventing cell overloads without additional infrastructure. However, as the system scales up, content placement becomes more complex because of the increased computational burden on the centralized network coordinator. This challenge is compounded by the inefficiency of tracking individual user mobility and preference trajectories on a large scale. In addition, continuous real-time monitoring of users raises privacy concerns because it may disclose sensitive personal details, such as residence information³⁵.

To address privacy concerns, user trajectories can be managed as anonymized cell-level transition logs instead of detailed individual tracking. This approach captures inter-cell user transitions without the need for individual user identification within each cell, thereby effectively safeguarding user data privacy²⁶. The aggregated approach significantly influences how base stations interpret movement patterns. While individual users exhibit diverse schedules and behaviors, the base station that observes the collective population only perceives inter-cell transitions, effectively treating the user group as a whole. This perspective reduces the significance of individual patterns, emphasizing characteristics solely dependent on the current state of collective movement. In this context, the resultant user mobility is modeled as a cell-level transition model. The collective inter-cell population flow is predicted and exchanged across the multi-cell network. Establishing a collaborative approach through inter-cell cooperation is appealing for practical deployment and reliable operation.

Based on the discussion above, this study aims to empirically evaluate the efficacy of SCAC in addressing mobile traffic issues using a simulation platform that captures real-world mobility patterns, preferences, and population dynamics. By employing simulations enriched with real-world mobilities and preference data, this study tries to provide empirical evidence supporting the efficacy of SCAC in alleviating cell-request overloads and related costs. Such findings will provide tangible benefits for both MNOs and users. Thus, this study proposed the following research questions:

RQ1. Is SCAC a reliable solution for addressing mobile traffic issues?
RQ2. Under what conditions or environments can effective implementation of SCAC be achieved?

Results

The effectiveness of SCAC in reducing mobile network traffic and its impact on the network environment were assessed in a simulation in an urban area with a user population of 400,000.

Assessing SCACâs reliability in addressing traffic issues (RQ1)

FigureÂ 2 shows the effectiveness of the SCAC strategy in terms of offloading gain: traffic demand was relieved after content sharing. Data traffic and user populations are normalized to the highest value in Cell 1, while the offloading ratio is expressed as the percentage of demand alleviated through sharing among users. Color coding in each cell represents the amount of traffic demand before and after the caching operation. The color bar shows the traffic demand volume, with red shades indicating high demand and blue shades indicating low demand. The coloring standard for each region is based on the highest value of peak-hour traffic demand. The three cells with the highest average daily demands are highlighted with white borders to illustrate congestion hotspots. Cell 1 experienced peak traffic between 12:00 and 14:00, while other periods were considered to be regular hours. The SCAC strategy demonstrably reduced traffic during both periods. During regular hours, the strategy led to significant reductions in load on cells, evidenced by many blue-demand areas indicating lower traffic. While the offloading gain was slightly lower during peak hours due to high user demand exceeding caching capabilities, the amount of traffic borne by the corresponding cell was still reduced. Furthermore, all congested cells shared a common characteristic: they are located near transportation hubs, which significantly contributes to the higher traffic in these locations.

FigureÂ 3 shows the performance of the cell-level caching strategy, specifically focusing on the offloading gain in three congested cells with the highest average daily traffic demand. It presents data traffic, user population, and offloading gain for each cell at hourly intervals during the day. Data traffic and user populations are normalized to the highest value in cell 1, and the offloading gain is expressed as the proportion of demand alleviated by sharing among users. The distributed offloading strategy at the three congested cells reduces traffic demands by 31.0% during regular hours and 26.4% during peak hours with respect to the original traffic demands. It is observed that traffic demands exhibit patterns similar to hourly population transitions whereas daily patterns of offloading gains are distinct.

Cell 1 experiences a gradual rise in traffic requests in the morning, with peak hour data traffic exceeding double that of regular hours. The SCAC framework anticipates this surge and proactively caches the content expected to be requested during peak hours. This allows users who will be in cell 1 during peak hours to have the content stored even before the peak period, facilitating efficient sharing with the visiting population. This proactive approach enables the early implementation of offloading strategies, effectively securing necessary resources before the peak hour surge. In contrast, cells 2 and 3 experience consistent traffic patterns, resulting in sustained offloading performance due to ongoing collaboration with adjacent cells. As data traffic patterns fluctuate in tandem with hourly population changes, the ability to predict future congested cells both spatially and temporally is facilitated by specifically tracking inter-cell mobility through crowdsensing. This predictive approach to content caching ensures that mobile users heading to high-demand areas can seamlessly share their data with neighboring users, either before reaching their destination or immediately upon arrival.

Optimal conditions or environments for effective implementation of SCAC (RQ2)

FigureÂ 4(a) presents a comparison of the weekly variations in gross content consumption and offloading gain, evaluated in gigabytes (GB). Monday through Thursday are classified as weekdays, and Friday through Sunday as weekends, reflecting user behavior patterns. On Fridays, people often leave work early and prepare for the weekend, resulting in demand patterns that differ from typical weekdays. Demand volume (yellow line) is measured on the right y-axis, and offloading gain is measured on the left y-axis. The total demand volume remains steady throughout the day with regular drops late at night, while the offloading gain exhibits minimal daily variation. The volume of content shared with users through content placement amounts to 818.27Â GB on weekdays and 543.26Â GB on weekends per hourly regular operation interval. Regarding content preference patterns, the local preferences observed on weekdays closely mirror the global preferences. This observation suggests that during weekdays, individual content preferences tend to synchronize as users move and interact socially³⁷. To reflect this difference in preference patterns, the content propagation degree \(\theta\) is set to 5â10 on weekdays and 10â20 on weekends for simulation under various environments. The strategic placement of popular content results in enhanced sharing performance on weekdays compared with weekends. Although the offloading gain decreases slightly when \(\theta\) is higher on weekends, it remains effective. This is because more diverse content is stored to support a wider range of preferences during weekends.

This framework ensures consistent offloading gains by adapting to variations in traffic and user preferences. The uneven regional distribution of traffic demands causes congestion to concentrate in no more than 10 cells. To balance data traffic, the trained SCAC principle cooperatively attempts to spread traffic demands over adjacent cells. Thus, the sum of the offloading gain of these cells covers half of the total offloading gain. The offloading gain indirectly shows bandwidth savings, as it reduces the load on network infrastructures and improves QoS by utilizing saved resources. Additionally, localized caching reduces the distance data must travel, thereby improving latency and enhancing overall network efficiency.

FigureÂ 4(b) depicts the impact of the availability of user mobility information under various data traffic configurations. Real-world systems typically lack complete user mobility due to the inherent difficulty of continuously pinpointing the userâs instantaneous location. The completeness of the collection about user mobility can be categorized as known or unknown. To characterize this, the incompleteness parameter, denoted by \(\delta\), is introduced as the percentage of users with unknown mobility patterns. The destinations of users of unknown mobility are chosen uniformly toward adjacent cells. For the users comprising the \(\delta\) proportion with unknown mobility, their destinations are chosen uniformly among adjacent cells. Two categories of data traffic configurations were assessed: regular versus peak hours and weekdays versus weekends. Each is characterized by four different values of the content propagation degree \(\theta\)= 5, 10, 15, and 20. With complete user mobility information \((\delta =0)\), sharing gains are maximized. With increasing \(\delta\) (incompleteness parameter), the proportion of content volumes determined by random placement increases, resulting in incomplete content sharing and a lower hit ratio in the destination cells of user mobility. For comparison, the sharing results outperform the geographic caching solution, which only considers the state within a cell without taking into account spatiotemporal mobility information. This demonstrates that even the utilization of inaccurate spatiotemporal information is effective in enhancing content accessibility.

Discussion

In urban regions, user mobility varies between densely and sparsely populated areas, often linked to transportation hubs, leading to imbalances in mobile network traffic⁹. Addressing these imbalances requires a flexible caching strategy that considers both spatial and temporal factors. This study aimed to achieve two objectives: first, to assess the effectiveness of the SCAC framework in decreasing bandwidth demands, particularly within congested cells, through collaboration with neighboring cells; and second, to investigate the commercial viability and explore the policy support mechanisms required to expedite the adoption and scaling of the SCAC principle in transportation hub areas.

Transitioning from a controlled simulation environment to real-world applications requires considering scalability in diverse and larger urban environments. Real urban environments can introduce new preferences and mobility patterns not previously encountered. For instance, sports stadiums may exhibit patterns where content related to the game is consumed intensively. Therefore, it is advisable to implement SCAC in pilot projects focusing on specific cohorts (e.g., university campuses, sports stadiums, and densely populated transportation hubs) to evaluate the potential of these strategies in alleviating data traffic volumes and ensuring a smooth transition to larger scales.

Considering the minimal base station transitions observed at the macrocell level⁸, it may be reasonable to confine the cooperative cells in the pilot to adjacent cells. Restricting crowdsensing activities to pre-peak hours could optimize traffic distribution during congested periods. Performance benchmarks demonstrate that the pilot operates effectively with only 30% of mobility data, eliminating the need to capture all user patterns. Evaluating the pilot project in transportation hubs will reveal the potential for traffic reduction with complete mobility information. Insights garnered from this evaluation will guide the expansion to other high-traffic areas, validate the technologyâs scalability, and establish infrastructure guidelines for collaborative cells. This approach facilitates a cascading expansion to effectively support broader urban zones.

Cell-level user mobility is tracked through handover technologies, which monitor population movements within a cell and to adjacent cells. In South Korea, for example, KT collects and shares mobility data based on LTE and 5G signals at the district and traffic polygon levels in hourly intervals³⁸. This means that SCAC can sufficiently track cell-level user mobility using existing infrastructure during the operational interval. While individual tracking might encounter issues like the ping-pong effect, SCAC estimates mobility at the base station level, mitigating short-term inaccuracies. Implementing SCAC requires initial investments in data collection, processing, and operating trained neural networks, with long-term costs such as maintenance and data storage analyzed post-pilot. However, SCACâs spatial and temporal distribution of network resources is expected to reduce the need for additional infrastructure to meet growing demand, offering potential cost-saving benefits.

Traditionally, MNOs have addressed the surge in mobile traffic by enhancing macrocells through the incorporation of additional cells, antennas, and network automation. However, with the available spectrum in commonly used frequency bands nearing full utilization, a constraint exists on the capacity expansion of macrocells. Consequently, a shift toward higher frequency bands such as the 26Â GHz has become necessary. While this higher frequency band holds promise for increased capacity, it exhibits limited propagation distance and is better suited for compact, street-level small cells rather than large macro cells^9,39. The transition from macrocells to small cells in mobile networks has become essential because of the limitations posed by full spectrum utilization in commonly used frequency bands. Smaller cells, which inherently cover smaller geographical areas, exhibit lower user density than macrocells. Consequently, the adoption of small cells reduces the crowdsensing burden per cell. Incorporating SCACâs content placement strategy into small cell infrastructure improves operational efficiency and addresses the critical need for sustainable and adaptable traffic management in densely populated urban areas.

Furthermore, SCACâs adaptability extends to incorporating mobile edge computing (MEC). MEC brings computation and storage resources closer to the user, reducing latency and improving response times^40,41. By integrating MEC with SCAC, we can further enhance content caching and delivery, ensuring the framework remains effective even as network architectures evolve. Additionally, the introduction of local storage-equipped helpers can enhance SCAC's local sharing performance⁴². These helpers, which may include fixed devices or mobile entities like vehicles, improve the overall efficiency of content distribution. This evolution in network management illustrates a significant shift toward more user-centric and demand-responsive telecommunication policies. By prioritizing efficient data distribution and network utilization, SCAC underscores the potential for a more balanced digital ecosystem. Given the ongoing complexities of urban data traffic and user demands, the integration of such forward-thinking strategies is imperative for fostering a more equitable and efficient digital landscape.

The efficacy of SCAC can be further improved by incorporating more detailed user data, including content preferences and viewing history, into the user mobility information. However, the collection and handling of such personal information can lead to sensitive concerns. Users are growing more cautious about granting consent for location data collection, especially considering the COVID-19 pandemic⁴³. This study, which conducted a systematic literature review of data privacy issues with contact tracing applications, highlighted significant privacy concerns regarding the collection of personal data, including location information. It emphasizes the need for clear privacy policies and transparency to address public concerns. This heightened sensitivity arises from concerns related to tracking the locations of infected individuals^43,44. The risk of MNOs collecting an excessive volume of mobility data emphasizes the importance of institutional measures. Stringent legislation should be enacted to clearly define the permissible boundaries of data collection. These measures, enforced with penalties for violations, should extend beyond mobility data to encompass other personal datasets, such as OTT viewing metrics. While MNOs currently have access to mobility data, OTT platforms possess unique datasets related to viewing habits, albeit not necessarily location information. The convergence of these two types of information, which are crucial for maximizing the effectiveness of SCAC, indicates that collaborative efforts between MNOs and OTT platforms could be pivotal in genuinely commercializing SCAC.

Ongoing global disputes between MNOs and OTT platforms revolve around network usage fees related to the traffic generated by OTT content streaming. These disputes highlight persistent concerns regarding network neutrality and the equitable distribution of network maintenance costs⁴⁵. Moreover, South Koreaâs representative MNOs failed to meet the performance conditions for the allocation of 5G frequency spectrum. These conditions included deploying a sufficient number of cell sites for 5G services and investing adequately in delivering 5G services over the 28Â GHz band, which were commitments made by the MNOs at the inception of their operations. This failure led to the cancelation of their 28Â GHz licenses, impeding the fulfillment of user demands for mobile network services, including those of the OTT service. Given these difficulties, implementing SCAC presents a mutually beneficial solution for MNOs and OTT platforms, alleviating financial burdens through reduced infrastructure and data transmission costs. In addition, the adoption of SCAC opens up possibilities for innovative partnerships. Through collaborations, SCAC collaborative efforts leading to improvements in OTT quality and the introduction of new mobile data plans not only expand the user base and improve market appeal but also strengthen the presence of both MNOs and OTT platforms.

However, the reliance of SCAC on pre-allocated user storage inherently requires active user engagement. This approach may encounter resistance from users who are cautious about sharing device storage or personal data. Research on usersâ psychological barriers to mobile caching suggests that potential user resistance may not be as great as feared. Especially when the benefits of participation, such as an improved content viewing experience, are effectively communicated to users⁴⁶. Specifically, the simulation settings of the SCAC framework, which limit data usage to 3Â GB, are reasonable and reflect realistic usage scenarios that are palatable to users. Additionally, providing appealing incentives as part of a user-centric approach can further mitigate these concerns and encourage participation^47,48. Incentive strategies can facilitate the establishment of a harmonious relationship among MNOs, OTT platforms, and users, thus contributing to the creation of a mutually beneficial digital landscape for all parties involved.

Moreover, measuring user satisfaction with the SCAC framework would ideally be conducted post-deployment. Preliminary research indicates that user acceptance tends to increase when they are aware of the limitations of existing infrastructure and the potential improvements offered by new technologies like mobile caching⁴⁹. Given the rising number of video streaming service users and the growing data demand, the user-centric approach of the SCAC framework, utilizing user preferences and mobility data, is likely to be well-received.

The limitations of the research and suggestions for future work are also presented. Implementing the SCAC principle involves forecasting collective statistics related to user mobility and content preferences. Dense public transportation networks characterized by regular schedules and established routes alleviate the need for precise predictions of collective user dynamics and temporal variations. However, tailoring SCAC strategies to align with the distinctive dynamics of user populations using public transportation can yield significant caching benefits. Consequently, further research examining the dynamics of public transportation mobility and analyzing content consumption patterns during public transportation usage is essential to enhance the efficacy of SCAC strategies.

Although SCAC strategies demonstrate potential in enhancing overall performance at the cell level, a more nuanced investigation is required to validate how content is shared among individual users and to ensure equitable content distribution among all users. The unpredictable nature of user behaviors and fluctuating network conditions pose challenges in achieving fair and efficient content distribution. Addressing these challenges necessitates the development of a robust content-sharing protocol that accommodates this variability. This protocol must ensure reliable and efficient content sharing, even under non-ideal conditions, such as weak device connections or sporadic user cooperation. Adapting the protocol to consider the actual availability of content sharing under diverse real-world conditions significantly improves the efficiency of content distribution among users.

Spaceâtime considerations offer new avenues for collaboration between MNOs and OTT platforms in developing cooperative management strategies that optimize network resources without substantial infrastructure investments. Consequently, the development and implementation of innovative business strategies in this realm are crucial areas for further investigation to ensure the economic viability and market acceptance of these technological advancements. However, for these possibilities to materialize into actual business ventures, empirical exploration is essential to determine the optimal conditions under which two service providers can form a strategic alliance and understand the resulting consequences.

Methods

This section details the design of a testbed that leverages datasets comprising user mobility and content preferences. An AI-based framework that employs neural parameter optimization to enhance content delivery efficiency is proposed. Specifically designed to optimize the selection of content stored on the userâs device, this framework enhances the efficiency and accuracy of content delivery at the cell level. This integrated approach leverages advanced data analysis and adaptive caching strategies to enhance the user mobile OTT experience and network performance.

Urban virtual testbed environment

The testbed leverages a dataset comprising user mobility and content preference to capture transition behaviors and collective movement patterns. FigureÂ 5 shows the data collection process. User mobility was tracked using an open-source traffic simulator, simulation of urban mobility (SUMO⁵⁰), within a 5âÃâ5Â km urban area near the Korea University campus in Seoul. Fifty real macrocells were selected based on geographical information, and a population of pedestrians and vehicles was distributed proportionally to the actual demography. Daily dynamic patterns are tracked from 8 a.m. to 8 p.m., reflecting that on-the-go user content consumption is concentrated during the daytime. Individual trips along the streets are headed to destinations chosen according to mobility patterns. To accurately depict mass mobility patterns in large-scale simulations, it is essential to begin with the meticulous modeling of individual mobility patterns, drawn from realistic data. The simulation monitors the instantaneous measurement of data traffic demand, inter-cell mobility, D2D link status, and content preference.

Preference and mobility collection

User content preferences serve as authentic reflections of user behavior. To enhance the accuracy of our analysis, we adopt preference models from previous studies. Leveraging a content library comprising the top 100 ranked content, we categorize users into two groups: regular and heavy content consumers. The viewing patterns for content depend on temporal preference patterns, which exhibit regional variations. Previous studies have explored the use of a shuffled content library to represent geographically heterogeneous preferences⁵¹. Building on this concept, we developed a model that gradually propagates regional preference patterns, initiating from densely populated areas. Geographical propagation of preferences can be characterized in a one-dimensional manner by introducing the propagation degree, denoted as \(\theta\). This degree represents the norm of the difference in popularity rank vectors. For example, an increase in \(\theta\) value signifies more pronounced regional preference characteristics, highlighting clearer distinctions in content preferences across different regions. This variability emphasizes the strategic importance of content placement from a network perspective. At the cellular-level, user mobility is tracked through inter-cell handover technology, which monitors both the population within their range and those transitioning to neighboring cells⁵². The regular collection of mobility and local preferences occurs at the closest cell, providing insights into level user behavior. Meanwhile, global preferences are aggregated across the network to identify broader patterns.

To enhance content placement intelligence, inter-contact model parameters²⁸, such as average user dwell time and average user interaction time, are explicitly calculated and incorporated as input data. Hourly traffic demands represent the total volume of content requested within each cell per hour. The supply of each cell is set as the average of its demands over the entire period. Furthermore, link configurations for content sharing are subject to the wireless characteristics of urban-model channel propagation, with link status modeled by the probability of users coexisting in a cell^16,28. For our simulation, D2D communication is generally restricted to devices with 3Â GB storage within a 20Â m distance. This general setup reflects typical interactions within the wireless caching network, offering a standardized environment for analyzing network dynamics and content sharing. The simulation mobility patterns mirror real-world dynamics: (i) limited mobility: over half of the population remains within a single cell. This result confirms a previous study on mobility⁸. (ii) Hub concentration: users congregate at transportation hubs reflecting high-traffic areas.

Neural parameter optimization

A distributed solution for content placement inherently involves optimization for the dynamic nature of user mobility and content preference. Various network optimization techniques have achieved success, including a recent AI-based approach that effectively leverages temporal patterns revealed from historical input records^53,54. In this subsection, we explore a neural network model that processes spatiotemporal characteristics of urban environment datasets.

FigureÂ 6(a) illustrates the process of aggregating hourly records from the mobility and preference domains into daily records, which are subsequently stored as data collections in a dataset queue. The record that cell \(i\) processes at the \(j\)-th hourly interval of the \(n\)-th day is denoted by \({T}^{n}\left(i,j\right)\). The resulting daily records \({D}_{i}^{n}\) encapsulate time-varying measurements related to data traffic, content preference, and mobility patterns within the cell-level operating cycle. Over extended periods, such as 10Â days, these daily records, which are stored as data collections in a dataset queue, serve as input batches for training the content placement strategy.

FigureÂ 6(b) presents the structure of a neural network model that constructs computation rules for spatiotemporal offloading without requiring exact knowledge of global network states. The model comprises two major components: a data encoder and a solution decoder. The data encoder includes an input layer, a convolutional layer, and a permutation layer, whereas the solution decoder contains several layers of dense neural networks and an output layer. Data collections stored in the dataset queue are fed into the input layer as training datasets. These collections enable training for time-adaptive operations that adapt to weekly and monthly user behavior patterns. Batch normalization was applied to avoid issues caused by significant regional differences in data traffic and population density. The convolutional layer extracts the spaceâtime characteristics of the input records. The extracted geometric feature constructs a latent space for location-specific traffic and preference records. The permutation layer associates the extracted local patterns with the adjacency information of the real-map geography and encodes traffic and preference flows. Given that user population and data traffic exchanges are infrequent between distant cells, these geographical features can be encoded as an adjacency map. This map is masked to the convolution layer output to calculate additional convolution only among cells with user and traffic exchanges. In addition, a permuted tensor is applied to provide equivariance for input ordering. In contrast to image processing, where pixel order matters, this feature ensures that the optimization solution remains independent of the location information of cells. Consequently, the developed model encodes network dynamics features for cell-specific information independent of cell-specific geography. The solution decoder comprises several layers of dense neural networks and an output layer. In forward-pass computations, the decoder network calculates the offloading gain of individual cells. It is defined as the total sum of the remaining demand after offloading. Furthermore, backpropagation computations update parameter sets via the RMSProp optimizer⁵⁵. The output layer yields a list of content volumes placed on the user population residing in the cell.

Data availability

Sample files of SCAC mobility data were uploaded in the repository. They can be found at: https://github.com/happywater12/SCAC

References

Mulla, T. Assessing the factors influencing the adoption of over-the-top streaming platforms: A literature review from 2007 to 2021. Telemat. Inform. 69, 101797 (2022).
ArticleÂ Google ScholarÂ
Mohajer, S., Bergel, I. & Caire, G. Cooperative wireless mobile caching: A signal processing perspective. IEEE Signal Process. Mag. 37, 18â38 (2020).
ArticleÂ Google ScholarÂ
Farooq, M. & Raju, V. Impact of over-the-top (OTT) services on the telecom companies in the era of transformative marketing. Glob. J. Flex. Syst. Manag. 20(2), 177â188 (2019).
ArticleÂ Google ScholarÂ
Sefati, S. S. & Halunga, S. Ultra-reliability and low-latency communications on the internet of things based on 5G network: Literature review, classification, and future research view. Trans. Emerg. Telecommun. Technol. 34(6), e4770 (2023).
ArticleÂ Google ScholarÂ
Choi, Y. W. & Lee, C. Time-of-Day and day-of-week effects on TV and OTT media choices: Evidence from South Korea. J. Theor. Appl. Electron. Commer. Res. 19(1), 1â19 (2023).
ArticleÂ Google ScholarÂ
Ericsson mobility report, https://www.ericsson.com/4ae12c/assets/local/reports-papers/mobility-report/documents/2023/ericsson-mobility-report-november-2023.pdf. (Accessed 1 Dec 2023).
Yao, J., Han, T. & Ansari, N. On mobile edge caching. IEEE Commun. Surv. Tutor. 21(3), 2525â2553 (2019).
ArticleÂ Google ScholarÂ
Paul, U., Subramanian, A. P., Buddhikot, M. M. & Das, S. R. Understanding traffic dynamics in cellular data networks. In 2011 Proceedings IEEE Infocom 882â890 (IEEE, 2011).
ChapterÂ Google ScholarÂ
Benseny, J., Lahteenmaki, J., Toyli, J. & Hammainen, H. Urban wireless traffic evolution: The role of new devices and the effect of policy. Telecommun. Policy 47(7), 102595 (2023).
ArticleÂ Google ScholarÂ
Nikandish, G., Staszewski, R. B. & Zhu, A. Breaking the bandwidth limit: A review of broadband doherty power amplifier design for 5G. IEEE Microw. Mag. 21(4), 57â75 (2020).
ArticleÂ Google ScholarÂ
Zhang, Y., Hossain, M. S., Ghoneim, A. & Guizani, M. Cocme: Content-oriented caching on the mobile edge for wireless communications. IEEE Wirel. Commun. 26, 26â31 (2019).
ArticleÂ CASÂ Google ScholarÂ
Sujata, J. et al. Impact of over the top (OTT) services on telecom service providers. Indian J. Sci. Technol. 8(S4), 145â160 (2015).
ArticleÂ ADSÂ Google ScholarÂ
Federal Communications Commission. 2020 Broadband Deployment Report. https://www.fcc.gov/reports-research/reports/broadband-progress-reports/2020-broadband-deployment-report. (Accessed 1 Oct 2023).
Deller, S., Whitacre, B. & Conroy, T. Rural broadband speeds and business startup rates. Am. J. Agric. Econ. 104(3), 999â1025 (2022).
ArticleÂ Google ScholarÂ
Yang, Z., Fu, Y., Liu, Y., Chen, Y. & Zhang, J. A new look at AI-driven noma-frans: Features extraction, cooperative caching, and cache-aided computing. IEEE Wirel. Commun. 29(3), 123â130 (2022).
ArticleÂ Google ScholarÂ
Song, J. & Choi, W. Mobility-aware content placement for device-to-device caching systems. IEEE Trans. Wirel. Commun. 18(7), 3658â3668 (2019).
ArticleÂ Google ScholarÂ
Liu, L. et al. Joint computation offloading and data caching in multi-access edge computing enabled internet of vehicles. IEEE Tran. Veh. Technol. 72(11), 14939â14954 (2023).
Xiao, A. et al. User preference aware resource management for wireless communication networks. IEEE Netw. 34(3), 78â85 (2020).
ArticleÂ Google ScholarÂ
He, S. et al. Cache-enabled coordinated mobile edge network: Opportunities and challenges. IEEE Wirel. Commun. 27(2), 204â211 (2020).
ArticleÂ Google ScholarÂ
Castellano, R. New Hard Disk Drive Technologies Offer Stiff Competition for NAND https://seekingalpha.com/article/4196050-new-hard-disk-drive-technologies-offer-stiff-competition-for-nand. (Accessed 1 Oct 2023).
Lee, M. C., Feng, H. & Molisch, A. F. Dynamic caching content replacement in base station assisted wireless D2D caching networks. IEEE Access 8, 33909â33925 (2020).
ArticleÂ Google ScholarÂ
He, S. et al. Cache-enabled coordinated mobile edge network: Opportunities and challenges. IEEE Wireless Commun. 27(2), 204â211 (2020).
ArticleÂ Google ScholarÂ
Ibrahim, A. M., Zewail, A. A. & Yener, A. Device-to-device coded-caching with distinct cache sizes. IEEE Trans. Commun. 68(5), 2748â2762 (2020).
ArticleÂ Google ScholarÂ
Wang, S. et al. An approach for spatial-temporal traffic modeling in mobile cellular networks. In 2015 27th International Teletraffic Congress 203â209 (IEEE, 2015).
ChapterÂ Google ScholarÂ
Chatzieleftheriou, L. E., Karaliopoulos, M. & Koutsopoulos, I. Caching-aware recommendations: Nudging user preferences towards better caching performance. In IEEE Infocom 2017-IEEE Conference on Computer Communications 1â9 (IEEE, 2017).
Google ScholarÂ
Yao, L., Chen, A., Deng, J., Wang, J. & Wu, G. A cooperative caching scheme based on mobility prediction in vehicular content centric networks. IEEE Trans. Veh. Technol. 67(6), 5435â5444 (2017).
ArticleÂ Google ScholarÂ
Poularakis, K. & Tassiulas, L. Exploiting user mobility for wireless content delivery. In 2013 IEEE International Symposium on Information Theory 1017â1021 (IEEE, 2013).
ChapterÂ Google ScholarÂ
Wang, R., Peng, X., Zhang, J. & Letaief, K. B. Mobility-aware caching for content-centric wireless networks: Modeling and methodology. IEEE Commun. Mag. 54(8), 77â83 (2016).
ArticleÂ Google ScholarÂ
Malak, D., Al-Shalash, M. & Andrews, J. G. Optimizing content caching to maximize the density of successful receptions in device-to-device networking. IEEE Trans. Commun. 64(10), 4365â4380 (2016).
Google ScholarÂ
Zhang, Y. et al. Efficient and robust certificateless signature for data crowdsensing in cloud-assisted industrial IoT. IEEE Trans. Ind. Inf. 15(9), 5099â5108 (2019).
ArticleÂ Google ScholarÂ
Zhao, C. et al. Data quality guarantee for credible caching device selection in mobile crowdsensing systems. IEEE Wirel. Commun. 25(3), 58â64 (2018).
ArticleÂ Google ScholarÂ
Liu, J. et al. A large-scale concurrent data anonymous batch verification scheme for mobile healthcare crowd sensing. IEEE Internet Things J. 6(2), 1321â1330 (2018).
ArticleÂ Google ScholarÂ
Liu, J., Shen, H., Narman, H. S., Chung, W. & Lin, Z. A survey of mobile crowdsensing techniques: A critical component for the internet of things. ACM Trans. Cyber Phys. Syst. 2(3), 1â26 (2018).
ArticleÂ Google ScholarÂ
Liu, Y., Kong, L. & Chen, G. Data-oriented mobile crowdsensing: A comprehensive survey. IEEE Commun. Surv. Tutor. 21(3), 2849â2885 (2019).
ArticleÂ Google ScholarÂ
Wu, F.-J. & Luo, T. Crowdprivacy: Publish more useful data with less privacy exposure in crowdsourced location-based services. ACM Trans. Priv. Secur. 23(1), 1â25 (2020).
ArticleÂ MathSciNetÂ Google ScholarÂ
Han, S., Xue, F., Yang, C., Liu, J. & Lin, F. Data-supported caching policy optimization for wireless D2D caching networks. IEEE Trans. Commun. 69(11), 7618â7630 (2021).
ArticleÂ Google ScholarÂ
Posfai, M. & Barabasi, A.-L. Network Science (Citeseer, 2016).
Google ScholarÂ
Korea open government license, Current status of mobility in Seoul https://data.seoul.go.kr/dataVisual/seoul/seoulLivingMigration.do. (Accessed 1 Jan 2024).
Busari, S. A., Mumtaz, S., Al-Rubaye, S. & Rodriguez, J. 5G millimeter-wave mobile broadband: Performance and challenges. IEEE Commun. Mag. 56(6), 137â143 (2018).
ArticleÂ Google ScholarÂ
Spinelli, F. & Mancuso, V. Toward enabled industrial verticals in 5G: A survey on MEC-based approaches to provisioning and flexibility. IEEE Commun. Surv. Tutor. 23(1), 596â630 (2020).
ArticleÂ Google ScholarÂ
Yang, S. et al. Caching-enabled computation offloading in multi-region MEC network via deep reinforcement learning. IEEE Internet Things J. 9(21), 21086â21098 (2022).
ArticleÂ MathSciNetÂ Google ScholarÂ
Rim, M. & Kang, C. G. Content prefetching of mobile caching devices in cooperative D2D communication systems. IEEE Access 8, 141331â141341 (2020).
ArticleÂ Google ScholarÂ
Hu, T. et al. Human mobility data in the covid-19 pandemic: Characteristics, applications, and challenges. Int. J. Digit. Earth. 14(9), 1126â1147 (2021).
ArticleÂ ADSÂ Google ScholarÂ
Kondor, D., Hashemian, B., de Montjoye, Y.-A. & Ratti, C. Towards matching user mobility traces in large-scale datasets. IEEE Trans. Big Data. 6(4), 714â726 (2018).
ArticleÂ Google ScholarÂ
Bauner, C. & Espin, A. Do subscribers of mobile networks care about data throttling?. Telecommun. Policy 47(10), 102665 (2023).
ArticleÂ Google ScholarÂ
Jang, Y. & Kim, S. Understanding mobile OTT service usersâ resistance to participation in wireless D2D caching networks. Behav. Sci. 14(3), 158 (2024).
ArticleÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Luthans, F. & Stajkovic, A. D. Provide recognition for performance improvement. In Handbook of Principles of Organizational Behavior: Indispensable Knowledge for Evidence-Based Management 239â253 (Wiley, 2012).
ChapterÂ Google ScholarÂ
Zhou, H., Wu, T., Zhang, H. & Wu, J. Incentive-driven deep reinforcement learning for content caching and D2D offloading. IEEE J. Sel. Areas Commun. 39(8), 2445â2460 (2021).
ArticleÂ Google ScholarÂ
Jang, Y. & Kim, S. A decision model for OTT service users to adopt wireless D2D caching networks: Exploring the Korean case. Telecommun. Policy https://doi.org/10.1016/j.telpol.2024.102793 (2024).
ArticleÂ Google ScholarÂ
Krajzewicz, D., Erdmann, J., Behrisch, M. & Bieker-Walz, L. Recent development and applications of SUMO-simulation of urban mobility. Int. J. Adv. Syst. Measure. 5, 128â138 (2012).
Google ScholarÂ
Liu, D. & Yang, C. Caching at base stations with heterogeneous user demands and spatial locality. IEEE Trans. Commun. 67(2), 1554â1569 (2019).
ArticleÂ ADSÂ Google ScholarÂ
Kumar, P. P. & Sagar, K. A relative survey on handover techniques in mobility management. In IOP Conference Series: Materials Science and Engineering 012027 (IOP Publishing, 2019).
Google ScholarÂ
Sutton, R. S. Learning to predict by the methods of temporal differences. Mach. learn. 3, 9â44 (1988).
ArticleÂ Google ScholarÂ
Gao, J., Galley, M. & Li, L. Neural approaches to conversational AI. In The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval. 1371â1374. (SIGIR, 2018).
Goodfellow, I.J., Vinyals, O. & Saxe, A. M. Qualitatively characterizing neural network optimization problems https://arxiv.org/abs/1412.6544 (2015).

Download references

Acknowledgements

This research was supported by the MSIT (Ministry of Science and ICT), Korea, under the ITRC (Information Technology Research Center) support program (IITP-2024-2020-0-01749) supervised by the IITP (Institute for Information & Communications Technology Planning & Evaluation).

Funding

This work was supported by Ministry of Science and ICT, South Korea, IITP-2024-2020-0-01749.

Author information

These authors contributed equally: Hee Soo Kim and Yumi Jang.

Authors and Affiliations

School of Electrical Engineering, Korea University, Seoul, Republic of Korea
Hee Soo Kim,Â Yun Jae Choi,Â Hong Ki KimÂ &Â Sang Hyun Lee
Smart Media Service Research Center, Korea University, Seoul, Republic of Korea
Yumi Jang
School of Media and Communication, Korea University, Seoul, Republic of Korea
Seongcheol Kim

Authors

Hee Soo Kim
View author publications
You can also search for this author in PubMedÂ Google Scholar
Yumi Jang
View author publications
You can also search for this author in PubMedÂ Google Scholar
Yun Jae Choi
View author publications
You can also search for this author in PubMedÂ Google Scholar
Hong Ki Kim
View author publications
You can also search for this author in PubMedÂ Google Scholar
Seongcheol Kim
View author publications
You can also search for this author in PubMedÂ Google Scholar
Sang Hyun Lee
View author publications
You can also search for this author in PubMedÂ Google Scholar

Contributions

H.K. and S.H.L. led the research program and proposed the initial concept of the SCAC framework. S.H.L. provided the technical approach in SCAC deployment. H.K., Y.J., and S.H.L. participated in literature search and writing. H.K. and Y.J. constructed a testbed based on urban environments and collected data. H.K.K. and Y.J.C. developed AI-based algorithms and processed data generated from the testbed. H.K., Y.J., H.K.K. and Y.J.C. performed simulation tests and analyzed the results. H.K. and Y.J. interpreted and visualized the results. Y.J. and S.K. reviewed the commercialization strategy for the technology. H.K. and Y.J. contributed equally to this work. S.K. and S.H.L. take responsibility for the organization of the overall paper and approved the submission. All authors reviewed the manuscript.

Corresponding authors

Correspondence to Seongcheol Kim or Sang Hyun Lee.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kim, H.S., Jang, Y., Choi, Y.J. et al. Examining spatiotemporal crowdsensing and caching for population-dynamic OTT content delivery. Sci Rep 14, 13783 (2024). https://doi.org/10.1038/s41598-024-64589-1

Download citation

Received: 25 February 2024
Accepted: 11 June 2024
Published: 14 June 2024
DOI: https://doi.org/10.1038/s41598-024-64589-1