Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2831296.2831338acmotherconferencesArticle/Chapter ViewAbstractPublication PagesmobicomConference Proceedingsconference-collections
short-paper

A Reinforcement Learning-based Data-Link Protocol for Underwater Acoustic Communications

Published: 22 October 2015 Publication History

Abstract

We consider an underwater acoustic link where a sender transmits a flow of packets to a receiver through a channel with time varying quality. We address the problem of scheduling packets transmission, forward error correction (FEC) code selection, and channel probing to achieve the best trade-off between energy consumption and latency. Unlike previous works, which assume complete knowledge of the statistics of the underwater acoustic environment, we make the protocol learn the optimal behavior based on experience, without relying on any prior knowledge on the environment. We design a Reinforcement-Learning (RL)-based protocol which learns how to minimize a cost function which is a combination of delay and energy consumption, at the same time ensuring packet delivery. Starting from a basic Q-learning strategy, we design two learning algorithms to speed up learning time, and compare the performance of the proposed solutions with the Q-learning-based strategy and with an aggressive strategy which always transmits all the packets in the buffer. The results show that the proposed techniques outperform the aggressive policy and Q-learning, and are successful in achieving good tradeoffs between energy consumption and packet delivery latency (PDL).

References

[1]
M. Chitre and K. Pelekanakis. "Channel variability measurements in an underwater acoustic network." Underwater Communications and Networking (UComms), 2014. IEEE, 2014.
[2]
P. Casari, M. Rossi, M. Zorzi, Towards optimal broadcasting policies for HARQ based on fountain codes in underwater networks, in: Proc. IEEE/IFIP WONS, 2008.
[3]
. Tomasi et al., Redundancy allocation in time-varying channels with long propagation delays, Ad Hoc Netw. (2015), http://dx.doi.org/10.1016/j.adhoc.2015.01.009
[4]
B. Tomasi et al., Cross-layer analysis via Markov models of incremental redundancy hybrid ARQ over underwater acoustic channels, in press, Ad Hoc Netw. (2014), http://dx.doi.org/10.1016/j.adhoc.2014.07.013
[5]
R. S. Sutton and A. G. Barto. Reinforcement learning: An introduction. Cambridge: MIT press, 1998.
[6]
N. Mastronarde and M. van der Schaar. "Joint physical-layer and system-level power management for delay-sensitive wireless communications." IEEE Trans. Mob. Comp. 12.4 (2013): 694-709.
[7]
T. Hu and Y. Fei, QELAR: A Machine-Learning-Based Adaptive Routing Protocol for Energy-Efficient and Lifetime-Extended Underwater Sensor Networks, IEEE Trans. Mob. Comp, Vol. 9, No. 6, pp. 796--808, JUNE 2010.
[8]
R. Plate, C. Wakayama, Utilizing kinematics and selective sweeping in reinforcement learning-based routing algorithms for underwater networks, in press, Ad Hoc Netw. (2014), http://dx.doi.org/10.1016/j.adhoc.2014.09.012.
[9]
B. Tomasi, et al. "On modeling JANUS packet errors over a shallow water acoustic channel using Markov and hidden Markov models." Proc. 2010 IEEE Military Communications Conference (MILCOM 2010), San Jose, Ca, USA, Oct. 31 - Nov. 3, 2010.
[10]
M. L. Puterman, "Markov decision processes: discrete stochastic dynamic programming", Wiley, 2019.

Cited By

View all
  • (2024)Joint Link Scheduling and Power Allocation in Imperfect and Energy-Constrained Underwater Wireless Sensor NetworksIEEE Transactions on Mobile Computing10.1109/TMC.2024.336842523:10(9863-9880)Online publication date: Oct-2024
  • (2024)Energy-Efficient and Reliable Deployment Models for Hybrid Underwater Acoustic Sensor Networks with a Mobile GatewayJournal of Marine Science and Application10.1007/s11804-024-00444-zOnline publication date: 8-Jul-2024
  • (2023)A Biologically Inspired Self-Organizing Underwater Sensor NetworkApplied Sciences10.3390/app1307433013:7(4330)Online publication date: 29-Mar-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
WUWNet '15: Proceedings of the 10th International Conference on Underwater Networks & Systems
October 2015
228 pages
ISBN:9781450340366
DOI:10.1145/2831296
  • General Chairs:
  • Scott Midkiff,
  • Xiaoli Ma,
  • Publications Chair:
  • Zheng Peng
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • ASA: American Statistical Association
  • ONRGlobal: U.S. Office of Naval Research Global

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 October 2015

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Underwater communications
  2. adaptive protocols
  3. reinforcement learning
  4. underwater networks

Qualifiers

  • Short-paper
  • Research
  • Refereed limited

Conference

WUWNET '15
Sponsor:
  • ASA
  • ONRGlobal

Acceptance Rates

Overall Acceptance Rate 84 of 180 submissions, 47%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)14
  • Downloads (Last 6 weeks)4
Reflects downloads up to 13 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Joint Link Scheduling and Power Allocation in Imperfect and Energy-Constrained Underwater Wireless Sensor NetworksIEEE Transactions on Mobile Computing10.1109/TMC.2024.336842523:10(9863-9880)Online publication date: Oct-2024
  • (2024)Energy-Efficient and Reliable Deployment Models for Hybrid Underwater Acoustic Sensor Networks with a Mobile GatewayJournal of Marine Science and Application10.1007/s11804-024-00444-zOnline publication date: 8-Jul-2024
  • (2023)A Biologically Inspired Self-Organizing Underwater Sensor NetworkApplied Sciences10.3390/app1307433013:7(4330)Online publication date: 29-Mar-2023
  • (2022)Machine Learning for Underwater Acoustic CommunicationsIEEE Wireless Communications10.1109/MWC.2020.200028429:3(102-108)Online publication date: Jun-2022
  • (2022)TSV-MAC: Time Slot Variable MAC Protocol Based on Deep Reinforcement Learning for UASNsWireless Algorithms, Systems, and Applications10.1007/978-3-031-19211-1_19(225-237)Online publication date: 24-Nov-2022
  • (2021)Systematic Review of Fault Tolerant Techniques in Underwater Sensor NetworksSensors10.3390/s2109326421:9(3264)Online publication date: 8-May-2021
  • (2021)Double-Scale Adaptive Transmission in Time-Varying Channel for Underwater Acoustic Sensor NetworksSensors10.3390/s2106225221:6(2252)Online publication date: 23-Mar-2021
  • (2021)Energy-Efficient Collision Avoidance MAC Protocols for Underwater Sensor Networks: Survey and ChallengesJournal of Marine Science and Engineering10.3390/jmse90707419:7(741)Online publication date: 4-Jul-2021
  • (2021)Active Queue-Management Policies for Undersea Networking via Deep Reinforcement LearningOCEANS 2021: San Diego – Porto10.23919/OCEANS44145.2021.9706025(1-8)Online publication date: 20-Sep-2021
  • (2021)Reinforcement Learning-Based Routing in Underwater Acoustic Sensor NetworksWireless Personal Communications: An International Journal10.1007/s11277-021-08467-3120:1(419-446)Online publication date: 1-Sep-2021
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media