Search | arXiv e-print repository

Maximizing Information Gain in Privacy-Aware Active Learning of Email Anomalies

Authors: Mu-Huan Miles Chung, Sharon Li, Jaturong Kongmanee, Lu Wang, Yuhong Yang, Calvin Giang, Khilan Jerath, Abhay Raman, David Lie, Mark Chignell

Abstract: Redacted emails satisfy most privacy requirements but they make it more difficult to detect anomalous emails that may be indicative of data exfiltration. In this paper we develop an enhanced method of Active Learning using an information gain maximizing heuristic, and we evaluate its effectiveness in a real world setting where only redacted versions of email could be labeled by human analysts due… ▽ More Redacted emails satisfy most privacy requirements but they make it more difficult to detect anomalous emails that may be indicative of data exfiltration. In this paper we develop an enhanced method of Active Learning using an information gain maximizing heuristic, and we evaluate its effectiveness in a real world setting where only redacted versions of email could be labeled by human analysts due to privacy concerns. In the first case study we examined how Active Learning should be carried out. We found that model performance was best when a single highly skilled (in terms of the labelling task) analyst provided the labels. In the second case study we used confidence ratings to estimate the labeling uncertainty of analysts and then prioritized instances for labeling based on the expected information gain (the difference between model uncertainty and analyst uncertainty) that would be provided by labelling each instance. We found that the information maximization gain heuristic improved model performance over existing sampling methods for Active Learning. Based on the results obtained, we recommend that analysts should be screened, and possibly trained, prior to implementation of Active Learning in cybersecurity applications. We also recommend that the information gain maximizing sample method (based on expert confidence) should be used in early stages of Active Learning, providing that well-calibrated confidence can be obtained. We also note that the expertise of analysts should be assessed prior to Active Learning, as we found that analysts with lower labelling skill had poorly calibrated (over-) confidence in their labels. △ Less

Submitted 12 May, 2024; originally announced May 2024.

Comments: arXiv admin note: substantial text overlap with arXiv:2303.00870

arXiv:2404.03048 [pdf, other]

Decentralised Moderation for Interoperable Social Networks: A Conversation-based Approach for Pleroma and the Fediverse

Authors: Vibhor Agarwal, Aravindh Raman, Nishanth Sastry, Ahmed M. Abdelmoniem, Gareth Tyson, Ignacio Castro

Abstract: The recent development of decentralised and interoperable social networks (such as the "fediverse") creates new challenges for content moderators. This is because millions of posts generated on one server can easily "spread" to another, even if the recipient server has very different moderation policies. An obvious solution would be to leverage moderation tools to automatically tag (and filter) po… ▽ More The recent development of decentralised and interoperable social networks (such as the "fediverse") creates new challenges for content moderators. This is because millions of posts generated on one server can easily "spread" to another, even if the recipient server has very different moderation policies. An obvious solution would be to leverage moderation tools to automatically tag (and filter) posts that contravene moderation policies, e.g. related to toxic speech. Recent work has exploited the conversational context of a post to improve this automatic tagging, e.g. using the replies to a post to help classify if it contains toxic speech. This has shown particular potential in environments with large training sets that contain complete conversations. This, however, creates challenges in a decentralised context, as a single conversation may be fragmented across multiple servers. Thus, each server only has a partial view of an entire conversation because conversations are often federated across servers in a non-synchronized fashion. To address this, we propose a decentralised conversation-aware content moderation approach suitable for the fediverse. Our approach employs a graph deep learning model (GraphNLI) trained locally on each server. The model exploits local data to train a model that combines post and conversational information captured through random walks to detect toxicity. We evaluate our approach with data from Pleroma, a major decentralised and interoperable micro-blogging network containing 2 million conversations. Our model effectively detects toxicity on larger instances, exclusively trained using their local post information (0.8837 macro-F1). Our approach has considerable scope to improve moderation in decentralised and interoperable social networks such as Pleroma or Mastodon. △ Less

Submitted 16 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

Comments: Accepted at International AAAI Conference on Web and Social Media (ICWSM) 2024. Please cite accordingly!

arXiv:2404.00167 [pdf]

Electrical double layer and capacitance of TiO2 electrolyte interfaces from first principles simulations

Authors: Chunyi Zhang, Marcos Calegari Andrade, Zachary K. Goldsmith, Abhinav S. Raman, Yifan Li, Pablo Piaggi, Xifan Wu, Roberto Car, Annabella Selloni

Abstract: The electrical double layer (EDL) at aqueous solution-metal oxide interfaces critically affects many fundamental processes in electrochemistry, geology and biology, yet understanding its microscopic structure is challenging for both theory and experiments. Here we employ ab initio-based machine learning potentials including long-range electrostatics in large-scale atomistic simulations of the EDL… ▽ More The electrical double layer (EDL) at aqueous solution-metal oxide interfaces critically affects many fundamental processes in electrochemistry, geology and biology, yet understanding its microscopic structure is challenging for both theory and experiments. Here we employ ab initio-based machine learning potentials including long-range electrostatics in large-scale atomistic simulations of the EDL at the TiO2-electrolyte interface. Our simulations provide a molecular-scale picture of the EDL that demonstrates the limitations of standard mean-field models. We further develop a method to accurately calculate the electrostatic potential drop at the interface. The computed capacitance originating from the adsorbed charges and the potential drop agrees with experiments, supporting the reliability of our description of the EDL. The larger interfacial capacitance of basic relative to acidic solutions originates from the higher affinity of the cations for the oxide surface and gives rise to distinct charging mechanisms on negative and positive surfaces. △ Less

Submitted 29 March, 2024; originally announced April 2024.

arXiv:2402.09392 [pdf, other]

LL-GABR: Energy Efficient Live Video Streaming Using Reinforcement Learning

Authors: Adithya Raman, Bekir Turkkan, Tevfik Kosar

Abstract: Over the recent years, research and development in adaptive bitrate (ABR) algorithms for live video streaming have been successful in improving users' quality of experience (QoE) by reducing latency to near real-time levels while delivering higher bitrate videos with minimal rebuffering time. However, the QoE models used by these ABR algorithms do not take into account that a large portion of live… ▽ More Over the recent years, research and development in adaptive bitrate (ABR) algorithms for live video streaming have been successful in improving users' quality of experience (QoE) by reducing latency to near real-time levels while delivering higher bitrate videos with minimal rebuffering time. However, the QoE models used by these ABR algorithms do not take into account that a large portion of live video streaming clients use mobile devices where a higher bitrate does not necessarily translate into higher perceived quality. Ignoring perceived quality results in playing videos at higher bitrates without a significant increase in perceptual video quality and becomes a burden for battery-constrained mobile devices due to higher energy consumption. In this paper, we propose LL-GABR, a deep reinforcement learning approach that models the QoE using perceived video quality instead of bitrate and uses energy consumption along with other metrics like latency, rebuffering events, and smoothness. LL-GABR makes no assumptions about the underlying video, environment, or network settings and can operate flexibly on different video titles, each having a different bitrate encoding ladder without additional re-training, unlike existing learning-based ABRs. Trace-driven experimental results show that LL-GABR outperforms the state-of-the-art approaches by up to 44% in terms of perceptual QoE and a 73% increase in energy efficiency as a result of reducing net energy consumption by 11%. △ Less

Submitted 14 February, 2024; originally announced February 2024.

Comments: 10 pages, 3 figures, 3 Tables

arXiv:2401.08821 [pdf]

Surface-Enhanced Raman Spectroscopy and Transfer Learning Toward Accurate Reconstruction of the Surgical Zone

Authors: Ashutosh Raman, Ren A. Odion, Kent K. Yamamoto, Weston Ross, Tuan Vo-Dinh, Patrick J. Codd

Abstract: Raman spectroscopy, a photonic modality based on the inelastic backscattering of coherent light, is a valuable asset to the intraoperative sensing space, offering non-ionizing potential and highly-specific molecular fingerprint-like spectroscopic signatures that can be used for diagnosis of pathological tissue in the dynamic surgical field. Though Raman suffers from weakness in intensity, Surface-… ▽ More Raman spectroscopy, a photonic modality based on the inelastic backscattering of coherent light, is a valuable asset to the intraoperative sensing space, offering non-ionizing potential and highly-specific molecular fingerprint-like spectroscopic signatures that can be used for diagnosis of pathological tissue in the dynamic surgical field. Though Raman suffers from weakness in intensity, Surface-Enhanced Raman Spectroscopy (SERS), which uses metal nanostructures to amplify Raman signals, can achieve detection sensitivities that rival traditional photonic modalities. In this study, we outline a robotic Raman system that can reliably pinpoint the location and boundaries of a tumor embedded in healthy tissue, modeled here as a tissue-mimicking phantom with selectively infused Gold Nanostar regions. Further, due to the relative dearth of collected biological SERS or Raman data, we implement transfer learning to achieve 100% validation classification accuracy for Gold Nanostars compared to Control Agarose, thus providing a proof-of-concept for Raman-based deep learning training pipelines. We reconstruct a surgical field of 30x60mm in 10.2 minutes, and achieve 98.2% accuracy, preserving relative measurements between features in the phantom. We also achieve an 84.3% Intersection-over-Union score, which is the extent of overlap between the ground truth and predicted reconstructions. Lastly, we also demonstrate that the Raman system and classification algorithm do not discern based on sample color, but instead on presence of SERS agents. This study provides a crucial step in the translation of intelligent Raman systems in intraoperative oncological spaces. △ Less

Submitted 16 January, 2024; originally announced January 2024.

Comments: Accepted to Hamlyn Symposium on Medical Robotics, 2023

arXiv:2310.19064 [pdf, other]

Apple Tasting: Combinatorial Dimensions and Minimax Rates

Authors: Vinod Raman, Unique Subedi, Ananth Raman, Ambuj Tewari

Abstract: In online binary classification under \emph{apple tasting} feedback, the learner only observes the true label if it predicts ``1". First studied by \cite{helmbold2000apple}, we revisit this classical partial-feedback setting and study online learnability from a combinatorial perspective. We show that the Littlestone dimension continues to provide a tight quantitative characterization of apple tast… ▽ More In online binary classification under \emph{apple tasting} feedback, the learner only observes the true label if it predicts ``1". First studied by \cite{helmbold2000apple}, we revisit this classical partial-feedback setting and study online learnability from a combinatorial perspective. We show that the Littlestone dimension continues to provide a tight quantitative characterization of apple tasting in the agnostic setting, closing an open question posed by \cite{helmbold2000apple}. In addition, we give a new combinatorial parameter, called the Effective width, that tightly quantifies the minimax expected mistakes in the realizable setting. As a corollary, we use the Effective width to establish a \emph{trichotomy} of the minimax expected number of mistakes in the realizable setting. In particular, we show that in the realizable setting, the expected number of mistakes of any learner, under apple tasting feedback, can be $Θ(1), Θ(\sqrt{T})$, or $Θ(T)$. This is in contrast to the full-information realizable setting where only $Θ(1)$ and $Θ(T)$ are possible. △ Less

Submitted 18 June, 2024; v1 submitted 29 October, 2023; originally announced October 2023.

Comments: 21 pages, COLT 2024 Camera Ready

arXiv:2310.15808 [pdf, other]

Dissecting the Performance of Satellite Network Operators

Authors: Aravindh Raman, Matteo Varvello, Hyunseok Chang, Nishanth Sastry, Yasir Zaki

Abstract: The rapid growth of satellite network operators (SNOs) has revolutionized broadband communications, enabling global connectivity and bridging the digital divide. As these networks expand, it is important to evaluate their performance and efficiency. This paper presents the first comprehensive study of SNOs. We take an opportunistic approach and devise a methodology which allows to identify public… ▽ More The rapid growth of satellite network operators (SNOs) has revolutionized broadband communications, enabling global connectivity and bridging the digital divide. As these networks expand, it is important to evaluate their performance and efficiency. This paper presents the first comprehensive study of SNOs. We take an opportunistic approach and devise a methodology which allows to identify public network measurements performed via SNOs. We apply this methodology to both M-Lab and RIPE public datasets which allowed us to characterize low level performance and footprint of up to 18 SNOs operating in different orbits. Finally, we identify and recruit paid testers on three popular SNOs (Starlink, HughesNet, and ViaSat) to evaluate the performance of popular applications like web browsing and video streaming. △ Less

Submitted 16 November, 2023; v1 submitted 24 October, 2023; originally announced October 2023.

Comments: Published at International Conference on emerging Networking EXperiments and Technologies (CoNEXT 2023). Please cite the CoNEXT version

arXiv:2310.11835 [pdf, other]

T3P: Demystifying Low-Earth Orbit Satellite Broadband

Authors: Shubham Tiwari, Saksham Bhushan, Aryan Taneja, Mohamed Kassem, Cheng Luo, Cong Zhou, Zhiyuan He, Aravindh Raman, Nishanth Sastry, Lili Qiu, Debopam Bhattacherjee

Abstract: The Internet is going through a massive infrastructural revolution with the advent of low-flying satellite networks, 5/6G, WiFi7, and hollow-core fiber deployments. While these networks could unleash enhanced connectivity and new capabilities, it is critical to understand the performance characteristics to efficiently drive applications over them. Low-Earth orbit (LEO) satellite mega-constellation… ▽ More The Internet is going through a massive infrastructural revolution with the advent of low-flying satellite networks, 5/6G, WiFi7, and hollow-core fiber deployments. While these networks could unleash enhanced connectivity and new capabilities, it is critical to understand the performance characteristics to efficiently drive applications over them. Low-Earth orbit (LEO) satellite mega-constellations like SpaceX Starlink aim to offer broad coverage and low latencies at the expense of high orbital dynamics leading to continuous latency changes and frequent satellite hand-offs. This paper aims to quantify Starlink's latency and its variations and components using a real testbed spanning multiple latitudes from the North to the South of Europe. We identify tail latencies as a problem. We develop predictors for latency and throughput and show their utility in improving application performance by up to 25%. We also explore how transport protocols can be optimized for LEO networks and show that this can improve throughput by up to 115% (with only a 5% increase in latency). Also, our measurement testbed with a footprint across multiple locations offers unique trigger-based scheduling capabilities that are necessary to quantify the impact of LEO dynamics. △ Less

Submitted 18 October, 2023; originally announced October 2023.

Comments: 16 pages

arXiv:2308.04620 [pdf, other]

Multiclass Online Learnability under Bandit Feedback

Authors: Ananth Raman, Vinod Raman, Unique Subedi, Idan Mehalel, Ambuj Tewari

Abstract: We study online multiclass classification under bandit feedback. We extend the results of Daniely and Helbertal [2013] by showing that the finiteness of the Bandit Littlestone dimension is necessary and sufficient for bandit online learnability even when the label space is unbounded. Moreover, we show that, unlike the full-information setting, sequential uniform convergence is necessary but not su… ▽ More We study online multiclass classification under bandit feedback. We extend the results of Daniely and Helbertal [2013] by showing that the finiteness of the Bandit Littlestone dimension is necessary and sufficient for bandit online learnability even when the label space is unbounded. Moreover, we show that, unlike the full-information setting, sequential uniform convergence is necessary but not sufficient for bandit online learnability. Our result complements the recent work by Hanneke, Moran, Raman, Subedi, and Tewari [2023] who show that the Littlestone dimension characterizes online multiclass learnability in the full-information setting even when the label space is unbounded. △ Less

Submitted 20 January, 2024; v1 submitted 8 August, 2023; originally announced August 2023.

Comments: 16 pages, ALT 2024 Camera Ready

arXiv:2307.08782 [pdf, other]

Unsupervised Learning of Distributional Properties can Supplement Human Labeling and Increase Active Learning Efficiency in Anomaly Detection

Authors: Jaturong Kongmanee, Mark Chignell, Khilan Jerath, Abhay Raman

Abstract: Exfiltration of data via email is a serious cybersecurity threat for many organizations. Detecting data exfiltration (anomaly) patterns typically requires labeling, most often done by a human annotator, to reduce the high number of false alarms. Active Learning (AL) is a promising approach for labeling data efficiently, but it needs to choose an efficient order in which cases are to be labeled, an… ▽ More Exfiltration of data via email is a serious cybersecurity threat for many organizations. Detecting data exfiltration (anomaly) patterns typically requires labeling, most often done by a human annotator, to reduce the high number of false alarms. Active Learning (AL) is a promising approach for labeling data efficiently, but it needs to choose an efficient order in which cases are to be labeled, and there are uncertainties as to what scoring procedure should be used to prioritize cases for labeling, especially when detecting rare cases of interest is crucial. We propose an adaptive AL sampling strategy that leverages the underlying prior data distribution, as well as model uncertainty, to produce batches of cases to be labeled that contain instances of rare anomalies. We show that (1) the classifier benefits from a batch of representative and informative instances of both normal and anomalous examples, (2) unsupervised anomaly detection plays a useful role in building the classifier in the early stages of training when relatively little labeling has been done thus far. Our approach to AL for anomaly detection outperformed existing AL approaches on three highly unbalanced UCI benchmarks and on one real-world redacted email data set. △ Less

Submitted 13 July, 2023; originally announced July 2023.

arXiv:2306.05405 [pdf, other]

Resonant Anti-Reflection Metasurface for Infrared Transmission Optics

Authors: John Brewer, Sachin Kulkarni, Aaswath P. Raman

Abstract: A fundamental capability for any transmissive optical component is anti-reflection, yet this capability is challenging to achieve in a cost-efficient manner over longer infrared wavelengths. We demonstrate that Mie resonant nanophotonic structures enhance transmission in Silicon, allowing it to function as an effective optical material over long-wave infrared wavelengths. This approach enables a w… ▽ More A fundamental capability for any transmissive optical component is anti-reflection, yet this capability is challenging to achieve in a cost-efficient manner over longer infrared wavelengths. We demonstrate that Mie resonant nanophotonic structures enhance transmission in Silicon, allowing it to function as an effective optical material over long-wave infrared wavelengths. This approach enables a window optic with up to 40\% greater transmission than equal thickness unpatterned Si. Imaging comparisons with unpatterned silicon and off-the-shelf Germanium optics are shown, as well as basic broadband slant edge MTF measurements. Overall, we demonstrate how Mie-resonant structures can be used to improve optical transmission through window optics of arbitrary lithographically patternable optical media, and highlight their possible use in imaging applications. △ Less

Submitted 8 June, 2023; originally announced June 2023.

Comments: 18 Pages, 4 figures

arXiv:2305.15760 [pdf, other]

Svarah: Evaluating English ASR Systems on Indian Accents

Authors: Tahir Javed, Sakshi Joshi, Vignesh Nagarajan, Sai Sundaresan, Janki Nawale, Abhigyan Raman, Kaushal Bhogale, Pratyush Kumar, Mitesh M. Khapra

Abstract: India is the second largest English-speaking country in the world with a speaker base of roughly 130 million. Thus, it is imperative that automatic speech recognition (ASR) systems for English should be evaluated on Indian accents. Unfortunately, Indian speakers find a very poor representation in existing English ASR benchmarks such as LibriSpeech, Switchboard, Speech Accent Archive, etc. In this… ▽ More India is the second largest English-speaking country in the world with a speaker base of roughly 130 million. Thus, it is imperative that automatic speech recognition (ASR) systems for English should be evaluated on Indian accents. Unfortunately, Indian speakers find a very poor representation in existing English ASR benchmarks such as LibriSpeech, Switchboard, Speech Accent Archive, etc. In this work, we address this gap by creating Svarah, a benchmark that contains 9.6 hours of transcribed English audio from 117 speakers across 65 geographic locations throughout India, resulting in a diverse range of accents. Svarah comprises both read speech and spontaneous conversational data, covering various domains, such as history, culture, tourism, etc., ensuring a diverse vocabulary. We evaluate 6 open source ASR models and 2 commercial ASR systems on Svarah and show that there is clear scope for improvement on Indian accents. Svarah as well as all our code will be publicly available. △ Less

Submitted 25 May, 2023; originally announced May 2023.

arXiv:2305.15386 [pdf, other]

Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR

Authors: Kaushal Santosh Bhogale, Sai Sundaresan, Abhigyan Raman, Tahir Javed, Mitesh M. Khapra, Pratyush Kumar

Abstract: Improving ASR systems is necessary to make new LLM-based use-cases accessible to people across the globe. In this paper, we focus on Indian languages, and make the case that diverse benchmarks are required to evaluate and improve ASR systems for Indian languages. To address this, we collate Vistaar as a set of 59 benchmarks across various language and domain combinations, on which we evaluate 3 pu… ▽ More Improving ASR systems is necessary to make new LLM-based use-cases accessible to people across the globe. In this paper, we focus on Indian languages, and make the case that diverse benchmarks are required to evaluate and improve ASR systems for Indian languages. To address this, we collate Vistaar as a set of 59 benchmarks across various language and domain combinations, on which we evaluate 3 publicly available ASR systems and 2 commercial systems. We also train IndicWhisper models by fine-tuning the Whisper models on publicly available training datasets across 12 Indian languages totalling to 10.7K hours. We show that IndicWhisper significantly improves on considered ASR systems on the Vistaar benchmark. Indeed, IndicWhisper has the lowest WER in 39 out of the 59 benchmarks, with an average reduction of 4.1 WER. We open-source all datasets, code and models. △ Less

Submitted 2 August, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

Comments: Accepted in INTERSPEECH 2023

arXiv:2305.09670 [pdf, ps, other]

On the zeros of Riemann's Xi Function

Authors: Akhila Raman

Abstract: We consider Riemann's Xi function $ξ(s)$ which is evaluated at $s = \frac{1}{2} + σ+ i ω$, given by $ξ(\frac{1}{2} + σ+ i ω)= E_{pω}(ω)$, where $σ, ω$ are real and compute its inverse Fourier transform given by $E_p(t)$. We study the properties of $E_p(t)$ and a promising new method is presented which could be used to show that the Fourier Transform of $E_p(t)$ given by… ▽ More We consider Riemann's Xi function $ξ(s)$ which is evaluated at $s = \frac{1}{2} + σ+ i ω$, given by $ξ(\frac{1}{2} + σ+ i ω)= E_{pω}(ω)$, where $σ, ω$ are real and compute its inverse Fourier transform given by $E_p(t)$. We study the properties of $E_p(t)$ and a promising new method is presented which could be used to show that the Fourier Transform of $E_p(t)$ given by $E_{pω}(ω) = ξ(\frac{1}{2} + σ+ i ω)$ does not have zeros for finite and real $ω$ when $0 < |σ| < \frac{1}{2}$, corresponding to the critical strip excluding the critical line. △ Less

Submitted 29 December, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

Comments: Added more detailed explanations

arXiv:2303.15047 [pdf]

doi 10.1017/jfm.2023.542

Bubble nucleation and jetting inside a millimetric droplet

Authors: Juan Manuel Rosselló, Hendrik Reese, K. Ashoke Raman, Claus-Dieter Ohl

Abstract: In this work, we present experiments and simulations on the nucleation and successive dynamics of laser-induced bubbles inside liquid droplets in free-fall motion, i.e. a case with a free boundary in all directions. The droplets of a millimetric size have a nearly spherical shape by the moment the bubble is nucleated. We have investigated the nucleation of secondary bubbles induced by the rarefact… ▽ More In this work, we present experiments and simulations on the nucleation and successive dynamics of laser-induced bubbles inside liquid droplets in free-fall motion, i.e. a case with a free boundary in all directions. The droplets of a millimetric size have a nearly spherical shape by the moment the bubble is nucleated. We have investigated the nucleation of secondary bubbles induced by the rarefaction wave that is produced when the shock wave emitted by the laser-induced plasma reflects at the drop surface. Interestingly, three-dimensional clusters of cavitation bubbles are observed. Their shape is compared with the negative pressure distribution computed with a CFD model and allows us to estimate a cavitation threshold value. High-speed recordings of the drop/bubble dynamics are complemented by the velocity and pressure fields simulated for the same initial conditions. The effect of the proximity of a curved free surface on the jetting dynamics of the bubbles was qualitatively assessed by classifying the cavitation events using a non-dimensional stand-off parameter which depends on the drop size, the bubble maximum radius and the relative position of the bubble inside the drop. Additionally, we studied the role of the drop's curvature by implementing a structural similarity algorithm to compare cases with bubbles produced near a flat surface to the bubbles inside the drop. This quantitative comparison method indicated the existence of equivalent stand-off distances at which bubbles influenced by different boundaries behave in a very similar way. The oscillation of the laser-induced bubbles promote the onset of Rayleigh-Taylor and Rayleigh-Plateau instabilities, observed on the drop's surface. This phenomenon was studied by varying the ratio of the maximum radii of the bubble and the drop. The specific mechanisms leading to the destabilisation of the droplet surface were identified. △ Less

Submitted 5 June, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

arXiv:2303.00870 [pdf, other]

Implementing Active Learning in Cybersecurity: Detecting Anomalies in Redacted Emails

Authors: Mu-Huan Chung, Lu Wang, Sharon Li, Yuhong Yang, Calvin Giang, Khilan Jerath, Abhay Raman, David Lie, Mark Chignell

Abstract: Research on email anomaly detection has typically relied on specially prepared datasets that may not adequately reflect the type of data that occurs in industry settings. In our research, at a major financial services company, privacy concerns prevented inspection of the bodies of emails and attachment details (although subject headings and attachment filenames were available). This made labeling… ▽ More Research on email anomaly detection has typically relied on specially prepared datasets that may not adequately reflect the type of data that occurs in industry settings. In our research, at a major financial services company, privacy concerns prevented inspection of the bodies of emails and attachment details (although subject headings and attachment filenames were available). This made labeling possible anomalies in the resulting redacted emails more difficult. Another source of difficulty is the high volume of emails combined with the scarcity of resources making machine learning (ML) a necessity, but also creating a need for more efficient human training of ML models. Active learning (AL) has been proposed as a way to make human training of ML models more efficient. However, the implementation of Active Learning methods is a human-centered AI challenge due to potential human analyst uncertainty, and the labeling task can be further complicated in domains such as the cybersecurity domain (or healthcare, aviation, etc.) where mistakes in labeling can have highly adverse consequences. In this paper we present research results concerning the application of Active Learning to anomaly detection in redacted emails, comparing the utility of different methods for implementing active learning in this context. We evaluate different AL strategies and their impact on resulting model performance. We also examine how ratings of confidence that experts have in their labels can inform AL. The results obtained are discussed in terms of their implications for AL methodology and for the role of experts in model-assisted email anomaly screening. △ Less

Submitted 2 March, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

arXiv:2302.14294 [pdf, other]

Flocking to Mastodon: Tracking the Great Twitter Migration

Authors: Haris Bin Zia, Jiahui He, Aravindh Raman, Ignacio Castro, Nishanth Sastry, Gareth Tyson

Abstract: The acquisition of Twitter by Elon Musk has spurred controversy and uncertainty among Twitter users. The move raised as many praises as concerns, particularly regarding Musk's views on free speech. As a result, a large number of Twitter users have looked for alternatives to Twitter. Mastodon, a decentralized micro-blogging social network, has attracted the attention of many users and the general m… ▽ More The acquisition of Twitter by Elon Musk has spurred controversy and uncertainty among Twitter users. The move raised as many praises as concerns, particularly regarding Musk's views on free speech. As a result, a large number of Twitter users have looked for alternatives to Twitter. Mastodon, a decentralized micro-blogging social network, has attracted the attention of many users and the general media. In this paper, we track and analyze the migration of 136,009 users from Twitter to Mastodon. Our analysis sheds light on the user-driven pressure towards centralization in a decentralized ecosystem and identifies the strong influence of the social network in platform migration. We also characterize the activity of migrated users on both Twitter and Mastodon. △ Less

Submitted 27 February, 2023; originally announced February 2023.

arXiv:2302.05915 [pdf, other]

doi 10.1145/3543507.3583487

Will Admins Cope? Decentralized Moderation in the Fediverse

Authors: Ishaku Hassan Anaobi, Aravindh Raman, Ignacio Castro, Haris Bin Zia, Dami Ibosiola, Gareth Tyson

Abstract: As an alternative to Twitter and other centralized social networks, the Fediverse is growing in popularity. The recent, and polemical, takeover of Twitter by Elon Musk has exacerbated this trend. The Fediverse includes a growing number of decentralized social networks, such as Pleroma or Mastodon, that share the same subscription protocol (ActivityPub). Each of these decentralized social networks… ▽ More As an alternative to Twitter and other centralized social networks, the Fediverse is growing in popularity. The recent, and polemical, takeover of Twitter by Elon Musk has exacerbated this trend. The Fediverse includes a growing number of decentralized social networks, such as Pleroma or Mastodon, that share the same subscription protocol (ActivityPub). Each of these decentralized social networks is composed of independent instances that are run by different administrators. Users, however, can interact with other users across the Fediverse regardless of the instance they are signed up to. The growing user base of the Fediverse creates key challenges for the administrators, who may experience a growing burden. In this paper, we explore how large that overhead is, and whether there are solutions to alleviate the burden. We study the overhead of moderation on the administrators. We observe a diversity of administrator strategies, with evidence that administrators on larger instances struggle to find sufficient resources. We then propose a tool, WatchGen, to semi-automate the process. △ Less

Submitted 12 February, 2023; originally announced February 2023.

arXiv:2301.13618 [pdf, other]

Scheduling Inference Workloads on Distributed Edge Clusters with Reinforcement Learning

Authors: Gabriele Castellano, Juan-José Nieto, Jordi Luque, Ferrán Diego, Carlos Segura, Diego Perino, Flavio Esposito, Fulvio Risso, Aravindh Raman

Abstract: Many real-time applications (e.g., Augmented/Virtual Reality, cognitive assistance) rely on Deep Neural Networks (DNNs) to process inference tasks. Edge computing is considered a key infrastructure to deploy such applications, as moving computation close to the data sources enables us to meet stringent latency and throughput requirements. However, the constrained nature of edge networks poses seve… ▽ More Many real-time applications (e.g., Augmented/Virtual Reality, cognitive assistance) rely on Deep Neural Networks (DNNs) to process inference tasks. Edge computing is considered a key infrastructure to deploy such applications, as moving computation close to the data sources enables us to meet stringent latency and throughput requirements. However, the constrained nature of edge networks poses several additional challenges to the management of inference workloads: edge clusters can not provide unlimited processing power to DNN models, and often a trade-off between network and processing time should be considered when it comes to end-to-end delay requirements. In this paper, we focus on the problem of scheduling inference queries on DNN models in edge networks at short timescales (i.e., few milliseconds). By means of simulations, we analyze several policies in the realistic network settings and workloads of a large ISP, highlighting the need for a dynamic scheduling policy that can adapt to network conditions and workloads. We therefore design ASET, a Reinforcement Learning based scheduling algorithm able to adapt its decisions according to the system conditions. Our results show that ASET effectively provides the best performance compared to static policies when scheduling over a distributed pool of edge resources. △ Less

Submitted 31 January, 2023; originally announced January 2023.

arXiv:2212.14112 [pdf]

Simultaneous control of spectral and directional emissivity with gradient epsilon-near-zero InAs photonic structures

Authors: Jae Seung Hwang, Jin Xu, Aaswath P. Raman

Abstract: Controlling both the spectral bandwidth and directional range of emitted thermal radiation is a fundamental challenge in modern photonics and materials research. Recent work has shown that materials with a spatial gradient in their epsilon near zero response can support broad spectrum directionality in their emissivity, enabling high radiance to specific angles of incidence. However, this capabili… ▽ More Controlling both the spectral bandwidth and directional range of emitted thermal radiation is a fundamental challenge in modern photonics and materials research. Recent work has shown that materials with a spatial gradient in their epsilon near zero response can support broad spectrum directionality in their emissivity, enabling high radiance to specific angles of incidence. However, this capability has been limited spectrally and directionally by the availability of materials supporting phonon-polariton resonances over long-wave infrared wavelengths. Here, we design and experimentally demonstrate an approach using doped III-V semiconductors that can simultaneously tailor spectral peak, bandwidth and directionality of infrared emissivity. We epitaxially grow and characterize InAs-based gradient ENZ photonic structures that exhibit broadband directional emission with varying spectral bandwidths and peak directions as a function of their doping concentration profile and thickness. Due to its easy-to-fabricate geometry we believe this approach provides a versatile photonic platform to dynamically control broadband spectral and directional emissivity for a range of emerging applications. △ Less

Submitted 16 December, 2022; originally announced December 2022.

Comments: 15 pages, 5 figures

arXiv:2211.11765 [pdf, other]

GalaxyFlow: Upsampling Hydrodynamical Simulations for Realistic Gaia Mock Catalogs

Authors: Sung Hak Lim, Kailash A. Raman, Matthew R. Buckley, David Shih

Abstract: Cosmological N-body simulations of galaxies operate at the level of "star particles" with a mass resolution on the scale of thousands of solar masses. Turning these simulations into stellar mock catalogs requires "upsampling" the star particles into individual stars following the same phase-space density. In this paper, we demonstrate that normalizing flows provide a viable upsampling method that… ▽ More Cosmological N-body simulations of galaxies operate at the level of "star particles" with a mass resolution on the scale of thousands of solar masses. Turning these simulations into stellar mock catalogs requires "upsampling" the star particles into individual stars following the same phase-space density. In this paper, we demonstrate that normalizing flows provide a viable upsampling method that greatly improves on conventionally-used kernel smoothing algorithms such as EnBiD. We demonstrate our flow-based upsampling technique, dubbed GalaxyFlow, on a neighborhood of the Solar location in two simulated galaxies: Auriga 6 and h277. By eye, GalaxyFlow produces stellar distributions that are smoother than EnBiD-based methods and more closely match the Gaia DR3 catalog. For a quantitative comparison of generative model performance, we introduce a novel multi-model classifier test. Using this classifier test, we show that GalaxyFlow more accurately estimates the density of the underlying star particles than previous methods. △ Less

Submitted 21 November, 2022; originally announced November 2022.

Comments: 17 pages, 11 figures

arXiv:2210.11697 [pdf, other]

Optimal Pose Estimation and Covariance Analysis with Simultaneous Localization and Mapping Applications

Authors: Saeed Maleki, Adhiti Raman, Yang Cheng, John Crassidis, Matthias Schmid

Abstract: This work provides a theoretical analysis for optimally solving the pose estimation problem using total least squares for vector observations from landmark features, which is central to applications involving simultaneous localization and mapping. First, the optimization process is formulated with observation vectors extracted from point-cloud features. Then, error-covariance expressions are deriv… ▽ More This work provides a theoretical analysis for optimally solving the pose estimation problem using total least squares for vector observations from landmark features, which is central to applications involving simultaneous localization and mapping. First, the optimization process is formulated with observation vectors extracted from point-cloud features. Then, error-covariance expressions are derived. The attitude and position estimates obtained via the derived optimization process are proven to reach the bounds defined by the Cramér-Rao lower bound under the small-angle approximation of attitude errors. A fully populated observation noise-covariance matrix is assumed as the weight in the cost function to cover the most general case of the sensor uncertainty. This includes more generic correlations in the errors than previous cases involving an isotropic noise assumption. The proposed solution is verified using Monte Carlo simulations and an experiment with an actual LIDAR to validate the error-covariance analysis. △ Less

Submitted 20 October, 2022; originally announced October 2022.

Comments: arXiv admin note: substantial text overlap with arXiv:2106.11522

arXiv:2209.15434 [pdf]

DeepAdjoint: An All-in-One Photonic Inverse Design Framework Integrating Data-Driven Machine Learning with Optimization Algorithms

Authors: Christopher Yeung, Benjamin Pham, Ryan Tsai, Katherine T. Fountaine, Aaswath P. Raman

Abstract: In recent years, hybrid design strategies combining machine learning (ML) with electromagnetic optimization algorithms have emerged as a new paradigm for the inverse design of photonic structures and devices. While a trained, data-driven neural network can rapidly identify solutions near the global optimum with a given dataset's design space, an iterative optimization algorithm can further refine… ▽ More In recent years, hybrid design strategies combining machine learning (ML) with electromagnetic optimization algorithms have emerged as a new paradigm for the inverse design of photonic structures and devices. While a trained, data-driven neural network can rapidly identify solutions near the global optimum with a given dataset's design space, an iterative optimization algorithm can further refine the solution and overcome dataset limitations. Furthermore, such hybrid ML-optimization methodologies can reduce computational costs and expedite the discovery of novel electromagnetic components. However, existing hybrid ML-optimization methods have yet to optimize across both materials and geometries in a single integrated and user-friendly environment. In addition, due to the challenge of acquiring large datasets for ML, as well as the exponential growth of isolated models being trained for photonics design, there is a need to standardize the ML-optimization workflow while making the pre-trained models easily accessible. Motivated by these challenges, here we introduce DeepAdjoint, a general-purpose, open-source, and multi-objective "all-in-one" global photonics inverse design application framework which integrates pre-trained deep generative networks with state-of-the-art electromagnetic optimization algorithms such as the adjoint variables method. DeepAdjoint allows a designer to specify an arbitrary optical design target, then obtain a photonic structure that is robust to fabrication tolerances and possesses the desired optical properties - all within a single user-guided application interface. Our framework thus paves a path towards the systematic unification of ML and optimization algorithms for photonic inverse design. △ Less

Submitted 28 September, 2022; originally announced September 2022.

arXiv:2209.08303 [pdf, ps, other]

Limitations to Realize Quantum Zeno Effect in Beam Splitter Array -- a Monte Carlo Wavefunction Analysis

Authors: Nilakantha Meher, Akhil Raman, S. Sivakumar

Abstract: Effects of non-ideal optical components in realizing quantum Zeno effect in an all-optical setup are analyzed. Beam splitters are the important components in this experimental configuration. Nonuniform transmission coefficient, photon absorption and thermal noise are considered. Numerical simulation of the experiment is performed using the Monte Carlo wavefunction method. It is argued that there i… ▽ More Effects of non-ideal optical components in realizing quantum Zeno effect in an all-optical setup are analyzed. Beam splitters are the important components in this experimental configuration. Nonuniform transmission coefficient, photon absorption and thermal noise are considered. Numerical simulation of the experiment is performed using the Monte Carlo wavefunction method. It is argued that there is an optimal number of beam splitters to be used for maximizing the expected output in the experiment. △ Less

Submitted 17 September, 2022; originally announced September 2022.

Comments: To be published in the Journal of the Physical Society of Japan

arXiv:2209.04447 [pdf]

Hybrid Supervised and Reinforcement Learning for the Design and Optimization of Nanophotonic Structures

Authors: Christopher Yeung, Benjamin Pham, Zihan Zhang, Katherine T. Fountaine, Aaswath P. Raman

Abstract: From higher computational efficiency to enabling the discovery of novel and complex structures, deep learning has emerged as a powerful framework for the design and optimization of nanophotonic circuits and components. However, both data-driven and exploration-based machine learning strategies have limitations in their effectiveness for nanophotonic inverse design. Supervised machine learning appr… ▽ More From higher computational efficiency to enabling the discovery of novel and complex structures, deep learning has emerged as a powerful framework for the design and optimization of nanophotonic circuits and components. However, both data-driven and exploration-based machine learning strategies have limitations in their effectiveness for nanophotonic inverse design. Supervised machine learning approaches require large quantities of training data to produce high-performance models and have difficulty generalizing beyond training data given the complexity of the design space. Unsupervised and reinforcement learning-based approaches on the other hand can have very lengthy training or optimization times associated with them. Here we demonstrate a hybrid supervised learning and reinforcement learning approach to the inverse design of nanophotonic structures and show this approach can reduce training data dependence, improve the generalizability of model predictions, and shorten exploratory training times by orders of magnitude. The presented strategy thus addresses a number of contemporary deep learning-based challenges, while opening the door for new design methodologies that leverage multiple classes of machine learning algorithms to produce more effective and practical solutions for photonic design. △ Less

Submitted 8 September, 2022; originally announced September 2022.

arXiv:2209.03951 [pdf, other]

doi 10.1103/PhysRevApplied.19.034037

Temporal coupled-mode theory for thermal emission from multiple arbitrarily coupled resonators

Authors: Xin Huang, Christopher Yeung, Aaswath P. Raman

Abstract: Controlling the spectral response of thermal emitters has become increasingly important for a range of energy and sensing applications. Conventional approaches to achieving arbitrary spectrum selectivity in photonic systems have entailed combining multiple resonantly emissive elements together to achieve a range of spectral profiles through numerical optimization, with a universal theoretical fram… ▽ More Controlling the spectral response of thermal emitters has become increasingly important for a range of energy and sensing applications. Conventional approaches to achieving arbitrary spectrum selectivity in photonic systems have entailed combining multiple resonantly emissive elements together to achieve a range of spectral profiles through numerical optimization, with a universal theoretical framework lacking. Here, we develop a temporal coupled mode theory for thermal emission from multiple, arbtirarily-coupled resonators. We validate our theory against numerical simulations of complex two- and three-dimensional nanophotonic thermal emitters, highlighting the anomalous thermal emission spectra that can emerge when multiple resonators with arbitrary properties couple to each other with varying strengths. △ Less

Submitted 8 September, 2022; originally announced September 2022.

arXiv:2209.01260 [pdf, other]

A Failure Identification and Recovery Framework for a Planar Reconfigurable Cable Driven Parallel Robot

Authors: Adhiti Raman, Ian Walker, Venkat Krovi, Matthias Schmid

Abstract: In cable driven parallel robots (CDPRs), a single cable malfunction usually induces complete failure of the entire robot. However, the lost static workspace (due to failure) can often be recovered through reconfiguration of the cable attachment points on the frame. This capability is introduced by adding kinematic redundancies to the robot in the form of moving linear sliders that are manipulated… ▽ More In cable driven parallel robots (CDPRs), a single cable malfunction usually induces complete failure of the entire robot. However, the lost static workspace (due to failure) can often be recovered through reconfiguration of the cable attachment points on the frame. This capability is introduced by adding kinematic redundancies to the robot in the form of moving linear sliders that are manipulated in a real-time redundancy resolution controller. The presented work combines this controller with an online failure detection framework to develop a complete fault tolerant control scheme for automatic task recovery. This solution provides robustness by combining pose estimation of the end-effector with the failure detection through the application of an Interactive Multiple Model (IMM) algorithm relying only on end-effector information. The failure and pose estimation scheme is then tied into the redundancy resolution approach to produce a seamless automatic task (trajectory) recovery approach for cable failures. △ Less

Submitted 2 September, 2022; originally announced September 2022.

arXiv:2208.12666 [pdf, other]

Effectiveness of Mining Audio and Text Pairs from Public Data for Improving ASR Systems for Low-Resource Languages

Authors: Kaushal Santosh Bhogale, Abhigyan Raman, Tahir Javed, Sumanth Doddapaneni, Anoop Kunchukuttan, Pratyush Kumar, Mitesh M. Khapra

Abstract: End-to-end (E2E) models have become the default choice for state-of-the-art speech recognition systems. Such models are trained on large amounts of labelled data, which are often not available for low-resource languages. Techniques such as self-supervised learning and transfer learning hold promise, but have not yet been effective in training accurate models. On the other hand, collecting labelled… ▽ More End-to-end (E2E) models have become the default choice for state-of-the-art speech recognition systems. Such models are trained on large amounts of labelled data, which are often not available for low-resource languages. Techniques such as self-supervised learning and transfer learning hold promise, but have not yet been effective in training accurate models. On the other hand, collecting labelled datasets on a diverse set of domains and speakers is very expensive. In this work, we demonstrate an inexpensive and effective alternative to these approaches by ``mining'' text and audio pairs for Indian languages from public sources, specifically from the public archives of All India Radio. As a key component, we adapt the Needleman-Wunsch algorithm to align sentences with corresponding audio segments given a long audio and a PDF of its transcript, while being robust to errors due to OCR, extraneous text, and non-transcribed speech. We thus create Shrutilipi, a dataset which contains over 6,400 hours of labelled audio across 12 Indian languages totalling to 4.95M sentences. On average, Shrutilipi results in a 2.3x increase over publicly available labelled data. We establish the quality of Shrutilipi with 21 human evaluators across the 12 languages. We also establish the diversity of Shrutilipi in terms of represented regions, speakers, and mentioned named entities. Significantly, we show that adding Shrutilipi to the training set of Wav2Vec models leads to an average decrease in WER of 5.8\% for 7 languages on the IndicSUPERB benchmark. For Hindi, which has the most benchmarks (7), the average WER falls from 18.8% to 13.5%. This improvement extends to efficient models: We show a 2.3% drop in WER for a Conformer model (10x smaller than Wav2Vec). Finally, we demonstrate the diversity of Shrutilipi by showing that the model trained with it is more robust to noisy input. △ Less

Submitted 26 August, 2022; originally announced August 2022.

arXiv:2208.11761 [pdf, other]

IndicSUPERB: A Speech Processing Universal Performance Benchmark for Indian languages

Authors: Tahir Javed, Kaushal Santosh Bhogale, Abhigyan Raman, Anoop Kunchukuttan, Pratyush Kumar, Mitesh M. Khapra

Abstract: A cornerstone in AI research has been the creation and adoption of standardized training and test datasets to earmark the progress of state-of-the-art models. A particularly successful example is the GLUE dataset for training and evaluating Natural Language Understanding (NLU) models for English. The large body of research around self-supervised BERT-based language models revolved around performan… ▽ More A cornerstone in AI research has been the creation and adoption of standardized training and test datasets to earmark the progress of state-of-the-art models. A particularly successful example is the GLUE dataset for training and evaluating Natural Language Understanding (NLU) models for English. The large body of research around self-supervised BERT-based language models revolved around performance improvements on NLU tasks in GLUE. To evaluate language models in other languages, several language-specific GLUE datasets were created. The area of speech language understanding (SLU) has followed a similar trajectory. The success of large self-supervised models such as wav2vec2 enable creation of speech models with relatively easy to access unlabelled data. These models can then be evaluated on SLU tasks, such as the SUPERB benchmark. In this work, we extend this to Indic languages by releasing the IndicSUPERB benchmark. Specifically, we make the following three contributions. (i) We collect Kathbath containing 1,684 hours of labelled speech data across 12 Indian languages from 1,218 contributors located in 203 districts in India. (ii) Using Kathbath, we create benchmarks across 6 speech tasks: Automatic Speech Recognition, Speaker Verification, Speaker Identification (mono/multi), Language Identification, Query By Example, and Keyword Spotting for 12 languages. (iii) On the released benchmarks, we train and evaluate different self-supervised models alongside a commonly used baseline FBANK. We show that language-specific fine-tuned models are more accurate than baseline on most of the tasks, including a large gap of 76\% for the Language Identification task. However, for speaker identification, self-supervised models trained on large datasets demonstrate an advantage. We hope IndicSUPERB contributes to the progress of developing speech language understanding models for Indian languages. △ Less

Submitted 15 December, 2022; v1 submitted 24 August, 2022; originally announced August 2022.

arXiv:2208.05877 [pdf, other]

doi 10.1145/3544216.3544232

Design and Evaluation of IPFS: A Storage Layer for the Decentralized Web

Authors: Dennis Trautwein, Aravindh Raman, Gareth Tyson, Ignacio Castro, Will Scott, Moritz Schubotz, Bela Gipp, Yiannis Psaras

Abstract: Recent years have witnessed growing consolidation of web operations. For example, the majority of web traffic now originates from a few organizations, and even micro-websites often choose to host on large pre-existing cloud infrastructures. In response to this, the "Decentralized Web" attempts to distribute ownership and operation of web services more evenly. This paper describes the design and im… ▽ More Recent years have witnessed growing consolidation of web operations. For example, the majority of web traffic now originates from a few organizations, and even micro-websites often choose to host on large pre-existing cloud infrastructures. In response to this, the "Decentralized Web" attempts to distribute ownership and operation of web services more evenly. This paper describes the design and implementation of the largest and most widely used Decentralized Web platform - the InterPlanetary File System (IPFS) - an open-source, content-addressable peer-to-peer network that provides distributed data storage and delivery. IPFS has millions of daily content retrievals and already underpins dozens of third-party applications. This paper evaluates the performance of IPFS by introducing a set of measurement methodologies that allow us to uncover the characteristics of peers in the IPFS network. We reveal presence in more than 2700 Autonomous Systems and 152 countries, the majority of which operate outside large central cloud providers like Amazon or Azure. We further evaluate IPFS performance, showing that both publication and retrieval delays are acceptable for a wide range of use cases. Finally, we share our datasets, experiences and lessons learned. △ Less

Submitted 11 August, 2022; originally announced August 2022.

Comments: 14 pages, 11 figures

ACM Class: C.2.2; C.2.1

Journal ref: SIGCOMM '22, August 22-26, 2022, Amsterdam, Netherlands

arXiv:2208.05381 [pdf, other]

doi 10.1109/TNSM.2022.3153452

DSM-MoC as Baseline: Reliability Assurance via Redundant Cellular Connectivity in Connected Cars

Authors: Emeka Obiodu, Aravindh Raman, Abdullahi Abubakar, Simone Mangiante, Nishanth Sastry, Hamid Aghvami

Abstract: Connected Cars (CCs) and vehicle-to-everything (V2X) use cases require stringent reliability for safety and non-safety uses. With increasing network softwarisation, it has become easier to use multiple, redundant connectivity options instead of relying on a single network connectivity. But where should these redundant connections be managed? Is it at a network provider's core network - i.e. supply… ▽ More Connected Cars (CCs) and vehicle-to-everything (V2X) use cases require stringent reliability for safety and non-safety uses. With increasing network softwarisation, it has become easier to use multiple, redundant connectivity options instead of relying on a single network connectivity. But where should these redundant connections be managed? Is it at a network provider's core network - i.e. supply side managed (SSM) - or at the CC - i.e. demand side managed (DSM)? In our work, we investigate the use of SSM and DSM for CCs on four separate days and across 800 kilometers of major / minor roads in South East England. For Day 1, we captured performance indicators, and determined hypothetical multi-operator configurations for four UK providers and a global Universal SIM. For Day 2, 3& 4, we built and deployed a test-bed to actually implement network switching and understand performance (incl. for TCP & UDP) either on the road or in a stationary location. Based on our results, we make three contributions. First, we show that DSM can deliver superior performance for CCs more than any individual network (up to 28 percentage points in a hypothetical scenario), or SSM which had up to 4.8x longer page load times. Second, unlike other smartphone-only studies, our system-level study demonstrates that improvements of at least 12% can be obtained in a practical DSM field implementation for a CC. Third, we confirm that the advantage of DSM in a field implementation is higher for UDP traffic (23% better latency) compared to TCP (13%). △ Less

Submitted 10 August, 2022; originally announced August 2022.

Journal ref: IEEE Transactions on Network and Service Management, 2022

arXiv:2207.10169 [pdf]

Pediatric Bone Age Assessment using Deep Learning Models

Authors: Aravinda Raman, Sameena Pathan, Tanweer Ali

Abstract: Bone age assessment (BAA) is a standard method for determining the age difference between skeletal and chronological age. Manual processes are complicated and necessitate the expertise of experts. This is where deep learning comes into play. In this study, pre-trained models like VGG-16, InceptionV3, XceptionNet, and MobileNet are used to assess the bone age of the input data, and their mean avera… ▽ More Bone age assessment (BAA) is a standard method for determining the age difference between skeletal and chronological age. Manual processes are complicated and necessitate the expertise of experts. This is where deep learning comes into play. In this study, pre-trained models like VGG-16, InceptionV3, XceptionNet, and MobileNet are used to assess the bone age of the input data, and their mean average errors are compared and evaluated to see which model predicts the best. △ Less

Submitted 20 July, 2022; originally announced July 2022.

Comments: 18 pages, 28 figures, 1 table

arXiv:2204.12709 [pdf, other]

Toxicity in the Decentralized Web and the Potential for Model Sharing

Authors: Haris Bin Zia, Aravindh. Raman, Ignacio Castro, Ishaku Hassan Anaobi, Emiliano De Cristofaro, Nishanth Sastry, Gareth Tyson

Abstract: The "Decentralised Web" (DW) is an evolving concept, which encompasses technologies aimed at providing greater transparency and openness on the web. The DW relies on independent servers (aka instances) that mesh together in a peer-to-peer fashion to deliver a range of services (e.g. micro-blogs, image sharing, video streaming). However, toxic content moderation in this decentralised context is cha… ▽ More The "Decentralised Web" (DW) is an evolving concept, which encompasses technologies aimed at providing greater transparency and openness on the web. The DW relies on independent servers (aka instances) that mesh together in a peer-to-peer fashion to deliver a range of services (e.g. micro-blogs, image sharing, video streaming). However, toxic content moderation in this decentralised context is challenging. This is because there is no central entity that can define toxicity, nor a large central pool of data that can be used to build universal classifiers. It is therefore unsurprising that there have been several high-profile cases of the DW being misused to coordinate and disseminate harmful material. Using a dataset of 9.9M posts from 117K users on Pleroma (a popular DW microblogging service), we quantify the presence of toxic content. We find that toxic content is prevalent and spreads rapidly between instances. We show that automating per-instance content moderation is challenging due to the lack of sufficient training data available and the effort required in labelling. We therefore propose and evaluate ModPair, a model sharing system that effectively detects toxic content, gaining an average per-instance macro-F1 score 0.89. △ Less

Submitted 27 April, 2022; originally announced April 2022.

Journal ref: Published in the Proceedings of the 2022 ACM International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS'22). Please cite accordingly

arXiv:2202.04185 [pdf, other]

OSM-tree: A Sortedness-Aware Index

Authors: Aneesh Raman, Subhadeep Sarkar, Matthaios Olma, Manos Athanassoulis

Abstract: Indexes facilitate efficient querying when the selection predicate is on an indexed key. As a result, when loading data, if we anticipate future selective (point or range) queries, we typically maintain an index that is gradually populated as new data is ingested. In that respect, indexing can be perceived as the process of adding structure to an incoming, otherwise unsorted, data collection. The… ▽ More Indexes facilitate efficient querying when the selection predicate is on an indexed key. As a result, when loading data, if we anticipate future selective (point or range) queries, we typically maintain an index that is gradually populated as new data is ingested. In that respect, indexing can be perceived as the process of adding structure to an incoming, otherwise unsorted, data collection. The process of adding structure comes at a cost, as instead of simply appending incoming data, every new entry is inserted into the index. If the data ingestion order matches the indexed attribute order, the ingestion cost is entirely redundant and can be avoided (e.g., via bulk loading in a B+-tree). However, state-of-the-art index designs do not benefit when data is ingested in an order that is close to being sorted but not fully sorted. In this paper, we study how indexes can benefit from partial data sortedness or near-sortedness, and we propose an ensemble of techniques that combine bulk loading, index appends, variable node fill/split factor, and buffering, to optimize the ingestion cost of a tree index in presence of partial data sortedness. We further augment the proposed design with necessary metadata structures to ensure competitive read performance. We apply the proposed design paradigm on a state-of-the-art B+-tree, and we propose the Ordered Sort-Merge tree (OSM-tree). OSM-tree outperforms the state of the art by up to 8.8x in ingestion performance in the presence of sortedness, while falling back to a B+-tree's ingestion performance when data is scrambled. OSM-tree offers competitive query performance, leading to performance benefits between 28% and 5x for mixed read/write workloads. △ Less

Submitted 8 February, 2022; originally announced February 2022.

arXiv:2201.00286 [pdf, ps, other]

Reinforcement Learning for Task Specifications with Action-Constraints

Authors: Arun Raman, Keerthan Shagrithaya, Shalabh Bhatnagar

Abstract: In this paper, we use concepts from supervisory control theory of discrete event systems to propose a method to learn optimal control policies for a finite-state Markov Decision Process (MDP) in which (only) certain sequences of actions are deemed unsafe (respectively safe). We assume that the set of action sequences that are deemed unsafe and/or safe are given in terms of a finite-state automaton… ▽ More In this paper, we use concepts from supervisory control theory of discrete event systems to propose a method to learn optimal control policies for a finite-state Markov Decision Process (MDP) in which (only) certain sequences of actions are deemed unsafe (respectively safe). We assume that the set of action sequences that are deemed unsafe and/or safe are given in terms of a finite-state automaton; and propose a supervisor that disables a subset of actions at every state of the MDP so that the constraints on action sequence are satisfied. Then we present a version of the Q-learning algorithm for learning optimal policies in the presence of non-Markovian action-sequence and state constraints, where we use the development of reward machines to handle the state constraints. We illustrate the method using an example that captures the utility of automata-based methods for non-Markovian state and action specifications for reinforcement learning and show the results of simulations in this setting. △ Less

Submitted 1 January, 2022; originally announced January 2022.

arXiv:2111.03945 [pdf, other]

Towards Building ASR Systems for the Next Billion Users

Authors: Tahir Javed, Sumanth Doddapaneni, Abhigyan Raman, Kaushal Santosh Bhogale, Gowtham Ramesh, Anoop Kunchukuttan, Pratyush Kumar, Mitesh M. Khapra

Abstract: Recent methods in speech and language technology pretrain very LARGE models which are fine-tuned for specific tasks. However, the benefits of such LARGE models are often limited to a few resource rich languages of the world. In this work, we make multiple contributions towards building ASR systems for low resource languages from the Indian subcontinent. First, we curate 17,000 hours of raw speech… ▽ More Recent methods in speech and language technology pretrain very LARGE models which are fine-tuned for specific tasks. However, the benefits of such LARGE models are often limited to a few resource rich languages of the world. In this work, we make multiple contributions towards building ASR systems for low resource languages from the Indian subcontinent. First, we curate 17,000 hours of raw speech data for 40 Indian languages from a wide variety of domains including education, news, technology, and finance. Second, using this raw speech data we pretrain several variants of wav2vec style models for 40 Indian languages. Third, we analyze the pretrained models to find key features: codebook vectors of similar sounding phonemes are shared across languages, representations across layers are discriminative of the language family, and attention heads often pay attention within small local windows. Fourth, we fine-tune this model for downstream ASR for 9 languages and obtain state-of-the-art results on 3 public datasets, including on very low-resource languages such as Sinhala and Nepali. Our work establishes that multilingual pretraining is an effective strategy for building ASR systems for the linguistically diverse speakers of the Indian subcontinent. Our code, data and models are available publicly at https://indicnlp.ai4bharat.org/indicwav2vec/ and we hope they will help advance research in ASR for Indic languages. △ Less

Submitted 22 December, 2021; v1 submitted 6 November, 2021; originally announced November 2021.

arXiv:2110.13500 [pdf, other]

Exploring Content Moderation in the Decentralised Web: The Pleroma Case

Authors: Anaobi Ishaku Hassan, Aravindh Raman, Ignacio Castro, Haris Bin Zia, Emiliano De Cristofaro, Nishanth Sastry, Gareth Tyson

Abstract: Decentralising the Web is a desirable but challenging goal. One particular challenge is achieving decentralised content moderation in the face of various adversaries (e.g. trolls). To overcome this challenge, many Decentralised Web (DW) implementations rely on federation policies. Administrators use these policies to create rules that ban or modify content that matches specific rules. This, howeve… ▽ More Decentralising the Web is a desirable but challenging goal. One particular challenge is achieving decentralised content moderation in the face of various adversaries (e.g. trolls). To overcome this challenge, many Decentralised Web (DW) implementations rely on federation policies. Administrators use these policies to create rules that ban or modify content that matches specific rules. This, however, can have unintended consequences for many users. In this paper, we present the first study of federation policies on the DW, their in-the-wild usage, and their impact on users. We identify how these policies may negatively impact "innocent" users and outline possible solutions to avoid this problem in the future. △ Less

Submitted 30 October, 2021; v1 submitted 26 October, 2021; originally announced October 2021.

Journal ref: Proceedings of the 17th International Conference on emerging Networking EXperiments and Technologies (ACM CoNext 2021)

arXiv:2109.14886 [pdf]

doi 10.1021/acsphotonics.1c01636

Enhancing Adjoint Optimization-based Photonics Inverse Design with Explainable Machine Learning

Authors: Christopher Yeung, David Ho, Benjamin Pham, Katherine T. Fountaine, Aaswath P. Raman

Abstract: A fundamental challenge in the design of photonic devices, and electromagnetic structures more generally, is the optimization of their overall architecture to achieve a desired response. To this end, topology or shape optimizers based on the adjoint variables method have been widely adopted due to their high computational efficiency and ability to create complex freeform geometries. However, the f… ▽ More A fundamental challenge in the design of photonic devices, and electromagnetic structures more generally, is the optimization of their overall architecture to achieve a desired response. To this end, topology or shape optimizers based on the adjoint variables method have been widely adopted due to their high computational efficiency and ability to create complex freeform geometries. However, the functional understanding of such freeform structures remains a black box. Moreover, unless a design space of high-performance devices is known in advance, such gradient-based optimizers can get trapped in local minima valleys or saddle points, which limits performance achievable through this inverse design process. To elucidate the relationships between device performance and nanoscale structuring while mitigating the effects of local minima trapping, we present an inverse design framework that combines adjoint optimization, automated machine learning (AutoML), and explainable artificial intelligence (XAI). Integrated with a numerical electromagnetics simulation method, our framework reveals structural contributions towards a figure-of-merit (FOM) of interest. Through an explanation-based reoptimization process, this information is then leveraged to minimize the FOM further than that obtained through adjoint optimization alone, thus overcoming the optimization's local minima. We demonstrate our framework in the context of waveguide design and achieve between 39% and 74% increases in device performance relative to state-of-the-art adjoint optimization-based inverse design across a range of telecom wavelengths. Results of this work therefore highlight machine learning strategies that can substantially extend and enhance the capabilities of a conventional, optimization-based inverse design algorithm while revealing deeper insights into the algorithm's designs. △ Less

Submitted 25 April, 2022; v1 submitted 30 September, 2021; originally announced September 2021.

arXiv:2108.07952 [pdf]

Accurately Quantifying Radiative Cooling Potentials: A Temperature-correction to the Transmittance-based approximation

Authors: Jyotirmoy Mandal, Xin Huang, Aaswath P. Raman

Abstract: Theoretical calculations of the cooling potential of radiative cooling materials are crucial for determining their cooling capability under different meteorological conditions and evaluating their performance. To enable these calculations, accurate models of long-wave infrared downwelling atmospheric irradiance are needed, However, the transmittance-based cosine approximation, which is widely used… ▽ More Theoretical calculations of the cooling potential of radiative cooling materials are crucial for determining their cooling capability under different meteorological conditions and evaluating their performance. To enable these calculations, accurate models of long-wave infrared downwelling atmospheric irradiance are needed, However, the transmittance-based cosine approximation, which is widely used to determine radiative cooling potentials, does not account for the cooling potential arising from heat loss to the colder reaches of the atmosphere itself. Here, we show that use of the approximation can lead to > 10% underestimation of the cooling potential relative to MODTRAN 6 outputs. We propose a temperature correction to the transmittance-based approximation which accounts for heat loss to the cold upper atmosphere, and significantly reduces this underestimation, while retaining the advantages of the original model. In light of the widespread and continued use of the transmittance-based model, our results highlight an important source of potential errors and a means to correct for them. △ Less

Submitted 17 August, 2021; originally announced August 2021.

arXiv:2106.14328 [pdf]

Nanostructured Plasmonic Metal Surfaces as Optical Components for Infrared Imaging and Sensing

Authors: Jyotirmoy Mandal, John Brewer, Sagar Mandal, Aaswath P. Raman

Abstract: Thermal imaging and sensing technologies offer critical information about our thermally radiant world, and in recent years, have seen dramatic increases in usage for a range of applications. However, the cost and technical finesse of manufacturing infrared optical components remain a major barrier towards the democratization of these technologies. In this report, we present a solution processed pl… ▽ More Thermal imaging and sensing technologies offer critical information about our thermally radiant world, and in recent years, have seen dramatic increases in usage for a range of applications. However, the cost and technical finesse of manufacturing infrared optical components remain a major barrier towards the democratization of these technologies. In this report, we present a solution processed plasmonic reflective filter or PRF as a scalable and inexpensive thermal infrared optic. The PRF selectively absorbs sunlight and specularly reflects thermal infrared TIR wavelengths with performance comparable to state-of-the-art TIR optics made of materials like Germanium. Unlike traditional infrared optical components, however, the PRF can be conveniently fabricated using inexpensive materials and a dip and dry chemical synthesis technique, and crucially, has manufacturing costs that are orders of magnitude lower. We experimentally demonstrate the core optical functionality of the PRF, as well as its integration into infrared imaging and sensing systems without compromising their thermographic or radiometric capabilities. From a practical standpoint, the inexpensive and convenient fabricability of the PRF represent a significant advance towards making the benefits of thermal imaging and sensing systems more affordable and accessible. Scientifically, our work demonstrates a previously unexplored optical functionality and a new direction for versatile chemical synthesis in designing optical components. △ Less

Submitted 27 June, 2021; originally announced June 2021.

arXiv:2106.05184 [pdf, other]

Jettisoning Junk Messaging in the Era of End-to-End Encryption: A Case Study of WhatsApp

Authors: Pushkal Agarwal, Aravindh Raman, Damilola Ibosiola, Gareth Tyson, Nishanth Sastry, Kiran Garimella

Abstract: WhatsApp is a popular messaging app used by over a billion users around the globe. Due to this popularity, understanding misbehavior on WhatsApp is an important issue. The sending of unwanted junk messages by unknown contacts via WhatsApp remains understudied by researchers, in part because of the end-to-end encryption offered by the platform. We address this gap by studying junk messaging on a mu… ▽ More WhatsApp is a popular messaging app used by over a billion users around the globe. Due to this popularity, understanding misbehavior on WhatsApp is an important issue. The sending of unwanted junk messages by unknown contacts via WhatsApp remains understudied by researchers, in part because of the end-to-end encryption offered by the platform. We address this gap by studying junk messaging on a multilingual dataset of 2.6M messages sent to 5K public WhatsApp groups in India. We characterise both junk content and senders. We find that nearly 1 in 10 messages is unwanted content sent by junk senders, and a number of unique strategies are employed to reflect challenges faced on WhatsApp, e.g., the need to change phone numbers regularly. We finally experiment with on-device classification to automate the detection of junk, whilst respecting end-to-end encryption. △ Less

Submitted 12 February, 2022; v1 submitted 8 June, 2021; originally announced June 2021.

Comments: A PREPRINT OF ACCEPTED PUBLICATION AT The Web Conference (WWW) 2022

arXiv:2106.03558 [pdf, other]

doi 10.1021/acs.nanolett.1c03273

Multi-scale photonic emissivity engineering for relativistic lightsail thermal regulation

Authors: John Brewer, Matthew F. Campbell, Pawan Kumar, Sachin Kulkarni, Deep Jariwala, Igor Bargatin, Aaswath P. Raman

Abstract: The Breakthrough Starshot Initiative aims to send a gram-scale probe to Proxima Centuri B using a laser-accelerated lightsail traveling at relativistic speeds. Thermal management is a key lightsail design objective because of the intense laser powers required but has generally been considered secondary to accelerative performance. Here, we demonstrate nanophotonic photonic crystal slab reflectors… ▽ More The Breakthrough Starshot Initiative aims to send a gram-scale probe to Proxima Centuri B using a laser-accelerated lightsail traveling at relativistic speeds. Thermal management is a key lightsail design objective because of the intense laser powers required but has generally been considered secondary to accelerative performance. Here, we demonstrate nanophotonic photonic crystal slab reflectors composed of 2H-phase molybdenum disulfide and crystalline silicon nitride, highlight the inverse relationship between the thermal band extinction coefficient and the lightsail's maximum temperature, and examine the trade-off between the acceleration distance and setting realistic sail thermal limits, ultimately realizing a thermally endurable acceleration minimum distance of 16.3~Gm. We additionally demonstrate multi-scale photonic structures featuring thermal-wavelength-scale Mie resonant geometries, and characterize their broadband Mie resonance-driven emissivity enhancement and acceleration distance reduction. Our results highlight new possibilities in simultaneously controlling optical and thermal response over broad wavelength ranges in ultralight nanophotonic structures. △ Less

Submitted 14 September, 2021; v1 submitted 3 June, 2021; originally announced June 2021.

Comments: 15 pages, 4 figures

arXiv:2105.10849 [pdf, other]

doi 10.1021/acs.nanolett.1c03272

Relativistic light sails need to billow

Authors: Matthew F. Campbell, John Brewer, Deep Jariwala, Aaswath Raman, Igor Bargatin

Abstract: We argue that light sails that are rapidly accelerated to relativistic velocities by lasers must be significantly curved in order to reduce their mechanical stresses and avoid tears. Using an integrated opto-thermo-mechanical model, we show that the diameter and radius of curvature of a circular light sail should be comparable in magnitude, both on the order of a few meters in optimal designs for… ▽ More We argue that light sails that are rapidly accelerated to relativistic velocities by lasers must be significantly curved in order to reduce their mechanical stresses and avoid tears. Using an integrated opto-thermo-mechanical model, we show that the diameter and radius of curvature of a circular light sail should be comparable in magnitude, both on the order of a few meters in optimal designs for gram-scale payloads. Moreover, when sufficient laser power is available, a sail's acceleration length decreases and its chip payload capacity increases as its curvature increases. Our findings provide guidance for emerging light sail design programs, which herald a new era of interstellar space exploration. △ Less

Submitted 22 May, 2021; originally announced May 2021.

Comments: 8 pages, 4 figures

arXiv:2105.03611 [pdf, other]

doi 10.1145/3458306.3460998

360NorVic: 360-Degree Video Classification from Mobile Encrypted Video Traffic

Authors: Chamara Kattadige, Aravindh Raman, Kanchana Thilakarathna, Andra Lutu, Diego Perino

Abstract: Streaming 360° video demands high bandwidth and low latency, and poses significant challenges to Internet Service Providers (ISPs) and Mobile Network Operators (MNOs). The identification of 360° video traffic can therefore benefits fixed and mobile carriers to optimize their network and provide better Quality of Experience (QoE) to the user. However, end-to-end encryption of network traffic has ob… ▽ More Streaming 360° video demands high bandwidth and low latency, and poses significant challenges to Internet Service Providers (ISPs) and Mobile Network Operators (MNOs). The identification of 360° video traffic can therefore benefits fixed and mobile carriers to optimize their network and provide better Quality of Experience (QoE) to the user. However, end-to-end encryption of network traffic has obstructed identifying those 360° videos from regular videos. As a solution this paper presents 360NorVic, a near-realtime and offline Machine Learning (ML) classification engine to distinguish 360° videos from regular videos when streamed from mobile devices. We collect packet and flow level data for over 800 video traces from YouTube & Facebook accounting for 200 unique videos under varying streaming conditions. Our results show that for near-realtime and offline classification at packet level, average accuracy exceeds 95%, and that for flow level, 360NorVic achieves more than 92% average accuracy. Finally, we pilot our solution in the commercial network of a large MNO showing the feasibility and effectiveness of 360NorVic in production settings. △ Less

Submitted 8 May, 2021; originally announced May 2021.

Comments: 7 pages, 15 figures, accepted in Workshop on Network and OperatingSystem Support for Digital Audio and Video (NOSSDAV 21)

arXiv:2012.15790 [pdf]

doi 10.1002/adom.202100548

Global Inverse Design Across Multiple Photonic Structure Classes Using Generative Deep Learning

Authors: Christopher Yeung, Ryan Tsai, Benjamin Pham, Brian King, Yusaku Kawagoe, David Ho, Julia Liang, Mark W. Knight, Aaswath P. Raman

Abstract: Understanding how nano- or micro-scale structures and material properties can be optimally configured to attain specific functionalities remains a fundamental challenge. Photonic metasurfaces, for instance, can be spectrally tuned through material choice and structural geometry to achieve unique optical responses. However, existing numerical design methods require prior identification of specific… ▽ More Understanding how nano- or micro-scale structures and material properties can be optimally configured to attain specific functionalities remains a fundamental challenge. Photonic metasurfaces, for instance, can be spectrally tuned through material choice and structural geometry to achieve unique optical responses. However, existing numerical design methods require prior identification of specific material-structure combinations, or device classes, as the starting point for optimization. As such, a unified solution that simultaneously optimizes across materials and geometries has yet to be realized. To overcome these challenges, we present a global deep learning-based inverse design framework, where a conditional deep convolutional generative adversarial network is trained on colored images encoded with a range of material and structural parameters, including refractive index, plasma frequency, and geometric design. We demonstrate that, in response to target absorption spectra, the network can identify an effective metasurface in terms of its class, materials properties, and overall shape. Furthermore, the model can arrive at multiple design variants with distinct materials and structures that present nearly identical absorption spectra. Our proposed framework is thus an important step towards global photonics and materials design strategies that can identify combinations of device categories, material properties, and geometric parameters which algorithmically deliver a sought functionality. △ Less

Submitted 19 July, 2021; v1 submitted 31 December, 2020; originally announced December 2020.

arXiv:2009.08044 [pdf, other]

doi 10.1109/BigData50022.2020.9378270

Large-Scale Intelligent Microservices

Authors: Mark Hamilton, Nick Gonsalves, Christina Lee, Anand Raman, Brendan Walsh, Siddhartha Prasad, Dalitso Banda, Lucy Zhang, Mei Gao, Lei Zhang, William T. Freeman

Abstract: Deploying Machine Learning (ML) algorithms within databases is a challenge due to the varied computational footprints of modern ML algorithms and the myriad of database technologies each with its own restrictive syntax. We introduce an Apache Spark-based micro-service orchestration framework that extends database operations to include web service primitives. Our system can orchestrate web services… ▽ More Deploying Machine Learning (ML) algorithms within databases is a challenge due to the varied computational footprints of modern ML algorithms and the myriad of database technologies each with its own restrictive syntax. We introduce an Apache Spark-based micro-service orchestration framework that extends database operations to include web service primitives. Our system can orchestrate web services across hundreds of machines and takes full advantage of cluster, thread, and asynchronous parallelism. Using this framework, we provide large scale clients for intelligent services such as speech, vision, search, anomaly detection, and text analysis. This allows users to integrate ready-to-use intelligence into any datastore with an Apache Spark connector. To eliminate the majority of overhead from network communication, we also introduce a low-latency containerized version of our architecture. Finally, we demonstrate that the services we investigate are competitive on a variety of benchmarks, and present two applications of this framework to create intelligent search engines, and real-time auto race analytics systems. △ Less

Submitted 2 December, 2021; v1 submitted 16 September, 2020; originally announced September 2020.

arXiv:2008.12107 [pdf]

doi 10.1063/5.0022667

Modeling and optimization of radiative cooling based thermoelectric generators

Authors: Bin Zhao, Gang Pei, Aaswath P. Raman

Abstract: The possibility of night-time power generation has recently stimulated interest in using the radiative sky cooling mechanism with thermoelectric generators (TEG). These passive, low-temperature difference devices have been shown to generate electricity at night with no active input of heat needed, instead using the ambient air itself as the heat source. Here, we optimize both the geometry and oper… ▽ More The possibility of night-time power generation has recently stimulated interest in using the radiative sky cooling mechanism with thermoelectric generators (TEG). These passive, low-temperature difference devices have been shown to generate electricity at night with no active input of heat needed, instead using the ambient air itself as the heat source. Here, we optimize both the geometry and operating conditions of radiative cooling driven thermoelectric (RC-TE) generators. We determine the optimal operating conditions, including maximum power point and maximum efficiency point, by developing a combined thermal and electrical model. Our results show that the optimal operating condition results in larger power output than was previously expected. Moreover, we show that maximum power density occurs when the area ratio between cooler and P or N element reaches an optimal value. Finally, we perform a parametric study that takes account of environmental and structural parameters to improve the performance of the RC-TE device, including enhancing heat transfer between the hot surface and ambient air, suppressing the cooling loss of the radiative cooler, and optimizing the geometry of individual thermocouples. Our work identifies how to maximize the output of RC-TE devices and provides comprehensive guidance on making use of this new passive power generation method. △ Less

Submitted 21 July, 2020; originally announced August 2020.

Comments: 17 pages, 5 figures

arXiv:2008.00587 [pdf]

doi 10.1515/nanoph-2020-0549

Multiplexed Supercell Metasurface Design and Optimization with Tandem Residual Networks

Authors: Christopher Yeung, Ju-Ming Tsai, Brian King, Benjamin Pham, David Ho, Julia Liang, Mark W. Knight, Aaswath P. Raman

Abstract: Complex nanophotonic structures hold the potential to deliver exquisitely tailored optical responses for a range of applications. Metal-insulator-metal (MIM) metasurfaces arranged in supercells, for instance, can be tailored by geometry and material choice to exhibit a variety of absorption properties and resonant wavelengths. With this flexibility, however, comes a vast space of design possibilit… ▽ More Complex nanophotonic structures hold the potential to deliver exquisitely tailored optical responses for a range of applications. Metal-insulator-metal (MIM) metasurfaces arranged in supercells, for instance, can be tailored by geometry and material choice to exhibit a variety of absorption properties and resonant wavelengths. With this flexibility, however, comes a vast space of design possibilities that classical design paradigms struggle to effectively navigate. To overcome this challenge, here we demonstrate a tandem residual network approach to efficiently generate multiplexed supercells through inverse design. By using a training dataset with several thousand full-wave electromagnetic simulations in a design space of over three trillion possible designs, the deep learning model can accurately generate a wide range of complex supercell designs given a spectral target. Beyond inverse design, the presented approach can also be used to explore the structure-property relationships of broadband absorption and emission in such supercell configurations. Thus, this study demonstrates the feasibility of high-dimensional supercell inverse design with deep neural networks that is applicable to complex nanophotonic structures composed of multiple subunit elements that may exhibit coupling. △ Less

Submitted 14 December, 2020; v1 submitted 2 August, 2020; originally announced August 2020.

arXiv:2007.15406 [pdf]

doi 10.1109/MCOMSTD.2018.1800024

The Making of 5G: Building an End-to-End 5G-Enabled System

Authors: Idelkys Quintana-Ramirez, Anthony Tsiopoulos, Maria A Lema, Fragkiskos Sardis, Luis Sequeira, James Arias, Aravindh Raman, Ali Azam, Mischa Dohler

Abstract: This article documents one of the world's first standards-compliant pre-commercial end-to-end 5th generation (5G) systems. Focus is on a standardized 5G architecture which includes the underlying 3GPP components but also the ETSI Network Function Virtualization (NFV) management and orchestration capabilities. The truly innovative character of 5G enabling fundamental changes to architecture and imp… ▽ More This article documents one of the world's first standards-compliant pre-commercial end-to-end 5th generation (5G) systems. Focus is on a standardized 5G architecture which includes the underlying 3GPP components but also the ETSI Network Function Virtualization (NFV) management and orchestration capabilities. The truly innovative character of 5G enabling fundamental changes to architecture and implementation is discussed, and details of monitoring and orchestration approaches that are deemed instrumental in unlocking the full potential of 5G. Finally, it is important to us to share the lessons learned which we hope are of use to industry and academia alike when building, deploying and testing emerging 5G systems. △ Less

Submitted 30 July, 2020; originally announced July 2020.

Journal ref: IEEE Communications Standards Magazine. Vol. 2, Issue 4, pp. 88-96, Dec. 2018

arXiv:2006.11931 [pdf]

Radiative Cooling and Thermoregulation in the Earth's Glow

Authors: Jyotirmoy Mandal, Sagar Mandal, John Brewer, Arvind Ramachandran, Aaswath Pattabhi Raman

Abstract: Passive radiative cooling involves a net radiative heat loss into the cold outer space through the atmospheric transmission windows. Due to its passive nature and net cooling effect, it is a promising alternative or complement to electrical cooling. For efficient radiative cooling of objects, an unimpeded view of the sky is ideal. However, the view of the sky is usually limited - for instance, the… ▽ More Passive radiative cooling involves a net radiative heat loss into the cold outer space through the atmospheric transmission windows. Due to its passive nature and net cooling effect, it is a promising alternative or complement to electrical cooling. For efficient radiative cooling of objects, an unimpeded view of the sky is ideal. However, the view of the sky is usually limited - for instance, the walls of buildings have >50% of their field of view subtended by the earth. Moreover, objects on earth become sources of heat under sunlight. Therefore, building walls with hot terrestrial objects in view experience reduced cooling or heating, even with materials optimized for heat loss into the sky. We show that by using materials with selective long-wavelength infrared (LWIR) emittances, vertical building facades experience higher cooling than achievable by using broadband thermal emitters like typical building envelopes. Intriguingly, this effect is pronounced in the summer and diminishes or even reverses during the winter, indicating a thermoregulation effect. The findings highlight a major opportunity to harness untapped energy savings in buildings. △ Less

Submitted 7 May, 2021; v1 submitted 21 June, 2020; originally announced June 2020.

Showing 1–50 of 75 results for author: Raman, A