-
The anonymization problem in social networks
Authors:
Rachel G. de Jong,
Mark P. J. van der Loo,
Frank W. Takes
Abstract:
In this paper we introduce a general version of the anonymization problem in social networks, in which the goal is to maximize the number of anonymous nodes by altering a given graph. We define three variants of this optimization problem, being full, partial and budgeted anonymization. In each, the objective is to maximize the number of k-anonymous nodes, i.e., nodes for which there are at least k…
▽ More
In this paper we introduce a general version of the anonymization problem in social networks, in which the goal is to maximize the number of anonymous nodes by altering a given graph. We define three variants of this optimization problem, being full, partial and budgeted anonymization. In each, the objective is to maximize the number of k-anonymous nodes, i.e., nodes for which there are at least k-1 equivalent nodes, according to a particular anonymity measure of structural node equivalence. We propose six new heuristic algorithms for solving the anonymization problem which we implement into the reusable ANO-NET computational framework. As a baseline, we use an edge sampling method introduced in previous work. Experiments on both graph models and 17 real-world network datasets result in three empirical findings. First, we demonstrate that edge deletion is the most effective graph alteration operation. Second, we compare four commonly used anonymity measures from the literature and highlight how the choice of anonymity measure has a tremendous effect on both the achieved anonymity as well as the difficulty of solving the anonymization problem. Third, we find that the proposed algorithms that preferentially delete edges with a larger effect on nodes at a structurally unique position consistently outperform heuristics solely based on network structure. With similar runtimes, our algorithms retain on average 17 times more edges, ensuring higher data utility after full anonymization. In the budgeted variant, they achieve 4.4 times more anonymous nodes than the baseline. This work lays important foundations for future development of algorithms for anonymizing social networks.
△ Less
Submitted 24 September, 2024;
originally announced September 2024.
-
Asking an AI for salary negotiation advice is a matter of concern: Controlled experimental perturbation of ChatGPT for protected and non-protected group discrimination on a contextual task with no clear ground truth answers
Authors:
R. Stuart Geiger,
Flynn O'Sullivan,
Elsie Wang,
Jonathan Lo
Abstract:
We conducted controlled experimental bias audits for four versions of ChatGPT, which we asked to recommend an opening offer in salary negotiations for a new hire. We submitted 98,800 prompts to each version, systematically varying the employee's gender, university, and major, and tested prompts in voice of each side of the negotiation: the employee versus employer. We find ChatGPT as a multi-model…
▽ More
We conducted controlled experimental bias audits for four versions of ChatGPT, which we asked to recommend an opening offer in salary negotiations for a new hire. We submitted 98,800 prompts to each version, systematically varying the employee's gender, university, and major, and tested prompts in voice of each side of the negotiation: the employee versus employer. We find ChatGPT as a multi-model platform is not robust and consistent enough to be trusted for such a task. We observed statistically significant salary offers when varying gender for all four models, although with smaller gaps than for other attributes tested. The largest gaps were different model versions and between the employee- vs employer-voiced prompts. We also observed substantial gaps when varying university and major, but many of the biases were not consistent across model versions. We tested for fictional and fraudulent universities and found wildly inconsistent results across cases and model versions. We make broader contributions to the AI/ML fairness literature. Our scenario and our experimental design differ from mainstream AI/ML auditing efforts in key ways. Bias audits typically test discrimination for protected classes like gender, which we contrast with testing non-protected classes of university and major. Asking for negotiation advice includes how aggressive one ought to be in a negotiation relative to known empirical salary distributions and scales, which is a deeply contextual and personalized task that has no objective ground truth to validate. These results raise concerns for the specific model versions we tested and ChatGPT as a multi-model platform in continuous development. Our epistemology does not permit us to definitively certify these models as either generally biased or unbiased on the attributes we test, but our study raises matters of concern for stakeholders to further investigate.
△ Less
Submitted 25 September, 2024; v1 submitted 23 September, 2024;
originally announced September 2024.
-
Experimental and computational study of ethanolamine ices at astrochemical conditions
Authors:
R Ramachandran,
Milan Sil,
Prasanta Gorai,
J K Meka,
S Pavithraa,
J -I Lo,
S -L Chou,
Y -J Wu,
P Janardhan,
B -M Cheng,
Anil Bhardwaj,
Vıctor M. Rivilla,
N J Mason,
B Sivaraman,
Ankan Das
Abstract:
Ethanolamine (NH2CH2CH2OH) has recently been identified in the molecular cloud G+0.693-0.027, situated in the SgrB2 complex in the Galactic center. However, its presence in other regions, and in particular in star-forming sites, is still elusive. Given its likely role as a precursor to simple amino acids, understanding its presence in the star-forming region is required. Here, we present the exper…
▽ More
Ethanolamine (NH2CH2CH2OH) has recently been identified in the molecular cloud G+0.693-0.027, situated in the SgrB2 complex in the Galactic center. However, its presence in other regions, and in particular in star-forming sites, is still elusive. Given its likely role as a precursor to simple amino acids, understanding its presence in the star-forming region is required. Here, we present the experimentally obtained temperature-dependent spectral features and morphological behavior of pure ethanolamine ices under astrochemical conditions in the 2 - 12 micro meter (MIR) and 120 - 230 nm (VUV) regions for the first time. These features would help in understanding its photochemical behavior. In addition, we present the first chemical models specifically dedicated to ethanolamine. These models include all the discussed chemical routes from the literature, along with the estimated binding energies and activation energies from quantum chemical calculations reported in this work. We have found that surface reactions: CH2OH + NH2CH2 --> NH2CH2CH2OH and NH2 + C2H4OH --> NH2CH2CH2OH in warmer regions (60-90 K) could play a significant role in the formation of ethanolamine. Our modeled abundance of ethanolamine complements the upper limit of ethanolamine column density estimated in earlier observations in hot core/corino regions. Furthermore, we provide a theoretical estimation of the rotational and distortional constants for various species (such as HNCCO, NH2CHCO, and NH2CH2CO) related to ethanolamine that have not been studied in existing literature. This study could be valuable for identifying these species in the future.
△ Less
Submitted 2 September, 2024;
originally announced September 2024.
-
Variational Potential Flow: A Novel Probabilistic Framework for Energy-Based Generative Modelling
Authors:
Junn Yong Loo,
Michelle Adeline,
Arghya Pal,
Vishnu Monn Baskaran,
Chee-Ming Ting,
Raphael C. -W. Phan
Abstract:
Energy based models (EBMs) are appealing for their generality and simplicity in data likelihood modeling, but have conventionally been difficult to train due to the unstable and time-consuming implicit MCMC sampling during contrastive divergence training. In this paper, we present a novel energy-based generative framework, Variational Potential Flow (VAPO), that entirely dispenses with implicit MC…
▽ More
Energy based models (EBMs) are appealing for their generality and simplicity in data likelihood modeling, but have conventionally been difficult to train due to the unstable and time-consuming implicit MCMC sampling during contrastive divergence training. In this paper, we present a novel energy-based generative framework, Variational Potential Flow (VAPO), that entirely dispenses with implicit MCMC sampling and does not rely on complementary latent models or cooperative training. The VAPO framework aims to learn a potential energy function whose gradient (flow) guides the prior samples, so that their density evolution closely follows an approximate data likelihood homotopy. An energy loss function is then formulated to minimize the Kullback-Leibler divergence between density evolution of the flow-driven prior and the data likelihood homotopy. Images can be generated after training the potential energy, by initializing the samples from Gaussian prior and solving the ODE governing the potential flow on a fixed time interval using generic ODE solvers. Experiment results show that the proposed VAPO framework is capable of generating realistic images on various image datasets. In particular, our proposed framework achieves competitive FID scores for unconditional image generation on the CIFAR-10 and CelebA datasets.
△ Less
Submitted 21 July, 2024;
originally announced July 2024.
-
IntentionNet: Map-Lite Visual Navigation at the Kilometre Scale
Authors:
Wei Gao,
Bo Ai,
Joel Loo,
Vinay,
David Hsu
Abstract:
This work explores the challenges of creating a scalable and robust robot navigation system that can traverse both indoor and outdoor environments to reach distant goals. We propose a navigation system architecture called IntentionNet that employs a monolithic neural network as the low-level planner/controller, and uses a general interface that we call intentions to steer the controller. The paper…
▽ More
This work explores the challenges of creating a scalable and robust robot navigation system that can traverse both indoor and outdoor environments to reach distant goals. We propose a navigation system architecture called IntentionNet that employs a monolithic neural network as the low-level planner/controller, and uses a general interface that we call intentions to steer the controller. The paper proposes two types of intentions, Local Path and Environment (LPE) and Discretised Local Move (DLM), and shows that DLM is robust to significant metric positioning and mapping errors. The paper also presents Kilo-IntentionNet, an instance of the IntentionNet system using the DLM intention that is deployed on a Boston Dynamics Spot robot, and which successfully navigates through complex indoor and outdoor environments over distances of up to a kilometre with only noisy odometry.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
Open Scene Graphs for Open World Object-Goal Navigation
Authors:
Joel Loo,
Zhanxin Wu,
David Hsu
Abstract:
How can we build robots for open-world semantic navigation tasks, like searching for target objects in novel scenes? While foundation models have the rich knowledge and generalisation needed for these tasks, a suitable scene representation is needed to connect them into a complete robot system. We address this with Open Scene Graphs (OSGs), a topo-semantic representation that retains and organises…
▽ More
How can we build robots for open-world semantic navigation tasks, like searching for target objects in novel scenes? While foundation models have the rich knowledge and generalisation needed for these tasks, a suitable scene representation is needed to connect them into a complete robot system. We address this with Open Scene Graphs (OSGs), a topo-semantic representation that retains and organises open-set scene information for these models, and has a structure that can be configured for different environment types. We integrate foundation models and OSGs into the OpenSearch system for Open World Object-Goal Navigation, which is capable of searching for open-set objects specified in natural language, while generalising zero-shot across diverse environments and embodiments. Our OSGs enhance reasoning with Large Language Models (LLM), enabling robust object-goal navigation outperforming existing LLM approaches. Through simulation and real-world experiments, we validate OpenSearch's generalisation across varied environments, robots and novel instructions.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
A systematic comparison of measures for k-anonymity in networks
Authors:
Rachel G. de Jong,
Mark P. J. van der Loo,
Frank W. Takes
Abstract:
Privacy-aware sharing of network data is a difficult task due to the interconnectedness of individuals in networks. An important part of this problem is the inherently difficult question of how in a particular situation the privacy of an individual node should be measured. To that end, in this paper we propose a set of aspects that one should consider when choosing a measure for privacy. These asp…
▽ More
Privacy-aware sharing of network data is a difficult task due to the interconnectedness of individuals in networks. An important part of this problem is the inherently difficult question of how in a particular situation the privacy of an individual node should be measured. To that end, in this paper we propose a set of aspects that one should consider when choosing a measure for privacy. These aspects include the type of desired privacy and attacker scenario against which the measure protects, utility of the data, the type of desired output, and the computational complexity of the chosen measure. Based on these aspects, we provide a systematic overview of existing approaches in the literature. We then focus on a set of measures that ultimately enables our objective: sharing the anonymized full network dataset with limited disclosure risk. The considered measures, each based on the concept of k-anonymity, account for the structure of the surroundings of a certain node and differ in completeness and reach of the structural information taken into account. We present a comprehensive theoretical characterization as well as comparative empirical experiments on a wide range of real-world network datasets with up to millions of edges. We find that the choice of the measure has an enormous effect on aforementioned aspects. Most interestingly, we find that the most effective measures consider a greater node vicinity, yet utilize minimal structural information and thus use minimal computational resources. This finding has important implications for researchers and practitioners, who may, based on the recommendations given in this paper, make an informed choice on how to safely share large-scale network data in a privacy-aware manner.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
MDHA: Multi-Scale Deformable Transformer with Hybrid Anchors for Multi-View 3D Object Detection
Authors:
Michelle Adeline,
Junn Yong Loo,
Vishnu Monn Baskaran
Abstract:
Multi-view 3D object detection is a crucial component of autonomous driving systems. Contemporary query-based methods primarily depend either on dataset-specific initialization of 3D anchors, introducing bias, or utilize dense attention mechanisms, which are computationally inefficient and unscalable. To overcome these issues, we present MDHA, a novel sparse query-based framework, which constructs…
▽ More
Multi-view 3D object detection is a crucial component of autonomous driving systems. Contemporary query-based methods primarily depend either on dataset-specific initialization of 3D anchors, introducing bias, or utilize dense attention mechanisms, which are computationally inefficient and unscalable. To overcome these issues, we present MDHA, a novel sparse query-based framework, which constructs adaptive 3D output proposals using hybrid anchors from multi-view, multi-scale input. Fixed 2D anchors are combined with depth predictions to form 2.5D anchors, which are projected to obtain 3D proposals. To ensure high efficiency, our proposed Anchor Encoder performs sparse refinement and selects the top-k anchors and features. Moreover, while existing multi-view attention mechanisms rely on projecting reference points to multiple images, our novel Circular Deformable Attention mechanism only projects to a single image but allows reference points to seamlessly attend to adjacent images, improving efficiency without compromising on performance. On the nuScenes val set, it achieves 46.4% mAP and 55.0% NDS with a ResNet101 backbone. MDHA significantly outperforms the baseline, where anchor proposals are modelled as learnable embeddings.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Split-Apply-Combine with Dynamic Grouping
Authors:
Mark P. J. van der Loo
Abstract:
Partitioning a data set by one or more of its attributes and computing an aggregate for each part is one of the most common operations in data analyses. There are use cases where the partitioning is determined dynamically by collapsing smaller subsets into larger ones, to ensure sufficient support for the computed aggregate. These use cases are not supported by software implementing split-apply-co…
▽ More
Partitioning a data set by one or more of its attributes and computing an aggregate for each part is one of the most common operations in data analyses. There are use cases where the partitioning is determined dynamically by collapsing smaller subsets into larger ones, to ensure sufficient support for the computed aggregate. These use cases are not supported by software implementing split-apply-combine types of operations. This paper presents the \texttt{R} package \texttt{accumulate} that offers convenient interfaces for defining grouped aggregation where the grouping itself is dynamically determined, based on user-defined conditions on subsets, and a user-defined subset collapsing scheme. The formal underlying algorithm is described and analyzed as well.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
FPN-IAIA-BL: A Multi-Scale Interpretable Deep Learning Model for Classification of Mass Margins in Digital Mammography
Authors:
Julia Yang,
Alina Jade Barnett,
Jon Donnelly,
Satvik Kishore,
Jerry Fang,
Fides Regina Schwartz,
Chaofan Chen,
Joseph Y. Lo,
Cynthia Rudin
Abstract:
Digital mammography is essential to breast cancer detection, and deep learning offers promising tools for faster and more accurate mammogram analysis. In radiology and other high-stakes environments, uninterpretable ("black box") deep learning models are unsuitable and there is a call in these fields to make interpretable models. Recent work in interpretable computer vision provides transparency t…
▽ More
Digital mammography is essential to breast cancer detection, and deep learning offers promising tools for faster and more accurate mammogram analysis. In radiology and other high-stakes environments, uninterpretable ("black box") deep learning models are unsuitable and there is a call in these fields to make interpretable models. Recent work in interpretable computer vision provides transparency to these formerly black boxes by utilizing prototypes for case-based explanations, achieving high accuracy in applications including mammography. However, these models struggle with precise feature localization, reasoning on large portions of an image when only a small part is relevant. This paper addresses this gap by proposing a novel multi-scale interpretable deep learning model for mammographic mass margin classification. Our contribution not only offers an interpretable model with reasoning aligned with radiologist practices, but also provides a general architecture for computer vision with user-configurable prototypes from coarse- to fine-grained prototypes.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
XCAT-3.0: A Comprehensive Library of Personalized Digital Twins Derived from CT Scans
Authors:
Lavsen Dahal,
Mobina Ghojoghnejad,
Dhrubajyoti Ghosh,
Yubraj Bhandari,
David Kim,
Fong Chi Ho,
Fakrul Islam Tushar,
Sheng Luoa,
Kyle J. Lafata,
Ehsan Abadi,
Ehsan Samei,
Joseph Y. Lo,
W. Paul Segars
Abstract:
Virtual Imaging Trials (VIT) offer a cost-effective and scalable approach for evaluating medical imaging technologies. Computational phantoms, which mimic real patient anatomy and physiology, play a central role in VITs. However, the current libraries of computational phantoms face limitations, particularly in terms of sample size and diversity. Insufficient representation of the population hamper…
▽ More
Virtual Imaging Trials (VIT) offer a cost-effective and scalable approach for evaluating medical imaging technologies. Computational phantoms, which mimic real patient anatomy and physiology, play a central role in VITs. However, the current libraries of computational phantoms face limitations, particularly in terms of sample size and diversity. Insufficient representation of the population hampers accurate assessment of imaging technologies across different patient groups. Traditionally, the more realistic computational phantoms were created by manual segmentation, which is a laborious and time-consuming task, impeding the expansion of phantom libraries. This study presents a framework for creating realistic computational phantoms using a suite of automatic segmentation models and performing three forms of automated quality control on the segmented organ masks. The result is the release of over 2500 new computational phantoms, so-named XCAT3.0 after the ubiquitous XCAT computational construct. This new formation embodies 140 structures and represents a comprehensive approach to detailed anatomical modeling. The developed computational phantoms are formatted in both voxelized and surface mesh formats. The framework is combined with an in-house CT scanner simulator to produce realistic CT images. The framework has the potential to advance virtual imaging trials, facilitating comprehensive and reliable evaluations of medical imaging technologies. Phantoms may be requested at https://cvit.duke.edu/resources/. Code, model weights, and sample CT images are available at https://xcat-3.github.io/.
△ Less
Submitted 9 September, 2024; v1 submitted 17 May, 2024;
originally announced May 2024.
-
Scene Action Maps: Behavioural Maps for Navigation without Metric Information
Authors:
Joel Loo,
David Hsu
Abstract:
Humans are remarkable in their ability to navigate without metric information. We can read abstract 2D maps, such as floor-plans or hand-drawn sketches, and use them to navigate in unseen rich 3D environments, without requiring prior traversals to map out these scenes in detail. We posit that this is enabled by the ability to represent the environment abstractly as interconnected navigational beha…
▽ More
Humans are remarkable in their ability to navigate without metric information. We can read abstract 2D maps, such as floor-plans or hand-drawn sketches, and use them to navigate in unseen rich 3D environments, without requiring prior traversals to map out these scenes in detail. We posit that this is enabled by the ability to represent the environment abstractly as interconnected navigational behaviours, e.g., "follow the corridor" or "turn right", while avoiding detailed, accurate spatial information at the metric level. We introduce the Scene Action Map (SAM), a behavioural topological graph, and propose a learnable map-reading method, which parses a variety of 2D maps into SAMs. Map-reading extracts salient information about navigational behaviours from the overlooked wealth of pre-existing, abstract and inaccurate maps, ranging from floor-plans to sketches. We evaluate the performance of SAMs for navigation, by building and deploying a behavioural navigation stack on a quadrupedal robot. Videos and more information is available at: https://scene-action-maps.github.io.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
Proceedings Virtual Imaging Trials in Medicine 2024
Authors:
Ehsan Abadi,
Aldo Badano,
Predrag Bakic,
Kristina Bliznakova,
Hilde Bosmans,
Ann-Katherine Carton,
Alejandro Frangi,
Stephen Glick,
Paul Kinahan,
Joseph Lo,
Andrew Maidment,
Francesco Ria,
Ehsan Samei,
Ioannis Sechopoulos,
Paul Segars,
Rie Tanaka,
Liesbeth Vancoillie
Abstract:
This submission comprises the proceedings of the 1st Virtual Imaging Trials in Medicine conference, organized by Duke University on April 22-24, 2024. The listed authors serve as the program directors for this conference. The VITM conference is a pioneering summit uniting experts from academia, industry and government in the fields of medical imaging and therapy to explore the transformative poten…
▽ More
This submission comprises the proceedings of the 1st Virtual Imaging Trials in Medicine conference, organized by Duke University on April 22-24, 2024. The listed authors serve as the program directors for this conference. The VITM conference is a pioneering summit uniting experts from academia, industry and government in the fields of medical imaging and therapy to explore the transformative potential of in silico virtual trials and digital twins in revolutionizing healthcare. The proceedings are categorized by the respective days of the conference: Monday presentations, Tuesday presentations, Wednesday presentations, followed by the abstracts for the posters presented on Monday and Tuesday.
△ Less
Submitted 8 May, 2024;
originally announced May 2024.
-
AI in Lung Health: Benchmarking Detection and Diagnostic Models Across Multiple CT Scan Datasets
Authors:
Fakrul Islam Tushar,
Avivah Wang,
Lavsen Dahal,
Michael R. Harowicz,
Kyle J. Lafata,
Tina D. Tailor,
Joseph Y. Lo
Abstract:
Lung cancer's high mortality rate can be mitigated by early detection, increasingly reliant on AI for diagnostic imaging. However, AI model performance depends on training and validation datasets. This study develops and validates AI models for both nodule detection and cancer classification tasks. For detection, two models (DLCSD-mD and LUNA16-mD) were developed using the Duke Lung Cancer Screeni…
▽ More
Lung cancer's high mortality rate can be mitigated by early detection, increasingly reliant on AI for diagnostic imaging. However, AI model performance depends on training and validation datasets. This study develops and validates AI models for both nodule detection and cancer classification tasks. For detection, two models (DLCSD-mD and LUNA16-mD) were developed using the Duke Lung Cancer Screening Dataset (DLCSD), with over 2,000 CT scans from 1,613 patients and more than 3,000 annotations. These models were evaluated on internal (DLCSD) and external datasets, including LUNA16 (601 patients, 1186 nodules) and NLST (969 patients, 1192 nodules), using FROC analysis and AUC metrics. For classification, five models were developed and tested: a randomly initialized 3D ResNet50, Genesis, MedNet3D, an enhanced ResNet50 using Strategic Warm-Start++ (SWS++), and a linear classifier analyzing features from the Foundation Model for Cancer Biomarkers (FMCB). These models were trained to distinguish between benign and malignant nodules and evaluated using AUC analysis on internal (DLCSD) and external datasets, including LUNA16 (433 patients, 677 nodules) and NLST. The DLCSD-mD model achieved an AUC of 0.93 (95% CI: 0.91-0.94) on the internal DLCSD dataset. External validation results were 0.97 (95% CI: 0.96-0.98) on LUNA16 and 0.75 (95% CI: 0.73-0.76) on NLST. For classification, the ResNet50-SWS++ model recorded AUCs of 0.71 (95% CI: 0.61-0.81) on DLCSD, 0.90 (95% CI: 0.87-0.93) on LUNA16, and 0.81 (95% CI: 0.79-0.82) on NLST. Other models showed varying performance across datasets, underscoring the importance of diverse model approaches. This benchmarking establishes DLCSD as a reliable resource for lung cancer AI research.
△ Less
Submitted 12 June, 2024; v1 submitted 7 May, 2024;
originally announced May 2024.
-
Virtual Lung Screening Trial (VLST): An In Silico Replica of the National Lung Screening Trial for Lung Cancer Detection
Authors:
Fakrul Islam Tushar,
Liesbeth Vancoillie,
Cindy McCabe,
Amareswararao Kavuri,
Lavsen Dahal,
Brian Harrawood,
Milo Fryling,
Mojtaba Zarei,
Saman Sotoudeh-Paima,
Fong Chi Ho,
Dhrubajyoti Ghosh,
Sheng Luo,
W. Paul Segars,
Ehsan Abadi,
Kyle J. Lafata,
Ehsan Samei,
Joseph Y. Lo
Abstract:
Importance: Clinical imaging trials are crucial for definitive evaluation of medical innovations, but the process is inefficient, expensive, and ethically-constrained. Virtual imaging trial (VIT) approach address these limitations by emulating the components of a clinical trial. An in silico rendition of the National Lung Screening Trial (NCLS) via Virtual Lung Screening Trial (VLST) demonstrates…
▽ More
Importance: Clinical imaging trials are crucial for definitive evaluation of medical innovations, but the process is inefficient, expensive, and ethically-constrained. Virtual imaging trial (VIT) approach address these limitations by emulating the components of a clinical trial. An in silico rendition of the National Lung Screening Trial (NCLS) via Virtual Lung Screening Trial (VLST) demonstrates the promise of VITs to expedite clinical trials, reduce risks to subjects, and facilitate the optimal use of imaging technologies in clinical settings.
Design, Setting, and Participants: A diverse virtual patient population of 294 subjects was created from human models (XCAT) emulating the characteristics of cases on NLST, with two types of simulated lung nodules. The cohort was assessed using simulated CT and CXR systems to generate images that reflect the NLST imaging technologies. Deep learning models trained for lesion detection in CXR and CT served as virtual readers.
Results: The study analyzed 294 CT and CXR simulated images from 294 virtual patients, with a lesion-level AUC of 0.81 (95% CI: 0.79-0.84) for CT and 0.56 (95% CI: 0.54-0.58) for CXR. At the patient level, CT demonstrated an AUC of 0.84 (95% CI: 0.80-0.89), compared to 0.52 (95% CI: 0.45-0.58) for CXR. Subgroup analyses on CT results indicated superior detection of homogeneous lesions (lesion-level AUC 0.97) than heterogeneous lesions (lesion-level AUC 0.72). Performance was particularly high for identifying larger nodules (AUC of 0.98 for nodules > 8 mm). The VLST results closely mirrored the NLST, particularly in size-based detection trends, with CT achieving high AUCs for nodules > 8 mm and similar challenges in detecting smaller nodules.
Conclusion and Relevance: The VIT results closely replicated those of the earlier NLST, underscoring its potential to replicate real clinical imaging trials.
△ Less
Submitted 24 September, 2024; v1 submitted 17 April, 2024;
originally announced April 2024.
-
High-throughput measurement of elastic moduli of microfibers by rope coiling
Authors:
Yuan Liu,
Jack Hau Yung Lo,
Janine K. Nunes,
Howard A. Stone,
Ho Cheung Shum
Abstract:
There are many fields where it is of interest to measure the elastic moduli of tiny fragile fibers, such as filamentous bacteria, actin filaments, DNA, carbon nanotubes, and functional microfibers. The elastic modulus is typically deduced from a sophisticated tensile test under a microscope, but the throughput is low and limited by the time-consuming and skill-intensive sample loading/unloading. H…
▽ More
There are many fields where it is of interest to measure the elastic moduli of tiny fragile fibers, such as filamentous bacteria, actin filaments, DNA, carbon nanotubes, and functional microfibers. The elastic modulus is typically deduced from a sophisticated tensile test under a microscope, but the throughput is low and limited by the time-consuming and skill-intensive sample loading/unloading. Here, we demonstrate a simple microfluidic method enabling the high-throughput measurement of the elastic moduli of microfibers by rope coiling using a localized compression, where sample loading/unloading are not needed between consecutive measurements. The rope coiling phenomenon occurs spontaneously when a microfiber flows from a small channel into a wide channel. The elastic modulus is determined by measuring either the buckling length or the coiling radius. The throughput of this method, currently 3,300 fibers per hour, is a thousand times higher than that of a tensile tester. We demonstrate the feasibility of the method by testing a nonuniform fiber with axially varying elastic modulus. We also demonstrate its capability for in situ inline measurement in a microfluidic production line. We envisage that high-throughput measurements may facilitate potential applications such as screening or sorting by mechanical properties and real-time control during production of microfibers.
△ Less
Submitted 18 March, 2024;
originally announced March 2024.
-
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
Authors:
Gemini Team,
Petko Georgiev,
Ving Ian Lei,
Ryan Burnell,
Libin Bai,
Anmol Gulati,
Garrett Tanzer,
Damien Vincent,
Zhufeng Pan,
Shibo Wang,
Soroosh Mariooryad,
Yifan Ding,
Xinyang Geng,
Fred Alcober,
Roy Frostig,
Mark Omernick,
Lexi Walker,
Cosmin Paduraru,
Christina Sorokin,
Andrea Tacchetti,
Colin Gaffney,
Samira Daruki,
Olcan Sercinoglu,
Zach Gleicher,
Juliette Love
, et al. (1110 additional authors not shown)
Abstract:
In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February…
▽ More
In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February version on the great majority of capabilities and benchmarks; (2) Gemini 1.5 Flash, a more lightweight variant designed for efficiency with minimal regression in quality. Gemini 1.5 models achieve near-perfect recall on long-context retrieval tasks across modalities, improve the state-of-the-art in long-document QA, long-video QA and long-context ASR, and match or surpass Gemini 1.0 Ultra's state-of-the-art performance across a broad set of benchmarks. Studying the limits of Gemini 1.5's long-context ability, we find continued improvement in next-token prediction and near-perfect retrieval (>99%) up to at least 10M tokens, a generational leap over existing models such as Claude 3.0 (200k) and GPT-4 Turbo (128k). Finally, we highlight real-world use cases, such as Gemini 1.5 collaborating with professionals on completing their tasks achieving 26 to 75% time savings across 10 different job categories, as well as surprising new capabilities of large language models at the frontier; when given a grammar manual for Kalamang, a language with fewer than 200 speakers worldwide, the model learns to translate English to Kalamang at a similar level to a person who learned from the same content.
△ Less
Submitted 8 August, 2024; v1 submitted 8 March, 2024;
originally announced March 2024.
-
What limits performance of weakly supervised deep learning for chest CT classification?
Authors:
Fakrul Islam Tushar,
Vincent M. D'Anniballe,
Geoffrey D. Rubin,
Joseph Y. Lo
Abstract:
Weakly supervised learning with noisy data has drawn attention in the medical imaging community due to the sparsity of high-quality disease labels. However, little is known about the limitations of such weakly supervised learning and the effect of these constraints on disease classification performance. In this paper, we test the effects of such weak supervision by examining model tolerance for th…
▽ More
Weakly supervised learning with noisy data has drawn attention in the medical imaging community due to the sparsity of high-quality disease labels. However, little is known about the limitations of such weakly supervised learning and the effect of these constraints on disease classification performance. In this paper, we test the effects of such weak supervision by examining model tolerance for three conditions. First, we examined model tolerance for noisy data by incrementally increasing error in the labels within the training data. Second, we assessed the impact of dataset size by varying the amount of training data. Third, we compared performance differences between binary and multi-label classification. Results demonstrated that the model could endure up to 10% added label error before experiencing a decline in disease classification performance. Disease classification performance steadily rose as the amount of training data was increased for all disease classes, before experiencing a plateau in performance at 75% of training data. Last, the binary model outperformed the multilabel model in every disease category. However, such interpretations may be misleading, as the binary model was heavily influenced by co-occurring diseases and may not have learned the specific features of the disease in the image. In conclusion, this study may help the medical imaging community understand the benefits and risks of weak supervision with noisy labels. Such studies demonstrate the need to build diverse, large-scale datasets and to develop explainable and responsible AI.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
Quantifying analogy of concepts via ologs and wiring diagrams
Authors:
Jason Lo
Abstract:
We build on the theory of ontology logs (ologs) created by Spivak and Kent, and define a notion of wiring diagrams. In this article, a wiring diagram is a finite directed labelled graph. The labels correspond to types in an olog; they can also be interpreted as readings of sensors in an autonomous system. As such, wiring diagrams can be used as a framework for an autonomous system to form abstract…
▽ More
We build on the theory of ontology logs (ologs) created by Spivak and Kent, and define a notion of wiring diagrams. In this article, a wiring diagram is a finite directed labelled graph. The labels correspond to types in an olog; they can also be interpreted as readings of sensors in an autonomous system. As such, wiring diagrams can be used as a framework for an autonomous system to form abstract concepts. We show that the graphs underlying skeleton wiring diagrams form a category. This allows skeleton wiring diagrams to be compared and manipulated using techniques from both graph theory and category theory. We also extend the usual definition of graph edit distance to the case of wiring diagrams by using operations only available to wiring diagrams, leading to a metric on the set of all skeleton wiring diagrams. In the end, we give an extended example on calculating the distance between two concepts represented by wiring diagrams, and explain how to apply our framework to any application domain.
△ Less
Submitted 1 February, 2024;
originally announced February 2024.
-
Characterising the take-off dynamics and energy efficiency in spring-driven jumping robots
Authors:
John Lo,
Ben Parslew
Abstract:
Previous design methodologies for spring-driven jumping robots focused on jump height optimization for specific tasks. In doing so, numerous designs have been proposed including using nonlinear spring-linkages to increase the elastic energy storage and jump height. However, these systems can never achieve their theoretical maximum jump height due to taking off before the spring energy is fully rel…
▽ More
Previous design methodologies for spring-driven jumping robots focused on jump height optimization for specific tasks. In doing so, numerous designs have been proposed including using nonlinear spring-linkages to increase the elastic energy storage and jump height. However, these systems can never achieve their theoretical maximum jump height due to taking off before the spring energy is fully released, resulting in an incomplete transfer of stored elastic energy to gravitational potential energy. This paper presents low-order models aimed at characterising the energy conversion during the acceleration phase of jumping. It also proposes practical solutions for increasing the energy efficiency of jumping robots. A dynamic analysis is conducted on a multibody system comprised of rotational links, which is experimentally validated using a physical demonstrator. The analysis reveals that inefficient energy conversion is attributed to inertial effects caused by rotational and unsprung masses. Since these masses cannot be entirely eliminated from a physical linkage, a practical approach to improving energy efficiency involves structural redesign to reduce structural mass and moments of inertia while maintaining compliance with structural strength and stiffness requirements.
△ Less
Submitted 10 January, 2024;
originally announced January 2024.
-
Elastic energy storage of spring-driven jumping robots
Authors:
John Lo,
Ben Parslew
Abstract:
Spring-driven jumping robots use an energised spring for propulsion, while the onboard motor only serves as a spring-charging source. A common mechanism in designing these robots is the rhomboidal linkage, which has been combined with linear springs (spring-linkage) to create a nonlinear spring, thereby increasing elastic energy storage and jump height for a given motor force. The effectiveness of…
▽ More
Spring-driven jumping robots use an energised spring for propulsion, while the onboard motor only serves as a spring-charging source. A common mechanism in designing these robots is the rhomboidal linkage, which has been combined with linear springs (spring-linkage) to create a nonlinear spring, thereby increasing elastic energy storage and jump height for a given motor force. The effectiveness of this spring-linkage has been examined for individual designs, but a generalised design theory of this class of system remains absent. This paper presents an energetics analysis of the spring-linkage and provides insight into designing an ideal constant force spring, which stores the maximum energy for a given motor force. A quasi-static analysis shows that the force-displacement relationship of the spring-linkage changes with the orientation and type of the spring, but is independent of the linkage scale. Combining different types and orientations of springs within the linkage enables higher elastic energy storage than using single springs. Placing two translational springs at the diagonals of the rhomboidal linkage creates an ideal spring that could increase the jump height of prior robots by 50-160%.
△ Less
Submitted 3 November, 2023;
originally announced November 2023.
-
Domain-specific optimization and diverse evaluation of self-supervised models for histopathology
Authors:
Jeremy Lai,
Faruk Ahmed,
Supriya Vijay,
Tiam Jaroensri,
Jessica Loo,
Saurabh Vyawahare,
Saloni Agarwal,
Fayaz Jamil,
Yossi Matias,
Greg S. Corrado,
Dale R. Webster,
Jonathan Krause,
Yun Liu,
Po-Hsuan Cameron Chen,
Ellery Wulczyn,
David F. Steiner
Abstract:
Task-specific deep learning models in histopathology offer promising opportunities for improving diagnosis, clinical research, and precision medicine. However, development of such models is often limited by availability of high-quality data. Foundation models in histopathology that learn general representations across a wide range of tissue types, diagnoses, and magnifications offer the potential…
▽ More
Task-specific deep learning models in histopathology offer promising opportunities for improving diagnosis, clinical research, and precision medicine. However, development of such models is often limited by availability of high-quality data. Foundation models in histopathology that learn general representations across a wide range of tissue types, diagnoses, and magnifications offer the potential to reduce the data, compute, and technical expertise necessary to develop task-specific deep learning models with the required level of model performance. In this work, we describe the development and evaluation of foundation models for histopathology via self-supervised learning (SSL). We first establish a diverse set of benchmark tasks involving 17 unique tissue types and 12 unique cancer types and spanning different optimal magnifications and task types. Next, we use this benchmark to explore and evaluate histopathology-specific SSL methods followed by further evaluation on held out patch-level and weakly supervised tasks. We found that standard SSL methods thoughtfully applied to histopathology images are performant across our benchmark tasks and that domain-specific methodological improvements can further increase performance. Our findings reinforce the value of using domain-specific SSL methods in pathology, and establish a set of high quality foundation models to enable further research across diverse applications.
△ Less
Submitted 19 October, 2023;
originally announced October 2023.
-
Characterizing Aqueous Foams by In-situ Viscosity Measurement in a Foam Column
Authors:
Wei Yu,
Jack Hau Yung Lo,
Mazen Yousef Kanj
Abstract:
Foam characterization is essential in many applications of foams, such as cleaning, food processing, cosmetics, and oil production, due to these applications diversified requirements. The standard characterization method, the foam column test, cannot provide sufficient information for in-depth studies. Hence, there have been many studies that incorporated different characterization methods into th…
▽ More
Foam characterization is essential in many applications of foams, such as cleaning, food processing, cosmetics, and oil production, due to these applications diversified requirements. The standard characterization method, the foam column test, cannot provide sufficient information for in-depth studies. Hence, there have been many studies that incorporated different characterization methods into the standard test. It should be enlightening and feasible to measure the foam viscosity, which is both of practical and fundamental interest, during the foam column test, but it has never been done before. Here, we demonstrate a method to characterize aqueous foams and their aging behaviors with simultaneous measurement of foam viscosity and foam height. Using a vibration viscometer, we integrate foam column experiments with in-situ foam viscosity measurements. We studied the correlation among the foam structure, foam height, and foam viscosity during the foam decay process. We found a drastic decrease in foam viscosity in the early foam decay while the foam height remained unchanged, which is explained by coarsening. This method is much more sensitive and time-efficient than conventional foam-height-based methods by comparing the half-life. This method successfully characterizes the stability of foams made of various combinations of surfactants and gases.
△ Less
Submitted 8 October, 2023;
originally announced October 2023.
-
Large Intestine 3D Shape Refinement Using Point Diffusion Models for Digital Phantom Generation
Authors:
Kaouther Mouheb,
Mobina Ghojogh Nejad,
Lavsen Dahal,
Ehsan Samei,
Kyle J. Lafata,
W. Paul Segars,
Joseph Y. Lo
Abstract:
Accurate 3D modeling of human organs plays a crucial role in building computational phantoms for virtual imaging trials. However, generating anatomically plausible reconstructions of organ surfaces from computed tomography scans remains challenging for many structures in the human body. This challenge is particularly evident when dealing with the large intestine. In this study, we leverage recent…
▽ More
Accurate 3D modeling of human organs plays a crucial role in building computational phantoms for virtual imaging trials. However, generating anatomically plausible reconstructions of organ surfaces from computed tomography scans remains challenging for many structures in the human body. This challenge is particularly evident when dealing with the large intestine. In this study, we leverage recent advancements in geometric deep learning and denoising diffusion probabilistic models to refine the segmentation results of the large intestine. We begin by representing the organ as point clouds sampled from the surface of the 3D segmentation mask. Subsequently, we employ a hierarchical variational autoencoder to obtain global and local latent representations of the organ's shape. We train two conditional denoising diffusion models in the hierarchical latent space to perform shape refinement. To further enhance our method, we incorporate a state-of-the-art surface reconstruction model, allowing us to generate smooth meshes from the obtained complete point clouds. Experimental results demonstrate the effectiveness of our approach in capturing both the global distribution of the organ's shape and its fine details. Our complete refinement pipeline demonstrates remarkable enhancements in surface representation compared to the initial segmentation, reducing the Chamfer distance by 70%, the Hausdorff distance by 32%, and the Earth Mover's distance by 6%. By combining geometric deep learning, denoising diffusion models, and advanced surface reconstruction techniques, our proposed method offers a promising solution for accurately modeling the large intestine's surface and can easily be extended to other anatomical structures.
△ Less
Submitted 20 May, 2024; v1 submitted 15 September, 2023;
originally announced September 2023.
-
Virtual imaging trials improved the transparency and reliability of AI systems in COVID-19 imaging
Authors:
Fakrul Islam Tushar,
Lavsen Dahal,
Saman Sotoudeh-Paima,
Ehsan Abadi,
W. Paul Segars,
Ehsan Samei,
Joseph Y. Lo
Abstract:
The credibility of AI models in medical imaging is often challenged by reproducibility issues and obscured clinical insights, a reality highlighted during the COVID-19 pandemic by many reports of near-perfect artificial intelligence (AI) models that all failed to generalize. To address these concerns, we propose a virtual imaging trial framework, employing a diverse collection of medical images th…
▽ More
The credibility of AI models in medical imaging is often challenged by reproducibility issues and obscured clinical insights, a reality highlighted during the COVID-19 pandemic by many reports of near-perfect artificial intelligence (AI) models that all failed to generalize. To address these concerns, we propose a virtual imaging trial framework, employing a diverse collection of medical images that are both clinical and simulated. In this study, COVID-19 serves as a case example to unveil the intrinsic and extrinsic factors influencing AI performance. Our findings underscore a significant impact of dataset characteristics on AI efficacy. Even when trained on large, diverse clinical datasets with thousands of patients, AI performance plummeted by up to 20% in generalization. However, virtual imaging trials offer a robust platform for objective assessment, unveiling nuanced insights into the relationships between patient- and physics-based factors and AI performance. For instance, disease extent markedly influenced AI efficacy, computed tomography (CT) out-performed chest radiography (CXR), while imaging dose exhibited minimal impact. Using COVID-19 as a case study, this virtual imaging trial study verified that radiology AI models often suffer from a reproducibility crisis. Virtual imaging trials not only offered a solution for objective performance assessment but also extracted several clinical insights. This study illuminates the path for leveraging virtual imaging to augment the reliability, transparency, and clinical relevance of AI in medical imaging.
△ Less
Submitted 31 March, 2024; v1 submitted 17 August, 2023;
originally announced August 2023.
-
Joint Computing Offloading and Resource Allocation for Classification Intelligent Tasks in MEC Systems
Authors:
Yuanpeng Zheng,
Tiankui Zhang,
Jonathan Loo,
Yapeng Wang,
Arumugam Nallanathan
Abstract:
Mobile edge computing (MEC) enables low-latency and high-bandwidth applications by bringing computation and data storage closer to end-users. Intelligent computing is an important application of MEC, where computing resources are used to solve intelligent task-related problems based on task requirements. However, efficiently offloading computing and allocating resources for intelligent tasks in ME…
▽ More
Mobile edge computing (MEC) enables low-latency and high-bandwidth applications by bringing computation and data storage closer to end-users. Intelligent computing is an important application of MEC, where computing resources are used to solve intelligent task-related problems based on task requirements. However, efficiently offloading computing and allocating resources for intelligent tasks in MEC systems is a challenging problem due to complex interactions between task requirements and MEC resources. To address this challenge, we investigate joint computing offloading and resource allocation for intelligent tasks in MEC systems. Our goal is to optimize system utility by jointly considering computing accuracy and task delay to achieve maximum system performance. We focus on classification intelligence tasks and formulate an optimization problem that considers both the accuracy requirements of tasks and the parallel computing capabilities of MEC systems. To solve the optimization problem, we decompose it into three subproblems: subcarrier allocation, computing capacity allocation, and compression offloading. We use convex optimization and successive convex approximation to derive closed-form expressions for the subcarrier allocation, offloading decisions, computing capacity, and compressed ratio. Based on our solutions, we design an efficient computing offloading and resource allocation algorithm for intelligent tasks in MEC systems. Our simulation results demonstrate that our proposed algorithm significantly improves the performance of intelligent tasks in MEC systems and achieves a flexible trade-off between system revenue and cost considering intelligent tasks compared with the benchmarks.
△ Less
Submitted 5 July, 2023;
originally announced July 2023.
-
Dynamic Multi-time Scale User Admission and Resource Allocation for Semantic Extraction in MEC Systems
Authors:
Yuanpeng Zheng,
Tiankui Zhang,
Jonathan Loo
Abstract:
This paper investigates the semantic extraction task-oriented dynamic multi-time scale user admission and resourceallocation in mobile edge computing (MEC) systems. Amid prevalence artifi cial intelligence applications in various industries,the offloading of semantic extraction tasks which are mainlycomposed of convolutional neural networks of computer vision isa great challenge for communication…
▽ More
This paper investigates the semantic extraction task-oriented dynamic multi-time scale user admission and resourceallocation in mobile edge computing (MEC) systems. Amid prevalence artifi cial intelligence applications in various industries,the offloading of semantic extraction tasks which are mainlycomposed of convolutional neural networks of computer vision isa great challenge for communication bandwidth and computing capacity allocation in MEC systems. Considering the stochasticnature of the semantic extraction tasks, we formulate a stochastic optimization problem by modeling it as the dynamic arrival of tasks in the temporal domain. We jointly optimize the system revenue and cost which are represented as user admission in the long term and resource allocation in the short term respectively. To handle the proposed stochastic optimization problem, we decompose it into short-time-scale subproblems and a long-time-scale subproblem by using the Lyapunov optimization technique. After that, the short-time-scale optimization variables of resource allocation, including user association, bandwidth allocation, and computing capacity allocation are obtained in closed form. The user admission optimization on long-time scales is solved by a heuristic iteration method. Then, the multi-time scale user admission and resource allocation algorithm is proposed for dynamic semantic extraction task computing in MEC systems. Simulation results demonstrate that, compared with the benchmarks, the proposed algorithm improves the performance of user admission and resource allocation efficiently and achieves a flexible trade-off between system revenue and cost at multi-time scales and considering semantic extraction tasks.
△ Less
Submitted 5 July, 2023;
originally announced July 2023.
-
The effect of distant connections on node anonymity in complex networks
Authors:
Rachel G. de Jong,
Mark P. J. van der Loo,
Frank W. Takes
Abstract:
Ensuring privacy of individuals is of paramount importance to social network analysis research. Previous work assessed anonymity in a network based on the non-uniqueness of a node's ego network. In this work, we show that this approach does not adequately account for the strong de-anonymizing effect of distant connections. We first propose the use of d-k-anonymity, a novel measure that takes knowl…
▽ More
Ensuring privacy of individuals is of paramount importance to social network analysis research. Previous work assessed anonymity in a network based on the non-uniqueness of a node's ego network. In this work, we show that this approach does not adequately account for the strong de-anonymizing effect of distant connections. We first propose the use of d-k-anonymity, a novel measure that takes knowledge up to distance d of a considered node into account. Second, we introduce anonymity-cascade, which exploits the so-called infectiousness of uniqueness: mere information about being connected to another unique node can make a given node uniquely identifiable. These two approaches, together with relevant "twin node" processing steps in the underlying graph structure, offer practitioners flexible solutions, tunable in precision and computation time. This enables the assessment of anonymity in large-scale networks with up to millions of nodes and edges. Experiments on graph models and a wide range of real-world networks show drastic decreases in anonymity when connections at distance 2 are considered. Moreover, extending the knowledge beyond the ego network with just one extra link often already decreases overall anonymity by over 50%. These findings have important implications for privacy-aware sharing of sensitive network data.
△ Less
Submitted 14 November, 2023; v1 submitted 23 June, 2023;
originally announced June 2023.
-
Sigma-point Kalman Filter with Nonlinear Unknown Input Estimation via Optimization and Data-driven Approach for Dynamic Systems
Authors:
Junn Yong Loo,
Ze Yang Ding,
Vishnu Monn Baskaran,
Surya Girinatha Nurzaman,
Chee Pin Tan
Abstract:
Most works on joint state and unknown input (UI) estimation require the assumption that the UIs are linear; this is potentially restrictive as it does not hold in many intelligent autonomous systems. To overcome this restriction and circumvent the need to linearize the system, we propose a derivative-free Unknown Input Sigma-point Kalman Filter (SPKF-nUI) where the SPKF is interconnected with a ge…
▽ More
Most works on joint state and unknown input (UI) estimation require the assumption that the UIs are linear; this is potentially restrictive as it does not hold in many intelligent autonomous systems. To overcome this restriction and circumvent the need to linearize the system, we propose a derivative-free Unknown Input Sigma-point Kalman Filter (SPKF-nUI) where the SPKF is interconnected with a general nonlinear UI estimator that can be implemented via nonlinear optimization and data-driven approaches. The nonlinear UI estimator uses the posterior state estimate which is less susceptible to state prediction error. In addition, we introduce a joint sigma-point transformation scheme to incorporate both the state and UI uncertainties in the estimation of SPKF-nUI. An in-depth stochastic stability analysis proves that the proposed SPKF-nUI yields exponentially converging estimation error bounds under reasonable assumptions. Finally, two case studies are carried out on a simulation-based rigid robot and a physical soft robot, i.e., robots made of soft materials with complex dynamics to validate effectiveness of the proposed filter on nonlinear dynamic systems. Our results demonstrate that the proposed SPKF-nUI achieves the lowest state and UI estimation errors when compared to the existing nonlinear state-UI filters.
△ Less
Submitted 24 June, 2024; v1 submitted 21 June, 2023;
originally announced June 2023.
-
Stability for Line Bundles and Deformed Hermitian-Yang-Mills Equation on Some Elliptic Surfaces
Authors:
Tristan C. Collins,
Jason Lo,
Yun Shi,
Shing-Tung Yau
Abstract:
We study the twisted ampleness criterion due to Collins, Jacob and Yau on surfaces, which is equivalent to the existence of solutions to the deformed Hermitian-Yang-Mills (dHYM) equation. When $X$ is a Weierstrass elliptic K3 surface, and $ω$ an ample class such that $ω$ lies in the span of a section class and the fiber class, we show that for a class of line bundles $L$ with fiber degree 1 and…
▽ More
We study the twisted ampleness criterion due to Collins, Jacob and Yau on surfaces, which is equivalent to the existence of solutions to the deformed Hermitian-Yang-Mills (dHYM) equation. When $X$ is a Weierstrass elliptic K3 surface, and $ω$ an ample class such that $ω$ lies in the span of a section class and the fiber class, we show that for a class of line bundles $L$ with fiber degree 1 and $ωc_1(L)>0$, the twisted ampleness of $L$ respect to $ω$, always implies the $σ_{ω, 0}$-stability (Bridgeland stability) of $L$. This answers a question by Collins and Yau for a class of examples.
△ Less
Submitted 20 September, 2024; v1 submitted 8 June, 2023;
originally announced June 2023.
-
Unsupervised Cross-Domain Soft Sensor Modelling via Deep Physics-Inspired Particle Flow Bayes
Authors:
Junn Yong Loo,
Ze Yang Ding,
Surya G. Nurzaman,
Chee-Ming Ting,
Vishnu Monn Baskaran,
Chee Pin Tan
Abstract:
Data-driven soft sensors are essential for achieving accurate perception through reliable state inference. However, developing representative soft sensor models is challenged by issues such as missing labels, domain adaptability, and temporal coherence in data. To address these challenges, we propose a deep Particle Flow Bayes (DPFB) framework for cross-domain soft sensor modeling in the absence o…
▽ More
Data-driven soft sensors are essential for achieving accurate perception through reliable state inference. However, developing representative soft sensor models is challenged by issues such as missing labels, domain adaptability, and temporal coherence in data. To address these challenges, we propose a deep Particle Flow Bayes (DPFB) framework for cross-domain soft sensor modeling in the absence of target state labels. In particular, a sequential Bayes objective is first formulated to perform the maximum likelihood estimation underlying the cross-domain soft sensing problem. At the core of the framework, we incorporate a physics-inspired particle flow that optimizes the sequential Bayes objective to perform an exact Bayes update of the model extracted latent and hidden features. As a result, these contributions enable the proposed framework to learn a rich approximate posterior feature representation capable of characterizing complex cross-domain system dynamics and performing effective time series unsupervised domain adaptation (UDA). Finally, we validate the framework on a complex industrial multiphase flow process system with complex dynamics and multiple operating conditions. The results demonstrate that the DPFB framework achieves superior cross-domain soft sensing performance, outperforming state-of-the-art deep UDA and normalizing flow approaches.
△ Less
Submitted 8 July, 2023; v1 submitted 7 June, 2023;
originally announced June 2023.
-
A Data-Driven Computational Model for Engineered Cardiac Microtissues
Authors:
Javiera Jilberto,
Samuel J. DePalma,
Jason Lo,
Hiba Kobeissi,
Lani Quach,
Emma Lejeune,
Brendon M. Baker,
David Nordsletten
Abstract:
Engineered heart tissues (EHTs) present a potential solution to some of the current challenges in the treatment of heart disease; however, the development of mature, adult-like cardiac tissues remains elusive. Mechanical stimuli have been observed to improve whole-tissue function and cardiomyocyte (CM) maturation, although our ability to fully utilize these mechanisms is hampered, in part, by our…
▽ More
Engineered heart tissues (EHTs) present a potential solution to some of the current challenges in the treatment of heart disease; however, the development of mature, adult-like cardiac tissues remains elusive. Mechanical stimuli have been observed to improve whole-tissue function and cardiomyocyte (CM) maturation, although our ability to fully utilize these mechanisms is hampered, in part, by our incomplete understanding of the mechanobiology of EHTs. In this work, we leverage the experimental data produced by a mechanically tunable experimental setup to generate tissue-specific computational models of EHTs. Using imaging and functional data, our modeling pipeline generates models with tissue-specific ECM and myofibril structure, allowing us to estimate CM active stress. We use this experimental and modeling pipeline to study different mechanical environments, where we contrast the force output of the tissue with the computed active stress of CMs. We show that the significant differences in measured experimental forces can largely be explained by the levels of myofibril formation achieved by the CMs in the distinct mechanical environments, with active stress showing more muted variations across conditions. The presented model also enables us to dissect the relative contributions of myofibrils and extracellular matrix to tissue force output, a task difficult to address experimentally. These results highlight the importance of tissue-specific modeling to augment EHT experiments, providing deeper insights into the mechanobiology driving EHT function.
△ Less
Submitted 31 May, 2023;
originally announced June 2023.
-
SPARTA: Spatial Acceleration for Efficient and Scalable Horizontal Diffusion Weather Stencil Computation
Authors:
Gagandeep Singh,
Alireza Khodamoradi,
Kristof Denolf,
Jack Lo,
Juan Gómez-Luna,
Joseph Melber,
Andra Bisca,
Henk Corporaal,
Onur Mutlu
Abstract:
Fast and accurate climate simulations and weather predictions are critical for understanding and preparing for the impact of climate change. Real-world weather and climate modeling consist of complex compound stencil kernels that do not perform well on conventional architectures. Horizontal diffusion is one such important compound stencil found in many climate and weather prediction models. Recent…
▽ More
Fast and accurate climate simulations and weather predictions are critical for understanding and preparing for the impact of climate change. Real-world weather and climate modeling consist of complex compound stencil kernels that do not perform well on conventional architectures. Horizontal diffusion is one such important compound stencil found in many climate and weather prediction models. Recent works propose using FPGAs as an alternative to traditional CPU and GPU-based systems to accelerate compound stencil kernels. However, we observe that compound stencil computations cannot leverage the bit-level flexibility available on an FPGA because of its complex memory access patterns, leading to high hardware resource utilization and low peak performance. We introduce SPARTA, a novel spatial accelerator for horizontal diffusion weather stencil computation. We exploit the two-dimensional spatial architecture to efficiently accelerate horizontal diffusion stencil by designing the first scaled-out spatial accelerator using MLIR (Multi-Level Intermediate Representation) compiler framework. We evaluate its performance on a real cutting-edge AMD-Xilinx Versal AI Engine spatial architecture. Our real-system evaluation results demonstrate that SPARTA outperforms the state-of-the-art CPU, GPU, and FPGA implementations by 17.1x, 1.2x, and 2.1x, respectively. Our results reveal that balancing workload across the available processing resources is crucial in achieving high performance on spatial architectures. We also implement and evaluate five elementary stencils that are commonly used as benchmarks for stencil computation research. We freely open-source all our implementations to aid future research in stencil computation and spatial computing systems at https://github.com/CMU-SAFARI/SPARTA.
△ Less
Submitted 9 May, 2023; v1 submitted 6 March, 2023;
originally announced March 2023.
-
Cross-domain Transfer Learning and State Inference for Soft Robots via a Semi-supervised Sequential Variational Bayes Framework
Authors:
Shageenderan Sapai,
Junn Yong Loo,
Ze Yang Ding,
Chee Pin Tan,
Raphael CW Phan,
Vishnu Monn Baskaran,
Surya Girinatha Nurzaman
Abstract:
Recently, data-driven models such as deep neural networks have shown to be promising tools for modelling and state inference in soft robots. However, voluminous amounts of data are necessary for deep models to perform effectively, which requires exhaustive and quality data collection, particularly of state labels. Consequently, obtaining labelled state data for soft robotic systems is challenged f…
▽ More
Recently, data-driven models such as deep neural networks have shown to be promising tools for modelling and state inference in soft robots. However, voluminous amounts of data are necessary for deep models to perform effectively, which requires exhaustive and quality data collection, particularly of state labels. Consequently, obtaining labelled state data for soft robotic systems is challenged for various reasons, including difficulty in the sensorization of soft robots and the inconvenience of collecting data in unstructured environments. To address this challenge, in this paper, we propose a semi-supervised sequential variational Bayes (DSVB) framework for transfer learning and state inference in soft robots with missing state labels on certain robot configurations. Considering that soft robots may exhibit distinct dynamics under different robot configurations, a feature space transfer strategy is also incorporated to promote the adaptation of latent features across multiple configurations. Unlike existing transfer learning approaches, our proposed DSVB employs a recurrent neural network to model the nonlinear dynamics and temporal coherence in soft robot data. The proposed framework is validated on multiple setup configurations of a pneumatic-based soft robot finger. Experimental results on four transfer scenarios demonstrate that DSVB performs effective transfer learning and accurate state inference amidst missing state labels. The data and code are available at https://github.com/shageenderan/DSVB.
△ Less
Submitted 25 August, 2023; v1 submitted 2 March, 2023;
originally announced March 2023.
-
A Deep Probabilistic Spatiotemporal Framework for Dynamic Graph Representation Learning with Application to Brain Disorder Identification
Authors:
Sin-Yee Yap,
Junn Yong Loo,
Chee-Ming Ting,
Fuad Noman,
Raphael C. -W. Phan,
Adeel Razi,
David L. Dowe
Abstract:
Recent applications of pattern recognition techniques on brain connectome classification using functional connectivity (FC) are shifting towards acknowledging the non-Euclidean topology and causal dynamics of brain connectivity across time. In this paper, a deep spatiotemporal variational Bayes (DSVB) framework is proposed to learn time-varying topological structures in dynamic FC networks for ide…
▽ More
Recent applications of pattern recognition techniques on brain connectome classification using functional connectivity (FC) are shifting towards acknowledging the non-Euclidean topology and causal dynamics of brain connectivity across time. In this paper, a deep spatiotemporal variational Bayes (DSVB) framework is proposed to learn time-varying topological structures in dynamic FC networks for identifying autism spectrum disorder (ASD) in human participants. The framework incorporates a spatial-aware recurrent neural network with an attention-based message passing scheme to capture rich spatiotemporal patterns across dynamic FC networks. To overcome model overfitting on limited training datasets, an adversarial training strategy is introduced to learn graph embedding models that generalize well to unseen brain networks. Evaluation on the ABIDE resting-state functional magnetic resonance imaging dataset shows that our proposed framework substantially outperforms state-of-the-art methods in identifying patients with ASD. Dynamic FC analyses with DSVB-learned embeddings reveal apparent group differences between ASD and healthy controls in brain network connectivity patterns and switching dynamics of brain states.
△ Less
Submitted 13 May, 2024; v1 submitted 14 February, 2023;
originally announced February 2023.
-
Development of a Hardware-in-the-loop Testbed for Laboratory Performance Verification of Flexible Building Equipment in Typical Commercial Buildings
Authors:
Zhelun Chen,
Jin Wen,
Steven T. Bushby,
L. James Lo,
Zheng O'Neill,
W. Vance Payne,
Amanda Pertzborn,
Caleb Calfa,
Yangyang Fu,
Gabriel Grajewski,
Yicheng Li,
Zhiyao Yang
Abstract:
The goals of reducing energy costs, shifting electricity peaks, increasing the use of renewable energy, and enhancing the stability of the electric grid can be met in part by fully exploiting the energy flexibility potential of buildings and building equipment. The development of strategies that exploit these flexibilities could be facilitated by publicly available high-resolution datasets illustr…
▽ More
The goals of reducing energy costs, shifting electricity peaks, increasing the use of renewable energy, and enhancing the stability of the electric grid can be met in part by fully exploiting the energy flexibility potential of buildings and building equipment. The development of strategies that exploit these flexibilities could be facilitated by publicly available high-resolution datasets illustrating how control of HVAC systems in commercial buildings can be used in different climate zones to shape the energy use profile of a building for grid needs. This article presents the development and integration of a Hardware-In-the-Loop Flexible load Testbed (HILFT) that integrates physical HVAC systems with a simulated building model and simulated occupants with the goal of generating datasets to verify load flexibility of typical commercial buildings. Compared to simulation-only experiments, the hardware-in-the-loop approach captures the dynamics of the physical systems while also allowing efficient testing of various boundary conditions. The HILFT integration in this article is achieved through the co-simulation among various software environments including LabVIEW, MATLAB, and EnergyPlus. Although theoretically viable, such integration has encountered many real-world challenges, such as: 1) how to design the overall data infrastructure to ensure effective, robust, and efficient integration; 2) how to avoid closed-loop hunting between simulated and emulated variables; 3) how to quantify system response times and minimize system delays; and 4) how to assess the overall integration quality. Lessons-learned using the examples of an AHU-VAV system, an air-source heat pump system, and a water-source heat pump system are presented.
△ Less
Submitted 5 February, 2023; v1 submitted 31 January, 2023;
originally announced January 2023.
-
Hölder Regularity of the $\bar\partial-$equation on the Polydisc
Authors:
Yu Jun Loo,
Alexander Tumanov
Abstract:
In this note, we show that the canonical solution operator to the $\bar\partial-$equation in the polydisc preserves Hölder regularity. It is a well-known fact that such solution operators do not improve Hölder regularity, and as such, our solution operator is optimal in this regard.
In this note, we show that the canonical solution operator to the $\bar\partial-$equation in the polydisc preserves Hölder regularity. It is a well-known fact that such solution operators do not improve Hölder regularity, and as such, our solution operator is optimal in this regard.
△ Less
Submitted 11 January, 2023;
originally announced January 2023.
-
CHARM: Composing Heterogeneous Accelerators for Matrix Multiply on Versal ACAP Architecture
Authors:
Jinming Zhuang,
Jason Lau,
Hanchen Ye,
Zhuoping Yang,
Yubo Du,
Jack Lo,
Kristof Denolf,
Stephen Neuendorffer,
Alex Jones,
Jingtong Hu,
Deming Chen,
Jason Cong,
Peipei Zhou
Abstract:
Dense matrix multiply (MM) serves as one of the most heavily used kernels in deep learning applications. To cope with the high computation demands of these applications, heterogeneous architectures featuring both FPGA and dedicated ASIC accelerators have emerged as promising platforms. For example, the AMD/Xilinx Versal ACAP architecture combines general-purpose CPU cores and programmable logic wi…
▽ More
Dense matrix multiply (MM) serves as one of the most heavily used kernels in deep learning applications. To cope with the high computation demands of these applications, heterogeneous architectures featuring both FPGA and dedicated ASIC accelerators have emerged as promising platforms. For example, the AMD/Xilinx Versal ACAP architecture combines general-purpose CPU cores and programmable logic with AI Engine processors optimized for AI/ML. With 400 AIEs, it provides up to 6.4 TFLOPs performance for 32-bit floating-point data. However, machine learning models often contain both large and small MM operations. While large MM operations can be parallelized efficiently across many cores, small MM operations typically cannot. We observe that executing some small MM layers from the BERT natural language processing model on a large, monolithic MM accelerator in Versal ACAP achieved less than 5% of the theoretical peak performance. Therefore, one key question arises: How can we design accelerators to fully use the abundant computation resources under limited communication bandwidth for applications with multiple MM layers of diverse sizes? We identify the biggest system throughput bottleneck resulting from the mismatch of massive computation resources of one monolithic accelerator and the various MM layers of small sizes in the application. To resolve this problem, we propose the CHARM framework to compose multiple diverse MM accelerator architectures working concurrently towards different layers in one application. We deploy the CHARM framework for four different applications, including BERT, ViT, NCF, MLP, on the AMD Versal ACAP VCK190 evaluation board. Our experiments show that we achieve 1.46 TFLOPs, 1.61 TFLOPs, 1.74 TFLOPs, and 2.94 TFLOPs inference throughput for BERT, ViT, NCF and MLP, which obtain 5.40x, 32.51x, 1.00x and 1.00x throughput gains compared to one monolithic accelerator.
△ Less
Submitted 5 January, 2023;
originally announced January 2023.
-
Mechanosensitive bonds induced complex cell motility patterns
Authors:
Jen-Yu Lo,
Yuan-Heng Tseng,
Hsuan-Yi Chen
Abstract:
The one-dimensional crawling movement of a cell is considered in this theoretical study. Our active gel model shows that for a cell with weakly mechanosensitive adhesion complexes, as myosin contractility increases, a cell starts to move at a constant velocity. As the mechanosensitivity of the adhesion complexes increases, a cell can exhibit stick-slip motion. Finally, a cell with highly mechanose…
▽ More
The one-dimensional crawling movement of a cell is considered in this theoretical study. Our active gel model shows that for a cell with weakly mechanosensitive adhesion complexes, as myosin contractility increases, a cell starts to move at a constant velocity. As the mechanosensitivity of the adhesion complexes increases, a cell can exhibit stick-slip motion. Finally, a cell with highly mechanosensitive adhesion complexes exhibits periodic back-and-forth migration. A simplified model which assumes that the cell crawling dynamics are controlled by the evolution of the myosin density dipole and the asymmetry of adhesion complex distribution captures the motility behaviors of crawling cells qualitatively. It suggests that the complex cell crawling behaviors observed in the experiments could result from the interplay between the distribution of contractile force and mechanosensitive bonds.
△ Less
Submitted 4 January, 2023;
originally announced January 2023.
-
Diffusion-Dominated Pinch-Off of Ultralow Surface Tension Fluids
Authors:
Jack Hau Yung Lo,
Yuan Liu,
Sze Yi Mak,
Zhuo Xu,
Youchuang Chao,
Kaye Jiale Li,
Ho Cheung Shum,
Lei Xu
Abstract:
We study the breakup of a liquid thread inside another liquid at different surface tensions. In general, the pinch-off of a liquid thread is governed by the dynamics of fluid flow. However, when the interfacial tension is ultralow (2 to 3 orders lower than normal liquids), we find that the pinch-off dynamics can be governed by bulk diffusion. By studying the velocity and the profile of the pinch-o…
▽ More
We study the breakup of a liquid thread inside another liquid at different surface tensions. In general, the pinch-off of a liquid thread is governed by the dynamics of fluid flow. However, when the interfacial tension is ultralow (2 to 3 orders lower than normal liquids), we find that the pinch-off dynamics can be governed by bulk diffusion. By studying the velocity and the profile of the pinch-off, we explain why the diffusion-dominated pinch-off takes over the conventional breakup at ultralow surface tensions.
△ Less
Submitted 28 November, 2022;
originally announced November 2022.
-
The role of drop shape in impact and splash
Authors:
Qingzhe Liu,
Jack Hau Yung Lo,
Ye Li,
Yuan Liu,
Jinyu Zhao,
Lei Xu
Abstract:
The impact and splash of liquid drops on solid substrates are ubiquitous in many important fields. However, previous studies have mainly focused on spherical drops while the non-spherical situations, such as raindrops, charged drops, oscillating drops, and drops affected by electromagnetic field, remain largely unexplored. Using ferrofluid, we realize various drop shapes and illustrate the fundame…
▽ More
The impact and splash of liquid drops on solid substrates are ubiquitous in many important fields. However, previous studies have mainly focused on spherical drops while the non-spherical situations, such as raindrops, charged drops, oscillating drops, and drops affected by electromagnetic field, remain largely unexplored. Using ferrofluid, we realize various drop shapes and illustrate the fundamental role of shape in impact and splash. Experiments show that different drop shapes produce large variations in spreading dynamics, splash onset, and splash amount. However, underlying all these variations we discover universal mechanisms across various drop shapes: the impact dynamics is governed by the superellipse model, the splash onset is triggered by the Kelvin-Helmholtz instability, and the amount of splash is determined by the energy dissipation before liquid taking off. Our study generalizes the drop impact research beyond the spherical geometry, and reveals the potential of using drop shape to control impact and splash.
△ Less
Submitted 28 November, 2022;
originally announced November 2022.
-
Fast and Efficient Malware Detection with Joint Static and Dynamic Features Through Transfer Learning
Authors:
Mao V. Ngo,
Tram Truong-Huu,
Dima Rabadi,
Jia Yi Loo,
Sin G. Teo
Abstract:
In malware detection, dynamic analysis extracts the runtime behavior of malware samples in a controlled environment and static analysis extracts features using reverse engineering tools. While the former faces the challenges of anti-virtualization and evasive behavior of malware samples, the latter faces the challenges of code obfuscation. To tackle these drawbacks, prior works proposed to develop…
▽ More
In malware detection, dynamic analysis extracts the runtime behavior of malware samples in a controlled environment and static analysis extracts features using reverse engineering tools. While the former faces the challenges of anti-virtualization and evasive behavior of malware samples, the latter faces the challenges of code obfuscation. To tackle these drawbacks, prior works proposed to develop detection models by aggregating dynamic and static features, thus leveraging the advantages of both approaches. However, simply concatenating dynamic and static features raises an issue of imbalanced contribution due to the heterogeneous dimensions of feature vectors to the performance of malware detection models. Yet, dynamic analysis is a time-consuming task and requires a secure environment, leading to detection delays and high costs for maintaining the analysis infrastructure. In this paper, we first introduce a method of constructing aggregated features via concatenating latent features learned through deep learning with equally-contributed dimensions. We then develop a knowledge distillation technique to transfer knowledge learned from aggregated features by a teacher model to a student model trained only on static features and use the trained student model for the detection of new malware samples. We carry out extensive experiments with a dataset of 86709 samples including both benign and malware samples. The experimental results show that the teacher model trained on aggregated features constructed by our method outperforms the state-of-the-art models with an improvement of up to 2.38% in detection accuracy. The distilled student model not only achieves high performance (97.81% in terms of accuracy) as that of the teacher model but also significantly reduces the detection time (from 70046.6 ms to 194.9 ms) without requiring dynamic analysis.
△ Less
Submitted 24 November, 2022;
originally announced November 2022.
-
An Iterative Method to Learn a Linear Control Barrier Function
Authors:
Zihao Liang,
Jason King Ching Lo
Abstract:
Control barrier function (CBF) has recently started to serve as a basis to develop approaches for enforcing safety requirements in control systems. However, constructing such function for a general system is a non-trivial task. This paper proposes an iterative, optimization-based framework to obtain a CBF from a given user-specified set for a general control affine system. Without losing generalit…
▽ More
Control barrier function (CBF) has recently started to serve as a basis to develop approaches for enforcing safety requirements in control systems. However, constructing such function for a general system is a non-trivial task. This paper proposes an iterative, optimization-based framework to obtain a CBF from a given user-specified set for a general control affine system. Without losing generality, we parameterize the CBF as a set of linear functions of states. By taking samples from the given user-specified set, we reformulate the problem of learning a CBF into an optimization problem that solves for linear function coefficients. The resulting linear functions construct the CBF and yield a safe set which has forward invariance property. In addition, the proposed framework explicitly addresses control input constraints during the construction of CBFs. Effectiveness of the proposed method is demonstrated by learning a CBF for an nonlinear Moore Greitzer jet engine, where the system trajectory is prevented from entering unsafe set.
△ Less
Submitted 17 November, 2022;
originally announced November 2022.
-
Intersection numbers on fibrations and Catalan numbers
Authors:
Rimma Hämäläinen,
Jason Lo,
Edward Morales
Abstract:
On an elliptic surface or threefold, Catalan numbers appear when one tries to compute the autoequivalence group action on the Bridgeland stability manifold. We explain why this happens by identifying a class of equations in the Chow ring of a fibration, where the solutions always involve Catalan numbers.
On an elliptic surface or threefold, Catalan numbers appear when one tries to compute the autoequivalence group action on the Bridgeland stability manifold. We explain why this happens by identifying a class of equations in the Chow ring of a fibration, where the solutions always involve Catalan numbers.
△ Less
Submitted 4 October, 2022;
originally announced October 2022.
-
Geometric stability conditions under autoequivalences and applications: Elliptic Surfaces
Authors:
Jason Lo,
Cristian Martinez
Abstract:
On a Weierstrass elliptic surface, we describe the action of the relative Fourier-Mukai transform on the geometric chamber of $\mathrm{Stab}(X)$, and in the K3 case we also study the action on one of its boundary components. Using new estimates for the Gieseker chamber we prove that Gieseker stability for polarizations on certain Friedman chamber is preserved by the derived dual of the relative Fo…
▽ More
On a Weierstrass elliptic surface, we describe the action of the relative Fourier-Mukai transform on the geometric chamber of $\mathrm{Stab}(X)$, and in the K3 case we also study the action on one of its boundary components. Using new estimates for the Gieseker chamber we prove that Gieseker stability for polarizations on certain Friedman chamber is preserved by the derived dual of the relative Fourier-Mukai transform. As an application of our description of the action, we also prove projectivity for some moduli spaces of Bridgeland semistable objects.
△ Less
Submitted 3 October, 2022;
originally announced October 2022.
-
Automated Assessment of Transthoracic Echocardiogram Image Quality Using Deep Neural Networks
Authors:
Robert B. Labs,
Apostolos Vrettos,
Jonathan Loo,
Massoud Zolgharni
Abstract:
Standard views in two-dimensional echocardiography are well established but the quality of acquired images are highly dependent on operator skills and are assessed subjectively. This study is aimed at providing an objective assessment pipeline for echocardiogram image quality by defining a new set of domain-specific quality indicators. Consequently, image quality assessment can thus be automated t…
▽ More
Standard views in two-dimensional echocardiography are well established but the quality of acquired images are highly dependent on operator skills and are assessed subjectively. This study is aimed at providing an objective assessment pipeline for echocardiogram image quality by defining a new set of domain-specific quality indicators. Consequently, image quality assessment can thus be automated to enhance clinical measurements, interpretation, and real-time optimization. We have developed deep neural networks for the automated assessment of echocardiographic frame which were randomly sampled from 11,262 adult patients. The private echocardiography dataset consists of 33,784 frames, previously acquired between 2010 and 2020. Deep learning approaches were used to extract the spatiotemporal features and the image quality indicators were evaluated against the mean absolute error. Our quality indicators encapsulate both anatomical and pathological elements to provide multivariate assessment scores for anatomical visibility, clarity, depth-gain and foreshortedness, respectively.
△ Less
Submitted 2 September, 2022;
originally announced September 2022.
-
Echocardiographic Image Quality Assessment Using Deep Neural Networks
Authors:
Robert B. Labs,
Massoud Zolgharni,
Jonathan P. Loo
Abstract:
Echocardiography image quality assessment is not a trivial issue in transthoracic examination. As the in vivo examination of heart structures gained prominence in cardiac diagnosis, it has been affirmed that accurate diagnosis of the left ventricle functions is hugely dependent on the quality of echo images. Up till now, visual assessment of echo images is highly subjective and requires specific d…
▽ More
Echocardiography image quality assessment is not a trivial issue in transthoracic examination. As the in vivo examination of heart structures gained prominence in cardiac diagnosis, it has been affirmed that accurate diagnosis of the left ventricle functions is hugely dependent on the quality of echo images. Up till now, visual assessment of echo images is highly subjective and requires specific definition under clinical pathologies. While poor-quality images impair quantifications and diagnosis, the inherent variations in echocardiographic image quality standards indicates the complexity faced among different observers and provides apparent evidence for incoherent assessment under clinical trials, especially with less experienced cardiologists. In this research, our aim was to analyse and define specific quality attributes mostly discussed by experts and present a fully trained convolutional neural network model for assessing such quality features objectively.
△ Less
Submitted 2 September, 2022;
originally announced September 2022.
-
A Non-parametric Bayesian Model for Detecting Differential Item Functioning: An Application to Political Representation in the US
Authors:
Yuki Shiraito,
James Lo,
Santiago Olivella
Abstract:
A common approach when studying the quality of representation involves comparing the latent preferences of voters and legislators, commonly obtained by fitting an item-response theory (IRT) model to a common set of stimuli. Despite being exposed to the same stimuli, voters and legislators may not share a common understanding of how these stimuli map onto their latent preferences, leading to differ…
▽ More
A common approach when studying the quality of representation involves comparing the latent preferences of voters and legislators, commonly obtained by fitting an item-response theory (IRT) model to a common set of stimuli. Despite being exposed to the same stimuli, voters and legislators may not share a common understanding of how these stimuli map onto their latent preferences, leading to differential item-functioning (DIF) and incomparability of estimates. We explore the presence of DIF and incomparability of latent preferences obtained through IRT models by re-analyzing an influential survey data set, where survey respondents expressed their preferences on roll call votes that U.S. legislators had previously voted on. To do so, we propose defining a Dirichlet Process prior over item-response functions in standard IRT models. In contrast to typical multi-step approaches to detecting DIF, our strategy allows researchers to fit a single model, automatically identifying incomparable sub-groups with different mappings from latent traits onto observed responses. We find that although there is a group of voters whose estimated positions can be safely compared to those of legislators, a sizeable share of surveyed voters understand stimuli in fundamentally different ways. Ignoring these issues can lead to incorrect conclusions about the quality of representation.
△ Less
Submitted 14 November, 2022; v1 submitted 12 May, 2022;
originally announced May 2022.
-
Optimal Hölder Regularity of Solution Operator to the $\bar\partial$-equation on Product Domains
Authors:
Yu Jun Loo
Abstract:
This note seeks to prove the existence of a canonical solution operator to the $\bar\partial$-equation that preserves Hölder regularity on product domains. It is a well known fact that such solution operators do not in general gain Hölder regularity, and as such, our solution operator is optimal in this regard.
This note seeks to prove the existence of a canonical solution operator to the $\bar\partial$-equation that preserves Hölder regularity on product domains. It is a well known fact that such solution operators do not in general gain Hölder regularity, and as such, our solution operator is optimal in this regard.
△ Less
Submitted 24 April, 2022; v1 submitted 15 April, 2022;
originally announced April 2022.
-
Virtual vs. Reality: External Validation of COVID-19 Classifiers using XCAT Phantoms for Chest Computed Tomography
Authors:
Fakrul Islam Tushar,
Ehsan Abadi,
Saman Sotoudeh-Paima,
Rafael B. Fricks,
Maciej A. Mazurowski,
W. Paul Segars,
Ehsan Samei,
Joseph Y. Lo
Abstract:
Research studies of artificial intelligence models in medical imaging have been hampered by poor generalization. This problem has been especially concerning over the last year with numerous applications of deep learning for COVID-19 diagnosis. Virtual imaging trials (VITs) could provide a solution for objective evaluation of these models. In this work utilizing the VITs, we created the CVIT-COVID…
▽ More
Research studies of artificial intelligence models in medical imaging have been hampered by poor generalization. This problem has been especially concerning over the last year with numerous applications of deep learning for COVID-19 diagnosis. Virtual imaging trials (VITs) could provide a solution for objective evaluation of these models. In this work utilizing the VITs, we created the CVIT-COVID dataset including 180 virtually imaged computed tomography (CT) images from simulated COVID-19 and normal phantom models under different COVID-19 morphology and imaging properties. We evaluated the performance of an open-source, deep-learning model from the University of Waterloo trained with multi-institutional data and an in-house model trained with the open clinical dataset called MosMed. We further validated the model's performance against open clinical data of 305 CT images to understand virtual vs. real clinical data performance. The open-source model was published with nearly perfect performance on the original Waterloo dataset but showed a consistent performance drop in external testing on another clinical dataset (AUC=0.77) and our simulated CVIT-COVID dataset (AUC=0.55). The in-house model achieved an AUC of 0.87 while testing on the internal test set (MosMed test set). However, performance dropped to an AUC of 0.65 and 0.69 when evaluated on clinical and our simulated CVIT-COVID dataset. The VIT framework offered control over imaging conditions, allowing us to show there was no change in performance as CT exposure was changed from 28.5 to 57 mAs. The VIT framework also provided voxel-level ground truth, revealing that performance of in-house model was much higher at AUC=0.87 for diffuse COVID-19 infection size >2.65% lung volume versus AUC=0.52 for focal disease with <2.65% volume. The virtual imaging framework enabled these uniquely rigorous analyses of model performance.
△ Less
Submitted 6 March, 2022;
originally announced March 2022.