Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–49 of 49 results for author: Feng, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00031  [pdf, other

    cs.DC cs.SE

    Supercharging Federated Learning with Flower and NVIDIA FLARE

    Authors: Holger R. Roth, Daniel J. Beutel, Yan Cheng, Javier Fernandez Marques, Heng Pan, Chester Chen, Zhihong Zhang, Yuhong Wen, Sean Yang, Isaac, Yang, Yuan-Ting Hsieh, Ziyue Xu, Daguang Xu, Nicholas D. Lane, Andrew Feng

    Abstract: Several open-source systems, such as Flower and NVIDIA FLARE, have been developed in recent years while focusing on different aspects of federated learning (FL). Flower is dedicated to implementing a cohesive approach to FL, analytics, and evaluation. Over time, Flower has cultivated extensive strategies and algorithms tailored for FL application development, fostering a vibrant FL community in re… ▽ More

    Submitted 21 May, 2024; originally announced July 2024.

  2. arXiv:2406.12072  [pdf, other

    cs.AI cs.LG

    DTGB: A Comprehensive Benchmark for Dynamic Text-Attributed Graphs

    Authors: Jiasheng Zhang, Jialin Chen, Menglin Yang, Aosong Feng, Shuang Liang, Jie Shao, Rex Ying

    Abstract: Dynamic text-attributed graphs (DyTAGs) are prevalent in various real-world scenarios, where each node and edge are associated with text descriptions, and both the graph structure and text descriptions evolve over time. Despite their broad applicability, there is a notable scarcity of benchmark datasets tailored to DyTAGs, which hinders the potential advancement in many research fields. To address… ▽ More

    Submitted 18 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: 28 pages, 13 figures

  3. CMDBench: A Benchmark for Coarse-to-fine Multimodal Data Discovery in Compound AI Systems

    Authors: Yanlin Feng, Sajjadur Rahman, Aaron Feng, Vincent Chen, Eser Kandogan

    Abstract: Compound AI systems (CASs) that employ LLMs as agents to accomplish knowledge-intensive tasks via interactions with tools and data retrievers have garnered significant interest within database and AI communities. While these systems have the potential to supplement typical analysis workflows of data analysts in enterprise data platforms, unfortunately, CASs are subject to the same data discovery c… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: Governance, Understanding and Integration of Data for Effective and Responsible AI (GUIDE-AI '24), June 14, 2024, Santiago, AA, Chile

  4. arXiv:2405.12369  [pdf, other

    cs.CV

    AtomGS: Atomizing Gaussian Splatting for High-Fidelity Radiance Field

    Authors: Rong Liu, Rui Xu, Yue Hu, Meida Chen, Andrew Feng

    Abstract: 3D Gaussian Splatting (3DGS) has recently advanced radiance field reconstruction by offering superior capabilities for novel view synthesis and real-time rendering speed. However, its strategy of blending optimization and adaptive density control might lead to sub-optimal results; it can sometimes yield noisy geometry and blurry artifacts due to prioritizing optimizing large Gaussians at the cost… ▽ More

    Submitted 22 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

  5. arXiv:2404.01340  [pdf, other

    cs.LG cs.AI

    From Similarity to Superiority: Channel Clustering for Time Series Forecasting

    Authors: Jialin Chen, Jan Eric Lenssen, Aosong Feng, Weihua Hu, Matthias Fey, Leandros Tassiulas, Jure Leskovec, Rex Ying

    Abstract: Time series forecasting has attracted significant attention in recent decades. Previous studies have demonstrated that the Channel-Independent (CI) strategy improves forecasting performance by treating different channels individually, while it leads to poor generalization on unseen instances and ignores potentially necessary interactions between channels. Conversely, the Channel-Dependent (CD) str… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: 20 pages, 6 figures

  6. arXiv:2403.10585  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Solving General Noisy Inverse Problem via Posterior Sampling: A Policy Gradient Viewpoint

    Authors: Haoyue Tang, Tian Xie, Aosong Feng, Hanyu Wang, Chenyang Zhang, Yang Bai

    Abstract: Solving image inverse problems (e.g., super-resolution and inpainting) requires generating a high fidelity image that matches the given input (the low-resolution image or the masked image). By using the input image as guidance, we can leverage a pretrained diffusion generative model to solve a wide range of image inverse tasks without task specific model fine-tuning. To precisely estimate the guid… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: Accepted and to Appear, AISTATS 2024

  7. arXiv:2403.04882  [pdf, other

    cs.LG

    Efficient High-Resolution Time Series Classification via Attention Kronecker Decomposition

    Authors: Aosong Feng, Jialin Chen, Juan Garza, Brooklyn Berry, Francisco Salazar, Yifeng Gao, Rex Ying, Leandros Tassiulas

    Abstract: The high-resolution time series classification problem is essential due to the increasing availability of detailed temporal data in various domains. To tackle this challenge effectively, it is imperative that the state-of-the-art attention model is scalable to accommodate the growing sequence lengths typically encountered in high-resolution time series data, while also demonstrating robustness in… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  8. arXiv:2403.04880  [pdf, other

    cs.CV

    An Item is Worth a Prompt: Versatile Image Editing with Disentangled Control

    Authors: Aosong Feng, Weikang Qiu, Jinbin Bai, Xiao Zhang, Zhen Dong, Kaicheng Zhou, Rex Ying, Leandros Tassiulas

    Abstract: Building on the success of text-to-image diffusion models (DPMs), image editing is an important application to enable human interaction with AI-generated content. Among various editing methods, editing within the prompt space gains more attention due to its capacity and simplicity of controlling semantics. However, since diffusion models are commonly pretrained on descriptive text captions, direct… ▽ More

    Submitted 28 May, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

  9. arXiv:2402.14293  [pdf, other

    cs.CL

    Leveraging Large Language Models for Concept Graph Recovery and Question Answering in NLP Education

    Authors: Rui Yang, Boming Yang, Sixun Ouyang, Tianwei She, Aosong Feng, Yuang Jiang, Freddy Lecue, Jinghui Lu, Irene Li

    Abstract: In the domain of Natural Language Processing (NLP), Large Language Models (LLMs) have demonstrated promise in text-generation tasks. However, their educational applications, particularly for domain-specific queries, remain underexplored. This study investigates LLMs' capabilities in educational scenarios, focusing on concept graph recovery and question-answering (QA). We assess LLMs' zero-shot per… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  10. arXiv:2402.07792  [pdf, other

    cs.LG cs.DC

    Empowering Federated Learning for Massive Models with NVIDIA FLARE

    Authors: Holger R. Roth, Ziyue Xu, Yuan-Ting Hsieh, Adithya Renduchintala, Isaac Yang, Zhihong Zhang, Yuhong Wen, Sean Yang, Kevin Lu, Kristopher Kersten, Camir Ricketts, Daguang Xu, Chester Chen, Yan Cheng, Andrew Feng

    Abstract: In the ever-evolving landscape of artificial intelligence (AI) and large language models (LLMs), handling and leveraging data effectively has become a critical challenge. Most state-of-the-art machine learning algorithms are data-centric. However, as the lifeblood of model performance, necessary data cannot always be centralized due to various factors such as privacy, regulation, geopolitics, copy… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  11. arXiv:2310.16002  [pdf, other

    cs.CV

    Integrating View Conditions for Image Synthesis

    Authors: Jinbin Bai, Zhen Dong, Aosong Feng, Xiao Zhang, Tian Ye, Kaicheng Zhou

    Abstract: In the field of image processing, applying intricate semantic modifications within existing images remains an enduring challenge. This paper introduces a pioneering framework that integrates viewpoint information to enhance the control of image editing tasks, especially for interior design scenes. By surveying existing object editing methodologies, we distill three essential criteria -- consistenc… ▽ More

    Submitted 8 May, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: Accepted by IJCAI 2024

  12. arXiv:2309.10011  [pdf, other

    cs.CV eess.IV

    Instant Photorealistic Style Transfer: A Lightweight and Adaptive Approach

    Authors: Rong Liu, Enyu Zhao, Zhiyuan Liu, Andrew Feng, Scott John Easley

    Abstract: In this paper, we propose an Instant Photorealistic Style Transfer (IPST) approach, designed to achieve instant photorealistic style transfer on super-resolution inputs without the need for pre-training on pair-wise datasets or imposing extra constraints. Our method utilizes a lightweight StyleNet to enable style transfer from a style image to a content image while preserving non-color information… ▽ More

    Submitted 20 October, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: 8 pages (reference excluded), 6 figures, 4 tables

  13. arXiv:2308.13420  [pdf, other

    cs.NE cs.AI cs.LG

    Reinforcement Learning-assisted Evolutionary Algorithm: A Survey and Research Opportunities

    Authors: Yanjie Song, Yutong Wu, Yangyang Guo, Ran Yan, P. N. Suganthan, Yue Zhang, Witold Pedrycz, Swagatam Das, Rammohan Mallipeddi, Oladayo Solomon Ajani. Qiang Feng

    Abstract: Evolutionary algorithms (EA), a class of stochastic search methods based on the principles of natural evolution, have received widespread acclaim for their exceptional performance in various real-world optimization problems. While researchers worldwide have proposed a wide variety of EAs, certain limitations remain, such as slow convergence speed and poor generalization capabilities. Consequently,… ▽ More

    Submitted 27 January, 2024; v1 submitted 25 August, 2023; originally announced August 2023.

    Comments: 28 pages, 16 figures

    Report number: SWEVO-S-2023-00771

  14. arXiv:2307.15208  [pdf, other

    eess.IV cs.CV

    Generative AI for Medical Imaging: extending the MONAI Framework

    Authors: Walter H. L. Pinaya, Mark S. Graham, Eric Kerfoot, Petru-Daniel Tudosiu, Jessica Dafflon, Virginia Fernandez, Pedro Sanchez, Julia Wolleb, Pedro F. da Costa, Ashay Patel, Hyungjin Chung, Can Zhao, Wei Peng, Zelong Liu, Xueyan Mei, Oeslle Lucena, Jong Chul Ye, Sotirios A. Tsaftaris, Prerna Dogra, Andrew Feng, Marc Modat, Parashkev Nachev, Sebastien Ourselin, M. Jorge Cardoso

    Abstract: Recent advances in generative AI have brought incredible breakthroughs in several areas, including medical imaging. These generative models have tremendous potential not only to help safely share medical data via synthetic datasets but also to perform an array of diverse applications, such as anomaly detection, image-to-image translation, denoising, and MRI reconstruction. However, due to the comp… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

  15. arXiv:2307.13560  [pdf, other

    cs.CL

    XDLM: Cross-lingual Diffusion Language Model for Machine Translation

    Authors: Linyao Chen, Aosong Feng, Boming Yang, Zihui Li

    Abstract: Recently, diffusion models have excelled in image generation tasks and have also been applied to neural language processing (NLP) for controllable text generation. However, the application of diffusion models in a cross-lingual setting is less unexplored. Additionally, while pretraining with diffusion models has been studied within a single language, the potential of cross-lingual pretraining rema… ▽ More

    Submitted 30 July, 2023; v1 submitted 25 July, 2023; originally announced July 2023.

  16. arXiv:2307.05780  [pdf

    cs.CV

    Automated Artifact Detection in Ultra-widefield Fundus Photography of Patients with Sickle Cell Disease

    Authors: Anqi Feng, Dimitri Johnson, Grace R. Reilly, Loka Thangamathesvaran, Ann Nampomba, Mathias Unberath, Adrienne W. Scott, Craig Jones

    Abstract: Importance: Ultra-widefield fundus photography (UWF-FP) has shown utility in sickle cell retinopathy screening; however, image artifact may diminish quality and gradeability of images. Objective: To create an automated algorithm for UWF-FP artifact classification. Design: A neural network based automated artifact detection algorithm was designed to identify commonly encountered UWF-FP artifacts in… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

  17. arXiv:2305.10655  [pdf, other

    eess.IV cs.CV cs.LG

    DeepEdit: Deep Editable Learning for Interactive Segmentation of 3D Medical Images

    Authors: Andres Diaz-Pinto, Pritesh Mehta, Sachidanand Alle, Muhammad Asad, Richard Brown, Vishwesh Nath, Alvin Ihsani, Michela Antonelli, Daniel Palkovics, Csaba Pinter, Ron Alkalay, Steve Pieper, Holger R. Roth, Daguang Xu, Prerna Dogra, Tom Vercauteren, Andrew Feng, Abood Quraini, Sebastien Ourselin, M. Jorge Cardoso

    Abstract: Automatic segmentation of medical images is a key step for diagnostic and interventional tasks. However, achieving this requires large amounts of annotated volumes, which can be tedious and time-consuming task for expert annotators. In this paper, we introduce DeepEdit, a deep learning-based method for volumetric medical image annotation, that allows automatic and semi-automatic segmentation, and… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  18. arXiv:2305.03319  [pdf, other

    cs.CL

    HiPool: Modeling Long Documents Using Graph Neural Networks

    Authors: Irene Li, Aosong Feng, Dragomir Radev, Rex Ying

    Abstract: Encoding long sequences in Natural Language Processing (NLP) is a challenging problem. Though recent pretraining language models achieve satisfying performances in many NLP tasks, they are still restricted by a pre-defined maximum length, making them challenging to be extended to longer sequences. So some recent works utilize hierarchies to model long sequences. However, most of them apply sequent… ▽ More

    Submitted 14 May, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

    Journal ref: ACL 2023 main proceedings

  19. FIREBALL: A Dataset of Dungeons and Dragons Actual-Play with Structured Game State Information

    Authors: Andrew Zhu, Karmanya Aggarwal, Alexander Feng, Lara J. Martin, Chris Callison-Burch

    Abstract: Dungeons & Dragons (D&D) is a tabletop roleplaying game with complex natural language interactions between players and hidden state information. Recent work has shown that large language models (LLMs) that have access to state information can generate higher quality game turns than LLMs that use dialog history alone. However, previous work used game state information that was heuristically created… ▽ More

    Submitted 25 May, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

    Comments: 21 pages, 2 figures. Accepted at ACL 2023

    Journal ref: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023, pp. 4171-4193

  20. arXiv:2303.12822  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Co-Speech Gesture Synthesis using Discrete Gesture Token Learning

    Authors: Shuhong Lu, Youngwoo Yoon, Andrew Feng

    Abstract: Synthesizing realistic co-speech gestures is an important and yet unsolved problem for creating believable motions that can drive a humanoid robot to interact and communicate with human users. Such capability will improve the impressions of the robots by human users and will find applications in education, training, and medical services. One challenge in learning the co-speech gesture model is tha… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: 8 pages, 3 figures, 3 tables

  21. Author as Character and Narrator: Deconstructing Personal Narratives from the r/AmITheAsshole Reddit Community

    Authors: Salvatore Giorgi, Ke Zhao, Alexander H. Feng, Lara J. Martin

    Abstract: In the r/AmITheAsshole subreddit, people anonymously share first person narratives that contain some moral dilemma or conflict and ask the community to judge who is at fault (i.e., who is "the asshole"). In general, first person narratives are a unique storytelling domain where the author is the narrator (the person telling the story) but can also be a character (the person living the story) and,… ▽ More

    Submitted 15 March, 2023; v1 submitted 19 January, 2023; originally announced January 2023.

    Comments: Accepted to the 17th International AAAI Conference on Web and Social Media (ICWSM), 2023

    Journal ref: Proceedings of the International AAAI Conference on Web and Social Media (ICWSM) 2023, 17(1), 233-244

  22. arXiv:2301.06114  [pdf, other

    eess.IV cs.LG

    Segmenting thalamic nuclei from manifold projections of multi-contrast MRI

    Authors: Chang Yan, Muhan Shao, Zhangxing Bian, Anqi Feng, Yuan Xue, Jiachen Zhuo, Rao P. Gullapalli, Aaron Carass, Jerry L. Prince

    Abstract: The thalamus is a subcortical gray matter structure that plays a key role in relaying sensory and motor signals within the brain. Its nuclei can atrophy or otherwise be affected by neurological disease and injuries including mild traumatic brain injury. Segmenting both the thalamus and its nuclei is challenging because of the relatively low contrast within and around the thalamus in conventional m… ▽ More

    Submitted 31 January, 2023; v1 submitted 15 January, 2023; originally announced January 2023.

    Comments: 8 pages, 3 figures, 2023 SPIE-MI Image Processing

  23. arXiv:2211.02701  [pdf, other

    cs.LG cs.AI cs.CV

    MONAI: An open-source framework for deep learning in healthcare

    Authors: M. Jorge Cardoso, Wenqi Li, Richard Brown, Nic Ma, Eric Kerfoot, Yiheng Wang, Benjamin Murrey, Andriy Myronenko, Can Zhao, Dong Yang, Vishwesh Nath, Yufan He, Ziyue Xu, Ali Hatamizadeh, Andriy Myronenko, Wentao Zhu, Yun Liu, Mingxin Zheng, Yucheng Tang, Isaac Yang, Michael Zephyr, Behrooz Hashemian, Sachidanand Alle, Mohammad Zalbagi Darestani, Charlie Budd , et al. (32 additional authors not shown)

    Abstract: Artificial Intelligence (AI) is having a tremendous impact across most areas of science. Applications of AI in healthcare have the potential to improve our ability to detect, diagnose, prognose, and intervene on human disease. For AI models to be used clinically, they need to be made safe, reproducible and robust, and the underlying software framework must be aware of the particularities (e.g. geo… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

    Comments: www.monai.io

  24. arXiv:2210.13291  [pdf, other

    cs.LG cs.AI cs.CV cs.NI cs.SE

    NVIDIA FLARE: Federated Learning from Simulation to Real-World

    Authors: Holger R. Roth, Yan Cheng, Yuhong Wen, Isaac Yang, Ziyue Xu, Yuan-Ting Hsieh, Kristopher Kersten, Ahmed Harouni, Can Zhao, Kevin Lu, Zhihong Zhang, Wenqi Li, Andriy Myronenko, Dong Yang, Sean Yang, Nicola Rieke, Abood Quraini, Chester Chen, Daguang Xu, Nic Ma, Prerna Dogra, Mona Flores, Andrew Feng

    Abstract: Federated learning (FL) enables building robust and generalizable AI models by leveraging diverse datasets from multiple collaborators without centralizing the data. We created NVIDIA FLARE as an open-source software development kit (SDK) to make it easier for data scientists to use FL in their research and real-world applications. The SDK includes solutions for state-of-the-art FL algorithms and… ▽ More

    Submitted 28 April, 2023; v1 submitted 24 October, 2022; originally announced October 2022.

    Comments: Accepted at the International Workshop on Federated Learning, NeurIPS 2022, New Orleans, USA (https://federated-learning.org/fl-neurips-2022); Revised version v2: added Key Components list, system metrics for homomorphic encryption experiment; Extended v3 for journal submission

    Journal ref: IEEE Data Eng. Bull., Vol. 46, No. 1, 2023

  25. arXiv:2210.11794  [pdf, other

    cs.LG cs.CL

    Diffuser: Efficient Transformers with Multi-hop Attention Diffusion for Long Sequences

    Authors: Aosong Feng, Irene Li, Yuang Jiang, Rex Ying

    Abstract: Efficient Transformers have been developed for long sequence modeling, due to their subquadratic memory and time complexity. Sparse Transformer is a popular approach to improving the efficiency of Transformers by restricting self-attention to locations specified by the predefined sparse patterns. However, leveraging sparsity may sacrifice expressiveness compared to full-attention, when important t… ▽ More

    Submitted 31 January, 2023; v1 submitted 21 October, 2022; originally announced October 2022.

  26. arXiv:2210.03534  [pdf, other

    cs.NI

    A Quantitative Theory of Bottleneck Structures for Data Networks

    Authors: Jordi Ros-Giralt, Noah Amsel, Sruthi Yellamraju, James Ezick, Richard Lethin, Yuang Jiang, Aosong Feng, Leandros Tassiulas

    Abstract: The conventional view of the congestion control problem in data networks is based on the principle that a flow's performance is uniquely determined by the state of its bottleneck link, regardless of the topological properties of the network. However, recent work has shown that the behavior of congestion-controlled networks is better explained by models that account for the interactions between bot… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

  27. Detecting Environmental Violations with Satellite Imagery in Near Real Time: Land Application under the Clean Water Act

    Authors: Ben Chugg, Nicolas Rothbacher, Alex Feng, Xiaoqi Long, Daniel E. Ho

    Abstract: This paper introduces a new, highly consequential setting for the use of computer vision for environmental sustainability. Concentrated Animal Feeding Operations (CAFOs) (aka intensive livestock farms or "factory farms") produce significant manure and pollution. Dumping manure in the winter months poses significant environmental risks and violates environmental law in many states. Yet the federal… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

    Comments: Accepted to CIKM '22

  28. arXiv:2207.05064  [pdf, other

    cs.LG cs.AI

    Adaptive Graph Spatial-Temporal Transformer Network for Traffic Flow Forecasting

    Authors: Aosong Feng, Leandros Tassiulas

    Abstract: Traffic flow forecasting on graphs has real-world applications in many fields, such as transportation system and computer networks. Traffic forecasting can be highly challenging due to complex spatial-temporal correlations and non-linear traffic patterns. Existing works mostly model such spatial-temporal dependencies by considering spatial correlations and temporal correlations separately and fail… ▽ More

    Submitted 9 July, 2022; originally announced July 2022.

  29. arXiv:2203.12362  [pdf, other

    cs.HC cs.CV cs.LG eess.IV

    MONAI Label: A framework for AI-assisted Interactive Labeling of 3D Medical Images

    Authors: Andres Diaz-Pinto, Sachidanand Alle, Vishwesh Nath, Yucheng Tang, Alvin Ihsani, Muhammad Asad, Fernando Pérez-García, Pritesh Mehta, Wenqi Li, Mona Flores, Holger R. Roth, Tom Vercauteren, Daguang Xu, Prerna Dogra, Sebastien Ourselin, Andrew Feng, M. Jorge Cardoso

    Abstract: The lack of annotated datasets is a major bottleneck for training new task-specific supervised machine learning models, considering that manual annotation is extremely expensive and time-consuming. To address this problem, we present MONAI Label, a free and open-source framework that facilitates the development of applications based on artificial intelligence (AI) models that aim at reducing the t… ▽ More

    Submitted 28 April, 2023; v1 submitted 23 March, 2022; originally announced March 2022.

  30. arXiv:2203.09065  [pdf, other

    cs.CV

    STPLS3D: A Large-Scale Synthetic and Real Aerial Photogrammetry 3D Point Cloud Dataset

    Authors: Meida Chen, Qingyong Hu, Zifan Yu, Hugues Thomas, Andrew Feng, Yu Hou, Kyle McCullough, Fengbo Ren, Lucio Soibelman

    Abstract: Although various 3D datasets with different functions and scales have been proposed recently, it remains challenging for individuals to complete the whole pipeline of large-scale data collection, sanitization, and annotation. Moreover, the created datasets usually suffer from extremely imbalanced class distribution or partial low-quality data samples. Motivated by this, we explore the procedurally… ▽ More

    Submitted 13 October, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

    Report number: https://bmvc2022.mpi-inf.mpg.de/0429.pdf

    Journal ref: https://bmvc2022.mpi-inf.mpg.de/0429.pdf

  31. arXiv:2202.06924  [pdf, other

    cs.LG cs.CR cs.CV cs.DC

    Do Gradient Inversion Attacks Make Federated Learning Unsafe?

    Authors: Ali Hatamizadeh, Hongxu Yin, Pavlo Molchanov, Andriy Myronenko, Wenqi Li, Prerna Dogra, Andrew Feng, Mona G. Flores, Jan Kautz, Daguang Xu, Holger R. Roth

    Abstract: Federated learning (FL) allows the collaborative training of AI models without needing to share raw data. This capability makes it especially interesting for healthcare applications where patient and data privacy is of utmost concern. However, recent works on the inversion of deep neural networks from model gradients raised concerns about the security of FL in preventing the leakage of training da… ▽ More

    Submitted 30 January, 2023; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: Revised version; Accepted to IEEE Transactions on Medical Imaging; Improved and reformatted version of https://www.researchsquare.com/article/rs-1147182/v2; Added NVFlare reference

  32. arXiv:2201.00491  [pdf, other

    cs.LG cs.AI

    KerGNNs: Interpretable Graph Neural Networks with Graph Kernels

    Authors: Aosong Feng, Chenyu You, Shiqiang Wang, Leandros Tassiulas

    Abstract: Graph kernels are historically the most widely-used technique for graph classification tasks. However, these methods suffer from limited performance because of the hand-crafted combinatorial features of graphs. In recent years, graph neural networks (GNNs) have become the state-of-the-art method in downstream graph-related tasks due to their superior performance. Most GNNs are based on Message Pas… ▽ More

    Submitted 25 February, 2022; v1 submitted 3 January, 2022; originally announced January 2022.

  33. arXiv:2110.15327  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    MEGAN: Memory Enhanced Graph Attention Network for Space-Time Video Super-Resolution

    Authors: Chenyu You, Lianyi Han, Aosong Feng, Ruihan Zhao, Hui Tang, Wei Fan

    Abstract: Space-time video super-resolution (STVSR) aims to construct a high space-time resolution video sequence from the corresponding low-frame-rate, low-resolution video sequence. Inspired by the recent success to consider spatial-temporal information for space-time super-resolution, our main goal in this work is to take full considerations of spatial and temporal correlations within the video sequences… ▽ More

    Submitted 29 November, 2021; v1 submitted 28 October, 2021; originally announced October 2021.

  34. arXiv:2109.12221  [pdf

    cs.CV

    Ground material classification for UAV-based photogrammetric 3D data A 2D-3D Hybrid Approach

    Authors: Meida Chen, Andrew Feng, Yu Hou, Kyle McCullough, Pratusha Bhuvana Prasad, Lucio Soibelman

    Abstract: In recent years, photogrammetry has been widely used in many areas to create photorealistic 3D virtual data representing the physical environment. The innovation of small unmanned aerial vehicles (sUAVs) has provided additional high-resolution imaging capabilities with low cost for mapping a relatively large area of interest. These cutting-edge technologies have caught the US Army and Navy's atten… ▽ More

    Submitted 15 October, 2022; v1 submitted 24 September, 2021; originally announced September 2021.

    Journal ref: Interservice/Industry Training, Simulation, and Education Conference (I/ITSEC) 2021

  35. arXiv:2103.14620  [pdf, other

    cs.CL

    LiGCN: Label-interpretable Graph Convolutional Networks for Multi-label Text Classification

    Authors: Irene Li, Aosong Feng, Hao Wu, Tianxiao Li, Toyotaro Suzumura, Ruihai Dong

    Abstract: Multi-label text classification (MLTC) is an attractive and challenging task in natural language processing (NLP). Compared with single-label text classification, MLTC has a wider range of applications in practice. In this paper, we propose a label-interpretable graph convolutional network model to solve the MLTC problem by modeling tokens and labels as nodes in a heterogeneous graph. In this way,… ▽ More

    Submitted 22 May, 2022; v1 submitted 26 March, 2021; originally announced March 2021.

    Comments: 8 tables, 3 figures

    Journal ref: DLG4NLP Workshop, NAACL 2022

  36. arXiv:2009.00185  [pdf

    cs.CV

    Utilizing Satellite Imagery Datasets and Machine Learning Data Models to Evaluate Infrastructure Change in Undeveloped Regions

    Authors: Kyle McCullough, Andrew Feng, Meida Chen, Ryan McAlinden

    Abstract: In the globalized economic world, it has become important to understand the purpose behind infrastructural and construction initiatives occurring within developing regions of the earth. This is critical when the financing for such projects must be coming from external sources, as is occurring throughout major portions of the African continent. When it comes to imagery analysis to research these re… ▽ More

    Submitted 31 August, 2020; originally announced September 2020.

    Journal ref: Interservice/Industry Training, Simulation, and Education Conference (I/ITSEC) 2020

  37. arXiv:2008.09648  [pdf

    cs.CV

    Semantic Segmentation and Data Fusion of Microsoft Bing 3D Cities and Small UAV-based Photogrammetric Data

    Authors: Meida Chen, Andrew Feng, Kyle McCullough, Pratusha Bhuvana Prasad, Ryan McAlinden, Lucio Soibelman

    Abstract: With state-of-the-art sensing and photogrammetric techniques, Microsoft Bing Maps team has created over 125 highly detailed 3D cities from 11 different countries that cover hundreds of thousands of square kilometer areas. The 3D city models were created using the photogrammetric technique with high-resolution images that were captured from aircraft-mounted cameras. Such a large 3D city database ha… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

    Journal ref: Interservice/Industry Training, Simulation, and Education Conference (I/ITSEC) 2020

  38. arXiv:2008.09647  [pdf

    cs.CV

    Generating synthetic photogrammetric data for training deep learning based 3D point cloud segmentation models

    Authors: Meida Chen, Andrew Feng, Kyle McCullough, Pratusha Bhuvana Prasad, Ryan McAlinden, Lucio Soibelman

    Abstract: At I/ITSEC 2019, the authors presented a fully-automated workflow to segment 3D photogrammetric point-clouds/meshes and extract object information, including individual tree locations and ground materials (Chen et al., 2019). The ultimate goal is to create realistic virtual environments and provide the necessary information for simulation. We tested the generalizability of the previously proposed… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

    Journal ref: Interservice/Industry Training, Simulation, and Education Conference (I/ITSEC) 2020

  39. arXiv:2008.03697  [pdf

    cs.CV

    Fully Automated Photogrammetric Data Segmentation and Object Information Extraction Approach for Creating Simulation Terrain

    Authors: Meida Chen, Andrew Feng, Kyle McCullough, Pratusha Bhuvana Prasad, Ryan McAlinden, Lucio Soibelman, Mike Enloe

    Abstract: Our previous works have demonstrated that visually realistic 3D meshes can be automatically reconstructed with low-cost, off-the-shelf unmanned aerial systems (UAS) equipped with capable cameras, and efficient photogrammetric software techniques. However, such generated data do not contain semantic information/features of objects (i.e., man-made objects, vegetation, ground, object materials, etc.)… ▽ More

    Submitted 9 August, 2020; originally announced August 2020.

    Journal ref: Interservice/Industry Training, Simulation, and Education Conference (I/ITSEC) 2019

  40. arXiv:2003.13968  [pdf, other

    cs.DB cs.IR

    Towards Productionizing Subjective Search Systems

    Authors: Aaron Feng, Shuwei Chen, Yuliang Li, Hiroshi Matsuda, Hidekazu Tamaki, Wang-Chiew Tan

    Abstract: Existing e-commerce search engines typically support search only over objective attributes, such as price and locations, leaving the more desirable subjective attributes, such as romantic vibe and worklife balance unsearchable. We found that this is also the case for Recruit Group, which operates a wide range of online booking and search services, including jobs, travel, housing, bridal, dining, b… ▽ More

    Submitted 31 March, 2020; originally announced March 2020.

    Comments: In Submission to VLDB 2020

  41. arXiv:2003.01204  [pdf

    cs.CV cs.LG

    Energy-efficient and Robust Cumulative Training with Net2Net Transformation

    Authors: Aosong Feng, Priyadarshini Panda

    Abstract: Deep learning has achieved state-of-the-art accuracies on several computer vision tasks. However, the computational and energy requirements associated with training such deep neural networks can be quite high. In this paper, we propose a cumulative training strategy with Net2Net transformation that achieves training computational efficiency without incurring large accuracy loss, in comparison to a… ▽ More

    Submitted 2 March, 2020; originally announced March 2020.

    Comments: 6 pages, 6 figures, 2 Tables

  42. arXiv:1912.00406  [pdf, ps, other

    cs.IT

    Reconsidering Design of Multi-Antenna NOMA Systems with Limited Feedback

    Authors: Zhiyao Tang, Liang Sun, Lu Cao, Shutong Qi, and Yong Feng

    Abstract: We provide in this paper a comprehensive solution to the design, performance analysis, and optimization of a multi-antenna non-orthogonal multiple access (NOMA) system for multiuser downlink communications under a general limited channel state information (CSI) feedback framework for frequency division duplex mode. We design a general framework including user clustering, joint power and bits alloc… ▽ More

    Submitted 1 December, 2019; originally announced December 2019.

    Comments: accepted to IEEE Transactions on Wireless Communications,2019

  43. arXiv:1910.00962  [pdf, other

    cs.CV

    Privacy-preserving Federated Brain Tumour Segmentation

    Authors: Wenqi Li, Fausto Milletarì, Daguang Xu, Nicola Rieke, Jonny Hancox, Wentao Zhu, Maximilian Baust, Yan Cheng, Sébastien Ourselin, M. Jorge Cardoso, Andrew Feng

    Abstract: Due to medical data privacy regulations, it is often infeasible to collect and share patient data in a centralised data lake. This poses challenges for training machine learning algorithms, such as deep convolutional networks, which often require large numbers of diverse training examples. Federated learning sidesteps this difficulty by bringing code to the patient data owners and only sharing int… ▽ More

    Submitted 2 October, 2019; originally announced October 2019.

    Comments: MICCAI MLMI 2019

  44. arXiv:1903.01498  [pdf, other

    cs.DB cs.AI

    Voyageur: An Experiential Travel Search Engine

    Authors: Sara Evensen, Aaron Feng, Alon Halevy, Jinfeng Li, Vivian Li, Yuliang Li, Huining Liu, George Mihaila, John Morales, Natalie Nuno, Ekaterina Pavlovic, Wang-Chiew Tan, Xiaolan Wang

    Abstract: We describe Voyageur, which is an application of experiential search to the domain of travel. Unlike traditional search engines for online services, experiential search focuses on the experiential aspects of the service under consideration. In particular, Voyageur needs to handle queries for subjective aspects of the service (e.g., quiet hotel, friendly staff) and combine these with objective attr… ▽ More

    Submitted 4 March, 2019; originally announced March 2019.

    Comments: Demo paper accepted to the Web Conference

  45. arXiv:1902.09661  [pdf, other

    cs.DB

    Subjective Databases

    Authors: Yuliang Li, Aaron Xixuan Feng, Jinfeng Li, Saran Mumick, Alon Halevy, Vivian Li, Wang-Chiew Tan

    Abstract: Online users are constantly seeking experiences, such as a hotel with clean rooms and a lively bar, or a restaurant for a romantic rendezvous. However, e-commerce search engines only support queries involving objective attributes such as location, price, and cuisine, and any experiential data is relegated to text reviews. In order to support experiential queries, a database system needs to model… ▽ More

    Submitted 24 July, 2019; v1 submitted 25 February, 2019; originally announced February 2019.

  46. arXiv:1805.01083  [pdf, other

    cs.DB cs.CL

    Scalable Semantic Querying of Text

    Authors: Xiaolan Wang, Aaron Feng, Behzad Golshan, Alon Halevy, George Mihaila, Hidekazu Oiwa, Wang-Chiew Tan

    Abstract: We present the KOKO system that takes declarative information extraction to a new level by incorporating advances in natural language processing techniques in its extraction language. KOKO is novel in that its extraction language simultaneously supports conditions on the surface of the text and on the structure of the dependency parse tree of sentences, thereby allowing for more refined extraction… ▽ More

    Submitted 2 May, 2018; originally announced May 2018.

  47. arXiv:1607.01869  [pdf, other

    cs.IR cs.AI cs.CL

    Scalable Semantic Matching of Queries to Ads in Sponsored Search Advertising

    Authors: Mihajlo Grbovic, Nemanja Djuric, Vladan Radosavljevic, Fabrizio Silvestri, Ricardo Baeza-Yates, Andrew Feng, Erik Ordentlich, Lee Yang, Gavin Owens

    Abstract: Sponsored search represents a major source of revenue for web search engines. This popular advertising model brings a unique possibility for advertisers to target users' immediate intent communicated through a search query, usually by displaying their ads alongside organic search results for queries deemed relevant to their products or services. However, due to a large number of unique queries it… ▽ More

    Submitted 6 July, 2016; originally announced July 2016.

    Comments: 10 pages, 4 figures, 39th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2016, Pisa, Italy

    Journal ref: 39th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2016, Pisa, Italy

  48. arXiv:1606.08495  [pdf, other

    cs.CL

    Network-Efficient Distributed Word2vec Training System for Large Vocabularies

    Authors: Erik Ordentlich, Lee Yang, Andy Feng, Peter Cnudde, Mihajlo Grbovic, Nemanja Djuric, Vladan Radosavljevic, Gavin Owens

    Abstract: Word2vec is a popular family of algorithms for unsupervised training of dense vector representations of words on large text corpuses. The resulting vectors have been shown to capture semantic relationships among their corresponding words, and have shown promise in reducing a number of natural language processing (NLP) tasks to mathematical operations on these vectors. While heretofore applications… ▽ More

    Submitted 27 June, 2016; originally announced June 2016.

    Comments: 10 pages, 2 figures

  49. arXiv:1105.1421  [pdf

    physics.soc-ph cs.SI

    An Empirical Investigation on Important Subgraphs in Cooperation-Competition networks

    Authors: A. -X. Feng, C. -H. Fu, X. -L. Xu, Ai-Fen Liu, H. Chang, D. -R. He, G. -L. Feng

    Abstract: Subgraphs are very important for understanding structure and function of complex networks. Dyad and triad are the elementary subgraphs. We focus on the distribution of their act degree defined as the number of activities, events or organizations they join, which indicates the importance of the subgraphs. The empirical studies show that, in a lot of real world systems, the dyad or triad act degree… ▽ More

    Submitted 4 September, 2011; v1 submitted 7 May, 2011; originally announced May 2011.

    Comments: 14 pages and 14 figures