Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–40 of 40 results for author: Zhao, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.13979  [pdf, other

    cs.LG cs.DS stat.ML

    Truthfulness of Calibration Measures

    Authors: Nika Haghtalab, Mingda Qiao, Kunhe Yang, Eric Zhao

    Abstract: We initiate the study of the truthfulness of calibration measures in sequential prediction. A calibration measure is said to be truthful if the forecaster (approximately) minimizes the expected penalty by predicting the conditional expectation of the next outcome, given the prior distribution of outcomes. Truthfulness is an important property of calibration measures, ensuring that the forecaster i… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  2. arXiv:2406.14135  [pdf, other

    cs.RO cs.AI

    Autonomous Robotic Drilling System for Mice Cranial Window Creation

    Authors: Enduo Zhao, Murilo M. Marinho, Kanako Harada

    Abstract: Robotic assistance for experimental manipulation in the life sciences is expected to enable favorable outcomes, regardless of the skill of the scientist. Experimental specimens in the life sciences are subject to individual variability hence require intricate algorithms for successful autonomous robotic control. As a use case, we are studying the creation of cranial windows in mice. This operation… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 15 pages, 14 figures, to be submitted to IEEE

  3. arXiv:2406.09640  [pdf, other

    cs.RO

    GPT-Fabric: Folding and Smoothing Fabric by Leveraging Pre-Trained Foundation Models

    Authors: Vedant Raval, Enyu Zhao, Hejia Zhang, Stefanos Nikolaidis, Daniel Seita

    Abstract: Fabric manipulation has applications in folding blankets, handling patient clothing, and protecting items with covers. It is challenging for robots to perform fabric manipulation since fabrics have infinite-dimensional configuration spaces, complex dynamics, and may be in folded or crumpled configurations with severe self-occlusions. Prior work on robotic fabric manipulation relies either on heavi… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Code, prompts, and videos are available at https://tinyurl.com/gptfab

  4. arXiv:2403.04978  [pdf, other

    cs.LG stat.ML

    Stacking as Accelerated Gradient Descent

    Authors: Naman Agarwal, Pranjal Awasthi, Satyen Kale, Eric Zhao

    Abstract: Stacking, a heuristic technique for training deep residual networks by progressively increasing the number of layers and initializing new layers by copying parameters from older layers, has proven quite successful in improving the efficiency of training deep neural networks. In this paper, we propose a theoretical explanation for the efficacy of stacking: viz., stacking implements a form of Nester… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  5. arXiv:2401.08294  [pdf, other

    cs.CL

    Inferflow: an Efficient and Highly Configurable Inference Engine for Large Language Models

    Authors: Shuming Shi, Enbo Zhao, Deng Cai, Leyang Cui, Xinting Huang, Huayang Li

    Abstract: We present Inferflow, an efficient and highly configurable inference engine for large language models (LLMs). With Inferflow, users can serve most of the common transformer models by simply modifying some lines in corresponding configuration files, without writing a single line of source code. Compared with most existing inference engines, Inferflow has some key features. First, by implementing a… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: Technical report of Inferflow

  6. arXiv:2312.12096  [pdf, other

    cs.CV

    DLCA-Recon: Dynamic Loose Clothing Avatar Reconstruction from Monocular Videos

    Authors: Chunjie Luo, Fei Luo, Yusen Wang, Enxu Zhao, Chunxia Xiao

    Abstract: Reconstructing a dynamic human with loose clothing is an important but difficult task. To address this challenge, we propose a method named DLCA-Recon to create human avatars from monocular videos. The distance from loose clothing to the underlying body rapidly changes in every frame when the human freely moves and acts. Previous methods lack effective geometric initialization and constraints for… ▽ More

    Submitted 20 December, 2023; v1 submitted 19 December, 2023; originally announced December 2023.

  7. arXiv:2310.07299  [pdf, other

    cs.CL cs.AI

    RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation

    Authors: Yue Zhang, Leyang Cui, Enbo Zhao, Wei Bi, Shuming Shi

    Abstract: Grammatical Error Correction (GEC) systems play a vital role in assisting people with their daily writing tasks. However, users may sometimes come across a GEC system that initially performs well but fails to correct errors when the inputs are slightly modified. To ensure an ideal user experience, a reliable GEC system should have the ability to provide consistent and accurate suggestions when enc… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023 (main conference, long paper)

  8. arXiv:2309.10011  [pdf, other

    cs.CV eess.IV

    Instant Photorealistic Style Transfer: A Lightweight and Adaptive Approach

    Authors: Rong Liu, Enyu Zhao, Zhiyuan Liu, Andrew Feng, Scott John Easley

    Abstract: In this paper, we propose an Instant Photorealistic Style Transfer (IPST) approach, designed to achieve instant photorealistic style transfer on super-resolution inputs without the need for pre-training on pair-wise datasets or imposing extra constraints. Our method utilizes a lightweight StyleNet to enable style transfer from a style image to a content image while preserving non-color information… ▽ More

    Submitted 20 October, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: 8 pages (reference excluded), 6 figures, 4 tables

  9. arXiv:2309.01219  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models

    Authors: Yue Zhang, Yafu Li, Leyang Cui, Deng Cai, Lemao Liu, Tingchen Fu, Xinting Huang, Enbo Zhao, Yu Zhang, Yulong Chen, Longyue Wang, Anh Tuan Luu, Wei Bi, Freda Shi, Shuming Shi

    Abstract: While large language models (LLMs) have demonstrated remarkable capabilities across a range of downstream tasks, a significant concern revolves around their propensity to exhibit hallucinations: LLMs occasionally generate content that diverges from the user input, contradicts previously generated context, or misaligns with established world knowledge. This phenomenon poses a substantial challenge… ▽ More

    Submitted 24 September, 2023; v1 submitted 3 September, 2023; originally announced September 2023.

    Comments: work in progress; 32 pages

  10. arXiv:2307.12135  [pdf, ps, other

    cs.LG stat.ML

    The Sample Complexity of Multi-Distribution Learning for VC Classes

    Authors: Pranjal Awasthi, Nika Haghtalab, Eric Zhao

    Abstract: Multi-distribution learning is a natural generalization of PAC learning to settings with multiple data distributions. There remains a significant gap between the known upper and lower bounds for PAC-learnable classes. In particular, though we understand the sample complexity of learning a VC dimension d class on $k$ distributions to be… ▽ More

    Submitted 22 July, 2023; originally announced July 2023.

    Comments: 11 pages. Authors are ordered alphabetically. Open problem presented at the 36th Annual Conference on Learning Theory

  11. arXiv:2306.16759  [pdf, other

    cs.CV

    SaaFormer: Spectral-spatial Axial Aggregation Transformer for Hyperspectral Image Classification

    Authors: Enzhe Zhao, Zhichang Guo, Yao Li, Dazhi Zhang

    Abstract: Hyperspectral images (HSI) captured from earth observing satellites and aircraft is becoming increasingly important for applications in agriculture, environmental monitoring, mining, etc. Due to the limited available hyperspectral datasets, the pixel-wise random sampling is the most commonly used training-test dataset partition approach, which has significant overlap between samples in training an… ▽ More

    Submitted 4 July, 2023; v1 submitted 29 June, 2023; originally announced June 2023.

    Comments: arXiv admin note: text overlap with arXiv:2107.02988 by other authors

  12. Autonomous Robotic Drilling System for Mice Cranial Window Creation: An Evaluation with an Egg Model

    Authors: Enduo Zhao, Murilo M. Marinho, Kanako Harada

    Abstract: Robotic assistance for experimental manipulation in the life sciences is expected to enable precise manipulation of valuable samples, regardless of the skill of the scientist. Experimental specimens in the life sciences are subject to individual variability and deformation, and therefore require autonomous robotic control. As an example, we are studying the installation of a cranial window in a mo… ▽ More

    Submitted 20 September, 2023; v1 submitted 21 March, 2023; originally announced March 2023.

    Comments: Accepted on IROS 2023, 8 pages

    Journal ref: 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Detroit, MI, USA, 2023, pp. 4592-4599

  13. arXiv:2302.10863  [pdf, other

    cs.LG

    A Unifying Perspective on Multi-Calibration: Game Dynamics for Multi-Objective Learning

    Authors: Nika Haghtalab, Michael I. Jordan, Eric Zhao

    Abstract: We provide a unifying framework for the design and analysis of multicalibrated predictors. By placing the multicalibration problem in the general setting of multi-objective learning -- where learning guarantees must hold simultaneously over a set of distributions and loss functions -- we exploit connections to game dynamics to achieve state-of-the-art guarantees for a diverse set of multicalibrati… ▽ More

    Submitted 19 September, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: 45 pages. Authors are ordered alphabetically

  14. arXiv:2210.12927  [pdf

    cs.RO

    The Design and Realization of Multi-agent Obstacle Avoidance based on Reinforcement Learning

    Authors: Enyu Zhao, Chanjuan Liu, Houfu Su, Yang Liu

    Abstract: Intelligence agents and multi-agent systems play important roles in scenes like the control system of grouped drones, and multi-agent navigation and obstacle avoidance which is the foundational function of advanced application has great importance. In multi-agent navigation and obstacle avoidance tasks, the decision-making interactions and dynamic changes of agents are difficult for traditional ro… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

  15. arXiv:2210.12529  [pdf, other

    cs.LG cs.CY

    On-Demand Sampling: Learning Optimally from Multiple Distributions

    Authors: Nika Haghtalab, Michael I. Jordan, Eric Zhao

    Abstract: Social and real-world considerations such as robustness, fairness, social welfare and multi-agent tradeoffs have given rise to multi-distribution learning paradigms, such as collaborative learning, group distributionally robust optimization, and fair federated learning. In each of these settings, a learner seeks to uniformly minimize its expected loss over $n$ predefined data distributions, while… ▽ More

    Submitted 2 April, 2024; v1 submitted 22 October, 2022; originally announced October 2022.

    Comments: 28 pages, 1 figure. Authors are ordered alphabetically. Outstanding paper award at the Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS 2022). Version v2 updates a minor mistake in Lemma 3.1

  16. arXiv:2208.01815  [pdf, other

    cs.CL

    Effidit: Your AI Writing Assistant

    Authors: Shuming Shi, Enbo Zhao, Duyu Tang, Yan Wang, Piji Li, Wei Bi, Haiyun Jiang, Guoping Huang, Leyang Cui, Xinting Huang, Cong Zhou, Yong Dai, Dongyang Ma

    Abstract: In this technical report, we introduce Effidit (Efficient and Intelligent Editing), a digital writing assistant that facilitates users to write higher-quality text more efficiently by using artificial intelligence (AI) technologies. Previous writing assistants typically provide the function of error checking (to detect and correct spelling and grammatical errors) and limited text-rewriting functio… ▽ More

    Submitted 4 August, 2022; v1 submitted 2 August, 2022; originally announced August 2022.

    Comments: Technical report for Effidit. arXiv admin note: text overlap with arXiv:2202.06417

  17. arXiv:2207.05750  [pdf, other

    cs.IR cs.AI cs.LG

    FD-GATDR: A Federated-Decentralized-Learning Graph Attention Network for Doctor Recommendation Using EHR

    Authors: Luning Bi, Yunlong Wang, Fan Zhang, Zhuqing Liu, Yong Cai, Emily Zhao

    Abstract: In the past decade, with the development of big data technology, an increasing amount of patient information has been stored as electronic health records (EHRs). Leveraging these data, various doctor recommendation systems have been proposed. Typically, such studies process the EHR data in a flat-structured manner, where each encounter was treated as an unordered set of features. Nevertheless, the… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

  18. arXiv:2206.03321  [pdf, ps, other

    cs.LG

    Early Abnormal Detection of Sewage Pipe Network: Bagging of Various Abnormal Detection Algorithms

    Authors: Zhen-Yu Zhang, Guo-Xiang Shao, Chun-Ming Qiu, Yue-Jie Hou, En-Ming Zhao, Chi-Chun Zhou

    Abstract: Abnormalities of the sewage pipe network will affect the normal operation of the whole city. Therefore, it is important to detect the abnormalities early. This paper propose an early abnormal-detection method. The abnormalities are detected by using the conventional algorithms, such as isolation forest algorithm, two innovations are given: (1) The current and historical data measured by the sensor… ▽ More

    Submitted 5 June, 2022; originally announced June 2022.

  19. arXiv:2204.12052  [pdf, other

    cs.CL

    Pretraining Chinese BERT for Detecting Word Insertion and Deletion Errors

    Authors: Cong Zhou, Yong Dai, Duyu Tang, Enbo Zhao, Zhangyin Feng, Li Kuang, Shuming Shi

    Abstract: Chinese BERT models achieve remarkable progress in dealing with grammatical errors of word substitution. However, they fail to handle word insertion and deletion because BERT assumes the existence of a word at each position. To address this, we present a simple and effective Chinese pretrained model. The basic idea is to enable the model to determine whether a word exists at a particular position.… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

    Comments: 12 pages

  20. arXiv:2203.00286  [pdf, other

    cs.CL

    "Is Whole Word Masking Always Better for Chinese BERT?": Probing on Chinese Grammatical Error Correction

    Authors: Yong Dai, Linyang Li, Cong Zhou, Zhangyin Feng, Enbo Zhao, Xipeng Qiu, Piji Li, Duyu Tang

    Abstract: Whole word masking (WWM), which masks all subwords corresponding to a word at once, makes a better English BERT model. For the Chinese language, however, there is no subword because each token is an atomic character. The meaning of a word in Chinese is different in that a word is a compositional unit consisting of multiple characters. Such difference motivates us to investigate whether WWM leads t… ▽ More

    Submitted 2 March, 2022; v1 submitted 1 March, 2022; originally announced March 2022.

    Comments: Short paper in Findings of ACL 2022

  21. arXiv:2201.02025  [pdf, other

    cs.LG math.OC

    A deep learning-based model reduction (DeePMR) method for simplifying chemical kinetics

    Authors: Zhiwei Wang, Yaoyu Zhang, Enhan Zhao, Yiguang Ju, Weinan E, Zhi-Qin John Xu, Tianhan Zhang

    Abstract: A deep learning-based model reduction (DeePMR) method for simplifying chemical kinetics is proposed and validated using high-temperature auto-ignitions, perfectly stirred reactors (PSR), and one-dimensional freely propagating flames of n-heptane/air mixtures. The mechanism reduction is modeled as an optimization problem on Boolean space, where a Boolean vector, each entry corresponding to a specie… ▽ More

    Submitted 8 September, 2022; v1 submitted 6 January, 2022; originally announced January 2022.

  22. arXiv:2112.01224  [pdf

    cs.CY cs.LG

    Using word embedding for environmental violation analysis: Evidence from Pennsylvania unconventional oil and gas compliance reports

    Authors: Dan Bi, Ju-e Guo, Erlong Zhao, Shaolong Sun, Shouyang Wang

    Abstract: With the booming of the unconventional oil and gas industry, its inevitable damage to the environment and human health has attracted public attention. We applied text mining on a total 6057 the type of Environmental Health and Safety compliance reports from 2008 to 2018 lunched by the Department of Environmental Protection in Pennsylvania, USA, to discover the intern mechanism of environmental vio… ▽ More

    Submitted 9 November, 2021; originally announced December 2021.

    Comments: 23 pages

    MSC Class: 62P12

  23. arXiv:2107.11626  [pdf, other

    cs.CV

    Multi-Label Image Classification with Contrastive Learning

    Authors: Son D. Dao, Ethan Zhao, Dinh Phung, Jianfei Cai

    Abstract: Recently, as an effective way of learning latent representations, contrastive learning has been increasingly popular and successful in various domains. The success of constrastive learning in single-label classifications motivates us to leverage this learning framework to enhance distinctiveness for better performance in multi-label image classification. In this paper, we show that a direct applic… ▽ More

    Submitted 24 July, 2021; originally announced July 2021.

  24. arXiv:2106.05492  [pdf, other

    cs.LG

    Learning to Play General-Sum Games Against Multiple Boundedly Rational Agents

    Authors: Eric Zhao, Alexander R. Trott, Caiming Xiong, Stephan Zheng

    Abstract: We study the problem of training a principal in a multi-agent general-sum game using reinforcement learning (RL). Learning a robust principal policy requires anticipating the worst possible strategic responses of other agents, which is generally NP-hard. However, we show that no-regret dynamics can identify these worst-case responses in poly-time in smooth games. We propose a framework that uses t… ▽ More

    Submitted 19 December, 2022; v1 submitted 10 June, 2021; originally announced June 2021.

    Comments: 15 pages, 6 figures. Appearing at the Thirty-seventh AAAI Conference on Artificial Intelligence (AAAI 2023)

  25. arXiv:2104.12302  [pdf, other

    cs.IR

    A unified Neural Network Approach to E-CommerceRelevance Learning

    Authors: Yunjiang Jiang, Yue Shang, Rui Li, Wen-Yun Yang, Guoyu Tang, Chaoyi Ma, Yun Xiao, Eric Zhao

    Abstract: Result relevance scoring is critical to e-commerce search user experience. Traditional information retrieval methods focus on keyword matching and hand-crafted or counting-based numeric features, with limited understanding of item semantic relevance. We describe a highly-scalable feed-forward neural model to provide relevance score for (query, item) pairs, using only user query and item title as f… ▽ More

    Submitted 25 April, 2021; originally announced April 2021.

    Comments: 6 pages

    Journal ref: DLP-KDD 2019

  26. arXiv:2103.12982  [pdf, other

    cs.IR cs.AI cs.LG

    From Semantic Retrieval to Pairwise Ranking: Applying Deep Learning in E-commerce Search

    Authors: Rui Li, Yunjiang Jiang, Wenyun Yang, Guoyu Tang, Songlin Wang, Chaoyi Ma, Wei He, Xi Xiong, Yun Xiao, Eric Yihong Zhao

    Abstract: We introduce deep learning models to the two most important stages in product search at JD.com, one of the largest e-commerce platforms in the world. Specifically, we outline the design of a deep learning system that retrieves semantically relevant items to a query within milliseconds, and a pairwise deep re-ranking system, which learns subtle user preferences. Compared to traditional search syste… ▽ More

    Submitted 24 March, 2021; originally announced March 2021.

    Comments: Accepted in SIGIR 2019

  27. arXiv:2102.09381  [pdf, other

    cs.LG cs.AI cs.GT

    L2E: Learning to Exploit Your Opponent

    Authors: Zhe Wu, Kai Li, Enmin Zhao, Hang Xu, Meng Zhang, Haobo Fu, Bo An, Junliang Xing

    Abstract: Opponent modeling is essential to exploit sub-optimal opponents in strategic interactions. Most previous works focus on building explicit models to directly predict the opponents' styles or strategies, which require a large amount of data to train the model and lack adaptability to unknown opponents. In this work, we propose a novel Learning to Exploit (L2E) framework for implicit opponent modelin… ▽ More

    Submitted 18 February, 2021; originally announced February 2021.

  28. arXiv:2012.15639  [pdf, other

    cs.CL

    TexSmart: A Text Understanding System for Fine-Grained NER and Enhanced Semantic Analysis

    Authors: Haisong Zhang, Lemao Liu, Haiyun Jiang, Yangming Li, Enbo Zhao, Kun Xu, Linfeng Song, Suncong Zheng, Botong Zhou, Jianchen Zhu, Xiao Feng, Tao Chen, Tao Yang, Dong Yu, Feng Zhang, Zhanhui Kang, Shuming Shi

    Abstract: This technique report introduces TexSmart, a text understanding system that supports fine-grained named entity recognition (NER) and enhanced semantic analysis functionalities. Compared to most previous publicly available text understanding systems and tools, TexSmart holds some unique features. First, the NER function of TexSmart supports over 1,000 entity types, while most other public tools typ… ▽ More

    Submitted 31 December, 2020; originally announced December 2020.

  29. arXiv:2012.10853  [pdf, other

    cs.LG cs.AI

    eTREE: Learning Tree-structured Embeddings

    Authors: Faisal M. Almutairi, Yunlong Wang, Dong Wang, Emily Zhao, Nicholas D. Sidiropoulos

    Abstract: Matrix factorization (MF) plays an important role in a wide range of machine learning and data mining models. MF is commonly used to obtain item embeddings and feature representations due to its ability to capture correlations and higher-order statistical dependencies across dimensions. In many applications, the categories of items exhibit a hierarchical tree structure. For instance, human disease… ▽ More

    Submitted 20 December, 2020; originally announced December 2020.

  30. arXiv:2012.06168  [pdf, other

    cs.LG cs.AI cs.GT

    OpenHoldem: A Benchmark for Large-Scale Imperfect-Information Game Research

    Authors: Kai Li, Hang Xu, Enmin Zhao, Zhe Wu, Junliang Xing

    Abstract: Owning to the unremitting efforts by a few institutes, significant progress has recently been made in designing superhuman AIs in No-limit Texas Hold'em (NLTH), the primary testbed for large-scale imperfect-information game research. However, it remains challenging for new researchers to study this problem since there are no standard benchmarks for comparing with existing methods, which seriously… ▽ More

    Submitted 13 December, 2021; v1 submitted 11 December, 2020; originally announced December 2020.

  31. arXiv:2007.08479  [pdf, other

    cs.LG stat.ML

    Active Learning under Label Shift

    Authors: Eric Zhao, Anqi Liu, Animashree Anandkumar, Yisong Yue

    Abstract: We address the problem of active learning under label shift: when the class proportions of source and target domains differ. We introduce a "medial distribution" to incorporate a tradeoff between importance weighting and class-balanced sampling and propose their combined usage in active learning. Our method is known as Mediated Active Learning under Label Shift (MALLS). It balances the bias from c… ▽ More

    Submitted 25 February, 2021; v1 submitted 16 July, 2020; originally announced July 2020.

    Comments: 18 pages, 9 figures, to appear at the 2021 International Conference on Artificial Intelligence and Statistics (AIStats)

  32. arXiv:1910.02574  [pdf, other

    cs.LG cs.IR stat.ML

    Representation Learning of EHR Data via Graph-Based Medical Entity Embedding

    Authors: Tong Wu, Yunlong Wang, Yue Wang, Emily Zhao, Yilian Yuan, Zhi Yang

    Abstract: Automatic representation learning of key entities in electronic health record (EHR) data is a critical step for healthcare informatics that turns heterogeneous medical records into structured and actionable information. Here we propose ME2Vec, an algorithmic framework for learning low-dimensional vectors of the most common entities in EHR: medical services, doctors, and patients. ME2Vec leverages… ▽ More

    Submitted 6 October, 2019; originally announced October 2019.

    Comments: 5 pages, 2 figures, NeurIPS 2019 Graph Representation Learning Workshop

  33. arXiv:1907.01099  [pdf, other

    cs.LG stat.ML

    Predicting Treatment Initiation from Clinical Time Series Data via Graph-Augmented Time-Sensitive Model

    Authors: Fan Zhang, Tong Wu, Yunlong Wang, Yong Cai, Cao Xiao, Emily Zhao, Lucas Glass, Jimeng Sun

    Abstract: Many computational models were proposed to extract temporal patterns from clinical time series for each patient and among patient group for predictive healthcare. However, the common relations among patients (e.g., share the same doctor) were rarely considered. In this paper, we represent patients and clinicians relations by bipartite graphs addressing for example from whom a patient get a diagnos… ▽ More

    Submitted 1 July, 2019; originally announced July 2019.

    Comments: 5 pages, 3 figures, accepted by ICML 2019 Time Series Workshop

  34. arXiv:1907.01022  [pdf, other

    cs.LG stat.ML

    Rare Disease Detection by Sequence Modeling with Generative Adversarial Networks

    Authors: Kezi Yu, Yunlong Wang, Yong Cai, Cao Xiao, Emily Zhao, Lucas Glass, Jimeng Sun

    Abstract: Rare diseases affecting 350 million individuals are commonly associated with delay in diagnosis or misdiagnosis. To improve those patients' outcome, rare disease detection is an important task for identifying patients with rare conditions based on longitudinal medical claims. In this paper, we present a deep learning method for detecting patients with exocrine pancreatic insufficiency (EPI) (a rar… ▽ More

    Submitted 1 July, 2019; originally announced July 2019.

    Comments: International Conference on Machine Learning (ICML) 2019 time series workshop accepted paper

  35. arXiv:1905.07320  [pdf, ps, other

    cs.NE cs.CV cs.LG stat.ML

    EENA: Efficient Evolution of Neural Architecture

    Authors: Hui Zhu, Zhulin An, Chuanguang Yang, Kaiqiang Xu, Erhu Zhao, Yongjun Xu

    Abstract: Latest algorithms for automatic neural architecture search perform remarkable but are basically directionless in search space and computational expensive in training of every intermediate architecture. In this paper, we propose a method for efficient architecture search called EENA (Efficient Evolution of Neural Architecture). Due to the elaborately designed mutation and crossover operations, the… ▽ More

    Submitted 26 August, 2019; v1 submitted 9 May, 2019; originally announced May 2019.

    Comments: Accepted by ICCV2019 Neural Architects Workshop (ICCVW)

  36. arXiv:1902.07243  [pdf, other

    cs.IR cs.LG cs.SI

    Graph Neural Networks for Social Recommendation

    Authors: Wenqi Fan, Yao Ma, Qing Li, Yuan He, Eric Zhao, Jiliang Tang, Dawei Yin

    Abstract: In recent years, Graph Neural Networks (GNNs), which can naturally integrate node information and topological structure, have been demonstrated to be powerful in learning on graph data. These advantages of GNNs provide great potential to advance social recommendation since data in social recommender systems can be represented as user-user social graph and user-item graph; and learning latent facto… ▽ More

    Submitted 22 November, 2019; v1 submitted 19 February, 2019; originally announced February 2019.

    Comments: Accepted by WWW2019 Conference. Our code is available at \url{https://github.com/wenqifan03/GraphRec-WWW19}

  37. arXiv:1901.07696  [pdf, other

    cs.CL cs.AI

    Product-Aware Answer Generation in E-Commerce Question-Answering

    Authors: Shen Gao, Zhaochun Ren, Yihong Eric Zhao, Dongyan Zhao, Dawei Yin, Rui Yan

    Abstract: In e-commerce portals, generating answers for product-related questions has become a crucial task. In this paper, we propose the task of product-aware answer generation, which tends to generate an accurate and complete answer from large-scale unlabeled e-commerce reviews and product attributes. Unlike existing question-answering problems, answer generation in e-commerce confronts three main challe… ▽ More

    Submitted 23 January, 2019; v1 submitted 22 January, 2019; originally announced January 2019.

    Comments: Accepted by WSDM 2019

  38. arXiv:1812.00554  [pdf, other

    cs.LG stat.ML

    Modeling Treatment Delays for Patients using Feature Label Pairs in a Time Series

    Authors: Weiyu Huang, Yunlong Wang, Li Zhou, Emily Zhao, Yilian Yuan, Alejandro Ribero

    Abstract: Pharmaceutical targeting is one of key inputs for making sales and marketing strategy planning. Targeting list is built on predicting physician's sales potential of certain type of patient. In this paper, we present a time-sensitive targeting framework leveraging time series model to predict patient's disease and treatment progression. We create time features by extracting service history within a… ▽ More

    Submitted 2 December, 2018; originally announced December 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

  39. arXiv:1812.00547  [pdf, other

    cs.LG stat.ML

    Semi-supervised Rare Disease Detection Using Generative Adversarial Network

    Authors: Wenyuan Li, Yunlong Wang, Yong Cai, Corey Arnold, Emily Zhao, Yilian Yuan

    Abstract: Rare diseases affect a relatively small number of people, which limits investment in research for treatments and cures. Developing an efficient method for rare disease detection is a crucial first step towards subsequent clinical research. In this paper, we present a semi-supervised learning framework for rare disease detection using generative adversarial networks. Our method takes advantage of t… ▽ More

    Submitted 2 December, 2018; originally announced December 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

  40. arXiv:1810.10627  [pdf, other

    cs.LG stat.ML

    Streaming Graph Neural Networks

    Authors: Yao Ma, Ziyi Guo, Zhaochun Ren, Eric Zhao, Jiliang Tang, Dawei Yin

    Abstract: Graphs are essential representations of many real-world data such as social networks. Recent years have witnessed the increasing efforts made to extend the neural network models to graph-structured data. These methods, which are usually known as the graph neural networks, have been applied to advance many graphs related tasks such as reasoning dynamics of the physical system, graph classification,… ▽ More

    Submitted 6 November, 2018; v1 submitted 24 October, 2018; originally announced October 2018.