Author: Tian, Yuan : Search

research-article

An empirical study of developers’ challenges in implementing Workflows as Code: A case study on Apache Airflow

Journal of Systems and Software (JSSO), Volume 219, Issue Chttps://doi.org/10.1016/j.jss.2024.112248

Abstract

The Workflows as Code paradigm is becoming increasingly essential to streamline the design and management of complex processes within data-intensive software systems. These systems require robust capabilities to process, analyze, and extract ...

Highlights

First study on “Workflows as Code” challenges in data-intensive Machine Learning systems using Airflow.
Hierarchical taxonomy identifies 7 categories, 14 subcategories, and 10 root causes of challenges.
Defining and executing ...

research-article

Open Access

Insights into Natural Language Database Query Errors: from Attention Misalignment to User Handling Strategies

ACM Transactions on Interactive Intelligent Systems (TIIS), Volume 14, Issue 4Article No.: 25, Pages 1–32https://doi.org/10.1145/3650114

Querying structured databases with natural language (NL2SQL) has remained a difficult problem for years. Recently, the advancement of machine learning (ML), natural language processing (NLP), and large language models (LLM) have led to significant ...

research-article

YOLO-SS: optimizing YOLO for enhanced small object detection in remote sensing imagery: YOLO-SS: optimizing YOLO for enhanced small object detection...

The Journal of Supercomputing (JSCO), Volume 81, Issue 1https://doi.org/10.1007/s11227-024-06765-8

Abstract

The identification of minuscule objects in remote sensing data presents a formidable challenge in computer vision, where objects may occupy a mere handful of pixels. The lack of unique shape features in such small objects hinders the effectiveness ...

research-article

Open Access

BadMerging: Backdoor Attacks Against Model Merging

CCS '24: Proceedings of the 2024 on ACM SIGSAC Conference on Computer and Communications SecurityPages 4450–4464https://doi.org/10.1145/3658644.3690284

Fine-tuning pre-trained models for downstream tasks has led to a proliferation of open-sourced task-specific models. Recently, Model Merging (MM) has emerged as an effective approach to facilitate knowledge transfer among these independently fine-tuned ...

research-article

Open Access

AuthSaber: Automated Safety Verification of OpenID Connect Programs

CCS '24: Proceedings of the 2024 on ACM SIGSAC Conference on Computer and Communications SecurityPages 2949–2962https://doi.org/10.1145/3658644.3670318

Single Sign-On (SSO)-based authentication protocols, like OpenID Connect (OIDC), play a crucial role in enhancing security and privacy in today's interconnected digital world, gaining widespread adoption among the majority of prominent authentication ...

research-article

Multi-head self-attention mechanism combined with feedforward network for time-varying nonlinear digital self-interference cancellation

Digital Signal Processing (DISP), Volume 155, Issue Chttps://doi.org/10.1016/j.dsp.2024.104699

Abstract

In-band full duplex (IBFD) allows simultaneous communication in the same frequency band, thus significantly improving spectral efficiency. However, self-interference (SI) at the local transmitter affects decoding at the receiver, and SI intensity ...

research-article

Free

JUST ACCEPTED

ZS4C: Zero-Shot Synthesis of Compilable Code for Incomplete Code Snippets using LLMs

ACM Transactions on Software Engineering and Methodology (TOSEM), Just Accepted https://doi.org/10.1145/3702979

Technical Q&A sites are valuable for software developers seeking knowledge, but the code snippets they provide are often uncompilable and incomplete due to unresolved types and missing libraries. This poses a challenge for users who wish to reuse or ...

short-paper

Engaging with AI: An Exploratory Study on Developers' Sharing and Reactions to ChatGPT in GitHub Pull Requests

ASEW '24: Proceedings of the 39th IEEE/ACM International Conference on Automated Software Engineering WorkshopsPages 156–160https://doi.org/10.1145/3691621.3694946

ChatGPT, as a representative Foundation Model (FM)-powered tool, has demonstrated significant potential in assisting developers with various software engineering tasks, such as code generation, program repair, and test creation. However, the timing of ...

research-article

A First Look at Self-Admitted Miscommunications in GitHub Issues

ASEW '24: Proceedings of the 39th IEEE/ACM International Conference on Automated Software Engineering WorkshopsPages 118–127https://doi.org/10.1145/3691621.3694942

Effective communication is crucial for the success of open-source software development, particularly within distributed and asynchronous working environments on collaborative coding supporting platforms like GitHub. However, these environments often ...

Article

Efficient Zero-Knowledge Argument for Bilinear Matrix Relation over the Residue Ring

Data Security and Privacy ProtectionPages 87–105https://doi.org/10.1007/978-981-97-8540-7_6

Abstract

In private computing applications various data relations can be represented by or reduced to some matrix relations. In addition, the residue ring $Z_{m}$ is one of the most widely used arithmetic systems in practice. One of the main challenges in ... $^{}$ $_{}$ $^{}$

research-article

Open Access

SQLucid: Grounding Natural Language Database Queries with Interactive Explanations

UIST '24: Proceedings of the 37th Annual ACM Symposium on User Interface Software and TechnologyArticle No.: 12, Pages 1–20https://doi.org/10.1145/3654777.3676368

Though recent advances in machine learning have led to significant improvements in natural language interfaces for databases, the accuracy and reliability of these systems remain limited, especially in high-stakes domains. This paper introduces SQLucid, ...

Article

Interactive Design of Serious Game Based on Gesture for Transmission of Traditional Handicraft

Entertainment Computing – ICEC 2024Pages 345–354https://doi.org/10.1007/978-3-031-74353-5_30

Abstract

Traditional handicraft making and display usually require fixed steps of hand operating. Gesture-based design is a considerable way when designing serious interactive games for traditional handicraft transmission . This paper developed a design ...

research-article

SCL-CVD: Supervised contrastive learning for code vulnerability detection via GraphCodeBERT

Computers and Security (CSEC), Volume 145, Issue Chttps://doi.org/10.1016/j.cose.2024.103994

Abstract

Detecting vulnerabilities in source code is crucial for protecting software systems from cyberattacks. Pre-trained language models such as CodeBERT and GraphCodeBERT have been applied in multiple code-related downstream tasks such as code search ...

research-article

An empirical study on the effectiveness of large language models for SATD identification and classification

Empirical Software Engineering (KLU-EMSE), Volume 29, Issue 6https://doi.org/10.1007/s10664-024-10548-3

Abstract

Self-Admitted Technical Debt (SATD), a concept highlighting sub-optimal choices in software development documented in code comments or other project resources, poses challenges in the maintainability and evolution of software systems. Large ...

Article

Free-VSC: Free Semantics from Visual Foundation Models for Unsupervised Video Semantic Compression

Computer Vision – ECCV 2024Pages 163–183https://doi.org/10.1007/978-3-031-72967-6_10

Abstract

Unsupervised video semantic compression (UVSC), i.e., compressing videos to better support various analysis tasks, has recently garnered attention. However, the semantic richness of previous methods remains limited, due to the single semantic ...

Article

Towards Detection-Recovery Strategy for Robust Decentralized Matrix Factorization

Computer Security – ESORICS 2024Pages 24–44https://doi.org/10.1007/978-3-031-70879-4_2

Abstract

Decentralized matrix factorization (DMF) has emerged as a prominent technique for handling large-scale matrix completion tasks, such as those encountered in commercial recommender systems and social network analysis. Despite its effectiveness and ...

research-article

An empirical study on developers’ shared conversations with ChatGPT in GitHub pull requests and issues

Empirical Software Engineering (KLU-EMSE), Volume 29, Issue 6https://doi.org/10.1007/s10664-024-10540-x

Abstract

ChatGPT has significantly impacted software development practices, providing substantial assistance to developers in various tasks, including coding, testing, and debugging. Despite its widespread adoption, the impact of ChatGPT as an assistant in ...

research-article

A cross-temporal multimodal fusion system based on deep learning for orthodontic monitoring

Computers in Biology and Medicine (CBIM), Volume 180, Issue Chttps://doi.org/10.1016/j.compbiomed.2024.109025

Abstract Introduction

In the treatment of malocclusion, continuous monitoring of the three-dimensional relationship between dental roots and the surrounding alveolar bone is essential for preventing complications from orthodontic procedures. Cone-beam ...

Graphical abstract

Display Omitted

Highlights

The first deep learning based orthodontic system to continuous risk monitoring.
Cross-temporal fusion framework for multimodal medical imaging registration.
Novel registration method based on segmentation for internal structure ...

research-article

Open Access

Where Have You Been? A Study of Privacy Risk for Point-of-Interest Recommendation

KDD '24: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data MiningPages 175–186https://doi.org/10.1145/3637528.3671758

As location-based services (LBS) have grown in popularity, more human mobility data has been collected. The collected data can be used to build machine learning (ML) models for LBS to enhance their performance and improve overall experience for users. ...

research-article

Remote keylogging attacks in multi-user VR applications

SEC '24: Proceedings of the 33rd USENIX Conference on Security SymposiumArticle No.: 154, Pages 2743–2760

As Virtual Reality (VR) applications grow in popularity, they have bridged distances and brought users closer together. However, with this growth, there have been increasing concerns about security and privacy, especially related to the motion data used ...

Applied Filters

People

Names

Institutions

Authors

Editors

Advisors

Reviewers

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Save to Binder

Upcoming Conferences