Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- ArticleDecember 2024
TexDC: Text-Driven Disease-Aware 4D Cardiac Cine MRI Images Generation
AbstractGenerating disease-aware cardiac cine magnetic resonance imaging (cine MRI) images has immense potential in medical research, with recent advancements in text-driven image generation technology offering a viable solution. However, establishing ...
- research-articleNovember 2024
MII: A Multifaceted Framework for Intermittence-Aware Inference and Scheduling
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCADICS), Volume 43, Issue 11Pages 3708–3719https://doi.org/10.1109/TCAD.2024.3443710The concurrent execution of deep neural networks (DNNs) inference tasks on the intermittently-powered batteryless devices (IPDs) has recently garnered much attention due to its potential in a broad range of smart sensing applications. While the ...
- research-articleSeptember 2024
Automated Testing Linguistic Capabilities of NLP Models
ACM Transactions on Software Engineering and Methodology (TOSEM), Volume 33, Issue 7Article No.: 176, Pages 1–33https://doi.org/10.1145/3672455Natural language processing (NLP) has gained widespread adoption in the development of real-world applications. However, the black-box nature of neural networks in NLP applications poses a challenge when evaluating their performance, let alone ensuring ...
- research-articleSeptember 2024
ReHarvest: An ADC Resource-Harvesting Crossbar Architecture for ReRAM-Based DNN Accelerators
- Jiahong Xu,
- Haikun Liu,
- Zhuohui Duan,
- Xiaofei Liao,
- Hai Jin,
- Xiaokang Yang,
- Huize Li,
- Cong Liu,
- Fubing Mao,
- Yu Zhang
ACM Transactions on Architecture and Code Optimization (TACO), Volume 21, Issue 3Article No.: 63, Pages 1–26https://doi.org/10.1145/3659208ReRAM-based Processing-In-Memory (PIM) architectures have been increasingly explored to accelerate various Deep Neural Network (DNN) applications because they can achieve extremely high performance and energy-efficiency for in-situ analog Matrix-Vector ...
-
- research-articleOctober 2024
A data mining-then-predict method for proactive maritime traffic management by machine learning
Engineering Applications of Artificial Intelligence (EAAI), Volume 135, Issue Chttps://doi.org/10.1016/j.engappai.2024.108696AbstractProactive traffic management is increasingly critical in maritime intelligent transportation systems. Central to this is maritime traffic forecasting, which leverages specific structures and properties of the problem. This study focuses on the ...
- ArticleSeptember 2024
ICDAR 2024 Competition on Recognition of Chemical Structures
- Mingjun Chen,
- Hao Wu,
- Qikai Chang,
- Hanbo Cheng,
- Jiefeng Ma,
- Pengfei Hu,
- Zhenrong Zhang,
- Chenyu Liu,
- Changpeng Pi,
- Jinshui Hu,
- Baocai Yin,
- Bing Yin,
- Cong Liu,
- Jun Du
Document Analysis and Recognition - ICDAR 2024Pages 397–409https://doi.org/10.1007/978-3-031-70552-6_24AbstractThe recognition of chemical molecular structures is crucial in fields such as education and biochemistry. Due to the significant challenges in data acquisition and annotation, current methods mainly focus on recognizing printed structures with ...
- research-articleJanuary 2025
1DFormer: a transformer architecture learning 1D landmark representations for facial landmark tracking
IJCAI '24: Proceedings of the Thirty-Third International Joint Conference on Artificial IntelligenceArticle No.: 176, Pages 1588–1597https://doi.org/10.24963/ijcai.2024/176Recently, heatmap regression methods based on 1D landmark representations have shown prominent performance on locating facial landmarks. However, previous methods ignored to make deep explorations on the good potentials of 1D landmark representations for ...
- research-articleJuly 2024
Enforcing Correctness of Collaborative Business Processes Using Plans
IEEE Transactions on Software Engineering (ISOF), Volume 50, Issue 9Pages 2313–2336https://doi.org/10.1109/TSE.2024.3431585Generally, a collaborative business process is a distributed process, in which a set of parallel business processes are involved. These business processes have complementary competencies and knowledge, and cooperate with each other to achieve their common ...
- research-articleJuly 2024
Multi-modal fusion for business process prediction in call center scenarios
AbstractCall centers are critical for gathering customer feedback, making them essential for business communication. Predicting the ongoing business process status accurately has become a focus in both academia and industry. However, current methods ...
Highlights- Innovates business process prediction in call centers using multi-modal fusion.
- Proposes a multitask model for effective multi-source data integration.
- Provides empirical validation of improved prediction performance in call center ...
- research-articleJuly 2024
NDOrder: Exploring a novel decoding order for scene text recognition
- Dajian Zhong,
- Hongjian Zhan,
- Shujing Lyu,
- Cong Liu,
- Bing Yin,
- Palaiahnakote Shivakumara,
- Umapada Pal,
- Yue Lu
Expert Systems with Applications: An International Journal (EXWA), Volume 249, Issue PChttps://doi.org/10.1016/j.eswa.2024.123771AbstractText recognition in scene images is still considered as a challenging task for the computer vision and pattern recognition community. For text images affected by multiple adverse factors, such as occlusion (due to obstacles) and poor quality (due ...
Highlights- The NDOrder can recognize text images with occluded and low-quality characters.
- The ROG module ensures the method work with a novel decoding order.
- The VCP module constructs a robust connection among image, content and position.
- research-articleJuly 2024
DeciX: Explain Deep Learning Based Code Generation Applications
Proceedings of the ACM on Software Engineering (PACMSE), Volume 1, Issue FSEArticle No.: 107, Pages 2424–2446https://doi.org/10.1145/3660814Deep learning-based code generation (DL-CG) applications have shown great potential for assisting developers in programming with human-competitive accuracy. However, lacking transparency in such applications due to the uninterpretable nature of deep ...
- research-articleJuly 2024
PPM: Automated Generation of Diverse Programming Problems for Benchmarking Code Generation Models
Proceedings of the ACM on Software Engineering (PACMSE), Volume 1, Issue FSEArticle No.: 54, Pages 1194–1215https://doi.org/10.1145/3643780In recent times, a plethora of Large Code Generation Models (LCGMs) have been proposed, showcasing significant potential in assisting developers with complex programming tasks. Within the surge of LCGM proposals, a critical aspect of code generation ...
- rapid-communicationJuly 2024
Converting OMOP CDM to phenopackets: A model alignment and patient data representation evaluation
Journal of Biomedical Informatics (JOBI), Volume 155, Issue Chttps://doi.org/10.1016/j.jbi.2024.104659Graphical abstractDisplay Omitted
Abstract ObjectiveThis study aims to promote interoperability in precision medicine and translational research by aligning the Observational Medical Outcomes Partnership (OMOP) and Phenopackets data models. Phenopackets is an expert knowledge-driven ...
- research-articleJuly 2024
A hybrid deep learning method for the prediction of ship time headway using automatic identification system data
Engineering Applications of Artificial Intelligence (EAAI), Volume 133, Issue PBhttps://doi.org/10.1016/j.engappai.2024.108172AbstractShip Time Headway (STH) is used in maritime navigation to describe the time interval between the arrivals of two consecutive ships in the same water area. This measurement may offer a straightforward way to gauge the frequency of ship traffic and ...
- research-articleJune 2024
A transformer-encoder-based multimodal multi-attention fusion network for sentiment analysis
Applied Intelligence (KLU-APIN), Volume 54, Issue 17-18Pages 8415–8441https://doi.org/10.1007/s10489-024-05623-7AbstractFeature fusion for multimodal sentiment analysis is a challenging but worthwhile research topic. With the extension of the time dimension, there are interactions between multimodal signals and the lack of control over the target modal ...
- research-articleJune 2024
SEMv2: Table separation line detection based on instance segmentation
AbstractTable structure recognition is an indispensable element for enabling machines to comprehend tables. Its primary purpose is to identify the internal structure of a table. Nevertheless, due to the complexity and diversity of their structure and ...
Highlights- Introducing SEMv2: Enhances ’split’ stage in table line detection for robustness.
- Proposed parallel decoder in “merge” stage boosts efficiency and maintains accuracy.
- iFLYTAB: Challenging new dataset for diverse table structure ...
- research-articleFebruary 2024
Self-supervised medical slice interpolation network using controllable feature flow▪
Expert Systems with Applications: An International Journal (EXWA), Volume 238, Issue PEhttps://doi.org/10.1016/j.eswa.2023.121943AbstractDeep learning-based image interpolation methods are confronted with various challenges in its application to anisotropic medical volumetric data (i.e., CT and MR images) out of the complex nonlinear deformation and the scarcity of high-quality ...
- research-articleJanuary 2025
Image as a language: revisiting scene text recognition via balanced, unified and synchronized vision-language reasoning network
AAAI'24/IAAI'24/EAAI'24: Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence and Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial IntelligenceArticle No.: 654, Pages 5885–5893https://doi.org/10.1609/aaai.v38i6.28402Scene text recognition is inherently a vision-language task. However, previous works have predominantly focused either on extracting more robust visual features or designing better language modeling. How to effectively and jointly model vision and ...
- review-articleFebruary 2024
Weakly supervised scene text generation for low-resource languages
Expert Systems with Applications: An International Journal (EXWA), Volume 237, Issue PChttps://doi.org/10.1016/j.eswa.2023.121622AbstractA large number of annotated training images is crucial for training successful scene text recognition models. However, collecting sufficient datasets can be a labor-intensive and costly process, particularly for low-resource languages. To address ...