-
Schottky Anomaly of Reissner-Nordström-de Sitter spacetime
Authors:
Hai-Long Zhen,
Yu-Bo Ma,
Huai-Fan Li,
Li-Chun Zhang,
Yun-Zhi Du
Abstract:
In the extended thermodynamics of black holes, there exists a thermodynamical pressure and its conjugate volume. Extensive studies have been conducted on the phase structure of numerous black holes, which have demonstrated striking similarities to the phase structure of various ordinary matter systems. A comparison of the thermodynamic properties of spherically symmetric AdS black holes with those…
▽ More
In the extended thermodynamics of black holes, there exists a thermodynamical pressure and its conjugate volume. Extensive studies have been conducted on the phase structure of numerous black holes, which have demonstrated striking similarities to the phase structure of various ordinary matter systems. A comparison of the thermodynamic properties of spherically symmetric AdS black holes with those of ordinary thermodynamic systems reveals that the isovolumetric heat capacity of the former is zero, whereas that of the latter is non-zero. It is a subject of interest for the intrinsic reason for this discrepancy. The effective thermodynamic quantities, based on the context of the boundary between the black hole horizon and the cosmological horizon in dS spacetime, as well as the interaction between the two horizons, are presented. The heat capacity in the Reissner-Nördstrom-de Sitter (RN-dS) spacetime is then investigated, and it is demonstrated that the behavior of the heat capacity in the RN-dS spacetime is analogous to that of Schottky specific heat. Treating the two horizon interfaces in the RN-dS spacetime as two distinct energy levels in a two-energy system, and the thermodynamic properties in the RN-dS spacetime are discussed with the method of studying the thermodynamic properties in an ordinary two-energy system, to elucidate the intrinsic reasons for the occurrence of Schottky specific heat in the RN-dS spacetime. The heat capacity observed in the RN-dS spacetime is not only consistent with that of the Schottky specific heat described by the effective thermodynamic quantities in the RN-dS spacetime, but also with that of the heat capacity in an ordinary two-energy-level system. The quantum properties in the RN-dS spacetime are reflected, thereby providing a new avenue for further in-depth study of the quantum properties of black holes and dS spacetime.
△ Less
Submitted 8 June, 2024;
originally announced June 2024.
-
RapVerse: Coherent Vocals and Whole-Body Motions Generations from Text
Authors:
Jiaben Chen,
Xin Yan,
Yihang Chen,
Siyuan Cen,
Qinwei Ma,
Haoyu Zhen,
Kaizhi Qian,
Lie Lu,
Chuang Gan
Abstract:
In this work, we introduce a challenging task for simultaneously generating 3D holistic body motions and singing vocals directly from textual lyrics inputs, advancing beyond existing works that typically address these two modalities in isolation. To facilitate this, we first collect the RapVerse dataset, a large dataset containing synchronous rapping vocals, lyrics, and high-quality 3D holistic bo…
▽ More
In this work, we introduce a challenging task for simultaneously generating 3D holistic body motions and singing vocals directly from textual lyrics inputs, advancing beyond existing works that typically address these two modalities in isolation. To facilitate this, we first collect the RapVerse dataset, a large dataset containing synchronous rapping vocals, lyrics, and high-quality 3D holistic body meshes. With the RapVerse dataset, we investigate the extent to which scaling autoregressive multimodal transformers across language, audio, and motion can enhance the coherent and realistic generation of vocals and whole-body human motions. For modality unification, a vector-quantized variational autoencoder is employed to encode whole-body motion sequences into discrete motion tokens, while a vocal-to-unit model is leveraged to obtain quantized audio tokens preserving content, prosodic information, and singer identity. By jointly performing transformer modeling on these three modalities in a unified way, our framework ensures a seamless and realistic blend of vocals and human motions. Extensive experiments demonstrate that our unified generation framework not only produces coherent and realistic singing vocals alongside human motions directly from textual inputs but also rivals the performance of specialized single-modality generation systems, establishing new benchmarks for joint vocal-motion generation. The project page is available for research purposes at https://vis-www.cs.umass.edu/RapVerse.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
GraSS: Combining Graph Neural Networks with Expert Knowledge for SAT Solver Selection
Authors:
Zhanguang Zhang,
Didier Chetelat,
Joseph Cotnareanu,
Amur Ghose,
Wenyi Xiao,
Hui-Ling Zhen,
Yingxue Zhang,
Jianye Hao,
Mark Coates,
Mingxuan Yuan
Abstract:
Boolean satisfiability (SAT) problems are routinely solved by SAT solvers in real-life applications, yet solving time can vary drastically between solvers for the same instance. This has motivated research into machine learning models that can predict, for a given SAT instance, which solver to select among several options. Existing SAT solver selection methods all rely on some hand-picked instance…
▽ More
Boolean satisfiability (SAT) problems are routinely solved by SAT solvers in real-life applications, yet solving time can vary drastically between solvers for the same instance. This has motivated research into machine learning models that can predict, for a given SAT instance, which solver to select among several options. Existing SAT solver selection methods all rely on some hand-picked instance features, which are costly to compute and ignore the structural information in SAT graphs. In this paper we present GraSS, a novel approach for automatic SAT solver selection based on tripartite graph representations of instances and a heterogeneous graph neural network (GNN) model. While GNNs have been previously adopted in other SAT-related tasks, they do not incorporate any domain-specific knowledge and ignore the runtime variation introduced by different clause orders. We enrich the graph representation with domain-specific decisions, such as novel node feature design, positional encodings for clauses in the graph, a GNN architecture tailored to our tripartite graphs and a runtime-sensitive loss function. Through extensive experiments, we demonstrate that this combination of raw representations and domain-specific choices leads to improvements in runtime for a pool of seven state-of-the-art solvers on both an industrial circuit design benchmark, and on instances from the 20-year Anniversary Track of the 2022 SAT Competition.
△ Less
Submitted 17 May, 2024;
originally announced May 2024.
-
EDA-Driven Preprocessing for SAT Solving
Authors:
Zhengyuan Shi,
Tiebing Tang,
Sadaf Khan,
Hui-Ling Zhen,
Mingxuan Yuan,
Zhufei Chu,
Qiang Xu
Abstract:
Effective formulation of problems into Conjunctive Normal Form (CNF) is critical in modern Boolean Satisfiability (SAT) solving for optimizing solver performance. Addressing the limitations of existing methods, our Electronic Design Automation (EDA)-driven preprocessing framework introduces a novel methodology for preparing SAT instances, leveraging both circuit and CNF formats for enhanced flexib…
▽ More
Effective formulation of problems into Conjunctive Normal Form (CNF) is critical in modern Boolean Satisfiability (SAT) solving for optimizing solver performance. Addressing the limitations of existing methods, our Electronic Design Automation (EDA)-driven preprocessing framework introduces a novel methodology for preparing SAT instances, leveraging both circuit and CNF formats for enhanced flexibility and efficiency. Central to our approach is the integration of a new logic synthesis technique, guided by a reinforcement learning agent, and a novel cost-customized LUT mapping strategy, enabling efficient handling of diverse SAT challenges. By transforming the SAT competition benchmarks into circuit instances, our framework demonstrates substantial performance improvements, as evidenced by a 52.42% reduction on average compared to solving directly. Moreover, our framework achieves a remarkable 96.14% runtime reduction on average for a set of logic equivalence checking problems that exhibit inherent circuit structures. These results highlight the effectiveness and versatility of our approach in handling both CNF and circuit instances. The code is available at https://github.com/cure-lab/EDA4SAT.
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
Exploring the Berezinskii-Kosterlitz-Thouless Transition in a Two-dimensional Dipolar Bose Gas
Authors:
Yifei He,
Ziting Chen,
Haoting Zhen,
Mingchen Huang,
Mithilesh K Parit,
Gyu-Boong Jo
Abstract:
Long-range and anisotropic dipolar interactions induce complex order in quantum systems. It becomes particularly interesting in two-dimension (2D), where the superfluidity with quasi-long-range order emerges via Berezinskii-Kosterlitz-Thouless (BKT) mechanism, which still remains elusive with dipolar interactions. Here, we observe the BKT transition from a normal gas to the superfluid phase in a q…
▽ More
Long-range and anisotropic dipolar interactions induce complex order in quantum systems. It becomes particularly interesting in two-dimension (2D), where the superfluidity with quasi-long-range order emerges via Berezinskii-Kosterlitz-Thouless (BKT) mechanism, which still remains elusive with dipolar interactions. Here, we observe the BKT transition from a normal gas to the superfluid phase in a quasi-2D dipolar Bose gas of erbium atoms. Controlling the orientation of dipoles, we characterize the transition point by monitoring extended coherence and measuring the equation of state. This allows us to gain a systematic understanding of the BKT transition based on an effective short-range description of dipolar interaction in 2D. Additionally, we observe anisotropic density fluctuations and non-local effects in the superfluid regime, which establishes the dipolar nature of the 2D superfluid. Our results lay the ground for understanding the behavior of dipolar bosons in 2D and open up opportunities for examining complex orders in a dipolar superfluid.
△ Less
Submitted 27 March, 2024;
originally announced March 2024.
-
3D-VLA: A 3D Vision-Language-Action Generative World Model
Authors:
Haoyu Zhen,
Xiaowen Qiu,
Peihao Chen,
Jincheng Yang,
Xin Yan,
Yilun Du,
Yining Hong,
Chuang Gan
Abstract:
Recent vision-language-action (VLA) models rely on 2D inputs, lacking integration with the broader realm of the 3D physical world. Furthermore, they perform action prediction by learning a direct mapping from perception to action, neglecting the vast dynamics of the world and the relations between actions and dynamics. In contrast, human beings are endowed with world models that depict imagination…
▽ More
Recent vision-language-action (VLA) models rely on 2D inputs, lacking integration with the broader realm of the 3D physical world. Furthermore, they perform action prediction by learning a direct mapping from perception to action, neglecting the vast dynamics of the world and the relations between actions and dynamics. In contrast, human beings are endowed with world models that depict imagination about future scenarios to plan actions accordingly. To this end, we propose 3D-VLA by introducing a new family of embodied foundation models that seamlessly link 3D perception, reasoning, and action through a generative world model. Specifically, 3D-VLA is built on top of a 3D-based large language model (LLM), and a set of interaction tokens is introduced to engage with the embodied environment. Furthermore, to inject generation abilities into the model, we train a series of embodied diffusion models and align them into the LLM for predicting the goal images and point clouds. To train our 3D-VLA, we curate a large-scale 3D embodied instruction dataset by extracting vast 3D-related information from existing robotics datasets. Our experiments on held-in datasets demonstrate that 3D-VLA significantly improves the reasoning, multimodal generation, and planning capabilities in embodied environments, showcasing its potential in real-world applications.
△ Less
Submitted 14 March, 2024;
originally announced March 2024.
-
The Dawn of AI-Native EDA: Opportunities and Challenges of Large Circuit Models
Authors:
Lei Chen,
Yiqi Chen,
Zhufei Chu,
Wenji Fang,
Tsung-Yi Ho,
Ru Huang,
Yu Huang,
Sadaf Khan,
Min Li,
Xingquan Li,
Yu Li,
Yun Liang,
Jinwei Liu,
Yi Liu,
Yibo Lin,
Guojie Luo,
Zhengyuan Shi,
Guangyu Sun,
Dimitrios Tsaras,
Runsheng Wang,
Ziyi Wang,
Xinming Wei,
Zhiyao Xie,
Qiang Xu,
Chenhao Xue
, et al. (14 additional authors not shown)
Abstract:
Within the Electronic Design Automation (EDA) domain, AI-driven solutions have emerged as formidable tools, yet they typically augment rather than redefine existing methodologies. These solutions often repurpose deep learning models from other domains, such as vision, text, and graph analytics, applying them to circuit design without tailoring to the unique complexities of electronic circuits. Suc…
▽ More
Within the Electronic Design Automation (EDA) domain, AI-driven solutions have emerged as formidable tools, yet they typically augment rather than redefine existing methodologies. These solutions often repurpose deep learning models from other domains, such as vision, text, and graph analytics, applying them to circuit design without tailoring to the unique complexities of electronic circuits. Such an AI4EDA approach falls short of achieving a holistic design synthesis and understanding, overlooking the intricate interplay of electrical, logical, and physical facets of circuit data. This paper argues for a paradigm shift from AI4EDA towards AI-native EDA, integrating AI at the core of the design process. Pivotal to this vision is the development of a multimodal circuit representation learning technique, poised to provide a comprehensive understanding by harmonizing and extracting insights from varied data sources, such as functional specifications, RTL designs, circuit netlists, and physical layouts. We champion the creation of large circuit models (LCMs) that are inherently multimodal, crafted to decode and express the rich semantics and structures of circuit data, thus fostering more resilient, efficient, and inventive design methodologies. Embracing this AI-native philosophy, we foresee a trajectory that transcends the current innovation plateau in EDA, igniting a profound shift-left in electronic design methodology. The envisioned advancements herald not just an evolution of existing EDA tools but a revolution, giving rise to novel instruments of design tools that promise to radically enhance design productivity and inaugurate a new epoch where the optimization of circuit performance, power, and area (PPA) is achieved not incrementally, but through leaps that redefine the benchmarks of electronic systems' capabilities.
△ Less
Submitted 1 May, 2024; v1 submitted 11 March, 2024;
originally announced March 2024.
-
IB-Net: Initial Branch Network for Variable Decision in Boolean Satisfiability
Authors:
Tsz Ho Chan,
Wenyi Xiao,
Junhua Huang,
Huiling Zhen,
Guangji Tian,
Mingxuan Yuan
Abstract:
Boolean Satisfiability problems are vital components in Electronic Design Automation, particularly within the Logic Equivalence Checking process. Currently, SAT solvers are employed for these problems and neural network is tried as assistance to solvers. However, as SAT problems in the LEC context are distinctive due to their predominantly unsatisfiability nature and a substantial proportion of UN…
▽ More
Boolean Satisfiability problems are vital components in Electronic Design Automation, particularly within the Logic Equivalence Checking process. Currently, SAT solvers are employed for these problems and neural network is tried as assistance to solvers. However, as SAT problems in the LEC context are distinctive due to their predominantly unsatisfiability nature and a substantial proportion of UNSAT-core variables, existing neural network assistance has proven unsuccessful in this specialized domain. To tackle this challenge, we propose IB-Net, an innovative framework utilizing graph neural networks and novel graph encoding techniques to model unsatisfiable problems and interact with state-of-the-art solvers. Extensive evaluations across solvers and datasets demonstrate IB-Net's acceleration, achieving an average runtime speedup of 5.0% on industrial data and 8.3% on SAT competition data empirically. This breakthrough advances efficient solving in LEC workflows.
△ Less
Submitted 6 March, 2024;
originally announced March 2024.
-
DiLA: Enhancing LLM Tool Learning with Differential Logic Layer
Authors:
Yu Zhang,
Hui-Ling Zhen,
Zehua Pei,
Yingzhao Lian,
Lihao Yin,
Mingxuan Yuan,
Bei Yu
Abstract:
Considering the challenges faced by large language models (LLMs) in logical reasoning and planning, prior efforts have sought to augment LLMs with access to external solvers. While progress has been made on simple reasoning problems, solving classical constraint satisfaction problems, such as the Boolean Satisfiability Problem (SAT) and Graph Coloring Problem (GCP), remains difficult for off-the-s…
▽ More
Considering the challenges faced by large language models (LLMs) in logical reasoning and planning, prior efforts have sought to augment LLMs with access to external solvers. While progress has been made on simple reasoning problems, solving classical constraint satisfaction problems, such as the Boolean Satisfiability Problem (SAT) and Graph Coloring Problem (GCP), remains difficult for off-the-shelf solvers due to their intricate expressions and exponential search spaces. In this paper, we propose a novel differential logic layer-aided language modeling (DiLA) approach, where logical constraints are integrated into the forward and backward passes of a network layer, to provide another option for LLM tool learning. In DiLA, LLM aims to transform the language description to logic constraints and identify initial solutions of the highest quality, while the differential logic layer focuses on iteratively refining the LLM-prompted solution. Leveraging the logic layer as a bridge, DiLA enhances the logical reasoning ability of LLMs on a range of reasoning problems encoded by Boolean variables, guaranteeing the efficiency and correctness of the solution process. We evaluate the performance of DiLA on two classic reasoning problems and empirically demonstrate its consistent outperformance against existing prompt-based and solver-aided approaches.
△ Less
Submitted 18 June, 2024; v1 submitted 19 February, 2024;
originally announced February 2024.
-
BetterV: Controlled Verilog Generation with Discriminative Guidance
Authors:
Zehua Pei,
Hui-Ling Zhen,
Mingxuan Yuan,
Yu Huang,
Bei Yu
Abstract:
Due to the growing complexity of modern Integrated Circuits (ICs), there is a need for automated circuit design methods. Recent years have seen rising research in hardware design language generation to facilitate the design process. In this work, we propose a Verilog generation framework, BetterV, which fine-tunes the large language models (LLMs) on processed domain-specific datasets and incorpora…
▽ More
Due to the growing complexity of modern Integrated Circuits (ICs), there is a need for automated circuit design methods. Recent years have seen rising research in hardware design language generation to facilitate the design process. In this work, we propose a Verilog generation framework, BetterV, which fine-tunes the large language models (LLMs) on processed domain-specific datasets and incorporates generative discriminators for guidance on particular design demands. The Verilog modules are collected, filtered and processed from internet to form a clean and abundant dataset. Instruct-tuning methods are specially designed to fine-tune the LLMs to understand the knowledge about Verilog. Furthermore, data are augmented to enrich the training set and also used to train a generative discriminator on particular downstream task, which leads a guidance for the LLMs to optimize the Verilog implementation. BetterV has the ability to generate syntactically and functionally correct Verilog, which can outperform GPT-4 on the VerilogEval benchmark. With the help of task-specific generative discriminator, BetterV can achieve remarkable improvement on various electronic design automation (EDA) downstream tasks, including the netlist node reduction for synthesis and verification runtime reduction with Boolean Satisfiability (SAT) solving.
△ Less
Submitted 2 May, 2024; v1 submitted 3 February, 2024;
originally announced February 2024.
-
LLM4EDA: Emerging Progress in Large Language Models for Electronic Design Automation
Authors:
Ruizhe Zhong,
Xingbo Du,
Shixiong Kai,
Zhentao Tang,
Siyuan Xu,
Hui-Ling Zhen,
Jianye Hao,
Qiang Xu,
Mingxuan Yuan,
Junchi Yan
Abstract:
Driven by Moore's Law, the complexity and scale of modern chip design are increasing rapidly. Electronic Design Automation (EDA) has been widely applied to address the challenges encountered in the full chip design process. However, the evolution of very large-scale integrated circuits has made chip design time-consuming and resource-intensive, requiring substantial prior expert knowledge. Additio…
▽ More
Driven by Moore's Law, the complexity and scale of modern chip design are increasing rapidly. Electronic Design Automation (EDA) has been widely applied to address the challenges encountered in the full chip design process. However, the evolution of very large-scale integrated circuits has made chip design time-consuming and resource-intensive, requiring substantial prior expert knowledge. Additionally, intermediate human control activities are crucial for seeking optimal solutions. In system design stage, circuits are usually represented with Hardware Description Language (HDL) as a textual format. Recently, Large Language Models (LLMs) have demonstrated their capability in context understanding, logic reasoning and answer generation. Since circuit can be represented with HDL in a textual format, it is reasonable to question whether LLMs can be leveraged in the EDA field to achieve fully automated chip design and generate circuits with improved power, performance, and area (PPA). In this paper, we present a systematic study on the application of LLMs in the EDA field, categorizing it into the following cases: 1) assistant chatbot, 2) HDL and script generation, and 3) HDL verification and analysis. Additionally, we highlight the future research direction, focusing on applying LLMs in logic synthesis, physical design, multi-modal feature extraction and alignment of circuits. We collect relevant papers up-to-date in this field via the following link: https://github.com/Thinklab-SJTU/Awesome-LLM4EDA.
△ Less
Submitted 28 December, 2023;
originally announced January 2024.
-
Machine Learning Insides OptVerse AI Solver: Design Principles and Applications
Authors:
Xijun Li,
Fangzhou Zhu,
Hui-Ling Zhen,
Weilin Luo,
Meng Lu,
Yimin Huang,
Zhenan Fan,
Zirui Zhou,
Yufei Kuang,
Zhihai Wang,
Zijie Geng,
Yang Li,
Haoyang Liu,
Zhiwu An,
Muming Yang,
Jianshu Li,
Jie Wang,
Junchi Yan,
Defeng Sun,
Tao Zhong,
Yong Zhang,
Jia Zeng,
Mingxuan Yuan,
Jianye Hao,
Jun Yao
, et al. (1 additional authors not shown)
Abstract:
In an era of digital ubiquity, efficient resource management and decision-making are paramount across numerous industries. To this end, we present a comprehensive study on the integration of machine learning (ML) techniques into Huawei Cloud's OptVerse AI Solver, which aims to mitigate the scarcity of real-world mathematical programming instances, and to surpass the capabilities of traditional opt…
▽ More
In an era of digital ubiquity, efficient resource management and decision-making are paramount across numerous industries. To this end, we present a comprehensive study on the integration of machine learning (ML) techniques into Huawei Cloud's OptVerse AI Solver, which aims to mitigate the scarcity of real-world mathematical programming instances, and to surpass the capabilities of traditional optimization techniques. We showcase our methods for generating complex SAT and MILP instances utilizing generative models that mirror multifaceted structures of real-world problem. Furthermore, we introduce a training framework leveraging augmentation policies to maintain solvers' utility in dynamic environments. Besides the data generation and augmentation, our proposed approaches also include novel ML-driven policies for personalized solver strategies, with an emphasis on applications like graph convolutional networks for initial basis selection and reinforcement learning for advanced presolving and cut selection. Additionally, we detail the incorporation of state-of-the-art parameter tuning algorithms which markedly elevate solver performance. Compared with traditional solvers such as Cplex and SCIP, our ML-augmented OptVerse AI Solver demonstrates superior speed and precision across both established benchmarks and real-world scenarios, reinforcing the practical imperative and effectiveness of machine learning techniques in mathematical programming solvers.
△ Less
Submitted 17 January, 2024; v1 submitted 11 January, 2024;
originally announced January 2024.
-
CHORD: Category-level Hand-held Object Reconstruction via Shape Deformation
Authors:
Kailin Li,
Lixin Yang,
Haoyu Zhen,
Zenan Lin,
Xinyu Zhan,
Licheng Zhong,
Jian Xu,
Kejian Wu,
Cewu Lu
Abstract:
In daily life, humans utilize hands to manipulate objects. Modeling the shape of objects that are manipulated by the hand is essential for AI to comprehend daily tasks and to learn manipulation skills. However, previous approaches have encountered difficulties in reconstructing the precise shapes of hand-held objects, primarily owing to a deficiency in prior shape knowledge and inadequate data for…
▽ More
In daily life, humans utilize hands to manipulate objects. Modeling the shape of objects that are manipulated by the hand is essential for AI to comprehend daily tasks and to learn manipulation skills. However, previous approaches have encountered difficulties in reconstructing the precise shapes of hand-held objects, primarily owing to a deficiency in prior shape knowledge and inadequate data for training. As illustrated, given a particular type of tool, such as a mug, despite its infinite variations in shape and appearance, humans have a limited number of 'effective' modes and poses for its manipulation. This can be attributed to the fact that humans have mastered the shape prior of the 'mug' category, and can quickly establish the corresponding relations between different mug instances and the prior, such as where the rim and handle are located. In light of this, we propose a new method, CHORD, for Category-level Hand-held Object Reconstruction via shape Deformation. CHORD deforms a categorical shape prior for reconstructing the intra-class objects. To ensure accurate reconstruction, we empower CHORD with three types of awareness: appearance, shape, and interacting pose. In addition, we have constructed a new dataset, COMIC, of category-level hand-object interaction. COMIC contains a rich array of object instances, materials, hand interactions, and viewing directions. Extensive evaluation shows that CHORD outperforms state-of-the-art approaches in both quantitative and qualitative measures. Code, model, and datasets are available at https://kailinli.github.io/CHORD.
△ Less
Submitted 21 August, 2023;
originally announced August 2023.
-
Color-NeuS: Reconstructing Neural Implicit Surfaces with Color
Authors:
Licheng Zhong,
Lixin Yang,
Kailin Li,
Haoyu Zhen,
Mei Han,
Cewu Lu
Abstract:
The reconstruction of object surfaces from multi-view images or monocular video is a fundamental issue in computer vision. However, much of the recent research concentrates on reconstructing geometry through implicit or explicit methods. In this paper, we shift our focus towards reconstructing mesh in conjunction with color. We remove the view-dependent color from neural volume rendering while ret…
▽ More
The reconstruction of object surfaces from multi-view images or monocular video is a fundamental issue in computer vision. However, much of the recent research concentrates on reconstructing geometry through implicit or explicit methods. In this paper, we shift our focus towards reconstructing mesh in conjunction with color. We remove the view-dependent color from neural volume rendering while retaining volume rendering performance through a relighting network. Mesh is extracted from the signed distance function (SDF) network for the surface, and color for each surface vertex is drawn from the global color network. To evaluate our approach, we conceived a in hand object scanning task featuring numerous occlusions and dramatic shifts in lighting conditions. We've gathered several videos for this task, and the results surpass those of any existing methods capable of reconstructing mesh alongside color. Additionally, our method's performance was assessed using public datasets, including DTU, BlendedMVS, and OmniObject3D. The results indicated that our method performs well across all these datasets. Project page: https://colmar-zlicheng.github.io/color_neus.
△ Less
Submitted 19 December, 2023; v1 submitted 14 August, 2023;
originally announced August 2023.
-
3D-LLM: Injecting the 3D World into Large Language Models
Authors:
Yining Hong,
Haoyu Zhen,
Peihao Chen,
Shuhong Zheng,
Yilun Du,
Zhenfang Chen,
Chuang Gan
Abstract:
Large language models (LLMs) and Vision-Language Models (VLMs) have been proven to excel at multiple tasks, such as commonsense reasoning. Powerful as these models can be, they are not grounded in the 3D physical world, which involves richer concepts such as spatial relationships, affordances, physics, layout, and so on. In this work, we propose to inject the 3D world into large language models an…
▽ More
Large language models (LLMs) and Vision-Language Models (VLMs) have been proven to excel at multiple tasks, such as commonsense reasoning. Powerful as these models can be, they are not grounded in the 3D physical world, which involves richer concepts such as spatial relationships, affordances, physics, layout, and so on. In this work, we propose to inject the 3D world into large language models and introduce a whole new family of 3D-LLMs. Specifically, 3D-LLMs can take 3D point clouds and their features as input and perform a diverse set of 3D-related tasks, including captioning, dense captioning, 3D question answering, task decomposition, 3D grounding, 3D-assisted dialog, navigation, and so on. Using three types of prompting mechanisms that we design, we are able to collect over 300k 3D-language data covering these tasks. To efficiently train 3D-LLMs, we first utilize a 3D feature extractor that obtains 3D features from rendered multi- view images. Then, we use 2D VLMs as our backbones to train our 3D-LLMs. By introducing a 3D localization mechanism, 3D-LLMs can better capture 3D spatial information. Experiments on ScanQA show that our model outperforms state-of-the-art baselines by a large margin (e.g., the BLEU-1 score surpasses state-of-the-art score by 9%). Furthermore, experiments on our held-in datasets for 3D captioning, task composition, and 3D-assisted dialogue show that our model outperforms 2D VLMs. Qualitative examples also show that our model could perform more tasks beyond the scope of existing LLMs and VLMs. Project Page: : https://vis-www.cs.umass.edu/3dllm/.
△ Less
Submitted 24 July, 2023;
originally announced July 2023.
-
Magnetic field regression using artificial neural networks for cold atom experiments
Authors:
Ziting Chen,
Kin To Wong,
Bojeong Seo,
Mingchen Huang,
Mithilesh K. Parit,
Haoting Zhen,
Jensen Li,
Gyu-Boong Jo
Abstract:
Accurately measuring magnetic fields is essential for magnetic-field sensitive experiments in fields like atomic, molecular, and optical physics, condensed matter experiments, and other areas. However, since many experiments are conducted in an isolated vacuum environment that is inaccessible to experimentalists, it can be challenging to accurately determine the magnetic field. Here, we propose an…
▽ More
Accurately measuring magnetic fields is essential for magnetic-field sensitive experiments in fields like atomic, molecular, and optical physics, condensed matter experiments, and other areas. However, since many experiments are conducted in an isolated vacuum environment that is inaccessible to experimentalists, it can be challenging to accurately determine the magnetic field. Here, we propose an efficient method for detecting magnetic fields with the assistance of an artificial neural network (NN). Instead of measuring the magnetic field directly at the desired location, we detect magnetic fields at several surrounding positions, and a trained NN can accurately predict the magnetic field at the target location. After training, we achieve a relative error of magnetic field magnitude (magnitude of error over the magnitude of magnetic field) below 0.3$\%$, and we successfully apply this method to our erbium quantum gas apparatus. This approach significantly simplifies the process of determining magnetic fields in isolated vacuum environments and can be applied to various research fields across a wide range of magnetic field magnitudes.
△ Less
Submitted 30 May, 2023;
originally announced May 2023.
-
DeepGate2: Functionality-Aware Circuit Representation Learning
Authors:
Zhengyuan Shi,
Hongyang Pan,
Sadaf Khan,
Min Li,
Yi Liu,
Junhua Huang,
Hui-Ling Zhen,
Mingxuan Yuan,
Zhufei Chu,
Qiang Xu
Abstract:
Circuit representation learning aims to obtain neural representations of circuit elements and has emerged as a promising research direction that can be applied to various EDA and logic reasoning tasks. Existing solutions, such as DeepGate, have the potential to embed both circuit structural information and functional behavior. However, their capabilities are limited due to weak supervision or flaw…
▽ More
Circuit representation learning aims to obtain neural representations of circuit elements and has emerged as a promising research direction that can be applied to various EDA and logic reasoning tasks. Existing solutions, such as DeepGate, have the potential to embed both circuit structural information and functional behavior. However, their capabilities are limited due to weak supervision or flawed model design, resulting in unsatisfactory performance in downstream tasks. In this paper, we introduce DeepGate2, a novel functionality-aware learning framework that significantly improves upon the original DeepGate solution in terms of both learning effectiveness and efficiency. Our approach involves using pairwise truth table differences between sampled logic gates as training supervision, along with a well-designed and scalable loss function that explicitly considers circuit functionality. Additionally, we consider inherent circuit characteristics and design an efficient one-round graph neural network (GNN), resulting in an order of magnitude faster learning speed than the original DeepGate solution. Experimental results demonstrate significant improvements in two practical downstream tasks: logic synthesis and Boolean satisfiability solving. The code is available at https://github.com/cure-lab/DeepGate2
△ Less
Submitted 25 May, 2023;
originally announced May 2023.
-
Conflict-driven Structural Learning Towards Higher Coverage Rate in ATPG
Authors:
Hui-Ling Zhen,
Naixing Wang,
Junhua Huang,
Xinyue Huang,
Mingxuan Yuan,
Yu Huang
Abstract:
Due to the increasing challenges posed by the relentless rise in the design complexity of integrated circuits, Boolean Satisfiability (SAT) has emerged as a robust alternative to structural APTG techniques. However, the high cost of transforming a circuit testing problem to a Conjunctive Normal Form (CNF) limits the application of SAT in industrial ATPG scenarios, resulting in a loss of test cover…
▽ More
Due to the increasing challenges posed by the relentless rise in the design complexity of integrated circuits, Boolean Satisfiability (SAT) has emerged as a robust alternative to structural APTG techniques. However, the high cost of transforming a circuit testing problem to a Conjunctive Normal Form (CNF) limits the application of SAT in industrial ATPG scenarios, resulting in a loss of test coverage. In Order to address this problem, this paper proposes a conflict-driven structural learning (CDSL) ATPG algorithm firstly, in which the conflict-driven heuristic methods in modern SAT solver are implemented on the logic cone of fault propagation and activation directly. The proposed CDSL algorithm is composed of three parts: (1) According to the implication graph, various conflict constraints have been learned to prune search space. (2) Conflict-driven implication and justification have been applied to increase decision accuracy and solving efficiency. (3) A conflict-based diagnosis method is further proposed in the case of low coverage debug, leading to making the aborted faults testable by relaxing or modifying some constraints on primary inputs. Extensive experimental results on industrial circuits demonstrate the effectiveness and efficiency of the proposed CDSL algorithm. It is shown that compared with the SAT-based ATPG, the proposed CDSL can on average decrease $25.6\%$ aborted faults with $94.51\%$ less run time. With a two-stage computational flow, it has shown that the proposed CDSL can lead to $46.37\%$ less aborted faults than a one-stage structural algorithm, further with the $3.19\%$ improvement on fault coverage. In addition, the conflict diagnosis can lead to $8.89\%$ less aborted faults on average, and $0.271\%$ improvement in fault coverage rate.
△ Less
Submitted 3 March, 2023;
originally announced March 2023.
-
HardSATGEN: Understanding the Difficulty of Hard SAT Formula Generation and A Strong Structure-Hardness-Aware Baseline
Authors:
Yang Li,
Xinyan Chen,
Wenxuan Guo,
Xijun Li,
Wanqian Luo,
Junhua Huang,
Hui-Ling Zhen,
Mingxuan Yuan,
Junchi Yan
Abstract:
Industrial SAT formula generation is a critical yet challenging task. Existing SAT generation approaches can hardly simultaneously capture the global structural properties and maintain plausible computational hardness. We first present an in-depth analysis for the limitation of previous learning methods in reproducing the computational hardness of original instances, which may stem from the inhere…
▽ More
Industrial SAT formula generation is a critical yet challenging task. Existing SAT generation approaches can hardly simultaneously capture the global structural properties and maintain plausible computational hardness. We first present an in-depth analysis for the limitation of previous learning methods in reproducing the computational hardness of original instances, which may stem from the inherent homogeneity in their adopted split-merge procedure. On top of the observations that industrial formulae exhibit clear community structure and oversplit substructures lead to the difficulty in semantic formation of logical structures, we propose HardSATGEN, which introduces a fine-grained control mechanism to the neural split-merge paradigm for SAT formula generation to better recover the structural and computational properties of the industrial benchmarks. Experiments including evaluations on private and practical corporate testbed show the superiority of HardSATGEN being the only method to successfully augment formulae maintaining similar computational hardness and capturing the global structural properties simultaneously. Compared to the best previous methods, the average performance gains achieve 38.5% in structural statistics, 88.4% in computational metrics, and over 140.7% in the effectiveness of guiding solver tuning by our generated instances. Source code is available at http://github.com/Thinklab-SJTU/HardSATGEN
△ Less
Submitted 8 February, 2024; v1 submitted 4 February, 2023;
originally announced February 2023.
-
Co-supervised learning paradigm with conditional generative adversarial networks for sample-efficient classification
Authors:
Hao Zhen,
Yucheng Shi,
Jidong J. Yang,
Javad Mohammadpour Vehni
Abstract:
Classification using supervised learning requires annotating a large amount of classes-balanced data for model training and testing. This has practically limited the scope of applications with supervised learning, in particular deep learning. To address the issues associated with limited and imbalanced data, this paper introduces a sample-efficient co-supervised learning paradigm (SEC-CGAN), in wh…
▽ More
Classification using supervised learning requires annotating a large amount of classes-balanced data for model training and testing. This has practically limited the scope of applications with supervised learning, in particular deep learning. To address the issues associated with limited and imbalanced data, this paper introduces a sample-efficient co-supervised learning paradigm (SEC-CGAN), in which a conditional generative adversarial network (CGAN) is trained alongside the classifier and supplements semantics-conditioned, confidence-aware synthesized examples to the annotated data during the training process. In this setting, the CGAN not only serves as a co-supervisor but also provides complementary quality examples to aid the classifier training in an end-to-end fashion. Experiments demonstrate that the proposed SEC-CGAN outperforms the external classifier GAN (EC-GAN) and a baseline ResNet-18 classifier. For the comparison, all classifiers in above methods adopt the ResNet-18 architecture as the backbone. Particularly, for the Street View House Numbers dataset, using the 5% of training data, a test accuracy of 90.26% is achieved by SEC-CGAN as opposed to 88.59% by EC-GAN and 87.17% by the baseline classifier; for the highway image dataset, using the 10% of training data, a test accuracy of 98.27% is achieved by SEC-CGAN, compared to 97.84% by EC-GAN and 95.52% by the baseline classifier.
△ Less
Submitted 27 December, 2022;
originally announced December 2022.
-
SATformer: Transformer-Based UNSAT Core Learning
Authors:
Zhengyuan Shi,
Min Li,
Yi Liu,
Sadaf Khan,
Junhua Huang,
Hui-Ling Zhen,
Mingxuan Yuan,
Qiang Xu
Abstract:
This paper introduces SATformer, a novel Transformer-based approach for the Boolean Satisfiability (SAT) problem. Rather than solving the problem directly, SATformer approaches the problem from the opposite direction by focusing on unsatisfiability. Specifically, it models clause interactions to identify any unsatisfiable sub-problems. Using a graph neural network, we convert clauses into clause e…
▽ More
This paper introduces SATformer, a novel Transformer-based approach for the Boolean Satisfiability (SAT) problem. Rather than solving the problem directly, SATformer approaches the problem from the opposite direction by focusing on unsatisfiability. Specifically, it models clause interactions to identify any unsatisfiable sub-problems. Using a graph neural network, we convert clauses into clause embeddings and employ a hierarchical Transformer-based model to understand clause correlation. SATformer is trained through a multi-task learning approach, using the single-bit satisfiability result and the minimal unsatisfiable core (MUC) for UNSAT problems as clause supervision. As an end-to-end learning-based satisfiability classifier, the performance of SATformer surpasses that of NeuroSAT significantly. Furthermore, we integrate the clause predictions made by SATformer into modern heuristic-based SAT solvers and validate our approach with a logic equivalence checking task. Experimental results show that our SATformer can decrease the runtime of existing solvers by an average of 21.33%.
△ Less
Submitted 11 March, 2024; v1 submitted 2 September, 2022;
originally announced September 2022.
-
Branch Ranking for Efficient Mixed-Integer Programming via Offline Ranking-based Policy Learning
Authors:
Zeren Huang,
Wenhao Chen,
Weinan Zhang,
Chuhan Shi,
Furui Liu,
Hui-Ling Zhen,
Mingxuan Yuan,
Jianye Hao,
Yong Yu,
Jun Wang
Abstract:
Deriving a good variable selection strategy in branch-and-bound is essential for the efficiency of modern mixed-integer programming (MIP) solvers. With MIP branching data collected during the previous solution process, learning to branch methods have recently become superior over heuristics. As branch-and-bound is naturally a sequential decision making task, one should learn to optimize the utilit…
▽ More
Deriving a good variable selection strategy in branch-and-bound is essential for the efficiency of modern mixed-integer programming (MIP) solvers. With MIP branching data collected during the previous solution process, learning to branch methods have recently become superior over heuristics. As branch-and-bound is naturally a sequential decision making task, one should learn to optimize the utility of the whole MIP solving process instead of being myopic on each step. In this work, we formulate learning to branch as an offline reinforcement learning (RL) problem, and propose a long-sighted hybrid search scheme to construct the offline MIP dataset, which values the long-term utilities of branching decisions. During the policy training phase, we deploy a ranking-based reward assignment scheme to distinguish the promising samples from the long-term or short-term view, and train the branching model named Branch Ranking via offline policy learning. Experiments on synthetic MIP benchmarks and real-world tasks demonstrate that Branch Rankink is more efficient and robust, and can better generalize to large scales of MIP instances compared to the widely used heuristics and state-of-the-art learning-based branching models.
△ Less
Submitted 26 July, 2022;
originally announced July 2022.
-
Eco-driving Trajectory Planning of a Heterogeneous Platoon in Urban Environments
Authors:
Hao Zhen,
Sahand Mosharafian,
Jidong J. Yang,
Javad Mohammadpour Velni
Abstract:
Given the increasing popularity and demand for connected and autonomous vehicles (CAVs), Eco-driving and platooning in highways and urban areas to increase the efficiency of the traffic system is becoming a possibility. This paper presents Eco-driving trajectory planning for a platoon of heterogeneous electric vehicles (EVs) in urban environments. The proposed control strategy for the platoon cons…
▽ More
Given the increasing popularity and demand for connected and autonomous vehicles (CAVs), Eco-driving and platooning in highways and urban areas to increase the efficiency of the traffic system is becoming a possibility. This paper presents Eco-driving trajectory planning for a platoon of heterogeneous electric vehicles (EVs) in urban environments. The proposed control strategy for the platoon considers energy consumption, mobility and passenger comfort, with which vehicles may pass signalized intersections with no stops. For a given urban route, first, the platoon's leader vehicle employs dynamic programming (DP) to plan a trajectory for the anticipated path with the aim of balancing energy consumption, mobility and passenger comfort. Then, every other following CAV in the platoon either follows its preceding vehicle using a PID-based cooperative adaptive cruise control or plans its own trajectory by checking whether it can pass the next intersection without stopping. Furthermore, a heavy-duty vehicle that cannot efficiently follow a light-weight vehicle would instead employ the DP-based trajectory planner. Simulation studies demonstrate the efficacy of the proposed control strategy with which the platoon's energy consumption is shown to reduce while the mobility is not compromised.
△ Less
Submitted 19 May, 2022;
originally announced May 2022.
-
Machine Learning Methods in Solving the Boolean Satisfiability Problem
Authors:
Wenxuan Guo,
Junchi Yan,
Hui-Ling Zhen,
Xijun Li,
Mingxuan Yuan,
Yaohui Jin
Abstract:
This paper reviews the recent literature on solving the Boolean satisfiability problem (SAT), an archetypal NP-complete problem, with the help of machine learning techniques. Despite the great success of modern SAT solvers to solve large industrial instances, the design of handcrafted heuristics is time-consuming and empirical. Under the circumstances, the flexible and expressive machine learning…
▽ More
This paper reviews the recent literature on solving the Boolean satisfiability problem (SAT), an archetypal NP-complete problem, with the help of machine learning techniques. Despite the great success of modern SAT solvers to solve large industrial instances, the design of handcrafted heuristics is time-consuming and empirical. Under the circumstances, the flexible and expressive machine learning methods provide a proper alternative to solve this long-standing problem. We examine the evolving ML-SAT solvers from naive classifiers with handcrafted features to the emerging end-to-end SAT solvers such as NeuroSAT, as well as recent progress on combinations of existing CDCL and local search solvers with machine learning methods. Overall, solving SAT with machine learning is a promising yet challenging research topic. We conclude the limitations of current works and suggest possible future directions.
△ Less
Submitted 2 March, 2022;
originally announced March 2022.
-
A Survey for Solving Mixed Integer Programming via Machine Learning
Authors:
Jiayi Zhang,
Chang Liu,
Junchi Yan,
Xijun Li,
Hui-Ling Zhen,
Mingxuan Yuan
Abstract:
This paper surveys the trend of leveraging machine learning to solve mixed integer programming (MIP) problems. Theoretically, MIP is an NP-hard problem, and most of the combinatorial optimization (CO) problems can be formulated as the MIP. Like other CO problems, the human-designed heuristic algorithms for MIP rely on good initial solutions and cost a lot of computational resources. Therefore, we…
▽ More
This paper surveys the trend of leveraging machine learning to solve mixed integer programming (MIP) problems. Theoretically, MIP is an NP-hard problem, and most of the combinatorial optimization (CO) problems can be formulated as the MIP. Like other CO problems, the human-designed heuristic algorithms for MIP rely on good initial solutions and cost a lot of computational resources. Therefore, we consider applying machine learning methods to solve MIP, since ML-enhanced approaches can provide the solution based on the typical patterns from the historical data. In this paper, we first introduce the formulation and preliminaries of MIP and several traditional algorithms to solve MIP. Then, we advocate further promoting the different integration of machine learning and MIP and introducing related learning-based methods, which can be classified into exact algorithms and heuristic algorithms. Finally, we propose the outlook for learning-based MIP solvers, direction towards more combinatorial optimization problems beyond MIP, and also the mutual embrace of traditional solvers and machine learning components.
△ Less
Submitted 6 March, 2022;
originally announced March 2022.
-
Learning to Select Cuts for Efficient Mixed-Integer Programming
Authors:
Zeren Huang,
Kerong Wang,
Furui Liu,
Hui-ling Zhen,
Weinan Zhang,
Mingxuan Yuan,
Jianye Hao,
Yong Yu,
Jun Wang
Abstract:
Cutting plane methods play a significant role in modern solvers for tackling mixed-integer programming (MIP) problems. Proper selection of cuts would remove infeasible solutions in the early stage, thus largely reducing the computational burden without hurting the solution accuracy. However, the major cut selection approaches heavily rely on heuristics, which strongly depend on the specific proble…
▽ More
Cutting plane methods play a significant role in modern solvers for tackling mixed-integer programming (MIP) problems. Proper selection of cuts would remove infeasible solutions in the early stage, thus largely reducing the computational burden without hurting the solution accuracy. However, the major cut selection approaches heavily rely on heuristics, which strongly depend on the specific problem at hand and thus limit their generalization capability. In this paper, we propose a data-driven and generalizable cut selection approach, named Cut Ranking, in the settings of multiple instance learning. To measure the quality of the candidate cuts, a scoring function, which takes the instance-specific cut features as inputs, is trained and applied in cut ranking and selection. In order to evaluate our method, we conduct extensive experiments on both synthetic datasets and real-world datasets. Compared with commonly used heuristics for cut selection, the learning-based policy has shown to be more effective, and is capable of generalizing over multiple problems with different properties. Cut Ranking has been deployed in an industrial solver for large-scale MIPs. In the online A/B testing of the product planning problems with more than $10^7$ variables and constraints daily, Cut Ranking has achieved the average speedup ratio of 12.42% over the production solver without any accuracy loss of solution.
△ Less
Submitted 8 October, 2021; v1 submitted 28 May, 2021;
originally announced May 2021.
-
Temporal Properties of Precursors, Main peaks and Extended Emissions of Short GRBs in the Third Swift/BAT GRB Catalog
Authors:
X. J. Li,
Z. B. Zhang,
X. L. Zhang,
H. Y. Zhen
Abstract:
A comprehensive study is given to short gamma-ray bursts (sGRBs) in the third Swift/BAT GRB Catalog from December 2004 to July 2019. We examine in details the temporal properties of the three components in the prompt gamma-ray emission phase, including precursors, main peaks and extended emissions (EE). We investigate the similarity of the main peaks between one-component and two-component sGRBs.…
▽ More
A comprehensive study is given to short gamma-ray bursts (sGRBs) in the third Swift/BAT GRB Catalog from December 2004 to July 2019. We examine in details the temporal properties of the three components in the prompt gamma-ray emission phase, including precursors, main peaks and extended emissions (EE). We investigate the similarity of the main peaks between one-component and two-component sGRBs. It is found that there is no substantial difference among their main peaks. Importantly, comparisons are made between in the single-peaked sGRBs and the double-peaked sGRBs. It is found that our results of main peaks in Swift/BAT sGRBs are essentially consistent with those in CGRO/BATSE ones recently found in our paper I. Interestingly, we suspect, besides the newly-found MODE I/II evolution forms of pulses in BATSE sGRBs in paper I, that there would have more evolution modes of pulses across differently adjacent energy channels in view of the Swift/BAT observations. We further inspect the correlation of the main peaks with either the precursors or the EEs. We find that the main peaks tend to last longer than the precursors but shorter than the EEs. In particular, we verify the power-law correlations related with peak fluxes of the three components, strongly suggesting that they are produced from the similar central engine activities. Especially, we compare the temporal properties of GRB 170817A with other sGRBs with EE and find no obvious differences between them.
△ Less
Submitted 21 October, 2020;
originally announced October 2020.
-
Bilevel Learning Model Towards Industrial Scheduling
Authors:
Longkang Li,
Hui-Ling Zhen,
Mingxuan Yuan,
Jiawen Lu,
XialiangTong,
Jia Zeng,
Jun Wang,
Dirk Schnieders
Abstract:
Automatic industrial scheduling, aiming at optimizing the sequence of jobs over limited resources, is widely needed in manufacturing industries. However, existing scheduling systems heavily rely on heuristic algorithms, which either generate ineffective solutions or compute inefficiently when job scale increases. Thus, it is of great importance to develop new large-scale algorithms that are not on…
▽ More
Automatic industrial scheduling, aiming at optimizing the sequence of jobs over limited resources, is widely needed in manufacturing industries. However, existing scheduling systems heavily rely on heuristic algorithms, which either generate ineffective solutions or compute inefficiently when job scale increases. Thus, it is of great importance to develop new large-scale algorithms that are not only efficient and effective, but also capable of satisfying complex constraints in practice. In this paper, we propose a Bilevel Deep reinforcement learning Scheduler, \textit{BDS}, in which the higher level is responsible for exploring an initial global sequence, whereas the lower level is aiming at exploitation for partial sequence refinements, and the two levels are connected by a sliding-window sampling mechanism. In the implementation, a Double Deep Q Network (DDQN) is used in the upper level and Graph Pointer Network (GPN) lies within the lower level. After the theoretical guarantee for the convergence of BDS, we evaluate it in an industrial automatic warehouse scenario, with job number up to $5000$ in each production line. It is shown that our proposed BDS significantly outperforms two most used heuristics, three strong deep networks, and another bilevel baseline approach. In particular, compared with the most used greedy-based heuristic algorithm in real world which takes nearly an hour, our BDS can decrease the makespan by 27.5\%, 28.6\% and 22.1\% for 3 largest datasets respectively, with computational time less than 200 seconds.
△ Less
Submitted 10 August, 2020;
originally announced August 2020.
-
Pareto Multi-Task Learning
Authors:
Xi Lin,
Hui-Ling Zhen,
Zhenhua Li,
Qingfu Zhang,
Sam Kwong
Abstract:
Multi-task learning is a powerful method for solving multiple correlated tasks simultaneously. However, it is often impossible to find one single solution to optimize all the tasks, since different tasks might conflict with each other. Recently, a novel method is proposed to find one single Pareto optimal solution with good trade-off among different tasks by casting multi-task learning as multiobj…
▽ More
Multi-task learning is a powerful method for solving multiple correlated tasks simultaneously. However, it is often impossible to find one single solution to optimize all the tasks, since different tasks might conflict with each other. Recently, a novel method is proposed to find one single Pareto optimal solution with good trade-off among different tasks by casting multi-task learning as multiobjective optimization. In this paper, we generalize this idea and propose a novel Pareto multi-task learning algorithm (Pareto MTL) to find a set of well-distributed Pareto solutions which can represent different trade-offs among different tasks. The proposed algorithm first formulates a multi-task learning problem as a multiobjective optimization problem, and then decomposes the multiobjective optimization problem into a set of constrained subproblems with different trade-off preferences. By solving these subproblems in parallel, Pareto MTL can find a set of well-representative Pareto optimal solutions with different trade-off among all tasks. Practitioners can easily select their preferred solution from these Pareto solutions, or use different trade-off solutions for different situations. Experimental results confirm that the proposed algorithm can generate well-representative solutions and outperform some state-of-the-art algorithms on many multi-task learning applications.
△ Less
Submitted 30 December, 2019;
originally announced December 2019.
-
Anti-Chaos Control via Nonlinear Schrödinger Equations for the secured optical communication
Authors:
Zhenyu Tang,
Hui-Ling Zhen
Abstract:
Coupled nonlinear Schrödinger equations, governing the propagation of envelopes of electromagnetic waves in birefringent optical fibers, are studied in this paper for their potential applications in the secured optical communication. Periodicity and integrability of the CNLS equations are obtained via the phase-plane analysis. With the time-delay and perturbations introduced, CNLS equations are ch…
▽ More
Coupled nonlinear Schrödinger equations, governing the propagation of envelopes of electromagnetic waves in birefringent optical fibers, are studied in this paper for their potential applications in the secured optical communication. Periodicity and integrability of the CNLS equations are obtained via the phase-plane analysis. With the time-delay and perturbations introduced, CNLS equations are chaotified and a chaotic system is proposed. Numerical and analytical methods are conducted on such system: (I) Phase projections are given and the final chaotic states can be observed. (II) Power spectra and the largest Lyapunov exponents are calculated to corroborate that those motions are indeed chaotic.
△ Less
Submitted 18 November, 2018;
originally announced December 2018.
-
Chaotification via Higher-order Nonlinear Schrödinger Equations for Secured Communication
Authors:
Zhenyu Tang,
Hui-Ling Zhen
Abstract:
Higher-order nonlinear Schrödinger(HNLS) equation which can be used to describe the propagation of short light pulses in the optical fibers, is studied in this paper. Using the phase plane analysis, HNLS equation is reduced into the equivalent dynamical system, the periodicity of such system is obtained with the phase projections and power spectra given. By means of the time-delay feedback method,…
▽ More
Higher-order nonlinear Schrödinger(HNLS) equation which can be used to describe the propagation of short light pulses in the optical fibers, is studied in this paper. Using the phase plane analysis, HNLS equation is reduced into the equivalent dynamical system, the periodicity of such system is obtained with the phase projections and power spectra given. By means of the time-delay feedback method, with the original dynamical system rewritten, we construct a single-input single-output system, and propose a chaotic system based on the chaotification of HNLS. Numerical studies have been conducted on such system. Chaotic motions with different time delays are displayed. Power spectra of such chaotic motions are calculated. Lyapunov exponents are given to corroborate that those motions are indeed chaotic.
△ Less
Submitted 20 November, 2018;
originally announced November 2018.
-
A Batched Scalable Multi-Objective Bayesian Optimization Algorithm
Authors:
Xi Lin,
Hui-Ling Zhen,
Zhenhua Li,
Qingfu Zhang,
Sam Kwong
Abstract:
The surrogate-assisted optimization algorithm is a promising approach for solving expensive multi-objective optimization problems. However, most existing surrogate-assisted multi-objective optimization algorithms have three main drawbacks: 1) cannot scale well for solving problems with high dimensional decision space, 2) cannot incorporate available gradient information, and 3) do not support batc…
▽ More
The surrogate-assisted optimization algorithm is a promising approach for solving expensive multi-objective optimization problems. However, most existing surrogate-assisted multi-objective optimization algorithms have three main drawbacks: 1) cannot scale well for solving problems with high dimensional decision space, 2) cannot incorporate available gradient information, and 3) do not support batch optimization. These drawbacks prevent their use for solving many real-world large scale optimization problems. This paper proposes a batched scalable multi-objective Bayesian optimization algorithm to tackle these issues. The proposed algorithm uses the Bayesian neural network as the scalable surrogate model. Powered with Monte Carlo dropout and Sobolov training, the model can be easily trained and can incorporate available gradient information. We also propose a novel batch hypervolume upper confidence bound acquisition function to support batch optimization. Experimental results on various benchmark problems and a real-world application demonstrate the efficiency of the proposed algorithm.
△ Less
Submitted 4 November, 2018;
originally announced November 2018.
-
Nonlinear Collaborative Scheme for Deep Neural Networks
Authors:
Hui-Ling Zhen,
Xi Lin,
Alan Z. Tang,
Zhenhua Li,
Qingfu Zhang,
Sam Kwong
Abstract:
Conventional research attributes the improvements of generalization ability of deep neural networks either to powerful optimizers or the new network design. Different from them, in this paper, we aim to link the generalization ability of a deep network to optimizing a new objective function. To this end, we propose a \textit{nonlinear collaborative scheme} for deep network training, with the key t…
▽ More
Conventional research attributes the improvements of generalization ability of deep neural networks either to powerful optimizers or the new network design. Different from them, in this paper, we aim to link the generalization ability of a deep network to optimizing a new objective function. To this end, we propose a \textit{nonlinear collaborative scheme} for deep network training, with the key technique as combining different loss functions in a nonlinear manner. We find that after adaptively tuning the weights of different loss functions, the proposed objective function can efficiently guide the optimization process. What is more, we demonstrate that, from the mathematical perspective, the nonlinear collaborative scheme can lead to (i) smaller KL divergence with respect to optimal solutions; (ii) data-driven stochastic gradient descent; (iii) tighter PAC-Bayes bound. We also prove that its advantage can be strengthened by nonlinearity increasing. To some extent, we bridge the gap between learning (i.e., minimizing the new objective function) and generalization (i.e., minimizing a PAC-Bayes bound) in the new scheme. We also interpret our findings through the experiments on Residual Networks and DenseNet, showing that our new scheme performs superior to single-loss and multi-loss schemes no matter with randomization or not.
△ Less
Submitted 3 November, 2018;
originally announced November 2018.
-
Unsupervised prototype learning in an associative-memory network
Authors:
Huiling Zhen,
Shang-Nan Wang,
Hai-Jun Zhou
Abstract:
Unsupervised learning in a generalized Hopfield associative-memory network is investigated in this work. First, we prove that the (generalized) Hopfield model is equivalent to a semi-restricted Boltzmann machine with a layer of visible neurons and another layer of hidden binary neurons, so it could serve as the building block for a multilayered deep-learning system. We then demonstrate that the Ho…
▽ More
Unsupervised learning in a generalized Hopfield associative-memory network is investigated in this work. First, we prove that the (generalized) Hopfield model is equivalent to a semi-restricted Boltzmann machine with a layer of visible neurons and another layer of hidden binary neurons, so it could serve as the building block for a multilayered deep-learning system. We then demonstrate that the Hopfield network can learn to form a faithful internal representation of the observed samples, with the learned memory patterns being prototypes of the input data. Furthermore, we propose a spectral method to extract a small set of concepts (idealized prototypes) as the most concise summary or abstraction of the empirical data.
△ Less
Submitted 24 July, 2017; v1 submitted 10 April, 2017;
originally announced April 2017.
-
Power Penalty Due to First-order PMD in Optical OFDM/QAM and FBMC/OQAM Transmission System
Authors:
Jianping Wang,
Ke Zhang,
Xianyu Du,
He Zhen,
Jing Yan
Abstract:
Polarization mode dispersion (PMD) is a challenge for high-data-rate optical-communication systems. More researches are desirable for impairments that is induced by PMD in high-speed optical orthogonal frequency division multiplexing (OFDM) transmission system. In this paper, an approximately analytical method for evaluating the power penalty due to first-order PMD in optical OFDM with quadrature…
▽ More
Polarization mode dispersion (PMD) is a challenge for high-data-rate optical-communication systems. More researches are desirable for impairments that is induced by PMD in high-speed optical orthogonal frequency division multiplexing (OFDM) transmission system. In this paper, an approximately analytical method for evaluating the power penalty due to first-order PMD in optical OFDM with quadrature amplitude modulation (OFDM/QAM) and filter bank based multi-carrier with offset quadrature amplitude modulation (FBMC/OQAM) transmission system is presented. The simulation results show that, compared with the single carrier with quadrature phase shift keying(SC-QPSK), both the OFDM/QAM and the FBMC/OQAM can decrease the power penalty caused by PMD by half. Furthermore, the FBMC/OQAM shows better power penalty immunity than the OFDM/QAM under the influence of first order PMD.
△ Less
Submitted 29 November, 2013;
originally announced November 2013.
-
Multiplicity fluctuation analysis of target residues in nucleus-emulsion collisions at a few hundred MeV/nucleon
Authors:
D. H. Zhang,
Y. L. Chen,
G. R. Wang,
W. D. Li,
Q. Wang,
J. J. Yao,
J. G. Zhou,
S. H. Zhen,
L. L. Xu,
H. F. Miao,
P. Wang
Abstract:
Multiplicity fluctuation of the target evaporated fragments emitted in 290 A MeV 12C-AgBr, 400 A MeV 12C-AgBr, 400 A MeV 20Ne-AgBr and 500 A MeV 56Fe-AgBr interactions is investigated using scaled factorial moment method in two-dimensional normal phase space and cumulative variable space, respectively. It is found that in normal phase space the scaled factorial moment (ln<Fq>) increases linearly w…
▽ More
Multiplicity fluctuation of the target evaporated fragments emitted in 290 A MeV 12C-AgBr, 400 A MeV 12C-AgBr, 400 A MeV 20Ne-AgBr and 500 A MeV 56Fe-AgBr interactions is investigated using scaled factorial moment method in two-dimensional normal phase space and cumulative variable space, respectively. It is found that in normal phase space the scaled factorial moment (ln<Fq>) increases linearly with increase of the divided number of phase space (lnM) for lower q-value and increases linearly with the increase of lnM and then becomes saturated or decreased for higher q-value, in cumulative variable space ln<Fq> decreases linearly with increase of lnM, which indicates that no evidence of non-statistical multiplicity fluctuation is observed in our data sets. So any fluctuation indicated in the results of normal variable space analysis is totally caused by non-uniformity of single-particle density distribution.
△ Less
Submitted 18 August, 2013;
originally announced August 2013.
-
Preliminary study on CAD-based method of characteristics for neutron transport calculation
Authors:
Zhen-Ping Chen,
Hua-Qing Zhen,
Guang-Yao Sun,
Jing Song,
Li-Juan Hao,
Li-Qin Hu,
Yi-Can Wu
Abstract:
The method of characteristics (MOC) is widely used for neutron transport calculation in recent decades. However, the key problem determining whether MOC can be applied in highly heterogeneous geometry is how to combine an effective geometry modeling method with it. Most of the existing MOC codes conventionally describe the geometry model just by lines and arcs with extensive input data. Thus they…
▽ More
The method of characteristics (MOC) is widely used for neutron transport calculation in recent decades. However, the key problem determining whether MOC can be applied in highly heterogeneous geometry is how to combine an effective geometry modeling method with it. Most of the existing MOC codes conventionally describe the geometry model just by lines and arcs with extensive input data. Thus they have difficulty in geometry modeling and ray tracing for complicated geometries. In this study, a new method making use of a CAD-based automatic modeling tool MCAM which is a CAD/Image-based Automatic Modeling Program for Neutronics and Radiation Transport developed by FDS Team in China was introduced for geometry modeling and ray tracing of particle transport to remove those limitations. The diamond -difference scheme was applied to MOC to reduce the spatial discretization errors of the flat flux approximation. Based on MCAM and MOC, a new MOC code was developed and integrated into SuperMC system, whic h is a Super Multi-function Computational system for neutronics and radiation simulation. The numerical results demonstrated the feasibility and effectiveness of the new method for neutron transport calculation in MOC.
△ Less
Submitted 21 June, 2013;
originally announced June 2013.
-
Quantum Computing Using an Open System and Projected Subspace
Authors:
Bi Qiao,
Harry. E. Ruda,
X. H. Zhen
Abstract:
Using the subdynamical kinetic equation for an open quantum system, a formulation is presented for performing decoherence-free (DF) quantum computing in Rigged Liouville Space (RLS). Three types of interactions were considered, and in each case, stationary and evolutionary states were evaluated for DF behavior in both the total space and the projected subspace. Projected subspaces were found usi…
▽ More
Using the subdynamical kinetic equation for an open quantum system, a formulation is presented for performing decoherence-free (DF) quantum computing in Rigged Liouville Space (RLS). Three types of interactions were considered, and in each case, stationary and evolutionary states were evaluated for DF behavior in both the total space and the projected subspace. Projected subspaces were found using the subdynamics kinetic equation. It was shown that although the total space may be decoherent, the subspace can be DF. In the projected subspace, the evolution of the density operator may be time asymmetric. Hence, a formulation for performing quantum computing in RLS or rigged Hilbert space (RHS) was proposed, and a quantum Controlled-Not Logical gate with corresponding operations in RLS (RHS) was constructed. A generalized quantum Turing machine in RHS was also discussed. Key Words: Quantum Computing, Subdynamics, Rigged Liouvile Space, Decoherence, Open System PACS: 05.30.-d+85.30+82.20.Db+84.35.+i
△ Less
Submitted 28 September, 2001;
originally announced October 2001.