Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–37 of 37 results for author: Zhong, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.07551  [pdf, other

    cs.CL cs.AI

    MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical Reasoning

    Authors: Shuo Yin, Weihao You, Zhilong Ji, Guoqiang Zhong, Jinfeng Bai

    Abstract: The tool-use Large Language Models (LLMs) that integrate with external Python interpreters have significantly enhanced mathematical reasoning capabilities for open-source LLMs, while tool-free methods chose another track: augmenting math reasoning data. However, a great method to integrate the above two research paths and combine their advantages remains to be explored. In this work, we firstly in… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: The state-of-the-art open-source tool-use LLMs for mathematical reasoning

  2. arXiv:2404.01194  [pdf, other

    cs.CV

    Adaptive Query Prompting for Multi-Domain Landmark Detection

    Authors: Qiusen Wei, Guoheng Huang, Xiaochen Yuan, Xuhang Chen, Guo Zhong, Jianwen Huang, Jiajie Huang

    Abstract: Medical landmark detection is crucial in various medical imaging modalities and procedures. Although deep learning-based methods have achieve promising performance, they are mostly designed for specific anatomical regions or tasks. In this work, we propose a universal model for multi-domain landmark detection by leveraging transformer architecture and developing a prompting component, named as Ada… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  3. arXiv:2404.01127  [pdf, other

    cs.CV cs.AI

    Medical Visual Prompting (MVP): A Unified Framework for Versatile and High-Quality Medical Image Segmentation

    Authors: Yulin Chen, Guoheng Huang, Kai Huang, Zijin Lin, Guo Zhong, Shenghong Luo, Jie Deng, Jian Zhou

    Abstract: Accurate segmentation of lesion regions is crucial for clinical diagnosis and treatment across various diseases. While deep convolutional networks have achieved satisfactory results in medical image segmentation, they face challenges such as loss of lesion shape information due to continuous convolution and downsampling, as well as the high cost of manually labeling lesions with varying shapes and… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  4. arXiv:2312.06207  [pdf, other

    cs.DC

    A Primer on RecoNIC: RDMA-enabled Compute Offloading on SmartNIC

    Authors: Guanwen Zhong, Aditya Kolekar, Burin Amornpaisannon, Inho Choi, Haris Javaid, Mario Baldi

    Abstract: Today's data centers consist of thousands of network-connected hosts, each with CPUs and accelerators such as GPUs and FPGAs. These hosts also contain network interface cards (NICs), operating at speeds of 100Gb/s or higher, that are used to communicate with each other. We propose RecoNIC, an FPGA-based RDMA-enabled SmartNIC platform that is designed for compute acceleration while minimizing the o… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: RecoNIC is available at https://github.com/Xilinx/RecoNIC

  5. RBA-GCN: Relational Bilevel Aggregation Graph Convolutional Network for Emotion Recognition

    Authors: Lin Yuan, Guoheng Huang, Fenghuan Li, Xiaochen Yuan, Chi-Man Pun, Guo Zhong

    Abstract: Emotion recognition in conversation (ERC) has received increasing attention from researchers due to its wide range of applications.As conversation has a natural graph structure,numerous approaches used to model ERC based on graph convolutional networks (GCNs) have yielded significant results.However,the aggregation approach of traditional GCNs suffers from the node information redundancy problem,l… ▽ More

    Submitted 31 August, 2023; v1 submitted 18 August, 2023; originally announced August 2023.

    Journal ref: IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 31, pp. 2325-2337,2023

  6. arXiv:2308.01147  [pdf, other

    cs.CV cs.MM eess.IV

    Contrast-augmented Diffusion Model with Fine-grained Sequence Alignment for Markup-to-Image Generation

    Authors: Guojin Zhong, Jin Yuan, Pan Wang, Kailun Yang, Weili Guan, Zhiyong Li

    Abstract: The recently rising markup-to-image generation poses greater challenges as compared to natural image generation, due to its low tolerance for errors as well as the complex sequence and context correlations between markup and rendered image. This paper proposes a novel model named "Contrast-augmented Diffusion Model with Fine-grained Sequence Alignment" (FSA-CDM), which introduces contrastive posit… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

    Comments: Accepted to ACM MM 2023. The code will be released at https://github.com/zgj77/FSACDM

  7. arXiv:2307.14132  [pdf, other

    cs.SD cs.CL eess.AS

    CIF-T: A Novel CIF-based Transducer Architecture for Automatic Speech Recognition

    Authors: Tian-Hao Zhang, Dinghao Zhou, Guiping Zhong, Jiaming Zhou, Baoxiang Li

    Abstract: RNN-T models are widely used in ASR, which rely on the RNN-T loss to achieve length alignment between input audio and target sequence. However, the implementation complexity and the alignment-based optimization target of RNN-T loss lead to computational redundancy and a reduced role for predictor network, respectively. In this paper, we propose a novel model named CIF-Transducer (CIF-T) which inco… ▽ More

    Submitted 14 December, 2023; v1 submitted 26 July, 2023; originally announced July 2023.

    Comments: Accepted by ICASSP 2024

  8. TempEE: Temporal-Spatial Parallel Transformer for Radar Echo Extrapolation Beyond Auto-Regression

    Authors: Shengchao Chen, Ting Shu, Huan Zhao, Guo Zhong, Xunlai Chen

    Abstract: Meteorological radar reflectivity data (i.e. radar echo) significantly influences precipitation prediction. It can facilitate accurate and expeditious forecasting of short-term heavy rainfall bypassing the need for complex Numerical Weather Prediction (NWP) models. In comparison to conventional models, Deep Learning (DL)-based radar echo extrapolation algorithms exhibit higher effectiveness and ef… ▽ More

    Submitted 14 September, 2023; v1 submitted 27 April, 2023; originally announced April 2023.

    Comments: Have been accepted by IEEE Transactions on Geoscience and Remote Sensing, see https://ieeexplore.ieee.org/document/10238744

    Journal ref: IEEE Transactions on Geoscience and Remote Sensing 61, 5108914 (2023)

  9. arXiv:2304.04368  [pdf, other

    cs.CV

    Locality Preserving Multiview Graph Hashing for Large Scale Remote Sensing Image Search

    Authors: Wenyun Li, Guo Zhong, Xingyu Lu, Chi-Man Pun

    Abstract: Hashing is very popular for remote sensing image search. This article proposes a multiview hashing with learnable parameters to retrieve the queried images for a large-scale remote sensing dataset. Existing methods always neglect that real-world remote sensing data lies on a low-dimensional manifold embedded in high-dimensional ambient space. Unlike previous methods, this article proposes to learn… ▽ More

    Submitted 9 April, 2023; originally announced April 2023.

    Comments: 5 pages,icassp accepted

  10. arXiv:2303.13769  [pdf, other

    cs.CV cs.AI cs.LG

    Unknown Sniffer for Object Detection: Don't Turn a Blind Eye to Unknown Objects

    Authors: Wenteng Liang, Feng Xue, Yihao Liu, Guofeng Zhong, Anlong Ming

    Abstract: The recently proposed open-world object and open-set detection have achieved a breakthrough in finding never-seen-before objects and distinguishing them from known ones. However, their studies on knowledge transfer from known classes to unknown ones are not deep enough, resulting in the scanty capability for detecting unknowns hidden in the background. In this paper, we propose the unknown sniffer… ▽ More

    Submitted 19 April, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: CVPR 2023 camera-ready; Code: https://github.com/Went-Liang/UnSniffer Project: https://xuefengbupt.github.io/project_page/unsniffer_cvpr23.html Demo: https://www.bilibili.com/video/BV1xM4y1z7Hv Supplymentary: https://xuefengbupt.github.io/project_page/pdf/supplementary_cvpr2023.pdf

  11. arXiv:2303.03707  [pdf

    quant-ph cs.CV cs.LG

    Hybrid quantum-classical convolutional neural network for phytoplankton classification

    Authors: Shangshang Shi, Zhimin Wang, Ruimin Shang, Yanan Li, Jiaxin Li, Guoqiang Zhong, Yongjian Gu

    Abstract: The taxonomic composition and abundance of phytoplankton, having direct impact on marine ecosystem dynamic and global environment change, are listed as essential ocean variables. Phytoplankton classification is very crucial for Phytoplankton analysis, but it is very difficult because of the huge amount and tiny volume of Phytoplankton. Machine learning is the principle way of performing phytoplank… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

    Comments: 20 pages, 13 figures

  12. arXiv:2302.03244  [pdf, other

    quant-ph cs.LG

    Quantum Recurrent Neural Networks for Sequential Learning

    Authors: Yanan Li, Zhimin Wang, Rongbing Han, Shangshang Shi, Jiaxin Li, Ruimin Shang, Haiyong Zheng, Guoqiang Zhong, Yongjian Gu

    Abstract: Quantum neural network (QNN) is one of the promising directions where the near-term noisy intermediate-scale quantum (NISQ) devices could find advantageous applications against classical resources. Recurrent neural networks are the most fundamental networks for sequential learning, but up to now there is still a lack of canonical model of quantum recurrent neural network (QRNN), which certainly re… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

  13. arXiv:2203.15613  [pdf, other

    cs.SD cs.CL eess.AS

    Dynamic Latency for CTC-Based Streaming Automatic Speech Recognition With Emformer

    Authors: Jingyu Sun, Guiping Zhong, Dinghao Zhou, Baoxiang Li

    Abstract: An inferior performance of the streaming automatic speech recognition models versus non-streaming model is frequently seen due to the absence of future context. In order to improve the performance of the streaming model and reduce the computational complexity, a frame-level model using efficient augment memory transformer block and dynamic latency training method is employed for streaming automati… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: 5 pages, 2 figures, submitted to interspeech 2022

  14. arXiv:2203.15609  [pdf, other

    cs.SD eess.AS

    Locality Matters: A Locality-Biased Linear Attention for Automatic Speech Recognition

    Authors: Jingyu Sun, Guiping Zhong, Dinghao Zhou, Baoxiang Li, Yiran Zhong

    Abstract: Conformer has shown a great success in automatic speech recognition (ASR) on many public benchmarks. One of its crucial drawbacks is the quadratic time-space complexity with respect to the input sequence length, which prohibits the model to scale-up as well as process longer input audio sequences. To solve this issue, numerous linear attention methods have been proposed. However, these methods oft… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: 5 pages, 2 figures, submitted to interspeech 2022

  15. arXiv:2006.10250  [pdf, other

    cs.CV eess.IV

    Progressively Unfreezing Perceptual GAN

    Authors: Jinxuan Sun, Yang Chen, Junyu Dong, Guoqiang Zhong

    Abstract: Generative adversarial networks (GANs) are widely used in image generation tasks, yet the generated images are usually lack of texture details. In this paper, we propose a general framework, called Progressively Unfreezing Perceptual GAN (PUPGAN), which can generate images with fine texture details. Particularly, we propose an adaptive perceptual discriminator with a pre-trained perceptual feature… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

  16. arXiv:1909.13411  [pdf, other

    cs.CV

    SymmetricNet: A mesoscale eddy detection method based on multivariate fusion data

    Authors: Zhenlin Fan, Guoqiang Zhong

    Abstract: Mesoscale eddies play a significant role in marine energy transport, marine biological environment and marine climate. Due to their huge impact on the ocean, mesoscale eddy detection has become a hot research area in recent years. Therefore, more and more people are entering the field of mesoscale eddy detection. However, the existing detection methods mainly based on traditional detection methods… ▽ More

    Submitted 29 September, 2019; originally announced September 2019.

  17. arXiv:1909.07827  [pdf, other

    cs.CV

    Weak Edge Identification Nets for Ocean Front Detection

    Authors: Qingyang Li, Guoqiang Zhong, Cui Xie

    Abstract: The ocean front has an important impact in many areas, it is meaningful to obtain accurate ocean front positioning, therefore, ocean front detection is a very important task. However, the traditional edge detection algorithm does not detect the weak edge information of the ocean front very well. In response to this problem, we collected relevant ocean front gradient images and found relevant exper… ▽ More

    Submitted 17 September, 2019; originally announced September 2019.

  18. arXiv:1812.01353  [pdf, other

    stat.ML cs.LG

    Structured Semantic Model supported Deep Neural Network for Click-Through Rate Prediction

    Authors: Chenglei Niu, Guojing Zhong, Ying Liu, Yandong Zhang, Yongsheng Sun, Ailong He, Zhaoji Chen

    Abstract: With the rapid development of online advertising and recommendation systems, click-through rate prediction is expected to play an increasingly important role.Recently many DNN-based models which follow a similar Embedding&MLP paradigm have been proposed, and have achieved good result in image/voice and nlp fields. In these methods the Wide&Deep model announced by Google plays a key role.Most model… ▽ More

    Submitted 29 April, 2019; v1 submitted 4 December, 2018; originally announced December 2018.

  19. arXiv:1811.02132  [pdf, other

    cs.LG stat.ML

    Student's t-Generative Adversarial Networks

    Authors: Jinxuan Sun, Guoqiang Zhong, Yang Chen, Yongbin Liu, Tao Li, Zhongwen Guo

    Abstract: Generative Adversarial Networks (GANs) have a great performance in image generation, but they need a large scale of data to train the entire framework, and often result in nonsensical results. We propose a new method referring to conditional GAN, which equipments the latent noise with mixture of Student's t-distribution with attention mechanism in addition to class information. Student's t-distrib… ▽ More

    Submitted 5 November, 2018; originally announced November 2018.

  20. arXiv:1810.13155  [pdf, other

    cs.LG stat.ML

    Structure Learning of Deep Neural Networks with Q-Learning

    Authors: Guoqiang Zhong, Wencong Jiao, Wei Gao

    Abstract: Recently, with convolutional neural networks gaining significant achievements in many challenging machine learning fields, hand-crafted neural networks no longer satisfy our requirements as designing a network will cost a lot, and automatically generating architectures has attracted increasingly more attention and focus. Some research on auto-generated networks has achieved promising results. Howe… ▽ More

    Submitted 31 October, 2018; originally announced October 2018.

  21. arXiv:1810.12754  [pdf, other

    cs.LG cs.CL cs.NE stat.ML

    Recurrent Attention Unit

    Authors: Guoqiang Zhong, Guohua Yue, Xiao Ling

    Abstract: Recurrent Neural Network (RNN) has been successfully applied in many sequence learning problems. Such as handwriting recognition, image description, natural language processing and video motion analysis. After years of development, researchers have improved the internal structure of the RNN and introduced many variants. Among others, Gated Recurrent Unit (GRU) is one of the most widely used RNN mo… ▽ More

    Submitted 30 October, 2018; originally announced October 2018.

  22. arXiv:1810.12752  [pdf, other

    cs.LG cs.CL cs.NE

    Long Short-Term Attention

    Authors: Guoqiang Zhong, Xin Lin, Kang Chen, Qingyang Li, Kaizhu Huang

    Abstract: Attention is an important cognition process of humans, which helps humans concentrate on critical information during their perception and learning. However, although many machine learning models can remember information of data, they have no the attention mechanism. For example, the long short-term memory (LSTM) network is able to remember sequential information, but it cannot pay special attentio… ▽ More

    Submitted 4 September, 2019; v1 submitted 30 October, 2018; originally announced October 2018.

  23. arXiv:1810.10687  [pdf, other

    cs.NE cs.CV

    Structure Learning of Deep Networks via DNA Computing Algorithm

    Authors: Guoqiang Zhong, Tao Li, Wenxue Liu, Yang Chen

    Abstract: Convolutional Neural Network (CNN) has gained state-of-the-art results in many pattern recognition and computer vision tasks. However, most of the CNN structures are manually designed by experienced researchers. Therefore, auto- matically building high performance networks becomes an important problem. In this paper, we introduce the idea of using DNA computing algorithm to automatically learn hig… ▽ More

    Submitted 24 October, 2018; originally announced October 2018.

  24. arXiv:1810.07550  [pdf

    cs.LG physics.class-ph

    The Newton Scheme for Deep Learning

    Authors: Junqing Qiu, Guoren Zhong, Yihua Lu, Kun Xin, Huihuan Qian, Xi Zhu

    Abstract: We introduce a neural network (NN) strictly governed by Newton's Law, with the nature required basis functions derived from the fundamental classic mechanics. Then, by classifying the training model as a quick procedure of 'force pattern' recognition, we developed the Newton physics-based NS scheme. Once the force pattern is confirmed, the neuro network simply does the checking of the 'pattern sta… ▽ More

    Submitted 16 October, 2018; originally announced October 2018.

    Comments: 7 pages, 10 figures

  25. arXiv:1807.03923  [pdf, other

    cs.CV

    Generative Adversarial Networks with Decoder-Encoder Output Noise

    Authors: Guoqiang Zhong, Wei Gao, Yongbin Liu, Youzhao Yang

    Abstract: In recent years, research on image generation methods has been developing fast. The auto-encoding variational Bayes method (VAEs) was proposed in 2013, which uses variational inference to learn a latent space from the image database and then generates images using the decoder. The generative adversarial networks (GANs) came out as a promising framework, which uses adversarial training to improve t… ▽ More

    Submitted 10 July, 2018; originally announced July 2018.

  26. arXiv:1804.08758  [pdf, other

    cs.CV

    Switchable Temporal Propagation Network

    Authors: Sifei Liu, Guangyu Zhong, Shalini De Mello, Jinwei Gu, Varun Jampani, Ming-Hsuan Yang, Jan Kautz

    Abstract: Videos contain highly redundant information between frames. Such redundancy has been extensively studied in video compression and encoding, but is less explored for more advanced video processing. In this paper, we propose a learnable unified framework for propagating a variety of visual properties of video images, including but not limited to color, high dynamic range (HDR), and segmentation info… ▽ More

    Submitted 4 May, 2018; v1 submitted 23 April, 2018; originally announced April 2018.

  27. arXiv:1804.00706  [pdf, other

    cs.DC cs.AR cs.LG

    Synergy: A HW/SW Framework for High Throughput CNNs on Embedded Heterogeneous SoC

    Authors: Guanwen Zhong, Akshat Dubey, Tan Cheng, Tulika Mitra

    Abstract: Convolutional Neural Networks (CNN) have been widely deployed in diverse application domains. There has been significant progress in accelerating both their training and inference using high-performance GPUs, FPGAs, and custom ASICs for datacenter-scale environments. The recent proliferation of mobile and IoT devices have necessitated real-time, energy-efficient deep neural network inference on em… ▽ More

    Submitted 28 March, 2018; originally announced April 2018.

    Comments: 34 pages, submitted to ACM Transactions on Embedded Computing Systems (TECS)

    ACM Class: C.1.3

    Journal ref: TECS, 18 (2019) 13-39

  28. arXiv:1801.10281  [pdf, other

    cs.CV

    Learning Video-Story Composition via Recurrent Neural Network

    Authors: Guangyu Zhong, Yi-Hsuan Tsai, Sifei Liu, Zhixun Su, Ming-Hsuan Yang

    Abstract: In this paper, we propose a learning-based method to compose a video-story from a group of video clips that describe an activity or experience. We learn the coherence between video clips from real videos via the Recurrent Neural Network (RNN) that jointly incorporates the spatial-temporal semantics and motion dynamics to generate smooth and relevant compositions. We further rearrange the results g… ▽ More

    Submitted 30 January, 2018; originally announced January 2018.

  29. arXiv:1710.01020  [pdf, other

    cs.CV cs.LG

    Learning Affinity via Spatial Propagation Networks

    Authors: Sifei Liu, Shalini De Mello, Jinwei Gu, Guangyu Zhong, Ming-Hsuan Yang, Jan Kautz

    Abstract: In this paper, we propose spatial propagation networks for learning the affinity matrix for vision tasks. We show that by constructing a row/column linear propagation model, the spatially varying transformation matrix exactly constitutes an affinity matrix that models dense, global pairwise relationships of an image. Specifically, we develop a three-way connection for the linear propagation model,… ▽ More

    Submitted 3 October, 2017; originally announced October 2017.

    Comments: A long version of NIPS 2017

  30. arXiv:1708.02191  [pdf, other

    cs.CV cs.AI

    Unsupervised Domain Adaptation for Face Recognition in Unlabeled Videos

    Authors: Kihyuk Sohn, Sifei Liu, Guangyu Zhong, Xiang Yu, Ming-Hsuan Yang, Manmohan Chandraker

    Abstract: Despite rapid advances in face recognition, there remains a clear gap between the performance of still image-based face recognition and video-based face recognition, due to the vast difference in visual quality between the domains and the difficulty of curating diverse large-scale video datasets. This paper addresses both of those challenges, through an image to video feature-level domain adaptati… ▽ More

    Submitted 7 August, 2017; originally announced August 2017.

    Comments: accepted for publication at International Conference on Computer Vision (ICCV) 2017

  31. Prediction of Sea Surface Temperature using Long Short-Term Memory

    Authors: Qin Zhang, Hui Wang, Junyu Dong, Guoqiang Zhong, Xin Sun

    Abstract: This letter adopts long short-term memory(LSTM) to predict sea surface temperature(SST), which is the first attempt, to our knowledge, to use recurrent neural network to solve the problem of SST prediction, and to make one week and one month daily prediction. We formulate the SST prediction problem as a time series regression problem. LSTM is a special kind of recurrent neural network, which intro… ▽ More

    Submitted 19 May, 2017; originally announced May 2017.

    Comments: 5 page, 5 figures

  32. arXiv:1703.09784  [pdf, other

    cs.CV cs.AI cs.LG

    Perception Driven Texture Generation

    Authors: Yanhai Gan, Huifang Chi, Ying Gao, Jun Liu, Guoqiang Zhong, Junyu Dong

    Abstract: This paper investigates a novel task of generating texture images from perceptual descriptions. Previous work on texture generation focused on either synthesis from examples or generation from procedural models. Generating textures from perceptual attributes have not been well studied yet. Meanwhile, perceptual attributes, such as directionality, regularity and roughness are important factors for… ▽ More

    Submitted 23 March, 2017; originally announced March 2017.

    Comments: 7 pages, 4 figures, icme2017

  33. arXiv:1611.08331  [pdf, ps, other

    cs.LG stat.ML

    An Overview on Data Representation Learning: From Traditional Feature Learning to Recent Deep Learning

    Authors: Guoqiang Zhong, Li-Na Wang, Junyu Dong

    Abstract: Since about 100 years ago, to learn the intrinsic structure of data, many representation learning approaches have been proposed, including both linear ones and nonlinear ones, supervised ones and unsupervised ones. Particularly, deep architectures are widely applied for representation learning in recent years, and have delivered top results in many tasks, such as image classification, object detec… ▽ More

    Submitted 24 November, 2016; originally announced November 2016.

    Comments: About 20 pages. Submitted to Journal of Finance and Data Science as an invited paper

    MSC Class: 68T05

  34. arXiv:1507.06105  [pdf, other

    cs.LG cs.CV stat.ML

    Banzhaf Random Forests

    Authors: Jianyuan Sun, Guoqiang Zhong, Junyu Dong, Yajuan Cai

    Abstract: Random forests are a type of ensemble method which makes predictions by combining the results of several independent trees. However, the theory of random forests has long been outpaced by their application. In this paper, we propose a novel random forests algorithm based on cooperative game theory. Banzhaf power index is employed to evaluate the power of each feature by traversing possible feature… ▽ More

    Submitted 22 July, 2015; originally announced July 2015.

    Comments: arXiv admin note: text overlap with arXiv:1302.4853 by other authors

  35. arXiv:1507.04437  [pdf, ps, other

    cs.CV

    A Deep Hashing Learning Network

    Authors: Guoqiang Zhong, Pan Yang, Sijiang Wang, Junyu Dong

    Abstract: Hashing-based methods seek compact and efficient binary codes that preserve the neighborhood structure in the original data space. For most existing hashing methods, an image is first encoded as a vector of hand-crafted visual feature, followed by a hash projection and quantization step to get the compact binary vector. Most of the hand-crafted features just encode the low-level information of the… ▽ More

    Submitted 15 July, 2015; originally announced July 2015.

    Comments: 7 pages, 5 figures

  36. arXiv:1505.03703  [pdf, other

    cs.LG cs.CV cs.NE

    A PCA-Based Convolutional Network

    Authors: Yanhai Gan, Jun Liu, Junyu Dong, Guoqiang Zhong

    Abstract: In this paper, we propose a novel unsupervised deep learning model, called PCA-based Convolutional Network (PCN). The architecture of PCN is composed of several feature extraction stages and a nonlinear output stage. Particularly, each feature extraction stage includes two layers: a convolutional layer and a feature pooling layer. In the convolutional layer, the filter banks are simply learned by… ▽ More

    Submitted 14 May, 2015; originally announced May 2015.

    Comments: 8 pages,5 figures

  37. arXiv:1306.2663  [pdf, other

    cs.LG math.NA

    Large Margin Low Rank Tensor Analysis

    Authors: Guoqiang Zhong, Mohamed Cheriet

    Abstract: Other than vector representations, the direct objects of human cognition are generally high-order tensors, such as 2D images and 3D textures. From this fact, two interesting questions naturally arise: How does the human brain represent these tensor perceptions in a "manifold" way, and how can they be recognized on the "manifold"? In this paper, we present a supervised model to learn the intrinsic… ▽ More

    Submitted 11 June, 2013; originally announced June 2013.

    Comments: 30 pages

    MSC Class: 57-04