Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 50 results for author: Park, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.04650  [pdf

    cs.CL cs.AI cs.HC cs.LG

    Building Trust in Mental Health Chatbots: Safety Metrics and LLM-Based Evaluation Tools

    Authors: Jung In Park, Mahyar Abbasian, Iman Azimi, Dawn Bounds, Angela Jun, Jaesu Han, Robert McCarron, Jessica Borelli, Jia Li, Mona Mahmoudi, Carmen Wiedenhoeft, Amir Rahmani

    Abstract: Objective: This study aims to develop and validate an evaluation framework to ensure the safety and reliability of mental health chatbots, which are increasingly popular due to their accessibility, human-like interactions, and context-aware support. Materials and Methods: We created an evaluation framework with 100 benchmark questions and ideal responses, and five guideline questions for chatbot r… ▽ More

    Submitted 3 August, 2024; originally announced August 2024.

  2. arXiv:2408.00109  [pdf, other

    q-bio.NC cs.NE nlin.AO

    Back to the Continuous Attractor

    Authors: Ábel Ságodi, Guillermo Martín-Sánchez, Piotr Sokół, Il Memming Park

    Abstract: Continuous attractors offer a unique class of solutions for storing continuous-valued variables in recurrent system states for indefinitely long time intervals. Unfortunately, continuous attractors suffer from severe structural instability in general--they are destroyed by most infinitesimal changes of the dynamical law that defines them. This fragility limits their utility especially in biologica… ▽ More

    Submitted 31 July, 2024; originally announced August 2024.

  3. arXiv:2406.07488  [pdf, other

    cs.CV

    ReduceFormer: Attention with Tensor Reduction by Summation

    Authors: John Yang, Le An, Su Inn Park

    Abstract: Transformers have excelled in many tasks including vision. However, efficient deployment of transformer models in low-latency or high-throughput applications is hindered by the computation in the attention mechanism which involves expensive operations such as matrix multiplication and Softmax. To address this, we introduce ReduceFormer, a family of models optimized for efficiency with the spirit o… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  4. arXiv:2406.06004  [pdf, other

    cs.CV cs.AI cs.CL

    FLEUR: An Explainable Reference-Free Evaluation Metric for Image Captioning Using a Large Multimodal Model

    Authors: Yebin Lee, Imseong Park, Myungjoo Kang

    Abstract: Most existing image captioning evaluation metrics focus on assigning a single numerical score to a caption by comparing it with reference captions. However, these methods do not provide an explanation for the assigned score. Moreover, reference captions are expensive to acquire. In this paper, we propose FLEUR, an explainable reference-free metric to introduce explainability into image captioning… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Accepted at ACL (Main) 2024

  5. arXiv:2405.03958  [pdf, other

    cs.CV cs.AI cs.LG

    Simple Drop-in LoRA Conditioning on Attention Layers Will Improve Your Diffusion Model

    Authors: Joo Young Choi, Jaesung R. Park, Inkyu Park, Jaewoong Cho, Albert No, Ernest K. Ryu

    Abstract: Current state-of-the-art diffusion models employ U-Net architectures containing convolutional and (qkv) self-attention layers. The U-Net processes images while being conditioned on the time embedding input for each sampling step and the class or caption embedding input corresponding to the desired conditional generation. Such conditioning involves scale-and-shift operations to the convolutional la… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  6. arXiv:2404.11615  [pdf, other

    cs.CV

    Factorized Diffusion: Perceptual Illusions by Noise Decomposition

    Authors: Daniel Geng, Inbum Park, Andrew Owens

    Abstract: Given a factorization of an image into a sum of linear components, we present a zero-shot method to control each individual component through diffusion model sampling. For example, we can decompose an image into low and high spatial frequencies and condition these components on different text prompts. This produces hybrid images, which change appearance depending on viewing distance. By decomposin… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  7. arXiv:2403.01371  [pdf, other

    stat.ML cs.LG

    eXponential FAmily Dynamical Systems (XFADS): Large-scale nonlinear Gaussian state-space modeling

    Authors: Matthew Dowling, Yuan Zhao, Il Memming Park

    Abstract: State-space graphical models and the variational autoencoder framework provide a principled apparatus for learning dynamical systems from data. State-of-the-art probabilistic approaches are often able to scale to large problems at the cost of flexibility of the variational posterior or expressivity of the dynamics model. However, those consolidations can be detrimental if the ultimate goal is to l… ▽ More

    Submitted 31 May, 2024; v1 submitted 2 March, 2024; originally announced March 2024.

  8. arXiv:2401.16553  [pdf, other

    cs.CL cs.AI

    SelectLLM: Can LLMs Select Important Instructions to Annotate?

    Authors: Ritik Sachin Parkar, Jaehyung Kim, Jong Inn Park, Dongyeop Kang

    Abstract: Instruction tuning benefits from large and diverse datasets; however, creating such datasets involves a high cost of human labeling. While synthetic datasets generated by large language models (LLMs) have partly solved this issue, they often contain low-quality data. One effective solution is selectively annotating unlabelled instructions, especially given the relative ease of acquiring unlabeled… ▽ More

    Submitted 27 August, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: First Authors: Ritik Sachin Parkar and Jaehyung Kim | Second Author: Jong Inn Park | PI: Dongyeop Kang

  9. arXiv:2401.08655  [pdf, other

    cs.CV cs.AI cs.GR cs.LG cs.MM

    SAiD: Speech-driven Blendshape Facial Animation with Diffusion

    Authors: Inkyu Park, Jaewoong Cho

    Abstract: Speech-driven 3D facial animation is challenging due to the scarcity of large-scale visual-audio datasets despite extensive research. Most prior works, typically focused on learning regression models on a small dataset using the method of least squares, encounter difficulties generating diverse lip movements from speech and require substantial effort in refining the generated outputs. To address t… ▽ More

    Submitted 24 January, 2024; v1 submitted 24 December, 2023; originally announced January 2024.

    Comments: Fix bug related to the font size

  10. arXiv:2311.17919  [pdf, other

    cs.CV

    Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models

    Authors: Daniel Geng, Inbum Park, Andrew Owens

    Abstract: We address the problem of synthesizing multi-view optical illusions: images that change appearance upon a transformation, such as a flip or rotation. We propose a simple, zero-shot method for obtaining these illusions from off-the-shelf text-to-image diffusion models. During the reverse diffusion process, we estimate the noise from different views of a noisy image, and then combine these noise est… ▽ More

    Submitted 2 April, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: CVPR 2024 camera ready

  11. arXiv:2309.17012  [pdf, other

    cs.CL cs.AI cs.LG

    Benchmarking Cognitive Biases in Large Language Models as Evaluators

    Authors: Ryan Koo, Minhwa Lee, Vipul Raheja, Jong Inn Park, Zae Myung Kim, Dongyeop Kang

    Abstract: Large Language Models (LLMs) have recently been shown to be effective as automatic evaluators with simple prompting and in-context learning. In this work, we assemble 15 LLMs of four different size ranges and evaluate their output responses by preference ranking from the other LLMs as evaluators, such as System Star is better than System Square. We then evaluate the quality of ranking outputs intr… ▽ More

    Submitted 12 August, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: Publishsed at 2024. 29 pages, 9 figures, 14 tables

    ACM Class: I.2.7

  12. arXiv:2308.12585  [pdf, other

    q-bio.NC cs.LG cs.NE nlin.AO

    Persistent learning signals and working memory without continuous attractors

    Authors: Il Memming Park, Ábel Ságodi, Piotr Aleksander Sokół

    Abstract: Neural dynamical systems with stable attractor structures, such as point attractors and continuous attractors, are hypothesized to underlie meaningful temporal behavior that requires working memory. However, working memory may not support useful learning signals necessary to adapt to changes in the temporal structure of the environment. We show that in addition to the continuous attractors that ar… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

  13. arXiv:2308.05542  [pdf, other

    cs.CV

    Robust Asymmetric Loss for Multi-Label Long-Tailed Learning

    Authors: Wongi Park, Inhyuk Park, Sungeun Kim, Jongbin Ryu

    Abstract: In real medical data, training samples typically show long-tailed distributions with multiple labels. Class distribution of the medical data has a long-tailed shape, in which the incidence of different diseases is quite varied, and at the same time, it is not unusual for images taken from symptomatic patients to be multi-label diseases. Therefore, in this paper, we concurrently address these two i… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

    Journal ref: ICCVW 2023

  14. arXiv:2306.13776  [pdf, other

    cs.CV cs.LG

    Swin-Free: Achieving Better Cross-Window Attention and Efficiency with Size-varying Window

    Authors: Jinkyu Koo, John Yang, Le An, Gwenaelle Cunha Sergio, Su Inn Park

    Abstract: Transformer models have shown great potential in computer vision, following their success in language tasks. Swin Transformer is one of them that outperforms convolution-based architectures in terms of accuracy, while improving efficiency when compared to Vision Transformer (ViT) and its variants, which have quadratic complexity with respect to the input size. Swin Transformer features shifting wi… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

    Comments: 8 pages, 3 figures

  15. arXiv:2306.01802  [pdf, other

    q-bio.NC cs.LG stat.AP stat.ML

    Linear Time GPs for Inferring Latent Trajectories from Neural Spike Trains

    Authors: Matthew Dowling, Yuan Zhao, Il Memming Park

    Abstract: Latent Gaussian process (GP) models are widely used in neuroscience to uncover hidden state evolutions from sequential observations, mainly in neural activity recordings. While latent GP models provide a principled and powerful solution in theory, the intractable posterior in non-conjugate settings necessitates approximate inference schemes, which may lack scalability. In this work, we propose cvH… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: Published at ICML 2023

  16. arXiv:2305.11278  [pdf, other

    stat.ML cs.LG q-bio.NC

    Real-Time Variational Method for Learning Neural Trajectory and its Dynamics

    Authors: Matthew Dowling, Yuan Zhao, Il Memming Park

    Abstract: Latent variable models have become instrumental in computational neuroscience for reasoning about neural computation. This has fostered the development of powerful offline algorithms for extracting latent neural trajectories from neural recordings. However, despite the potential of real time alternatives to give immediate feedback to experimentalists, and enhance experimental design, they have rec… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: Published at ICLR 2023

  17. arXiv:2305.04468  [pdf, other

    cs.LG cs.AI

    AnomalyBERT: Self-Supervised Transformer for Time Series Anomaly Detection using Data Degradation Scheme

    Authors: Yungi Jeong, Eunseok Yang, Jung Hyun Ryu, Imseong Park, Myungjoo Kang

    Abstract: Mechanical defects in real situations affect observation values and cause abnormalities in multivariate time series, such as sensor values or network data. To perceive abnormalities in such data, it is crucial to understand the temporal context and interrelation between variables simultaneously. The anomaly detection task for time series, especially for unlabeled data, has been a challenging probl… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: 11 pages, Presented at ICLR 2023 workshop on Machine Learning for IoT

  18. arXiv:2303.02060  [pdf, other

    stat.ML cs.LG

    Spectral learning of Bernoulli linear dynamical systems models

    Authors: Iris R. Stone, Yotam Sagiv, Il Memming Park, Jonathan W. Pillow

    Abstract: Latent linear dynamical systems with Bernoulli observations provide a powerful modeling framework for identifying the temporal dynamics underlying binary time series data, which arise in a variety of contexts such as binary decision-making and discrete stochastic processes (e.g., binned neural spike trains). Here we develop a spectral learning method for fast, efficient fitting of probit-Bernoulli… ▽ More

    Submitted 26 July, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

    Comments: Published in Transactions on Machine Learning Research (https://jmlr.org/tmlr/papers/)

    Journal ref: Transactions on Machine Learning Research (2023)

  19. arXiv:2212.04319  [pdf, other

    cs.CV cs.AI

    On the Robustness of Normalizing Flows for Inverse Problems in Imaging

    Authors: Seongmin Hong, Inbum Park, Se Young Chun

    Abstract: Conditional normalizing flows can generate diverse image samples for solving inverse problems. Most normalizing flows for inverse problems in imaging employ the conditional affine coupling layer that can generate diverse images quickly. However, unintended severe artifacts are occasionally observed in the output of them. In this work, we address this critical issue by investigating the origins of… ▽ More

    Submitted 16 March, 2023; v1 submitted 8 December, 2022; originally announced December 2022.

    Comments: 16 pages

  20. arXiv:2211.07077  [pdf, other

    cs.CV

    IFQA: Interpretable Face Quality Assessment

    Authors: Byungho Jo, Donghyeon Cho, In Kyu Park, Sungeun Hong

    Abstract: Existing face restoration models have relied on general assessment metrics that do not consider the characteristics of facial regions. Recent works have therefore assessed their methods using human studies, which is not scalable and involves significant effort. This paper proposes a novel face-centric metric based on an adversarial framework where a generator simulates face restoration and a discr… ▽ More

    Submitted 16 November, 2022; v1 submitted 13 November, 2022; originally announced November 2022.

    Comments: WACV 2023, Code: https://github.com/VCLLab/IFQA

  21. arXiv:2208.08005  [pdf, other

    cs.CL cs.AI

    Transformer Encoder for Social Science

    Authors: Haosen Ge, In Young Park, Xuancheng Qian, Grace Zeng

    Abstract: High-quality text data has become an important data source for social scientists. We have witnessed the success of pretrained deep neural network models, such as BERT and RoBERTa, in recent social science research. In this paper, we propose a compact pretrained deep neural network, Transformer Encoder for Social Science (TESS), explicitly designed to tackle text processing tasks in social science… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

  22. arXiv:2204.13791  [pdf, other

    cs.CV cs.LG

    Depth Estimation with Simplified Transformer

    Authors: John Yang, Le An, Anurag Dixit, Jinkyu Koo, Su Inn Park

    Abstract: Transformer and its variants have shown state-of-the-art results in many vision tasks recently, ranging from image classification to dense prediction. Despite of their success, limited work has been reported on improving the model efficiency for deployment in latency-critical applications, such as autonomous driving and robotic navigation. In this paper, we aim at improving upon the existing trans… ▽ More

    Submitted 27 May, 2022; v1 submitted 28 April, 2022; originally announced April 2022.

    Comments: Accepted for the CVPR 2022 Transformers For Vision (T4V) workshop

  23. arXiv:2204.01264  [pdf, other

    cs.CV

    Probabilistic Implicit Scene Completion

    Authors: Dongsu Zhang, Changwoon Choi, Inbum Park, Young Min Kim

    Abstract: We propose a probabilistic shape completion method extended to the continuous geometry of large-scale 3D scenes. Real-world scans of 3D scenes suffer from a considerable amount of missing data cluttered with unsegmented objects. The problem of shape completion is inherently ill-posed, and high-quality result requires scalable solutions that consider multiple possible outcomes. We employ the Genera… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

    Comments: Accepted to ICLR 2022 as spotlight, code available at https://github.com/96lives/gca

  24. Human and Scene Motion Deblurring using Pseudo-blur Synthesizer

    Authors: Jonathan Samuel Lumentut, In Kyu Park

    Abstract: Present-day deep learning-based motion deblurring methods utilize the pair of synthetic blur and sharp data to regress any particular framework. This task is designed for directly translating a blurry image input into its restored version as output. The aforementioned approach relies heavily on the quality of the synthetic blurry data, which are only available before the training stage. Handling t… ▽ More

    Submitted 24 November, 2021; originally announced November 2021.

  25. arXiv:2109.04463  [pdf, other

    cs.LG q-bio.NC

    Neural Latents Benchmark '21: Evaluating latent variable models of neural population activity

    Authors: Felix Pei, Joel Ye, David Zoltowski, Anqi Wu, Raeed H. Chowdhury, Hansem Sohn, Joseph E. O'Doherty, Krishna V. Shenoy, Matthew T. Kaufman, Mark Churchland, Mehrdad Jazayeri, Lee E. Miller, Jonathan Pillow, Il Memming Park, Eva L. Dyer, Chethan Pandarinath

    Abstract: Advances in neural recording present increasing opportunities to study neural activity in unprecedented detail. Latent variable models (LVMs) are promising tools for analyzing this rich activity across diverse neural systems and behaviors, as LVMs do not depend on known relationships between the activity and external experimental variables. However, progress with LVMs for neuronal population activ… ▽ More

    Submitted 17 January, 2022; v1 submitted 9 September, 2021; originally announced September 2021.

  26. arXiv:2107.07098  [pdf, other

    stat.ML cs.LG

    Hida-Matérn Kernel

    Authors: Matthew Dowling, Piotr Sokół, Il Memming Park

    Abstract: We present the class of Hida-Matérn kernels, which is the canonical family of covariance functions over the entire space of stationary Gauss-Markov Processes. It extends upon Matérn kernels, by allowing for flexible construction of priors over processes with oscillatory components. Any stationary kernel, including the widely used squared-exponential and spectral mixture kernels, are either directl… ▽ More

    Submitted 27 December, 2021; v1 submitted 14 July, 2021; originally announced July 2021.

  27. Deep Context- and Relation-Aware Learning for Aspect-based Sentiment Analysis

    Authors: Shinhyeok Oh, Dongyub Lee, Taesun Whang, IlNam Park, Gaeun Seo, EungGyun Kim, Harksoo Kim

    Abstract: Existing works for aspect-based sentiment analysis (ABSA) have adopted a unified approach, which allows the interactive relations among subtasks. However, we observe that these methods tend to predict polarities based on the literal meaning of aspect and opinion terms and mainly consider relations implicitly among subtasks at the word level. In addition, identifying multiple aspect-opinion pairs w… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: Accepted to ACL-IJCNLP 2021

  28. arXiv:2103.16851  [pdf, other

    cs.CV

    Attention Map-guided Two-stage Anomaly Detection using Hard Augmentation

    Authors: Jou Won Song, Kyeongbo Kong, Ye In Park, Suk-Ju Kang

    Abstract: Anomaly detection is a task that recognizes whether an input sample is included in the distribution of a target normal class or an anomaly class. Conventional generative adversarial network (GAN)-based methods utilize an entire image including foreground and background as an input. However, in these methods, a useless region unrelated to the normal class (e.g., unrelated background) is learned as… ▽ More

    Submitted 31 March, 2021; originally announced March 2021.

  29. arXiv:2102.11517  [pdf, other

    cs.LG cs.DB cs.SI

    SliceNStitch: Continuous CP Decomposition of Sparse Tensor Streams

    Authors: Taehyung Kwon, Inkyu Park, Dongjin Lee, Kijung Shin

    Abstract: Consider traffic data (i.e., triplets in the form of source-destination-timestamp) that grow over time. Tensors (i.e., multi-dimensional arrays) with a time mode are widely used for modeling and analyzing such multi-aspect data streams. In such tensors, however, new entries are added only once per period, which is often an hour, a day, or even a year. This discreteness of tensors has limited their… ▽ More

    Submitted 2 March, 2021; v1 submitted 23 February, 2021; originally announced February 2021.

    Comments: Updated Figures 4, 5, 6, 7, and 8 after fixing a bug in preprocessing the Divvy dataset. To appear at the 37th IEEE International Conference on Data Engineering (ICDE '21)

    ACM Class: H.2.8

  30. arXiv:2012.04729  [pdf, other

    cs.LG

    On 1/n neural representation and robustness

    Authors: Josue Nassar, Piotr Aleksander Sokol, SueYeon Chung, Kenneth D. Harris, Il Memming Park

    Abstract: Understanding the nature of representation in neural networks is a goal shared by neuroscience and machine learning. It is therefore exciting that both fields converge not only on shared questions but also on similar approaches. A pressing question in these areas is understanding how the structure of the representation used by neural networks affects both their generalization, and robustness to pe… ▽ More

    Submitted 8 December, 2020; originally announced December 2020.

  31. arXiv:2010.12362  [pdf, other

    stat.ML cs.LG

    Rescuing neural spike train models from bad MLE

    Authors: Diego M. Arribas, Yuan Zhao, Il Memming Park

    Abstract: The standard approach to fitting an autoregressive spike train model is to maximize the likelihood for one-step prediction. This maximum likelihood estimation (MLE) often leads to models that perform poorly when generating samples recursively for more than one time step. Moreover, the generated spike trains can fail to capture important features of the data and even show diverging firing rates. To… ▽ More

    Submitted 23 October, 2020; originally announced October 2020.

    Comments: To appear in Advances in Neural Information Processing 2020

  32. arXiv:2009.01362  [pdf, other

    stat.ML cs.LG

    Non-parametric generalized linear model

    Authors: Matthew Dowling, Yuan Zhao, Il Memming Park

    Abstract: A fundamental problem in statistical neuroscience is to model how neurons encode information by analyzing electrophysiological recordings. A popular and widely-used approach is to fit the spike trains with an autoregressive point process model. These models are characterized by a set of convolutional temporal filters, whose subsequent analysis can help reveal how neurons encode stimuli, interact w… ▽ More

    Submitted 2 September, 2020; originally announced September 2020.

  33. Integrated Eojeol Embedding for Erroneous Sentence Classification in Korean Chatbots

    Authors: DongHyun Choi, IlNam Park, Myeong Cheol Shin, EungGyun Kim, Dong Ryeol Shin

    Abstract: This paper attempts to analyze the Korean sentence classification system for a chatbot. Sentence classification is the task of classifying an input sentence based on predefined categories. However, spelling or space error contained in the input sentence causes problems in morphological analysis and tokenization. This paper proposes a novel approach of Integrated Eojeol (Korean syntactic word separ… ▽ More

    Submitted 12 April, 2020; originally announced April 2020.

    Comments: 9 pages, 2 figures

    Journal ref: IEEE Access, 2021

  34. arXiv:1912.10687  [pdf, other

    cs.CV

    5D Light Field Synthesis from a Monocular Video

    Authors: Kyuho Bae, Andre Ivan, Hajime Nagahara, In Kyu Park

    Abstract: Commercially available light field cameras have difficulty in capturing 5D (4D + time) light field videos. They can only capture still light filed images or are excessively expensive for normal users to capture the light field video. To tackle this problem, we propose a deep learning-based method for synthesizing a light field video from a monocular video. We propose a new synthetic light field vi… ▽ More

    Submitted 23 December, 2019; originally announced December 2019.

  35. arXiv:1912.10427  [pdf, other

    cs.CV

    Joint Face Super-Resolution and Deblurring Using a Generative Adversarial Network

    Authors: Jung Un Yun, In Kyu Park

    Abstract: Facial image super-resolution (SR) is an important preprocessing for facial image analysis, face recognition, and image-based 3D face reconstruction. Recent convolutional neural network (CNN) based method has shown excellent performance by learning mapping relation using pairs of low-resolution (LR) and high-resolution (HR) facial images. However, since the HR facial image reconstruction using CNN… ▽ More

    Submitted 22 December, 2019; originally announced December 2019.

  36. Joint Spatial and Angular Super-Resolution from a Single Image

    Authors: Andre Ivan, Williem, In Kyu Park

    Abstract: Synthesizing a densely sampled light field from a single image is highly beneficial for many applications. Moreover, jointly solving both angular and spatial super-resolution problem also introduces new possibilities in light field imaging. The conventional method relies on physical-based rendering and a secondary network to solve the angular super-resolution problem. In addition, pixel-based loss… ▽ More

    Submitted 27 June, 2020; v1 submitted 22 November, 2019; originally announced November 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1903.12364

    Journal ref: IEEE Access, vol. 8, June 2020 page(s): 112562-112573

  37. Streaming Variational Monte Carlo

    Authors: Yuan Zhao, Josue Nassar, Ian Jordan, Mónica Bugallo, Il Memming Park

    Abstract: Nonlinear state-space models are powerful tools to describe dynamical structures in complex time series. In a streaming setting where data are processed one sample at a time, simultaneous inference of the state and its nonlinear dynamics has posed significant challenges in practice. We develop a novel online learning framework, leveraging variational inference and sequential Monte Carlo, which ena… ▽ More

    Submitted 8 November, 2021; v1 submitted 4 June, 2019; originally announced June 2019.

  38. Gated recurrent units viewed through the lens of continuous time dynamical systems

    Authors: Ian D. Jordan, Piotr Aleksander Sokol, Il Memming Park

    Abstract: Gated recurrent units (GRUs) are specialized memory elements for building recurrent neural networks. Despite their incredible success on various tasks, including extracting dynamics underlying neural data, little is understood about the specific dynamics representable in a GRU network. As a result, it is both difficult to know a priori how successful a GRU network will perform on a given task, and… ▽ More

    Submitted 28 July, 2021; v1 submitted 3 June, 2019; originally announced June 2019.

    Journal ref: Frontiers in Computational Neuroscience, 2021

  39. arXiv:1904.06109  [pdf, other

    cs.CV

    Face De-occlusion using 3D Morphable Model and Generative Adversarial Network

    Authors: Xiaowei Yuan, In Kyu Park

    Abstract: In recent decades, 3D morphable model (3DMM) has been commonly used in image-based photorealistic 3D face reconstruction. However, face images are often corrupted by serious occlusion by non-face objects including eyeglasses, masks, and hands. Such objects block the correct capture of landmarks and shading information. Therefore, the reconstructed 3D face model is hardly reusable. In this paper, a… ▽ More

    Submitted 6 September, 2019; v1 submitted 12 April, 2019; originally announced April 2019.

    Comments: Presented in ICCV 2019

  40. arXiv:1904.03326  [pdf, other

    cs.CV

    360 Panorama Synthesis from a Sparse Set of Images with Unknown Field of View

    Authors: Julius Surya Sumantri, In Kyu Park

    Abstract: 360 images represent scenes captured in all possible viewing directions and enable viewers to navigate freely around the scene thereby providing an immersive experience. Conversely, conventional images represent scenes in a single viewing direction with a small or limited field of view (FOV). As a result, only certain parts of the scenes are observed, and valuable information about the surrounding… ▽ More

    Submitted 22 December, 2019; v1 submitted 5 April, 2019; originally announced April 2019.

    Comments: Presented in WACV 2020

  41. Fast and Full-Resolution Light Field Deblurring using a Deep Neural Network

    Authors: Jonathan Samuel Lumentut, Tae Hyun Kim, Ravi Ramamoorthi, In Kyu Park

    Abstract: Restoring a sharp light field image from its blurry input has become essential due to the increasing popularity of parallax-based image processing. State-of-the-art blind light field deblurring methods suffer from several issues such as slow processing, reduced spatial size, and a limited motion blur model. In this work, we address these challenging problems by generating a complex blurry light fi… ▽ More

    Submitted 31 March, 2019; originally announced April 2019.

    Comments: 9 pages, 8 figures

    Journal ref: IEEE Signal Processing Letters, vol. 26, no. 12, pp. 1788-1792, December 2019

  42. arXiv:1903.12364  [pdf, other

    cs.CV

    Synthesizing a 4D Spatio-Angular Consistent Light Field from a Single Image

    Authors: Andre Ivan, Williem, In Kyu Park

    Abstract: Synthesizing a densely sampled light field from a single image is highly beneficial for many applications. The conventional method reconstructs a depth map and relies on physical-based rendering and a secondary network to improve the synthesized novel views. Simple pixel-based loss also limits the network by making it rely on pixel intensity cue rather than geometric reasoning. In this study, we s… ▽ More

    Submitted 29 March, 2019; originally announced March 2019.

  43. arXiv:1811.12386  [pdf, other

    stat.ML cs.LG

    Tree-Structured Recurrent Switching Linear Dynamical Systems for Multi-Scale Modeling

    Authors: Josue Nassar, Scott W. Linderman, Monica Bugallo, Il Memming Park

    Abstract: Many real-world systems studied are governed by complex, nonlinear dynamics. By modeling these dynamics, we can gain insight into how these systems work, make predictions about how they will behave, and develop strategies for controlling them. While there are many methods for modeling nonlinear dynamical systems, existing techniques face a trade off between offering interpretable descriptions and… ▽ More

    Submitted 4 June, 2019; v1 submitted 29 November, 2018; originally announced November 2018.

  44. Multi-Scale Distributed Representation for Deep Learning and its Application to b-Jet Tagging

    Authors: Jason Lee, Inkyu Park, Sangnam Park

    Abstract: Recently machine learning algorithms based on deep layered artificial neural networks (DNNs) have been applied to a wide variety of high energy physics problems such as jet tagging or event classification. We explore a simple but effective preprocessing step which transforms each real-valued observational quantity or input feature into a binary number with a fixed number of digits. Each binary dig… ▽ More

    Submitted 29 November, 2018; originally announced November 2018.

    Comments: 13 pages, 8 figures

    Journal ref: J.Korean Phys.Soc. 72 (2018) no.11, 1292-1300

  45. arXiv:1810.03785  [pdf, other

    stat.ML cs.LG

    Information Geometry of Orthogonal Initializations and Training

    Authors: Piotr A. Sokol, Il Memming Park

    Abstract: Recently mean field theory has been successfully used to analyze properties of wide, random neural networks. It gave rise to a prescriptive theory for initializing feed-forward neural networks with orthogonal weights, which ensures that both the forward propagated activations and the backpropagated gradients are near $\ell_2$ isometries and as a consequence training is orders of magnitude faster.… ▽ More

    Submitted 4 June, 2019; v1 submitted 8 October, 2018; originally announced October 2018.

    Comments: 10 pages and 5 figures; 5 page appendix

  46. arXiv:1711.10918  [pdf, other

    cs.CV

    Joint Blind Motion Deblurring and Depth Estimation of Light Field

    Authors: Dongwoo Lee, Haesol Park, In Kyu Park, Kyoung Mu Lee

    Abstract: Removing camera motion blur from a single light field is a challenging task since it is highly ill-posed inverse problem. The problem becomes even worse when blur kernel varies spatially due to scene depth variation and high-order camera motion. In this paper, we propose a novel algorithm to estimate all blur model variables jointly, including latent sub-aperture image, camera motion, and scene de… ▽ More

    Submitted 14 June, 2018; v1 submitted 29 November, 2017; originally announced November 2017.

  47. arXiv:1310.5347  [pdf, other

    stat.ML cs.LG

    Bayesian Extensions of Kernel Least Mean Squares

    Authors: Il Memming Park, Sohan Seth, Steven Van Vaerenbergh

    Abstract: The kernel least mean squares (KLMS) algorithm is a computationally efficient nonlinear adaptive filtering method that "kernelizes" the celebrated (linear) least mean squares algorithm. We demonstrate that the least mean squares algorithm is closely related to the Kalman filtering, and thus, the KLMS can be interpreted as an approximate Bayesian filtering method. This allows us to systematically d… ▽ More

    Submitted 20 October, 2013; originally announced October 2013.

    Comments: 7 pages, 4 fiures

  48. arXiv:1302.0328  [pdf, other

    cs.IT

    Bayesian Entropy Estimation for Countable Discrete Distributions

    Authors: Evan Archer, Il Memming Park, Jonathan Pillow

    Abstract: We consider the problem of estimating Shannon's entropy $H$ from discrete data, in cases where the number of possible symbols is unknown or even countably infinite. The Pitman-Yor process, a generalization of Dirichlet process, provides a tractable prior distribution over the space of countably infinite discrete distributions, and has found major applications in Bayesian non-parametric statistics… ▽ More

    Submitted 9 April, 2014; v1 submitted 1 February, 2013; originally announced February 2013.

    Comments: 38 pages LaTeX. Revised and resubmitted to JMLR

  49. arXiv:1202.2143  [pdf, other

    stat.ME cs.LG stat.ML

    Active Bayesian Optimization: Minimizing Minimizer Entropy

    Authors: Il Memming Park, Marcel Nassar, Mijung Park

    Abstract: The ultimate goal of optimization is to find the minimizer of a target function.However, typical criteria for active optimization often ignore the uncertainty about the minimizer. We propose a novel criterion for global optimization and an associated sequential active learning strategy using Gaussian processes.Our criterion is the reduction of uncertainty in the posterior distribution of the funct… ▽ More

    Submitted 9 February, 2012; originally announced February 2012.

  50. arXiv:0901.3475  [pdf, ps, other

    cs.IT

    Efficient decoding algorithm using triangularity of $\mbf{R}$ matrix of QR-decomposition

    Authors: In Sook Park

    Abstract: An efficient decoding algorithm named `divided decoder' is proposed in this paper. Divided decoding can be combined with any decoder using QR-decomposition and offers different pairs of performance and complexity. Divided decoding provides various combinations of two or more different searching algorithms. Hence it makes flexibility in error rate and complexity for the algorithms using it. We ca… ▽ More

    Submitted 22 January, 2009; originally announced January 2009.

    Comments: This paper is submitted to IEEE transactions on Information theory