Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–39 of 39 results for author: Park, J S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.20980  [pdf, other

    cs.DC cs.CR

    Impact of Conflicting Transactions in Blockchain: Detecting and Mitigating Potential Attacks

    Authors: Faisal Haque Bappy, Kamrul Hasan, Joon S. Park, Carlos Caicedo, Tariqul Islam

    Abstract: Conflicting transactions within blockchain networks not only pose performance challenges but also introduce security vulnerabilities, potentially facilitating malicious attacks. In this paper, we explore the impact of conflicting transactions on blockchain attack vectors. Through modeling and simulation, we delve into the dynamics of four pivotal attacks - block withholding, double spending, balan… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

  2. arXiv:2407.01942  [pdf, other

    cs.AI cs.CL cs.CV

    Certainly Uncertain: A Benchmark and Metric for Multimodal Epistemic and Aleatoric Awareness

    Authors: Khyathi Raghavi Chandu, Linjie Li, Anas Awadalla, Ximing Lu, Jae Sung Park, Jack Hessel, Lijuan Wang, Yejin Choi

    Abstract: The ability to acknowledge the inevitable uncertainty in their knowledge and reasoning is a prerequisite for AI systems to be truly truthful and reliable. In this paper, we present a taxonomy of uncertainty specific to vision-language AI systems, distinguishing between epistemic uncertainty (arising from a lack of information) and aleatoric uncertainty (due to inherent unpredictability), and furth… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 26 pages

  3. arXiv:2405.18400  [pdf, other

    cs.CL cs.LG

    Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass

    Authors: Ethan Shen, Alan Fan, Sarah M. Pratt, Jae Sung Park, Matthew Wallingford, Sham M. Kakade, Ari Holtzman, Ranjay Krishna, Ali Farhadi, Aditya Kusupati

    Abstract: Many applications today provide users with multiple auto-complete drafts as they type, including GitHub's code completion, Gmail's smart compose, and Apple's messaging auto-suggestions. Under the hood, language models support this by running an autoregressive inference pass to provide a draft. Consequently, providing $k$ drafts to the user requires running an expensive language model $k$ times. To… ▽ More

    Submitted 24 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: 22 pages, 15 figures

  4. arXiv:2403.10748  [pdf, other

    cs.CE cs.LG cs.MS math.NA

    A Comprehensive Review of Latent Space Dynamics Identification Algorithms for Intrusive and Non-Intrusive Reduced-Order-Modeling

    Authors: Christophe Bonneville, Xiaolong He, April Tran, Jun Sur Park, William Fries, Daniel A. Messenger, Siu Wun Cheung, Yeonjong Shin, David M. Bortz, Debojyoti Ghosh, Jiun-Shyan Chen, Jonathan Belof, Youngsoo Choi

    Abstract: Numerical solvers of partial differential equations (PDEs) have been widely employed for simulating physical systems. However, the computational cost remains a major bottleneck in various scientific and engineering applications, which has motivated the development of reduced-order models (ROMs). Recently, machine-learning-based ROMs have gained significant popularity and are promising for addressi… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  5. arXiv:2403.05848  [pdf, other

    cs.LG math.DS

    tLaSDI: Thermodynamics-informed latent space dynamics identification

    Authors: Jun Sur Richard Park, Siu Wun Cheung, Youngsoo Choi, Yeonjong Shin

    Abstract: We propose a latent space dynamics identification method, namely tLaSDI, that embeds the first and second principles of thermodynamics. The latent variables are learned through an autoencoder as a nonlinear dimension reduction model. The latent dynamics are constructed by a neural network-based model that precisely preserves certain structures for the thermodynamic laws through the GENERIC formali… ▽ More

    Submitted 21 March, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

    Comments: 32 pages, 8 figures

  6. arXiv:2401.03568  [pdf, other

    cs.AI cs.HC cs.LG

    Agent AI: Surveying the Horizons of Multimodal Interaction

    Authors: Zane Durante, Qiuyuan Huang, Naoki Wake, Ran Gong, Jae Sung Park, Bidipta Sarkar, Rohan Taori, Yusuke Noda, Demetri Terzopoulos, Yejin Choi, Katsushi Ikeuchi, Hoi Vo, Li Fei-Fei, Jianfeng Gao

    Abstract: Multi-modal AI systems will likely become a ubiquitous presence in our everyday lives. A promising approach to making these systems more interactive is to embody them as agents within physical and virtual environments. At present, systems leverage existing foundation models as the basic building blocks for the creation of embodied agents. Embedding agents within such environments facilitates the a… ▽ More

    Submitted 25 January, 2024; v1 submitted 7 January, 2024; originally announced January 2024.

  7. arXiv:2312.04837  [pdf, other

    cs.AI cs.CL cs.CV

    Localized Symbolic Knowledge Distillation for Visual Commonsense Models

    Authors: Jae Sung Park, Jack Hessel, Khyathi Raghavi Chandu, Paul Pu Liang, Ximing Lu, Peter West, Youngjae Yu, Qiuyuan Huang, Jianfeng Gao, Ali Farhadi, Yejin Choi

    Abstract: Instruction following vision-language (VL) models offer a flexible interface that supports a broad range of multimodal tasks in a zero-shot fashion. However, interfaces that operate on full images do not directly enable the user to "point to" and access specific regions within images. This capability is important not only to support reference-grounded VL benchmarks, but also, for practical applica… ▽ More

    Submitted 12 December, 2023; v1 submitted 8 December, 2023; originally announced December 2023.

    Comments: Neurips 2023

  8. arXiv:2311.04287  [pdf, other

    cs.CV cs.LG

    Holistic Evaluation of Text-To-Image Models

    Authors: Tony Lee, Michihiro Yasunaga, Chenlin Meng, Yifan Mai, Joon Sung Park, Agrim Gupta, Yunzhi Zhang, Deepak Narayanan, Hannah Benita Teufel, Marco Bellagente, Minguk Kang, Taesung Park, Jure Leskovec, Jun-Yan Zhu, Li Fei-Fei, Jiajun Wu, Stefano Ermon, Percy Liang

    Abstract: The stunning qualitative improvement of recent text-to-image models has led to their widespread attention and adoption. However, we lack a comprehensive quantitative understanding of their capabilities and risks. To fill this gap, we introduce a new benchmark, Holistic Evaluation of Text-to-Image Models (HEIM). Whereas previous evaluations focus mostly on text-image alignment and image quality, we… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: NeurIPS 2023. First three authors contributed equally

  9. arXiv:2308.04453  [pdf, other

    cs.CR

    Towards Immutability: A Secure and Efficient Auditing Framework for Cloud Supporting Data Integrity and File Version Control

    Authors: Faisal Haque Bappy, Saklain Zaman, Tariqul Islam, Redwan Ahmed Rizvee, Joon S. Park, Kamrul Hasan

    Abstract: Although wide-scale integration of cloud services with myriad applications increases quality of services (QoS) for enterprise users, verifying the existence and manipulation of stored cloud information remains an open research problem. Decentralized blockchain-based solutions are becoming more appealing for cloud auditing environments because of the immutable nature of blockchain. However, the dec… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

  10. arXiv:2307.04280  [pdf, ps, other

    cs.HC

    Shaping the Emerging Norms of Using Large Language Models in Social Computing Research

    Authors: Hong Shen, Tianshi Li, Toby Jia-Jun Li, Joon Sung Park, Diyi Yang

    Abstract: The emergence of Large Language Models (LLMs) has brought both excitement and concerns to social computing research. On the one hand, LLMs offer unprecedented capabilities in analyzing vast amounts of textual data and generating human-like responses, enabling researchers to delve into complex social phenomena. On the other hand, concerns are emerging regarding the validity, privacy, and ethics of… ▽ More

    Submitted 9 July, 2023; originally announced July 2023.

  11. arXiv:2305.00970  [pdf, other

    cs.CV

    ArK: Augmented Reality with Knowledge Interactive Emergent Ability

    Authors: Qiuyuan Huang, Jae Sung Park, Abhinav Gupta, Paul Bennett, Ran Gong, Subhojit Som, Baolin Peng, Owais Khan Mohammed, Chris Pal, Yejin Choi, Jianfeng Gao

    Abstract: Despite the growing adoption of mixed reality and interactive AI agents, it remains challenging for these systems to generate high quality 2D/3D scenes in unseen environments. The common practice requires deploying an AI agent to collect large amounts of data for model training for every new task. This process is costly, or even impossible, for many domains. In this study, we develop an infinite a… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Report number: EFI-94-11

  12. arXiv:2304.03442  [pdf, other

    cs.HC cs.AI cs.LG

    Generative Agents: Interactive Simulacra of Human Behavior

    Authors: Joon Sung Park, Joseph C. O'Brien, Carrie J. Cai, Meredith Ringel Morris, Percy Liang, Michael S. Bernstein

    Abstract: Believable proxies of human behavior can empower interactive applications ranging from immersive environments to rehearsal spaces for interpersonal communication to prototyping tools. In this paper, we introduce generative agents--computational software agents that simulate believable human behavior. Generative agents wake up, cook breakfast, and head to work; artists paint, while authors write; t… ▽ More

    Submitted 5 August, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

  13. arXiv:2212.09746  [pdf, other

    cs.CL

    Evaluating Human-Language Model Interaction

    Authors: Mina Lee, Megha Srivastava, Amelia Hardy, John Thickstun, Esin Durmus, Ashwin Paranjape, Ines Gerard-Ursin, Xiang Lisa Li, Faisal Ladhak, Frieda Rong, Rose E. Wang, Minae Kwon, Joon Sung Park, Hancheng Cao, Tony Lee, Rishi Bommasani, Michael Bernstein, Percy Liang

    Abstract: Many real-world applications of language models (LMs), such as writing assistance and code autocomplete, involve human-LM interaction. However, most benchmarks are non-interactive in that a model produces output without human involvement. To evaluate human-LM interaction, we develop a new framework, Human-AI Language-based Interaction Evaluation (HALIE), that defines the components of interactive… ▽ More

    Submitted 5 January, 2024; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: Authored by the Center for Research on Foundation Models (CRFM) at the Stanford Institute for Human-Centered Artificial Intelligence (HAI)

  14. arXiv:2208.13094  [pdf, other

    cs.HC

    Measuring the Prevalence of Anti-Social Behavior in Online Communities

    Authors: Joon Sung Park, Joseph Seering, Michael S. Bernstein

    Abstract: With increasing attention to online anti-social behaviors such as personal attacks and bigotry, it is critical to have an accurate accounting of how widespread anti-social behaviors are. In this paper, we empirically measure the prevalence of anti-social behavior in one of the world's most popular online community platforms. We operationalize this goal as measuring the proportion of unmoderated co… ▽ More

    Submitted 27 August, 2022; originally announced August 2022.

    Comments: This work will appear in the Proc. ACM Hum.-Comput. Interact. 6, CSCW (CSCW'22)

  15. arXiv:2208.04024  [pdf, other

    cs.HC

    Social Simulacra: Creating Populated Prototypes for Social Computing Systems

    Authors: Joon Sung Park, Lindsay Popowski, Carrie J. Cai, Meredith Ringel Morris, Percy Liang, Michael S. Bernstein

    Abstract: Social computing prototypes probe the social behaviors that may arise in an envisioned system design. This prototyping practice is currently limited to recruiting small groups of people. Unfortunately, many challenges do not arise until a system is populated at a larger scale. Can a designer understand how a social system might behave when populated, and make adjustments to the design before the s… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

    Comments: This work will appear in the 35th Annual ACM Symposium on User Interface Software and Technology (UIST '22)

  16. arXiv:2206.02837  [pdf, other

    eess.IV cs.CV

    EVAC+: Multi-scale V-net with Deep Feature CRF Layers for Brain Extraction

    Authors: Jong Sung Park, Shreyas Fadnavis, Eleftherios Garyfallidis

    Abstract: Brain extraction is one of the first steps of pre-processing 3D brain MRI data and a prerequisite for any forthcoming brain imaging analyses. However, it is not a simple segmentation problem due to the complex structure of the brain and human head. Although multiple solutions have been proposed in the literature, we are still far from having truly robust methods. While previous methods have used m… ▽ More

    Submitted 5 January, 2023; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: Replaced with advancements in the model and results

  17. arXiv:2202.04800  [pdf, other

    cs.CV cs.CL

    The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning

    Authors: Jack Hessel, Jena D. Hwang, Jae Sung Park, Rowan Zellers, Chandra Bhagavatula, Anna Rohrbach, Kate Saenko, Yejin Choi

    Abstract: Humans have remarkable capacity to reason abductively and hypothesize about what lies beyond the literal content of an image. By identifying concrete visual clues scattered throughout a scene, we almost can't help but draw probable inferences beyond the literal scene based on our everyday experience and knowledge about the world. For example, if we see a "20 mph" sign alongside a road, we might as… ▽ More

    Submitted 25 July, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

    Comments: code, data, models at http://visualabduction.com/

    Journal ref: ECCV 2022

  18. arXiv:2202.02950  [pdf, other

    cs.HC cs.AI cs.LG

    Jury Learning: Integrating Dissenting Voices into Machine Learning Models

    Authors: Mitchell L. Gordon, Michelle S. Lam, Joon Sung Park, Kayur Patel, Jeffrey T. Hancock, Tatsunori Hashimoto, Michael S. Bernstein

    Abstract: Whose labels should a machine learning (ML) algorithm learn to emulate? For ML tasks ranging from online comment toxicity to misinformation detection to medical diagnosis, different groups in society may have irreconcilable disagreements about ground truth labels. Supervised ML today resolves these label disagreements implicitly using majority vote, which overrides minority groups' labels. We intr… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

    Comments: To appear at CHI 2022

  19. arXiv:2108.07258  [pdf, other

    cs.LG cs.AI cs.CY

    On the Opportunities and Risks of Foundation Models

    Authors: Rishi Bommasani, Drew A. Hudson, Ehsan Adeli, Russ Altman, Simran Arora, Sydney von Arx, Michael S. Bernstein, Jeannette Bohg, Antoine Bosselut, Emma Brunskill, Erik Brynjolfsson, Shyamal Buch, Dallas Card, Rodrigo Castellon, Niladri Chatterji, Annie Chen, Kathleen Creel, Jared Quincy Davis, Dora Demszky, Chris Donahue, Moussa Doumbouya, Esin Durmus, Stefano Ermon, John Etchemendy, Kawin Ethayarajh , et al. (89 additional authors not shown)

    Abstract: AI is undergoing a paradigm shift with the rise of models (e.g., BERT, DALL-E, GPT-3) that are trained on broad data at scale and are adaptable to a wide range of downstream tasks. We call these models foundation models to underscore their critically central yet incomplete character. This report provides a thorough account of the opportunities and risks of foundation models, ranging from their cap… ▽ More

    Submitted 12 July, 2022; v1 submitted 16 August, 2021; originally announced August 2021.

    Comments: Authored by the Center for Research on Foundation Models (CRFM) at the Stanford Institute for Human-Centered Artificial Intelligence (HAI). Report page with citation guidelines: https://crfm.stanford.edu/report.html

  20. arXiv:2106.02636  [pdf, other

    cs.CV cs.CL cs.LG

    MERLOT: Multimodal Neural Script Knowledge Models

    Authors: Rowan Zellers, Ximing Lu, Jack Hessel, Youngjae Yu, Jae Sung Park, Jize Cao, Ali Farhadi, Yejin Choi

    Abstract: As humans, we understand events in the visual world contextually, performing multimodal reasoning across time to make inferences about the past, present, and future. We introduce MERLOT, a model that learns multimodal script knowledge by watching millions of YouTube videos with transcribed speech -- in an entirely label-free, self-supervised manner. By pretraining with a mix of both frame-level (s… ▽ More

    Submitted 21 October, 2021; v1 submitted 4 June, 2021; originally announced June 2021.

    Comments: project page at https://rowanzellers.com/merlot; NeurIPS 2021 camera ready

  21. arXiv:2106.01487  [pdf, other

    cs.LG cs.CV

    LLC: Accurate, Multi-purpose Learnt Low-dimensional Binary Codes

    Authors: Aditya Kusupati, Matthew Wallingford, Vivek Ramanujan, Raghav Somani, Jae Sung Park, Krishna Pillutla, Prateek Jain, Sham Kakade, Ali Farhadi

    Abstract: Learning binary representations of instances and classes is a classical problem with several high potential applications. In modern settings, the compression of high-dimensional neural representations to low-dimensional binary codes is a challenging task and often require large bit-codes to be accurate. In this work, we propose a novel method for Learning Low-dimensional binary Codes (LLC) for ins… ▽ More

    Submitted 6 October, 2021; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021 Camera Ready. 19 pages, 6 figures

  22. Understanding the Representation and Representativeness of Age in AI Data Sets

    Authors: Joon Sung Park, Michael S. Bernstein, Robin N. Brewer, Ece Kamar, Meredith Ringel Morris

    Abstract: A diverse representation of different demographic groups in AI training data sets is important in ensuring that the models will work for a large range of users. To this end, recent efforts in AI fairness and inclusion have advocated for creating AI data sets that are well-balanced across race, gender, socioeconomic status, and disability status. In this paper, we contribute to this line of work by… ▽ More

    Submitted 6 May, 2021; v1 submitted 10 March, 2021; originally announced March 2021.

    Comments: 9 pages

    Journal ref: In Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics, and Society (AIES '21)

  23. arXiv:2102.07028  [pdf, other

    cs.LG cs.AI

    ThetA -- fast and robust clustering via a distance parameter

    Authors: Eleftherios Garyfallidis, Shreyas Fadnavis, Jong Sung Park, Bramsh Qamar Chandio, Javier Guaje, Serge Koudoro, Nasim Anousheh

    Abstract: Clustering is a fundamental problem in machine learning where distance-based approaches have dominated the field for many decades. This set of problems is often tackled by partitioning the data into K clusters where the number of clusters is chosen apriori. While significant progress has been made on these lines over the years, it is well established that as the number of clusters or dimensions in… ▽ More

    Submitted 1 March, 2021; v1 submitted 13 February, 2021; originally announced February 2021.

  24. arXiv:2010.07526  [pdf, other

    cs.CL cs.CV

    Natural Language Rationales with Full-Stack Visual Reasoning: From Pixels to Semantic Frames to Commonsense Graphs

    Authors: Ana Marasović, Chandra Bhagavatula, Jae Sung Park, Ronan Le Bras, Noah A. Smith, Yejin Choi

    Abstract: Natural language rationales could provide intuitive, higher-level explanations that are easily understandable by humans, complementing the more broadly studied lower-level explanations based on gradients or attention weights. We present the first study focused on generating natural language rationales across several complex visual reasoning tasks: visual commonsense reasoning, visual-textual entai… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

    Comments: Accepted to Findings of EMNLP

  25. arXiv:2008.09791  [pdf, other

    cs.CV

    Identity-Aware Multi-Sentence Video Description

    Authors: Jae Sung Park, Trevor Darrell, Anna Rohrbach

    Abstract: Standard video and movie description tasks abstract away from person identities, thus failing to link identities across sentences. We propose a multi-sentence Identity-Aware Video Description task, which overcomes this limitation and requires to re-identify persons locally within a set of consecutive clips. We introduce an auxiliary task of Fill-in the Identity, that aims to predict persons' IDs c… ▽ More

    Submitted 22 August, 2020; originally announced August 2020.

    Comments: Project link at https://sites.google.com/site/describingmovies/lsmdc-2019/

  26. arXiv:2006.03193  [pdf, other

    eess.SP cs.LG

    LSTM-based Anomaly Detection for Non-linear Dynamical System

    Authors: Yue Tan, Chunjing Hu, Kuan Zhang, Kan Zheng, Ethan A. Davis, Jae Sung Park

    Abstract: Anomaly detection for non-linear dynamical system plays an important role in ensuring the system stability. However, it is usually complex and has to be solved by large-scale simulation which requires extensive computing resources. In this paper, we propose a novel anomaly detection scheme in non-linear dynamical system based on Long Short-Term Memory (LSTM) to capture complex temporal changes of… ▽ More

    Submitted 4 June, 2020; originally announced June 2020.

    Comments: 8 pages, 6 figures

  27. arXiv:2006.00424  [pdf, other

    cs.RO

    HMPO: Human Motion Prediction in Occluded Environments for Safe Motion Planning

    Authors: Jae Sung Park, Dinesh Manocha

    Abstract: We present a novel approach to generate collision-free trajectories for a robot operating in close proximity with a human obstacle in an occluded environment. The self-occlusions of the robot can significantly reduce the accuracy of human motion prediction, and we present a novel deep learning-based prediction algorithm. Our formulation uses CNNs and LSTMs and we augment human-action datasets with… ▽ More

    Submitted 30 May, 2020; originally announced June 2020.

    Comments: 11 pages, 5 figures, 2 tables

  28. arXiv:2004.10796  [pdf, other

    cs.CV cs.CL

    VisualCOMET: Reasoning about the Dynamic Context of a Still Image

    Authors: Jae Sung Park, Chandra Bhagavatula, Roozbeh Mottaghi, Ali Farhadi, Yejin Choi

    Abstract: Even from a single frame of a still image, people can reason about the dynamic story of the image before, after, and beyond the frame. For example, given an image of a man struggling to stay afloat in water, we can reason that the man fell into the water sometime in the past, the intent of that man at the moment is to stay alive, and he will need help in the near future or else he will get washed… ▽ More

    Submitted 1 August, 2020; v1 submitted 22 April, 2020; originally announced April 2020.

    Comments: Project Page: http://visualcomet.xyz (ECCV 2020 Spotlight)

  29. arXiv:1905.09616  [pdf, other

    eess.SP cs.IT

    A Comparative Study of Analog/Digital Self-Interference Cancellation for Full Duplex Radios

    Authors: Jong Woo Kwak, Min Soo Sim, In-Woong Kang, Jong Sung Park, Jaedon Park, Chan-Byoung Chae

    Abstract: Self-interference (SI) is the main obstacle to full-duplex radios. To overcome the SI, researchers have proposed several analog and digital domain self-interference cancellation (SIC) techniques. How well the digital cancellation works depends on the results of analog cancellation. Therefore, to analyze overall SIC performance, one should do so in an integrated manner. In this paper, we build a si… ▽ More

    Submitted 23 May, 2019; originally announced May 2019.

  30. arXiv:1905.00933  [pdf, other

    eess.IV cs.CV

    Joint High Dynamic Range Imaging and Super-Resolution from a Single Image

    Authors: Jae Woong Soh, Jae Sung Park, Nam Ik Cho

    Abstract: This paper presents a new framework for jointly enhancing the resolution and the dynamic range of an image, i.e., simultaneous super-resolution (SR) and high dynamic range imaging (HDRI), based on a convolutional neural network (CNN). From the common trends of both tasks, we train a CNN for the joint HDRI and SR by focusing on the reconstruction of high-frequency details. Specifically, the high-fr… ▽ More

    Submitted 2 May, 2019; originally announced May 2019.

    Comments: 11 pages

    MSC Class: 68T45

  31. arXiv:1903.04271  [pdf, ps, other

    cs.CR

    CloudSafe: A Tool for an Automated Security Analysis for Cloud Computing

    Authors: Seoungmo An, Taehoon Eom, Jong Sou Park, Jin B. Hong, Armstrong Nhlabatsi, Noora Fetais, Khaled M. Khan, Dong Seong Kim

    Abstract: Cloud computing has been adopted widely, providing on-demand computing resources to improve perfornance and reduce the operational costs. However, these new functionalities also bring new ways to exploit the cloud computing environment. To assess the security of the cloud, graphical security models can be used, such as Attack Graphs and Attack Trees. However, existing models do not consider all ty… ▽ More

    Submitted 11 March, 2019; originally announced March 2019.

  32. arXiv:1902.10252  [pdf, other

    cs.RO

    Efficient Probabilistic Collision Detection for Non-Gaussian Noise Distributions

    Authors: Jae Sung Park, Dinesh Manocha

    Abstract: We present an efficient algorithm to compute tight upper bounds of collision probability between two objects with positional uncertainties, whose error distributions are represented with non-Gaussian forms. Our approach can handle noisy datasets from depth sensors, whose distributions may correspond to Truncated Gaussian, Weighted Samples, or Truncated Gaussian Mixture Model. We derive tight proba… ▽ More

    Submitted 15 December, 2019; v1 submitted 26 February, 2019; originally announced February 2019.

    Comments: 11 pages, 6 figures, 1 table

  33. arXiv:1812.05634  [pdf, other

    cs.CV cs.CL

    Adversarial Inference for Multi-Sentence Video Description

    Authors: Jae Sung Park, Marcus Rohrbach, Trevor Darrell, Anna Rohrbach

    Abstract: While significant progress has been made in the image captioning task, video description is still in its infancy due to the complex nature of video data. Generating multi-sentence descriptions for long videos is even more challenging. Among the main issues are the fluency and coherence of the generated descriptions, and their relevance to the video. Recently, reinforcement and adversarial learning… ▽ More

    Submitted 15 April, 2019; v1 submitted 13 December, 2018; originally announced December 2018.

    Comments: Accepted to Computer Vision and Pattern Recognition (CVPR) 2019

  34. arXiv:1811.10673  [pdf, other

    eess.IV cs.CV

    Adversarial Video Compression Guided by Soft Edge Detection

    Authors: Sungsoo Kim, Jin Soo Park, Christos G. Bampis, Jaeseong Lee, Mia K. Markey, Alexandros G. Dimakis, Alan C. Bovik

    Abstract: We propose a video compression framework using conditional Generative Adversarial Networks (GANs). We rely on two encoders: one that deploys a standard video codec and another which generates low-level maps via a pipeline of down-sampling, a newly devised soft edge detector, and a novel lossless compression scheme. For decoding, we use a standard video decoder as well as a neural network based one… ▽ More

    Submitted 26 November, 2018; originally announced November 2018.

  35. arXiv:1708.00636  [pdf, ps, other

    cs.CV cs.GR

    Generation of High Dynamic Range Illumination from a Single Image for the Enhancement of Undesirably Illuminated Images

    Authors: Jae Sung Park, Nam Ik Cho

    Abstract: This paper presents an algorithm that enhances undesirably illuminated images by generating and fusing multi-level illuminations from a single image.The input image is first decomposed into illumination and reflectance components by using an edge-preserving smoothing filter. Then the reflectance component is scaled up to improve the image details in bright areas. The illumination component is scal… ▽ More

    Submitted 2 August, 2017; originally announced August 2017.

  36. arXiv:1707.02387  [pdf, other

    cs.RO

    Efficient Generation of Motion Plans from Attribute-Based Natural Language Instructions Using Dynamic Constraint Mapping

    Authors: Jae Sung Park, Biao Jia, Mohit Bansal, Dinesh Manocha

    Abstract: We present an algorithm for combining natural language processing (NLP) and fast robot motion planning to automatically generate robot movements. Our formulation uses a novel concept called Dynamic Constraint Mapping to transform complex, attribute-based natural language instructions into appropriate cost functions and parametric constraints for optimization-based motion planning. We generate a fa… ▽ More

    Submitted 15 October, 2018; v1 submitted 7 July, 2017; originally announced July 2017.

    Comments: 12 pages, 8 figures, 4 tables

  37. arXiv:1610.03651  [pdf, other

    cs.RO

    Efficient Probabilistic Collision Detection for Non-Convex Shapes

    Authors: Jae Sung Park, Chonhyon Park, Dinesh Manocha

    Abstract: We present new algorithms to perform fast probabilistic collision queries between convex as well as non-convex objects. Our approach is applicable to general shapes, where one or more objects are represented using Gaussian probability distributions. We present a fast new algorithm for a pair of convex objects, and extend the approach to non-convex models using hierarchical representations. We high… ▽ More

    Submitted 12 October, 2016; originally announced October 2016.

    Comments: 9 pages, 6 figures

  38. arXiv:1608.04837  [pdf, other

    cs.RO

    I-Planner: Intention-Aware Motion Planning Using Learning Based Human Motion Prediction

    Authors: Jae Sung Park, Chonhyon Park, Dinesh Manocha

    Abstract: We present a motion planning algorithm to compute collision-free and smooth trajectories for high-DOF robots interacting with humans in a shared workspace. Our approach uses offline learning of human actions along with temporal coherence to predict the human actions. Our intention-aware online planning algorithm uses the learned database to compute a reliable trajectory based on the predicted acti… ▽ More

    Submitted 27 November, 2017; v1 submitted 16 August, 2016; originally announced August 2016.

    Comments: 14 pages, 8 figures, 3 tables

  39. arXiv:1607.04788  [pdf, other

    cs.RO

    Fast and Bounded Probabilistic Collision Detection in Dynamic Environments for High-DOF Trajectory Planning

    Authors: Chonhyon Park, Jae Sung Park, Dinesh Manocha

    Abstract: We present a novel approach to perform probabilistic collision detection between a high-DOF robot and high-DOF obstacles in dynamic, uncertain environments. In dynamic environments with a high-DOF robot and moving obstacles, our approach efficiently computes accurate collision probability between the robot and obstacles with upper error bounds. Furthermore, we describe a prediction algorithm for f… ▽ More

    Submitted 16 July, 2016; originally announced July 2016.