Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 57 results for author: Rad, M

.
  1. arXiv:2405.13035  [pdf, other

    cs.HC cs.AI

    SIGMA: An Open-Source Interactive System for Mixed-Reality Task Assistance Research

    Authors: Dan Bohus, Sean Andrist, Nick Saw, Ann Paradiso, Ishani Chakraborty, Mahdi Rad

    Abstract: We introduce an open-source system called SIGMA (short for "Situated Interactive Guidance, Monitoring, and Assistance") as a platform for conducting research on task-assistive agents in mixed-reality scenarios. The system leverages the sensing and rendering affordances of a head-mounted mixed-reality device in conjunction with large language and vision models to guide users step by step through pr… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 10 pages, 5 figures

  2. arXiv:2405.08111  [pdf, other

    cs.LG

    Conformalized Physics-Informed Neural Networks

    Authors: Lena Podina, Mahdi Torabi Rad, Mohammad Kohandel

    Abstract: Physics-informed neural networks (PINNs) are an influential method of solving differential equations and estimating their parameters given data. However, since they make use of neural networks, they provide only a point estimate of differential equation parameters, as well as the solution at any given point, without any measure of uncertainty. Ensemble and Bayesian methods have been previously app… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  3. arXiv:2404.10179  [pdf, other

    cs.RO cs.AI cs.HC cs.LG

    Scaling Instructable Agents Across Many Simulated Worlds

    Authors: SIMA Team, Maria Abi Raad, Arun Ahuja, Catarina Barros, Frederic Besse, Andrew Bolt, Adrian Bolton, Bethanie Brownfield, Gavin Buttimore, Max Cant, Sarah Chakera, Stephanie C. Y. Chan, Jeff Clune, Adrian Collister, Vikki Copeman, Alex Cullum, Ishita Dasgupta, Dario de Cesare, Julia Di Trapani, Yani Donchev, Emma Dunleavy, Martin Engelcke, Ryan Faulkner, Frankie Garcia, Charles Gbadamosi , et al. (68 additional authors not shown)

    Abstract: Building embodied AI systems that can follow arbitrary language instructions in any 3D environment is a key challenge for creating general AI. Accomplishing this goal requires learning to ground language in perception and embodied actions, in order to accomplish complex tasks. The Scalable, Instructable, Multiworld Agent (SIMA) project tackles this by training agents to follow free-form instructio… ▽ More

    Submitted 17 April, 2024; v1 submitted 13 March, 2024; originally announced April 2024.

  4. arXiv:2403.13793  [pdf, other

    cs.LG

    Evaluating Frontier Models for Dangerous Capabilities

    Authors: Mary Phuong, Matthew Aitchison, Elliot Catt, Sarah Cogan, Alexandre Kaskasoli, Victoria Krakovna, David Lindner, Matthew Rahtz, Yannis Assael, Sarah Hodkinson, Heidi Howard, Tom Lieberum, Ramana Kumar, Maria Abi Raad, Albert Webson, Lewis Ho, Sharon Lin, Sebastian Farquhar, Marcus Hutter, Gregoire Deletang, Anian Ruoss, Seliem El-Sayed, Sasha Brown, Anca Dragan, Rohin Shah , et al. (2 additional authors not shown)

    Abstract: To understand the risks posed by a new AI system, we must understand what it can and cannot do. Building on prior work, we introduce a programme of new "dangerous capability" evaluations and pilot them on Gemini 1.0 models. Our evaluations cover four areas: (1) persuasion and deception; (2) cyber-security; (3) self-proliferation; and (4) self-reasoning. We do not find evidence of strong dangerous… ▽ More

    Submitted 5 April, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  5. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  6. arXiv:2402.14838  [pdf, other

    cs.CL cs.AI cs.LG

    RFBES at SemEval-2024 Task 8: Investigating Syntactic and Semantic Features for Distinguishing AI-Generated and Human-Written Texts

    Authors: Mohammad Heydari Rad, Farhan Farsi, Shayan Bali, Romina Etezadi, Mehrnoush Shamsfard

    Abstract: Nowadays, the usage of Large Language Models (LLMs) has increased, and LLMs have been used to generate texts in different languages and for different tasks. Additionally, due to the participation of remarkable companies such as Google and OpenAI, LLMs are now more accessible, and people can easily use them. However, an important issue is how we can detect AI-generated texts from human-written ones… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: Mohammad Heydari Rad, Farhan Farsi, and Shayan Bali have made equal contributions to this work

  7. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  8. arXiv:2309.17024  [pdf, other

    cs.CV

    HoloAssist: an Egocentric Human Interaction Dataset for Interactive AI Assistants in the Real World

    Authors: Xin Wang, Taein Kwon, Mahdi Rad, Bowen Pan, Ishani Chakraborty, Sean Andrist, Dan Bohus, Ashley Feniello, Bugra Tekin, Felipe Vieira Frujeri, Neel Joshi, Marc Pollefeys

    Abstract: Building an interactive AI assistant that can perceive, reason, and collaborate with humans in the real world has been a long-standing pursuit in the AI community. This work is part of a broader research effort to develop intelligent agents that can interactively guide humans through performing tasks in the physical world. As a first step in this direction, we introduce HoloAssist, a large-scale e… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

    Comments: ICCV 2023

  9. arXiv:2309.10001  [pdf, other

    cs.CV

    CaSAR: Contact-aware Skeletal Action Recognition

    Authors: Junan Lin, Zhichao Sun, Enjie Cao, Taein Kwon, Mahdi Rad, Marc Pollefeys

    Abstract: Skeletal Action recognition from an egocentric view is important for applications such as interfaces in AR/VR glasses and human-robot interaction, where the device has limited resources. Most of the existing skeletal action recognition approaches use 3D coordinates of hand joints and 8-corner rectangular bounding boxes of objects as inputs, but they do not capture how the hands and objects interac… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

    Comments: 10 pages, 8 figures

  10. arXiv:2307.01611  [pdf

    cond-mat.mtrl-sci

    Intermittent in-situ high-resolution X-ray microscopy of 400-nm porous glass under uniaxial compression: study of pore changes and crack formation

    Authors: Sebastian Schäfer, François Willot, Mansoureh Norouzi Rad, Stephen T. Kelly, Dirk Enke, Juliana Martins de Souza e Silva

    Abstract: The properties of porous glasses and their field of application strongly depend on the characteristics of the void space. Understanding the relationship between their porous structure and failure behaviour can contribute to the development of porous glasses with long-term reliability optimized for specific applications. In the present work, we used X-ray computed tomography with nanometric resolut… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

    Comments: 14 pages, 6 figures

  11. arXiv:2301.06653  [pdf

    physics.optics physics.app-ph

    Broadband high-resolution integrated spectrometer architecture & data processing method

    Authors: Mehedi Hasan, Gazi Mahamud Hasan, Houman Ghorbani, Mohammad Rad, Peng Liu, Eric Bernier, Trevor Hall

    Abstract: Up-to-date network telemetry is the key enabler for resource optimization by capacity scaling, fault recovery, and network reconfiguration among other means. Reliable optical performance monitoring in general and, specifically, the monitoring of the spectral profile of WDM signals in fixed- and flex- grid architectures across the entire C-band, remains challenging. This article describes a two-sta… ▽ More

    Submitted 16 January, 2023; originally announced January 2023.

    Comments: 16 pages, 7 figures. arXiv admin note: substantial text overlap with arXiv:2108.10121

  12. arXiv:2211.16214  [pdf

    q-bio.NC

    A biologically interfaced evolvable organic pattern classifier

    Authors: Jennifer Gerasimov, Deyu Tu, Vivek Hitaishi, Padinhare Cholakkal Harikesh, Chi-Yuan Yang, Tobias Abrahamsson, Meysam Rad, Mary J. Donahue, Malin Silverå Ejneby, Magnus Berggren, Robert Forchheimer, Simone Fabiano

    Abstract: Future brain-computer interfaces will require local and highly individualized signal processing of fully integrated electronic circuits within the nervous system and other living tissue. New devices will need to be developed that can receive data from a sensor array, process data into meaningful information, and translate that information into a format that living systems can interpret. Here, we r… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  13. On Radical of Intuitionistic Fuzzy Primary Submodule

    Authors: Abbas Taherpour, Shaban Ghalandarzadeh, Parastoo Malakooti Rad, Parvin Safari

    Abstract: In this paper, we further study the theory of Intuitionistic fuzzy submodules and we will define intuitionistic fuzzy primary submodule with the help of the definition of a radical submodule, and we also study the properties of these submodules. Furthermore, homomorphic image and pre-image of intuitionistic fuzzy primary submodule are investigated.

    Submitted 20 July, 2022; originally announced July 2022.

    Comments: 21 pages

    MSC Class: 08A72; 03F55

  14. arXiv:2207.07428  [pdf, other

    cond-mat.mtrl-sci

    Automatic detection of equiaxed dendrites using computer vision neural networks

    Authors: A. Viardin, K. Noth, M. Torabi Rad, L. Sturz

    Abstract: Equaixed dendrites are frequently encountered in solidification. They typically form in large numbers, which makes their detection, localization, and tracking practically impossible for a human eye. In this paper, we show how recent progress in the field of machine learning can be leveraged to tackle this problem and we present computer vision neural network to automatically detect equiaxed dendri… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

  15. arXiv:2207.03204  [pdf, other

    cs.CV cs.AI cs.GT

    MCTS with Refinement for Proposals Selection Games in Scene Understanding

    Authors: Sinisa Stekovic, Mahdi Rad, Alireza Moradi, Friedrich Fraundorfer, Vincent Lepetit

    Abstract: We propose a novel method applicable in many scene understanding problems that adapts the Monte Carlo Tree Search (MCTS) algorithm, originally designed to learn to play games of high-state complexity. From a generated pool of proposals, our method jointly selects and optimizes proposals that minimize the objective term. In our first application for floor plan reconstruction from point clouds, our… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

    Comments: Submitted to: TPAMI Special Section on the Best Papers of ICCV2021 GitHub Repository: https://github.com/vevenom/MonteScene. arXiv admin note: substantial text overlap with arXiv:2103.11161

  16. arXiv:2206.01640  [pdf, other

    cs.LG cs.AI stat.ME

    PROMISSING: Pruning Missing Values in Neural Networks

    Authors: Seyed Mostafa Kia, Nastaran Mohammadian Rad, Daniel van Opstal, Bart van Schie, Andre F. Marquand, Josien Pluim, Wiepke Cahn, Hugo G. Schnack

    Abstract: While data are the primary fuel for machine learning models, they often suffer from missing values, especially when collected in real-world scenarios. However, many off-the-shelf machine learning models, including artificial neural network models, are unable to handle these missing values directly. Therefore, extra data preprocessing and curation steps, such as data imputation, are inevitable befo… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

  17. arXiv:2203.07420  [pdf

    physics.optics

    Optical Wavelength Meter with Machine Learning Enhanced Precision

    Authors: Gazi Mahamud Hasan, Mehedi Hasan, Peng Liu, Mohammad Rad, Eric Bernier, Trevor James Hall

    Abstract: Diverse applications in photonics and microwave engineering require a means of measurement of the instantaneous frequency of a signal. A photonic implementation typically applies an interferometer equipped with three or more output ports to measure the frequency dependent phase shift provided by an optical delay line. The components constituting the interferometer are prone to impairments which re… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

    Comments: 18 pages, 9 figures, 1 table

  18. arXiv:2111.08098  [pdf, other

    physics.comp-ph cond-mat.mtrl-sci

    Geometry of triple junctions during grain boundary premelting

    Authors: M. Torabi Rad, G. Boussinot, M. Apel

    Abstract: Grain Boundaries (GB) whose energy is larger than twice the energy of the solid/liquid interface exhibit the premelting phenomenon, for which an atomically thin liquid layer develops at temperatures slightly below the bulk melting temperature. Premelting can have a severe impact on the structural integrity of a polycrystalline material and on the mechanical high temperature properties, also in the… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

  19. arXiv:2110.02117  [pdf, other

    cs.CV

    Self-Supervised Generative Style Transfer for One-Shot Medical Image Segmentation

    Authors: Devavrat Tomar, Behzad Bozorgtabar, Manana Lortkipanidze, Guillaume Vray, Mohammad Saeed Rad, Jean-Philippe Thiran

    Abstract: In medical image segmentation, supervised deep networks' success comes at the cost of requiring abundant labeled data. While asking domain experts to annotate only one or a few of the cohort's images is feasible, annotating all available images is impractical. This issue is further exacerbated when pre-trained deep networks are exposed to a new image dataset from an unfamiliar distribution. Using… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

    Comments: Accepted in WACV 2022

  20. arXiv:2108.10121  [pdf

    eess.SP physics.app-ph physics.optics

    Circuit design and integration feasibility of a high-resolution broadband on-chip spectral monitor

    Authors: Mehedi Hasan, Gazi Mahamud Hasan, Houman Ghorbani, Mohammad Rad, Peng Liu, Eric Bernier, Trevor Hall

    Abstract: Up-to-date network telemetry is the key enabler for resource optimization by a variety of means including capacity scaling, fault recovery, network reconfiguration. Reliable optical performance monitoring in general and specifically the monitoring of the spectral profile of WDM signals in fixed- and flex-grid architecture across the entire C-band remains challenging. This article describes a spect… ▽ More

    Submitted 11 August, 2021; originally announced August 2021.

    Comments: 17 pages, 13 figures

  21. Hybrid Deep Neural Network for Brachial Plexus Nerve Segmentation in Ultrasound Images

    Authors: Juul P. A. van Boxtel, Vincent R. J. Vousten, Josien Pluim, Nastaran Mohammadian Rad

    Abstract: Ultrasound-guided regional anesthesia (UGRA) can replace general anesthesia (GA), improving pain control and recovery time. This method can be applied on the brachial plexus (BP) after clavicular surgeries. However, identification of the BP from ultrasound (US) images is difficult, even for trained professionals. To address this problem, convolutional neural networks (CNNs) and more advanced deep… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

    Comments: The first two authors contributed equally

    ACM Class: I.4.6

  22. arXiv:2104.14639  [pdf, other

    cs.CV

    Keypoint Transformer: Solving Joint Identification in Challenging Hands and Object Interactions for Accurate 3D Pose Estimation

    Authors: Shreyas Hampali, Sayan Deb Sarkar, Mahdi Rad, Vincent Lepetit

    Abstract: We propose a robust and accurate method for estimating the 3D poses of two hands in close interaction from a single color image. This is a very challenging problem, as large occlusions and many confusions between the joints may happen. State-of-the-art methods solve this problem by regressing a heatmap for each joint, which requires solving two problems simultaneously: localizing the joints and re… ▽ More

    Submitted 19 April, 2022; v1 submitted 29 April, 2021; originally announced April 2021.

    Comments: Accepted at CVPR2022

  23. arXiv:2104.02663  [pdf, other

    cs.CV

    Test-Time Adaptation for Super-Resolution: You Only Need to Overfit on a Few More Images

    Authors: Mohammad Saeed Rad, Thomas Yu, Behzad Bozorgtabar, Jean-Philippe Thiran

    Abstract: Existing reference (RF)-based super-resolution (SR) models try to improve perceptual quality in SR under the assumption of the availability of high-resolution RF images paired with low-resolution (LR) inputs at testing. As the RF images should be similar in terms of content, colors, contrast, etc. to the test image, this hinders the applicability in a real scenario. Other approaches to increase th… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

  24. arXiv:2103.11161  [pdf, other

    cs.CV cs.AI cs.LG

    MonteFloor: Extending MCTS for Reconstructing Accurate Large-Scale Floor Plans

    Authors: Sinisa Stekovic, Mahdi Rad, Friedrich Fraundorfer, Vincent Lepetit

    Abstract: We propose a novel method for reconstructing floor plans from noisy 3D point clouds. Our main contribution is a principled approach that relies on the Monte Carlo Tree Search (MCTS) algorithm to maximize a suitable objective function efficiently despite the complexity of the problem. Like previous work, we first project the input point cloud to a top view to create a density map and extract room p… ▽ More

    Submitted 13 September, 2021; v1 submitted 20 March, 2021; originally announced March 2021.

    Comments: Accepted for oral presentation at ICCV 2021

  25. arXiv:2102.04890  [pdf, other

    cs.LG physics.app-ph

    On Theory-training Neural Networks to Infer the Solution of Highly Coupled Differential Equations

    Authors: M. Torabi Rad, A. Viardin, M. Apel

    Abstract: Deep neural networks are transforming fields ranging from computer vision to computational medicine, and we recently extended their application to the field of phase-change heat transfer by introducing theory-trained neural networks (TTNs) for a solidification problem \cite{TTN}. Here, we present general, in-depth, and empirical insights into theory-training networks for learning the solution of h… ▽ More

    Submitted 10 February, 2021; v1 submitted 9 February, 2021; originally announced February 2021.

  26. arXiv:2007.03053  [pdf, other

    eess.IV cs.CV

    Benefiting from Bicubically Down-Sampled Images for Learning Real-World Image Super-Resolution

    Authors: Mohammad Saeed Rad, Thomas Yu, Claudiu Musat, Hazim Kemal Ekenel, Behzad Bozorgtabar, Jean-Philippe Thiran

    Abstract: Super-resolution (SR) has traditionally been based on pairs of high-resolution images (HR) and their low-resolution (LR) counterparts obtained artificially with bicubic downsampling. However, in real-world SR, there is a large variety of realistic image degradations and analytically modeling these realistic degradations can prove quite difficult. In this work, we propose to handle real-world SR by… ▽ More

    Submitted 5 November, 2020; v1 submitted 6 July, 2020; originally announced July 2020.

    Comments: WACV 2021

  27. ALCN: Adaptive Local Contrast Normalization

    Authors: Mahdi Rad, Peter M. Roth, Vincent Lepetit

    Abstract: To make Robotics and Augmented Reality applications robust to illumination changes, the current trend is to train a Deep Network with training images captured under many different lighting conditions. Unfortunately, creating such a training set is a very unwieldy and complex task. We therefore propose a novel illumination normalization method that can easily be used for different problems with cha… ▽ More

    Submitted 15 April, 2020; originally announced April 2020.

    Comments: This version corresponds to the pre-print of the paper accepted for Computer Vision and Image Understanding (CVIU). arXiv admin note: substantial text overlap with arXiv:1708.09633

  28. arXiv:2004.01887  [pdf, ps, other

    math.PR

    Stability for Hawkes processes with inhibition

    Authors: Mads Bonde Raad, Eva Löcherbach

    Abstract: We consider a multivariate non-linear Hawkes process in a multi-class setup where particles are organised within two populations of possibly different sizes, such that one of the populations acts excitatory on the system while the other population acts inhibitory on the system. The goal of this note is to present a class of Hawkes Processes with stable dynamics without assumptions on the spectral… ▽ More

    Submitted 4 April, 2020; originally announced April 2020.

    MSC Class: 60G55; 60G57; 60J25; 60Fxx

  29. arXiv:2003.13764  [pdf, other

    cs.CV

    Measuring Generalisation to Unseen Viewpoints, Articulations, Shapes and Objects for 3D Hand Pose Estimation under Hand-Object Interaction

    Authors: Anil Armagan, Guillermo Garcia-Hernando, Seungryul Baek, Shreyas Hampali, Mahdi Rad, Zhaohui Zhang, Shipeng Xie, MingXiu Chen, Boshen Zhang, Fu Xiong, Yang Xiao, Zhiguo Cao, Junsong Yuan, Pengfei Ren, Weiting Huang, Haifeng Sun, Marek Hrúz, Jakub Kanis, Zdeněk Krňoul, Qingfu Wan, Shile Li, Linlin Yang, Dongheui Lee, Angela Yao, Weiguo Zhou , et al. (10 additional authors not shown)

    Abstract: We study how well different types of approaches generalise in the task of 3D hand pose estimation under single hand scenarios and hand-object interaction. We show that the accuracy of state-of-the-art methods can drop, and that they fail mostly on poses absent from the training set. Unfortunately, since the space of hand poses is highly dimensional, it is inherently not feasible to cover the whole… ▽ More

    Submitted 10 September, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: European Conference on Computer Vision (ECCV), 2020

  30. arXiv:2001.02149  [pdf, other

    cs.CV

    General 3D Room Layout from a Single View by Render-and-Compare

    Authors: Sinisa Stekovic, Shreyas Hampali, Mahdi Rad, Sayan Deb Sarkar, Friedrich Fraundorfer, Vincent Lepetit

    Abstract: We present a novel method to reconstruct the 3D layout of a room (walls, floors, ceilings) from a single perspective view in challenging conditions, by contrast with previous single-view methods restricted to cuboid-shaped layouts. This input view can consist of a color image only, but considering a depth map results in a more accurate reconstruction. Our approach is formalized as solving a constr… ▽ More

    Submitted 21 July, 2020; v1 submitted 7 January, 2020; originally announced January 2020.

  31. arXiv:1912.09800  [pdf

    physics.app-ph

    Theory-training deep neural networks for an alloy solidification benchmark problem

    Authors: M. Torabi Rad, A. Viardin, G. J. Schmitz, M. Apel

    Abstract: Deep neural networks are machine learning tools that are transforming fields ranging from speech recognition to computational medicine. In this study, we extend their application to the field of alloy solidification modeling. To that end, and for the first time in the field, theory-trained deep neural networks (TTNs) for solidification are introduced. These networks are trained using the framework… ▽ More

    Submitted 20 December, 2019; originally announced December 2019.

    Comments: 43 pages, 9 figures

  32. arXiv:1912.04133  [pdf

    physics.app-ph physics.optics

    Circuit architecture of a sub-GHz resolution panoramic C band on-chip spectral sensor

    Authors: Mehedi Hasan, Mohammad Rad, Gazi Mahamud Hasan, Peng Liu, Patric Dumais, Eric Bernier, Trevor Hall

    Abstract: Monitoring the state of the optical network is a key enabler for programmability of network functions, protocols and efficient use of the spectrum. A particular challenge is to provide the SDN-EON controller with a panoramic view of the complete state of the optical spectrum. This paper describes the architecture for compact on-chip spectrometry targeting high resolution across the entire C-band t… ▽ More

    Submitted 24 October, 2019; originally announced December 2019.

    Comments: 14 pages, 10 figures

  33. Artificial Intelligence Approaches

    Authors: Yingjie Hu, Wenwen Li, Dawn Wright, Orhun Aydin, Daniel Wilson, Omar Maher, Mansour Raad

    Abstract: Artificial Intelligence (AI) has received tremendous attention from academia, industry, and the general public in recent years. The integration of geography and AI, or GeoAI, provides novel approaches for addressing a variety of problems in the natural environment and our human society. This entry briefly reviews the recent development of AI with a focus on machine learning and deep learning appro… ▽ More

    Submitted 27 August, 2019; originally announced August 2019.

    Comments: 12 pages, 5 figures

    Journal ref: Artificial Intelligence Approaches. The Geographic Information Science & Technology Body of Knowledge (3rd Quarter 2019 Edition), John P. Wilson (ed.)

  34. arXiv:1908.07222  [pdf, other

    cs.CV

    SROBB: Targeted Perceptual Loss for Single Image Super-Resolution

    Authors: Mohammad Saeed Rad, Behzad Bozorgtabar, Urs-Viktor Marti, Max Basler, Hazim Kemal Ekenel, Jean-Philippe Thiran

    Abstract: By benefiting from perceptual losses, recent studies have improved significantly the performance of the super-resolution task, where a high-resolution image is resolved from its low-resolution counterpart. Although such objective functions generate near-photorealistic results, their capability is limited, since they estimate the reconstruction error for an entire image in the same way, without con… ▽ More

    Submitted 20 August, 2019; originally announced August 2019.

    Comments: ICCV 2019

  35. arXiv:1907.12488  [pdf, other

    cs.CV

    Benefiting from Multitask Learning to Improve Single Image Super-Resolution

    Authors: Mohammad Saeed Rad, Behzad Bozorgtabar, Claudiu Musat, Urs-Viktor Marti, Max Basler, Hazim Kemal Ekenel, Jean-Philippe Thiran

    Abstract: Despite significant progress toward super resolving more realistic images by deeper convolutional neural networks (CNNs), reconstructing fine and natural textures still remains a challenging problem. Recent works on single image super resolution (SISR) are mostly based on optimizing pixel and content wise similarity between recovered and high-resolution (HR) images and do not benefit from recogniz… ▽ More

    Submitted 29 July, 2019; originally announced July 2019.

    Comments: accepted at Neurocomputing (Special Issue on Deep Learning for Image Super-Resolution), 2019

  36. arXiv:1907.01481  [pdf, other

    cs.CV

    HOnnotate: A method for 3D Annotation of Hand and Object Poses

    Authors: Shreyas Hampali, Mahdi Rad, Markus Oberweger, Vincent Lepetit

    Abstract: We propose a method for annotating images of a hand manipulating an object with the 3D poses of both the hand and the object, together with a dataset created using this method. Our motivation is the current lack of annotated real images for this problem, as estimating the 3D poses is challenging, mostly because of the mutual occlusions between the hand and the object. To tackle this challenge, we… ▽ More

    Submitted 30 May, 2020; v1 submitted 2 July, 2019; originally announced July 2019.

    Comments: Accepted to CVPR2020

  37. arXiv:1906.02036  [pdf, ps, other

    math.PR

    Renewal Time Points for Hawkes Processes

    Authors: Mads Bonde Raad

    Abstract: In the last decade Hawkes processes have received much attention as models for functional connectivity in neural spiking networks and other dynamical systems with a cascade behavior. In this paper we establish a renewal approach for analyzing this process. We consider the ordinary nonlinear Hawkes process as well as the more recently described age dependent Hawkes process. We construct renewal-tim… ▽ More

    Submitted 8 June, 2019; v1 submitted 5 June, 2019; originally announced June 2019.

  38. arXiv:1905.08090  [pdf, other

    cs.CV

    Using Photorealistic Face Synthesis and Domain Adaptation to Improve Facial Expression Analysis

    Authors: Behzad Bozorgtabar, Mohammad Saeed Rad, Hazim Kemal Ekenel, Jean-Philippe Thiran

    Abstract: Cross-domain synthesizing realistic faces to learn deep models has attracted increasing attention for facial expression analysis as it helps to improve the performance of expression recognition accuracy despite having small number of real training images. However, learning from synthetic face images can be problematic due to the distribution discrepancy between low-quality synthetic images and rea… ▽ More

    Submitted 17 May, 2019; originally announced May 2019.

    Comments: 8 pages, 8 figures, 5 tables, accepted by FG 2019. arXiv admin note: substantial text overlap with arXiv:1905.00286

  39. arXiv:1905.00286  [pdf, other

    cs.CV

    Learn to synthesize and synthesize to learn

    Authors: Behzad Bozorgtabar, Mohammad Saeed Rad, Hazım Kemal Ekenel, Jean-Philippe Thiran

    Abstract: Attribute guided face image synthesis aims to manipulate attributes on a face image. Most existing methods for image-to-image translation can either perform a fixed translation between any two image domains using a single attribute or require training data with the attributes of interest for each subject. Therefore, these methods could only train one specific model for each pair of image domains,… ▽ More

    Submitted 1 May, 2019; originally announced May 2019.

    Comments: Accepted to Computer Vision and Image Understanding (CVIU)

  40. Beyond traditional coatings, a review on thermal sprayed functional and smart coatings

    Authors: Daniel Tejero-Martin, Milad Rezvani Rad, André McDonald, Tanvir Hussain

    Abstract: Thermal spraying has been present for over a century, being greatly refined and optimised during this time, becoming nowadays a reliable and cost-efficient method to deposit thick coatings with a wide variety of feedstock materials and substrates. Thermal sprayed coatings have been successfully applied in fields such as aerospace or electricity production, becoming an essential component of today'… ▽ More

    Submitted 25 March, 2019; v1 submitted 13 November, 2018; originally announced November 2018.

    Comments: 100 pages, 28 figures

  41. arXiv:1810.03707  [pdf, other

    cs.CV

    Domain Transfer for 3D Pose Estimation from Color Images without Manual Annotations

    Authors: Mahdi Rad, Markus Oberweger, Vincent Lepetit

    Abstract: We introduce a novel learning method for 3D pose estimation from color images. While acquiring annotations for color images is a difficult task, our approach circumvents this problem by learning a mapping from paired color and depth images captured with an RGB-D camera. We jointly learn the pose from synthetic depth images that are easy to generate, and learn to align these synthetic depth images… ▽ More

    Submitted 21 February, 2019; v1 submitted 8 October, 2018; originally announced October 2018.

    Comments: ACCV 2018 (oral)

  42. arXiv:1806.06370  [pdf, other

    math.PR

    Age Dependent Hawkes Process

    Authors: Mads Bonde Raad, Susanne Ditlevsen, Eva Löcherbach

    Abstract: In the last decade, Hawkes processes have received a lot of attention as good models for functional connectivity in neural spiking networks. In this paper we consider a variant of this process, the Age Dependent Hawkes process, which incorporates individual post-jump behaviour into the framework of the usual Hawkes model. This allows to model recovery properties such as refractory periods, where t… ▽ More

    Submitted 7 October, 2019; v1 submitted 17 June, 2018; originally announced June 2018.

    Comments: 43 pages, 1 figure

  43. arXiv:1804.03959  [pdf, other

    cs.CV

    Making Deep Heatmaps Robust to Partial Occlusions for 3D Object Pose Estimation

    Authors: Markus Oberweger, Mahdi Rad, Vincent Lepetit

    Abstract: We introduce a novel method for robust and accurate 3D object pose estimation from a single color image under large occlusions. Following recent approaches, we first predict the 2D projections of 3D points related to the target object and then compute the 3D pose from these correspondences using a geometric method. Unfortunately, as the results of our experiments show, predicting these 2D projecti… ▽ More

    Submitted 26 July, 2018; v1 submitted 11 April, 2018; originally announced April 2018.

    Journal ref: Proc. of ECCV 2018

  44. arXiv:1712.03904  [pdf, other

    cs.CV

    Feature Mapping for Learning Fast and Accurate 3D Pose Inference from Synthetic Images

    Authors: Mahdi Rad, Markus Oberweger, Vincent Lepetit

    Abstract: We propose a simple and efficient method for exploiting synthetic images when training a Deep Network to predict a 3D pose from an image. The ability of using synthetic images for training a Deep Network is extremely valuable as it is easy to create a virtually infinite training set made of such images, while capturing and annotating real images can be very cumbersome. However, synthetic images do… ▽ More

    Submitted 26 March, 2018; v1 submitted 11 December, 2017; originally announced December 2017.

    Comments: CVPR 2018

  45. On graphs of bounded semilattices

    Authors: Parastoo Malakooti Rad, Peyman Nasehpour

    Abstract: In this paper, we introduce the graph $G(S)$ of a bounded semilattice $S$, which is a generalization of the intersection graph of the substructures of an algebraic structure. We prove some general theorems about these graphs; as an example, we show that if $S$ is a product of three or more chains, then $G(S)$ is Eulerian if and only if either the length of every chain is even or all the chains are… ▽ More

    Submitted 5 November, 2018; v1 submitted 3 November, 2017; originally announced November 2017.

    Comments: totally revised! Comments are still welcomed!

    MSC Class: 05C99; 06A12

    Journal ref: Math Notes 107, 264--273 (2020)

  46. A Computer Vision System to Localize and Classify Wastes on the Streets

    Authors: Mohammad Saeed Rad, Andreas von Kaenel, Andre Droux, Francois Tieche, Nabil Ouerhani, Hazim Kemal Ekenel, Jean-Philippe Thiran

    Abstract: Littering quantification is an important step for improving cleanliness of cities. When human interpretation is too cumbersome or in some cases impossible, an objective index of cleanliness could reduce the littering by awareness actions. In this paper, we present a fully automated computer vision application for littering quantification based on images taken from the streets and sidewalks. We hav… ▽ More

    Submitted 31 October, 2017; originally announced October 2017.

    Journal ref: Liu M., Chen H., Vincze M. (eds) Computer Vision Systems. pp 195-204. ICVS 2017. Lecture Notes in Computer Science, vol 10528. Springer, Cham

  47. arXiv:1709.05956  [pdf, other

    cs.CV cs.NE

    Deep Learning for Automatic Stereotypical Motor Movement Detection using Wearable Sensors in Autism Spectrum Disorders

    Authors: Nastaran Mohammadian Rad, Seyed Mostafa Kia, Calogero Zarbo, Twan van Laarhoven, Giuseppe Jurman, Paola Venuti, Elena Marchiori, Cesare Furlanello

    Abstract: Autism Spectrum Disorders are associated with atypical movements, of which stereotypical motor movements (SMMs) interfere with learning and social interaction. The automatic SMM detection using inertial measurement units (IMU) remains complex due to the strong intra and inter-subject variability, especially when handcrafted features are extracted from the signal. We propose a new application of th… ▽ More

    Submitted 14 September, 2017; originally announced September 2017.

  48. arXiv:1708.09633  [pdf, other

    cs.CV

    ALCN: Meta-Learning for Contrast Normalization Applied to Robust 3D Pose Estimation

    Authors: Mahdi Rad, Peter M. Roth, Vincent Lepetit

    Abstract: To be robust to illumination changes when detecting objects in images, the current trend is to train a Deep Network with training images captured under many different lighting conditions. Unfortunately, creating such a training set is very cumbersome, or sometimes even impossible, for some applications such as 3D pose estimation of specific objects, which is the application we focus on in this pap… ▽ More

    Submitted 31 August, 2017; originally announced August 2017.

    Comments: BMVC' 17

  49. arXiv:1707.01330  [pdf, ps, other

    cs.CV

    A dataset for Computer-Aided Detection of Pulmonary Embolism in CTA images

    Authors: Mojtaba Masoudi, Hamidreza Pourreza, Mahdi Saadatmand Tarzjan, Fateme Shafiee Zargar, Masoud Pezeshki Rad, Noushin Eftekhari

    Abstract: Todays, researchers in the field of Pulmonary Embolism (PE) analysis need to use a publicly available dataset to assess and compare their methods. Different systems have been designed for the detection of pulmonary embolism (PE), but none of them have used any public datasets. All papers have used their own private dataset. In order to fill this gap, we have collected 5160 slices of computed tomog… ▽ More

    Submitted 5 July, 2017; originally announced July 2017.

  50. arXiv:1703.10896  [pdf, other

    cs.CV

    BB8: A Scalable, Accurate, Robust to Partial Occlusion Method for Predicting the 3D Poses of Challenging Objects without Using Depth

    Authors: Mahdi Rad, Vincent Lepetit

    Abstract: We introduce a novel method for 3D object detection and pose estimation from color images only. We first use segmentation to detect the objects of interest in 2D even in presence of partial occlusions and cluttered background. By contrast with recent patch-based methods, we rely on a "holistic" approach: We apply to the detected objects a Convolutional Neural Network (CNN) trained to predict their… ▽ More

    Submitted 26 March, 2018; v1 submitted 31 March, 2017; originally announced March 2017.

    Comments: ICCV 2017