Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleJuly 2023
TCSD: Triple Complementary Streams Detector for Comprehensive Deepfake Detection
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Volume 19, Issue 6Article No.: 213, Pages 1–22https://doi.org/10.1145/3558004Advancements in computer vision and deep learning have made it difficult to distinguish deepfake visual media. While existing detection frameworks have achieved significant performance on challenging deepfake datasets, these approaches consider only a ...
- research-articleJuly 2023
Pose driven motion image generation aided by depth information
RobCE '23: Proceedings of the 2023 3rd International Conference on Robotics and Control EngineeringMay 2023, Pages 153–160https://doi.org/10.1145/3598151.3598177Motion transfer which can be used as drive technology of the interaction between users and virtual roles has been a research hotspot in recent years. It is essentially a deformation process of human appearances, consequently motion transfer is ...
- research-articleJanuary 2023
Depth information acquisition and image measurement algorithm using microarray camera
International Journal of Autonomous and Adaptive Communications Systems (IJAACS), Volume 16, Issue 52023, Pages 419–435https://doi.org/10.1504/ijaacs.2023.134112The traditional algorithm for obtaining the depth information of an image is a binocular stereo vision algorithm. Meanwhile, the multi-eye stereo vision system can obtain more scene information. Based on the 3 × 3 array lens, we obtained the depth ...
- research-articleJanuary 2020
A parameter adaptive differential evolution based on depth information
Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology (JIFS), Volume 38, Issue 52020, Pages 5661–5671https://doi.org/10.3233/JIFS-179655Differential Evolution (DE) was an easy-coding and efficient stochastic algorithm for global optimization, and the whole optimization process simulates biological evolution. Superior individuals of the population that were suitable for the environment ...
- research-articleDecember 2019
HGR‐Net: a fusion network for hand gesture segmentation and recognition
IET Computer Vision (CVI2), Volume 13, Issue 8December 2019, Pages 700–707https://doi.org/10.1049/iet-cvi.2018.5796We propose a two‐stage convolutional neural network (CNN) architecture for robust recognition of hand gestures, called HGR‐Net, where the first stage performs accurate semantic segmentation to determine hand regions, and the second stage identifies the ...
-
- posterJuly 2018
Design of a vision substitution vibrotactile vest for the visually impaired
SETN '18: Proceedings of the 10th Hellenic Conference on Artificial IntelligenceJuly 2018, Article No.: 52, Pages 1–2https://doi.org/10.1145/3200947.3201055In this paper, we create a low-cost, discrete, vibrotactile vest to address the error-prone task of navigation for the visually impaired. Our implementation is based upon sensory substitution principle, which dictates that information captured by a ...
- research-articleAugust 2017
Double-Ring Marker Based 3D Pose Estimation for Rod-Shaped Object from a Single 2D Image
ICBIP '17: Proceedings of the 2nd International Conference on Biomedical Signal and Image ProcessingAugust 2017, Pages 53–57https://doi.org/10.1145/3133793.3133808In this paper, we propose like Double-Ring Marker based method which can estimate the 3D pose parameters of a Rod-shaped object such as MIS (Minimally Invasive Surgery) instrument using just a single 2D image. The core of the proposed method is a set of ...
- demonstrationMay 2016
VibroVision: An On-Body Tactile Image Guide for the Blind
- Philipp Wacker,
- Chat Wacharamanotham,
- Daniel Spelmezan,
- Jan Thar,
- David A. Sánchez,
- René Bohne,
- Jan Borchers
CHI EA '16: Proceedings of the 2016 CHI Conference Extended Abstracts on Human Factors in Computing SystemsMay 2016, Pages 3788–3791https://doi.org/10.1145/2851581.2890254Today, persons with a visual impairment use a cane to explore their surroundings and sense objects in their vicinity. While electronic aids have been proposed to aid them, they communicate limited information or require a fixed position. We propose ...
- ArticleAugust 2015
Finger Spelling Recognition from Depth Data Using Direction Cosines and Histogram of Cumulative Magnitudes
SIBGRAPI '15: Proceedings of the 2015 28th SIBGRAPI Conference on Graphics, Patterns and ImagesAugust 2015, Pages 173–179https://doi.org/10.1109/SIBGRAPI.2015.49In this paper, we propose a new approach for finger spelling recognition using depth information captured by Kinect sensor. We only use depth information to characterize hand configurations corresponding to alphabet letters. First, we use depth data to ...
- research-articleAugust 2015
Saliency detection for RGBD images
ICIMCS '15: Proceedings of the 7th International Conference on Internet Multimedia Computing and ServiceAugust 2015, Article No.: 72, Pages 1–4https://doi.org/10.1145/2808492.2808565Additional depth information from RGBD images is one of characteristics different from conventional 2D images. In this paper, we propose an effective saliency model to detect salient regions in RGBD images. Color contrast and depth contrast are first ...
- research-articleDecember 2014
Elliptical density shape model for hand gesture recognition
SoICT '14: Proceedings of the 5th Symposium on Information and Communication TechnologyDecember 2014, Pages 186–191https://doi.org/10.1145/2676585.2676600Recently, the Microsoft Kinect sensor has provided the whole new type of data in computer vision, the depth information. The most important contribution of depth information is to overcome one of the hardest parts in visual information extraction, the ...
- demonstrationNovember 2014
Eat as much as you can: a kinect-based facial rehabilitation game based on mouth and tongue movements
MM '14: Proceedings of the 22nd ACM international conference on MultimediaNovember 2014, Pages 743–744https://doi.org/10.1145/2647868.2654887In this demo, we present a Kinect-based interactive game which provides patients of facial palsy with a better and more fun way to perform facial physical therapy. By letting the user get scores when he/she bites or licks the virtual foods falling from ...
- research-articleJuly 2014
Depth Information Fused Salient Object Detection
ICIMCS '14: Proceedings of International Conference on Internet Multimedia Computing and ServiceJuly 2014, Pages 66–70https://doi.org/10.1145/2632856.2632938Saliency Detection has emerged as a hot topic due to its potential application in image and video understanding. Most existing saliency detection algorithms focus on two-dimensional information while the depth information is often ignored. In this paper,...
- research-articleFebruary 2014
Automatic object segmentation of unstructured scenes using colour and depth maps
IET Computer Vision (CVI2), Volume 8, Issue 1February 2014, Pages 45–53https://doi.org/10.1049/iet-cvi.2013.0018This study presents a segmentation pipeline that fuses colour and depth information to automatically separate objects of interest in video sequences captured from a quadcopter. Many approaches assume that cameras are static with known position, a ...
- articleJanuary 2014
Convolutional nets and watershed cuts for real-time semantic Labeling of RGBD videos
This work addresses multi-class segmentation of indoor scenes with RGB-D inputs. While this area of research has gained much attention recently, most works still rely on handcrafted features. In contrast, we apply a multiscale convolutional network to ...
- ArticleOctober 2013
Foreground Extraction Algorithm Using Depth Information for Image Segmentation
BWCCA '13: Proceedings of the 2013 Eighth International Conference on Broadband and Wireless Computing, Communication and ApplicationsOctober 2013, Pages 581–584https://doi.org/10.1109/BWCCA.2013.101Image segmentation is one of the most important topics in the field of computer vision. So lots of approaches for image segmentation have been proposed, and interactive methods based on energy minimization such as Grab Cut, etc have shown successful ...
- ArticleJuly 2013
Multi-person Identification and Localization for Ambient Assistive Living
Proceedings of the First International Conference on Distributed, Ambient, and Pervasive Interactions - Volume 8028July 2013, Pages 109–114https://doi.org/10.1007/978-3-642-39351-8_12In this paper, we present a novel, non-intrusive system that uses RFID technology and the Kinect sensor in order to identify and track multiple people in an assistive apartment. RFID is used for both identification and location estimation while ...
- ArticleJuly 2013
Robust multi-modal speech recognition in two languages utilizing video and distance information from the kinect
HCI'13: Proceedings of the 15th international conference on Human-Computer Interaction: interaction modalities and techniques - Volume Part IVJuly 2013, Pages 43–48https://doi.org/10.1007/978-3-642-39330-3_5We investigate the performance of our audio-visual speech recognition system in both English and Greek under the influence of audio noise. We present the architecture of our recently built system that utilizes information from three streams including 3-...
- ArticleDecember 2012
The research of the face's depth information generation technology based on the candide model
PCM'12: Proceedings of the 13th Pacific-Rim conference on Advances in Multimedia Information ProcessingDecember 2012, Pages 823–831https://doi.org/10.1007/978-3-642-34778-8_77Now in the 2D to 3D conversion of the many movie scenes, the obtained depth information is not satisfied because of its inaccuracy and poor stereoscopic result. The paper has provided a simple and effective approach to convert a specific two-dimensional ...
- research-articleNovember 2012
Hiding depth information into H.264 compressed video using reversible watermarking
CMBAS-EH '12: Proceedings of the 1st ACM multimedia international workshop on Cloud-based multimedia applications and services for e-healthNovember 2012, Pages 27–32https://doi.org/10.1145/2390906.2390915A scheme is proposed to hide 3D information (depth map) into H.264 compressed video using reversible watermarking. The watermark embedder works jointly with the H.264 encoder with concern of bit rate control and low complexity. The depth information is ...