default search action
25th ISM 2023: Laguna Hills, CA, USA
- IEEE International Symposium on Multimedia, ISM 2023, Laguna Hills, CA, USA, December 11-13, 2023. IEEE 2023, ISBN 979-8-3503-9576-1
- Amir Said, Hoang Le, Farzad Farhadzadeh:
Bitstream Organization for Parallel Entropy Coding on Neural Network-based Video Codecs. 1-9 - Jukka I. Ahonen, Nam Le, Honglei Zhang, Antti Hallapuro, Francesco Cricri, Hamed Rezazadegan Tavakoli, Miska M. Hannuksela, Esa Rahtu:
NN-VVC: Versatile Video Coding boosted by self-supervisedly learned image coding for machines. 10-19 - Sayed Mohammad Majidi Dorcheh, Mehdi Houshmand Sarkhoosh, Cise Midoglu, Saeed Shafiee Sabet, Tomas Kupka, Michael A. Riegler, Dag Johansen, Pål Halvorsen:
SmartCrop: AI-Based Cropping of Soccer Videos. 20-27 - Qian Zhou, Mingyuan Wu, Yinjie Zhang, Michael Zink, Ramesh K. Sitaraman, Klara Nahrstedt:
360TripleView: 360-Degree Video View Management System Driven by Convergence Value of Viewing Preferences. 28-35 - Yinjie Zhang, Mingyuan Wu, Beitong Tian, Jiaxi Li, Bo Chen, Qian Zhou, Klara Nahrstedt:
SAVG360: Saliency-aware Viewport-guidance-enabled 360-video Streaming System. 36-43 - Hannes Mareen, Casper Haems, Tim Wauters, Filip De Turck, Peter Lambert, Glenn Van Wallendael:
Temporal Layer Injection for Fast Bitrate Ladder Creation in Video Live Streaming. 44-51 - Lauri Ilola, Sudarshan Bisht, Ugurcan Budak, Peter Fasogbon, Jaakko Keränen, Lukasz Kondrad:
Real-time Delivery of Visual Volumetric Video-based Coding Data. 52-56 - Nicola Giuliani, Hongjie You, Ahmet Burakhan Koyuncu, Atanas Boev, Elena Alshina, Eckehard G. Steinbach:
CALC-VFS: Content-adaptive low-complexity Video Frame Synthesis. 57-61 - João Pedro Oliveira Batisteli, Silvio Jamil Ferzoli Guimarães, Zenilton K. G. do Patrocínio:
Multi-Scale Image Graph Representation: A Novel GNN Approach for Image Classification through Scale Importance Estimation. 62-68 - Abhilash Dharmavarapu, Stefano Petrangeli, Jiashen Cao, Hyesoon Kim:
EHT-SR: An Entropy-Based Hybrid Approach for Faster Super-Resolution. 69-78 - Gabriel Lugo Bustillo, Joey Quinlan, Lingrui Zhou, Md Nahid Sadik, Irene Cheng:
Active Learning for Multi-Class Vehicle Categorization and Traffic Analysis in complex environments. 79-88 - Yang Li, Gang Wu, Stefano Petrangeli, Haoliang Wang, Ryan A. Rossi, Viswanathan Swaminathan:
Active Context Modeling for Efficient Image and Burst Compression. 89-94 - Jiawei Qin, Xueting Wang:
Angle Range and Identity Similarity Enhanced Gaze and Head Redirection based on Synthetic data. 95-102 - Afnan Althoupety, Li-Yun Wang, Wu-Chi Feng, Banafsheh Rekabdar:
Illuminating the Bias in Pedestrian Detection. 103-107 - Takumi Inagawa, Tomoya Hayashi, Yohei Nakada:
Validating Quantification Method for Object Visual Appeal to Motorists in Simulated Hazardous Driving Scene. 108-112 - Taiji Kurami, Takuya Ishikawa, Kazuhiro Hotta:
DeformableFormer for Classifying Endoscopic Ultrasound-Guided Fine-Needle Biopsy in Pancreatic Diseases. 113-114 - Li-Yun Wang, Wu-chi Feng:
Video as Text: A New Paradigm for Flexible Video Analysis. 115-122 - Dominik Keller, Rakesh Rao Ramachandra Rao, Steve Göring, Alexander Raake:
The Effect of Viewing Distances on 4K and 8K HDR Video Quality Perception. 123-130 - Stephan Fremerey, Raja Faseeh Uz Zaman, Touseef Ashraf, Rakesh Rao Ramachandra Rao, Steve Göring, Alexander Raake:
Towards evaluation of immersion, visual comfort and exploration behaviour for non-stereoscopic and stereoscopic 360° videos. 131-138 - Simran Singh, Jacob Chakareski:
Aerial 360-Degree Video Delivery for Immersive First Person View UAV Navigation. 139-146 - Anustup Choudhury, Guan-Ming Su:
Progressive Coding for Neural Field Transmission. 147-151 - Zachary McBride Lazri, Guan-Ming Su, Peng Yin:
A Framework for Multi-plane Image Layer Merging. 152-159 - Joshua Howell, Angela Chan, Glen Hordemann, Francis K. H. Quek:
Mathematics as Interactive Multimedia. 160-167 - Taner Gülez, Mustafa Sert:
Self-Supervised Learning of Free-Hand Sketches with Bézier Curve Features. 168-171 - Mohammadreza Ghafari, André F. R. Guarda, Nuno M. M. Rodrigues, Fernando Pereira:
Deep Learning-based Point Cloud Geometry Coding with Attention Models. 172-176 - Jie Li, Bahareh Abbasi:
Estimation of the Caloric Intake of Food Consumption Using Convolutional Neural Network. 177-181 - Guruprasad Nayak, Gerald Friedland:
Deep Layers Beware: Unraveling the Surprising Benefits of JPEG Compression for Image Classification Pre-processing. 182-185 - Yesim Akar, Mustafa Sert:
Weakly Labeled Sound Event Detection using Attention Mechanism with Teacher-Student Model. 186-192 - Wen-Hung Liao, Yen-Chun Ou, Po-Han Chen, Yi-Chieh Wu:
The Impact of Parroting Mode on Cross-Lingual Speaker Recognition. 193-197 - Vania Miriam Ortiz Ramos, Sukhan Lee:
Synthesis of Disparate Audio Species via Recurrent Neural Embedding. 198-201 - Fernando Terroso-Sáenz, Andrés Muñoz, Philippe Roose:
War & Music: The impact of the Ukrainian War on the Music Listening Behaviour in Eastern Europe. 202-205 - Donghuo Zeng, Kazushi Ikeda:
Triplet Loss with Curriculum Learning for Audio-Visual Retrieval. 206-207 - Abhinav Upadhyay, Alpana Dubey, Suma Mani Kuriakose:
3DTextureNet: Neural 3D Texture Style Transfer. 208-215 - Sivaji Retta, Ramarajulu Srinivasan:
Towards Imperceptible Adversarial Image Generation: Minimizing Perceptual Difference. 216-220 - Junda Wu, Haoliang Wang, Tong Yu, Gang Wu, Stefano Petrangeli, Handong Zhao, Sungchul Kim, Viswanathan Swaminathan:
Content-aware Progressive Image Compression and Syncing. 221-224 - Kai Liu, Zheng Guo, Lei Gao, Naimul Mefraz Khan, Ling Guan:
Towards Efficient Multi-view Representation Learning. 225-229 - Rakesh Rao Ramachandra Rao, Steve Göring, Alexander Raake:
Adaptation of Bitstream-based Video Quality Models for Image Quality Assessment. 230-231 - Conrad Hsu, Ross Greer:
Bridging Subjectivity and Objectivity in Evaluation of Machine-Generated Jazz Music: A Multimetric Approach. 232-237 - Manu Agarwal, Ross Greer:
Spectrogram-Based Deep Learning for Flute Audition Assessment and Intelligent Feedback. 238-242 - Aniket Jagtap, RamaKrishna Venkatesh Saripalli, Joe Lemley, Waseem Shariff, Alan F. Smeaton:
Heart Rate Detection Using an Event Camera. 243-246 - Dimitrios Daskalakis, Nikolaos Gkalelis, Vasileios Mezaris:
Masked Feature Modelling for the unsupervised pre-training of a Graph Attention Network block for bottom-up video event recognition. 247-250 - Mingyuan Wu, Yuhan Lu, Shiv Trivedi, Bo Chen, Qian Zhou, Lingdong Wang, Simran Singh, Michael Zink, Ramesh K. Sitaraman, Jacob Chakareski, Klara Nahrstedt:
Interactive Scene Graph Analysis for Future Intelligent Teleconferencing Systems. 251-255 - Dylan Seychell, Matthew Kenely, Matthias Bartolo, Carl James Debono, Mark Bugeja, Matthew Sacco:
Efficient Automatic Annotation of Binary Masks for Enhanced Training of Computer Vision Models. 256-259 - Michael G. Adam, Sebastian Eger, Martin Piccolrovazzi, Maged Iskandar, Joern Vogel, Alexander Dietrich, Seongjien Bien, Jon Skerlj, Abdeldjallil Naceri, Eckehard G. Steinbach, Alin Albu-Schäffer, Sami Haddadin, Wolfram Burgard:
Care3D: An Active 3D Object Detection Dataset of Real Robotic-Care Environments. 260-261 - Andreas Mallas, Hara Papadatou, Michalis Xenos:
Maintaining Text Legibility Regarding Font Size Based on User Distance in Mobile Devices: Application Development and User Evaluation. 262-269 - Majid Pourmemar, Charalambos Poullis:
Analysis of Hand Movement and Head Orientation in Hierarchical Menu Selection in Immersive AR. 270-275 - Gabriel Lugo Bustillo, Rutvik Chauhan, Irene Cheng:
Exploring terrestrial point clouds with Google Street View for discovery and fine-grained catalog of urban objects. 276-281 - Florian Eggenkemper, Lars Kölker, Mike Valente, Constantin A. Rothkopf, Robert Mertens:
Learning Individualized Automatic Content Magnification in Gaze-based Interaction. 282-286 - Taner Cagali, Hadi Wazni, Saba Nazir, Mehrnoosh Sadrzadeh, Chris Newell:
Semantic and Lexical Token Based Vectors Improve Precision of Recommendations for TV Programmes. 287-290 - Florian Schimanke, Robert Mertens, Lars Kölker:
An app-based Spaced Repetition Learning Environment with User Generated Video Content. 291-296 - Alexander Gantikow, Andreas Isking, Paul Libbrecht, Wolfgang Müller, Sandra Rebholz:
On the Creation of Classifiers to Support Assessment of E-Portfolios. 297-302 - Sanghamitra Das, Mario Wolf, Philippe Schmidt, Heinrich Söbke:
Mental Workload in Augmented Reality-based Urban Planning Education. 303-308 - Samuli Laato, Sampsa Rauti, Alexander Espeseth, Heinrich Söbke, Juho Hamari, Oguz 'Oz' Buruk:
Composing Music Through Tile-based Games. 309-314 - Dipayan Biswas, Shishir K. Shah, Jaspal Subhlok:
Identification of Visual Objects in Lecture Videos with Color and Keypoints Analysis. 315-320 - Yu Shen, Gang Wu, Vishy Swaminathan, Haoliang Wang, Stefano Petrangeli, Tong Yu:
GPU-accelerated Lossless Image Compression with Massive Parallelization. 321-324 - Fabio Persia, Daniela D'Auria:
Complex Event Processing in Heterogeneous Domains. 325-330
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.