Proceedings of the 23rd Brazillian Symposium on Multimedia and the Web
This paper presents a general approach to perform crowdsourcing video annotation without requirin... more This paper presents a general approach to perform crowdsourcing video annotation without requiring trained workers nor experts. It consists of dividing complex annotation tasks into simple and small microtasks and cascading them to generate a final result. Moreover, this approach allows using simple annotation tools rather than complex and expensive annotation systems. Also, it tends to avoid activities that may be tedious and time-consuming for workers. The cascade microtasks strategy is included in a workflow of three steps: Preparation, Annotation, and Presentation. A crowdsourcing video annotation process in which four different microtasks were cascaded was developed to evaluate the proposed approach. In the process, extra content such as images, text, hyperlinks and other elements are applied in the video enrichment. To support the experiment was developed a toolkit that includes Web-based annotation tools and aggregation methods, besides a presentation system for the annotated videos. This toolkit is open source and can be downloaded and used to replicate this experiment, as so to construct different crowdsourcing video annotation systems.
2017 International Conference on Systems, Signals and Image Processing (IWSSIP)
User Generated Videos are contents created by heterogeneous users around an event. Each user film... more User Generated Videos are contents created by heterogeneous users around an event. Each user films the event with his point of view, and according to his limitations. In this scenario, it is impossible to guarantee that all the videos will be stable, focused on a point of the event or other characteristics that turn the automatic video synchronization process possible. Focused on this scenario we propose the use of crowdsourcing techniques in video synchronization (CrowdSync). The crowd is not affected by heterogeneous videos as the automatic processes are, so it is possible to use them to process videos and find the synchronization points. In order to make this process possible, a structure is described that can manage both crowd and video synchronization: the Dynamic Alignment List (DAL). Therefore, we carried out two experiments to verify that the crowd can perform the proposed approach through two experiments: a crowd simulator and a small task based experiment.
2018 25th International Conference on Systems, Signals and Image Processing (IWSSIP), 2018
Media annotation consists of supplementing media objects, such as videos, images, and audios, by ... more Media annotation consists of supplementing media objects, such as videos, images, and audios, by adding metadata about their content and context, and describing media characteristics such as quality, encoding, among other features. Complex media annotation involves annotating different aspects of media objects as well as relating them to a context. This kind of annotation usually is associated with a demanding process that requires experts and elaborated annotation systems. This paper tries to overcome this issue and presents a method to achieve complex media annotation without requiring complex tools, experts nor trained workers. In this method, the complex annotation process is divided into a set of simple annotation microtasks, and based on them, a process workflow for generating complex annotations is defined. To demonstrate the execution of our approach, we developed a video enrichment system and carried out an experiment in which a crowd was responsible for executing a set of simple microtasks which are aggregated to produce enriched video content.
Proceedings of the Brazilian Symposium on Multimedia and the Web, 2020
Like the spoken languages, sign languages are not universal and vary in different countries. LIBR... more Like the spoken languages, sign languages are not universal and vary in different countries. LIBRAS (Brazilian Sign Language) is the second official language of Brazil and it is the language adopted by Brazilian Deaf's community to communicate. The signs of LIBRAS are composed of hand configurations, facial expressions and are affected by space and intensity modifiers, which makes their recognition more complicated than the simple identification of hand signs. The signs are arranged, according to a grammar, respecting form phrases, clauses, and sentences like any other spoken or sign language. The automatic machine translation of a sign language typically includes an initial phase for detecting sign boundaries. In this paper, we apply a crowdsourcing method to identifying signs boundaries present in pre-recorded videos those features LIBRAS interpreters. The limits or boundaries of the signs in the videos were established from the processing of contributions from workers from different countries, who have supposedly never heard of LIBRAS nor any other sign languages. To evaluate the segmentation process, we compared the sign boundaries identified by the crowd with the ground truth provided by a team of LIBRAS experts, who also assessed the quality of the delimitation of the identified signs. Our analysis showed that our crowdsourcing method was able to get 93.75% of the sign boundaries successfully.
Video annotation is an activity that aims to supplement this type of multimedia object with addit... more Video annotation is an activity that aims to supplement this type of multimedia object with additional content or information about its context, nature, content, quality and other aspects. These annotations are the basis for building a variety of multimedia applications for various purposes ranging from entertainment to security. Manual annotation is a strategy that uses the intelligence and workforce of people in the annotation process and is an alternative to cases where automatic methods cannot be applied. However, manual video annotation can be a costly process because as the content to be annotated increases, so does the workload for annotating. Crowdsourcing appears as a viable solution strategy in this context because it relies on outsourcing the tasks to a multitude of workers, who perform specific parts of the work in a distributed way. However, as the complexity of required media annoyances increases, it becomes necessary to employ skilled labor, or willing In: II Concurso...
This work presents a tool that allows users to synchronize live videos from multiple sources such... more This work presents a tool that allows users to synchronize live videos from multiple sources such as YouTube or any other video streaming sources. The proposed approach to proceed the multiple camera video synchronization is based in crowdsourcing techniques, using the power of a crowd of collaborators to synchronize videos, requiring from each user the sync of only a pairs of videos. Additional sync relations are inferred from the known contributions, using transitivity properties and an appropriate structure for this inference, the Dynamic Alignment List.
Proceedings of the 22nd Brazilian Symposium on Multimedia and the Web - Webmedia '16, 2016
This paper summarizes some information about a 4 hours short course presented by the authors duri... more This paper summarizes some information about a 4 hours short course presented by the authors during the WebMedia'16. Our short course aims to present the concept of crowdsourcing and empower participants to implement this model of production in various activities linked with Multimedia and Web Systems, such as annotation, generation, summarization, synchronization, recommendation, retrieval, presentation and evaluation of the content quality. The idea behind crowdsourcing is to take advantage of the processing power of a multitude of employees to accomplish tasks that are "difficult for a computer", but which are apparently "simple to human intelligence". Describing the contents of an image or a video as inappropriate is an example of such difficult task, because the description of the problem by means of algorithms and automated techniques applied to parameters of this content is very complex and inaccurate. Another complex task is the subjective assessment of the quality of video encoders, the results depend on the user's perception and not only on parameters such as signal-to-noise ratio, resolution or frame rate. The crowdsourcing model tends to provide reliable results for this and other problems related to Multimedia and Web Systems. The additional content support for this short course brings the fundamental concepts of crowdsourcing, a discussion of suitable scenarios for their use within the multimedia and examples of practical use of the concept in real-world scenarios.
Proceedings of the 23rd Brazillian Symposium on Multimedia and the Web
This paper presents a general approach to perform crowdsourcing video annotation without requirin... more This paper presents a general approach to perform crowdsourcing video annotation without requiring trained workers nor experts. It consists of dividing complex annotation tasks into simple and small microtasks and cascading them to generate a final result. Moreover, this approach allows using simple annotation tools rather than complex and expensive annotation systems. Also, it tends to avoid activities that may be tedious and time-consuming for workers. The cascade microtasks strategy is included in a workflow of three steps: Preparation, Annotation, and Presentation. A crowdsourcing video annotation process in which four different microtasks were cascaded was developed to evaluate the proposed approach. In the process, extra content such as images, text, hyperlinks and other elements are applied in the video enrichment. To support the experiment was developed a toolkit that includes Web-based annotation tools and aggregation methods, besides a presentation system for the annotated videos. This toolkit is open source and can be downloaded and used to replicate this experiment, as so to construct different crowdsourcing video annotation systems.
2017 International Conference on Systems, Signals and Image Processing (IWSSIP)
User Generated Videos are contents created by heterogeneous users around an event. Each user film... more User Generated Videos are contents created by heterogeneous users around an event. Each user films the event with his point of view, and according to his limitations. In this scenario, it is impossible to guarantee that all the videos will be stable, focused on a point of the event or other characteristics that turn the automatic video synchronization process possible. Focused on this scenario we propose the use of crowdsourcing techniques in video synchronization (CrowdSync). The crowd is not affected by heterogeneous videos as the automatic processes are, so it is possible to use them to process videos and find the synchronization points. In order to make this process possible, a structure is described that can manage both crowd and video synchronization: the Dynamic Alignment List (DAL). Therefore, we carried out two experiments to verify that the crowd can perform the proposed approach through two experiments: a crowd simulator and a small task based experiment.
2018 25th International Conference on Systems, Signals and Image Processing (IWSSIP), 2018
Media annotation consists of supplementing media objects, such as videos, images, and audios, by ... more Media annotation consists of supplementing media objects, such as videos, images, and audios, by adding metadata about their content and context, and describing media characteristics such as quality, encoding, among other features. Complex media annotation involves annotating different aspects of media objects as well as relating them to a context. This kind of annotation usually is associated with a demanding process that requires experts and elaborated annotation systems. This paper tries to overcome this issue and presents a method to achieve complex media annotation without requiring complex tools, experts nor trained workers. In this method, the complex annotation process is divided into a set of simple annotation microtasks, and based on them, a process workflow for generating complex annotations is defined. To demonstrate the execution of our approach, we developed a video enrichment system and carried out an experiment in which a crowd was responsible for executing a set of simple microtasks which are aggregated to produce enriched video content.
Proceedings of the Brazilian Symposium on Multimedia and the Web, 2020
Like the spoken languages, sign languages are not universal and vary in different countries. LIBR... more Like the spoken languages, sign languages are not universal and vary in different countries. LIBRAS (Brazilian Sign Language) is the second official language of Brazil and it is the language adopted by Brazilian Deaf's community to communicate. The signs of LIBRAS are composed of hand configurations, facial expressions and are affected by space and intensity modifiers, which makes their recognition more complicated than the simple identification of hand signs. The signs are arranged, according to a grammar, respecting form phrases, clauses, and sentences like any other spoken or sign language. The automatic machine translation of a sign language typically includes an initial phase for detecting sign boundaries. In this paper, we apply a crowdsourcing method to identifying signs boundaries present in pre-recorded videos those features LIBRAS interpreters. The limits or boundaries of the signs in the videos were established from the processing of contributions from workers from different countries, who have supposedly never heard of LIBRAS nor any other sign languages. To evaluate the segmentation process, we compared the sign boundaries identified by the crowd with the ground truth provided by a team of LIBRAS experts, who also assessed the quality of the delimitation of the identified signs. Our analysis showed that our crowdsourcing method was able to get 93.75% of the sign boundaries successfully.
Video annotation is an activity that aims to supplement this type of multimedia object with addit... more Video annotation is an activity that aims to supplement this type of multimedia object with additional content or information about its context, nature, content, quality and other aspects. These annotations are the basis for building a variety of multimedia applications for various purposes ranging from entertainment to security. Manual annotation is a strategy that uses the intelligence and workforce of people in the annotation process and is an alternative to cases where automatic methods cannot be applied. However, manual video annotation can be a costly process because as the content to be annotated increases, so does the workload for annotating. Crowdsourcing appears as a viable solution strategy in this context because it relies on outsourcing the tasks to a multitude of workers, who perform specific parts of the work in a distributed way. However, as the complexity of required media annoyances increases, it becomes necessary to employ skilled labor, or willing In: II Concurso...
This work presents a tool that allows users to synchronize live videos from multiple sources such... more This work presents a tool that allows users to synchronize live videos from multiple sources such as YouTube or any other video streaming sources. The proposed approach to proceed the multiple camera video synchronization is based in crowdsourcing techniques, using the power of a crowd of collaborators to synchronize videos, requiring from each user the sync of only a pairs of videos. Additional sync relations are inferred from the known contributions, using transitivity properties and an appropriate structure for this inference, the Dynamic Alignment List.
Proceedings of the 22nd Brazilian Symposium on Multimedia and the Web - Webmedia '16, 2016
This paper summarizes some information about a 4 hours short course presented by the authors duri... more This paper summarizes some information about a 4 hours short course presented by the authors during the WebMedia'16. Our short course aims to present the concept of crowdsourcing and empower participants to implement this model of production in various activities linked with Multimedia and Web Systems, such as annotation, generation, summarization, synchronization, recommendation, retrieval, presentation and evaluation of the content quality. The idea behind crowdsourcing is to take advantage of the processing power of a multitude of employees to accomplish tasks that are "difficult for a computer", but which are apparently "simple to human intelligence". Describing the contents of an image or a video as inappropriate is an example of such difficult task, because the description of the problem by means of algorithms and automated techniques applied to parameters of this content is very complex and inaccurate. Another complex task is the subjective assessment of the quality of video encoders, the results depend on the user's perception and not only on parameters such as signal-to-noise ratio, resolution or frame rate. The crowdsourcing model tends to provide reliable results for this and other problems related to Multimedia and Web Systems. The additional content support for this short course brings the fundamental concepts of crowdsourcing, a discussion of suitable scenarios for their use within the multimedia and examples of practical use of the concept in real-world scenarios.
Uploads