Este relat?rio apresenta os resultados de um estudo realizado com dados abertos governamentais qu... more Este relat?rio apresenta os resultados de um estudo realizado com dados abertos governamentais que s?o empenhados mensalmente, visando ? identifica??o de anomalias na administra??o desses gastos. Para isso, foi utilizada a t?cnica de agrupamento com o intuito de identificar grupos baseados em algumas informa??es de cada gasto p?blico. O objetivo geral foi a identifica??o de registros fora dos grupos gerados e o estudo mais aprofundado sobre estes itens
In recent years, social networks have gained a huge popularity among internet users, serving dive... more In recent years, social networks have gained a huge popularity among internet users, serving diverse purposes and communities. Meanwhile, in data-oriented applications, the increasing amount of available data has made it hard for users to find the information they need in the way they consider relevant. To help matters, a user-centric approach may be used to enhance query answering and, particularly, provide query personalization. In this work, we address the issue of personalizing query answers in data-oriented applications considering the user context provided by social network information. To this end, we propose a context-aware plugin named CODI4In. The CODI4In extracts users' social network information regarding their "likes" and use them as context information to provide query personalization. In this paper, we present the developed approach and some experimental results we have accomplished with real users. These results show that by considering the acquired use...
In distributed data environments, ontologies have been used as a support for data management. For... more In distributed data environments, ontologies have been used as a support for data management. For instance, ontologies may be used to describe the semantics of data at different sources, helping to overcome problems of data heterogeneity and semantic interoperability. Generally, the task of accessing data by means of conceptual ontologies has been called Ontology-based Data Access (OBDA). A typical scenario for OBDA instantiation is a Peer Data Management System (PDMS) where queries submitted at a peer are answered with data residing at that peer and with data acquired from neighbor peers through the use of mappings. In this work, we apply the principles underlying an ODBA in the light of a PDMS, using geographic databases as data sources. When dealing with geospatial data, specific problems regarding query answering and data visualization may occur. To help matters, we propose an approach named easeGO, which provides access to a geographic database using an ontology. We also presen...
One of the major database research areas is Data Integration, which refers to providing users wit... more One of the major database research areas is Data Integration, which refers to providing users with a uniform view over a set of heterogeneous, distributed and autonomous data sources. Data Integration settings concern, for instance, mediator-based ...
Realizar consultas em sistemas computacionais pode ser uma tarefa desafiadora, visto que, quando ... more Realizar consultas em sistemas computacionais pode ser uma tarefa desafiadora, visto que, quando uma consulta e submetida por varios usuarios, normalmente, as mesmas respostas sao retornadas, independentemente de suas preferencias e do contexto no qual a consulta ocorreu. Para facilitar esse processo, uma abordagem centrada no usuario pode ser usada visando prover a personalizacao da consulta. Neste trabalho, esta personalizacao e realizada considerando o contexto do usuario. Para tal, foi desenvolvido uma primeira versao de um plugin chamado CODI4In que prove a persistencia das informacoes contextuais do usuario. Estas sao representadas por meio de uma ontologia e armazenadas em um banco de dados baseado em grafos.
Query answering has been addressed as a key issue in dynamic distributed environments such as Pee... more Query answering has been addressed as a key issue in dynamic distributed environments such as Peer Data Management Systems (PDMS). An important step in this process regards query routing, i.e., how to find peers (data sources) that are most likely to provide matching results according to the semantics of a submitted query. To help matters, we argue that semantic information like contextual information, combined with Information Quality (IQ) provided by IQ measures, may be employed together to enrich query routing processes. In this work, we propose an instantiation of a metamodel which combines both concepts as a means to produce semantic knowledge to be used in query routing processes. We also present an example of such instantiation.
The lack of metadata to describe datasets published on the Web makes their location and access by... more The lack of metadata to describe datasets published on the Web makes their location and access by search engines or applications more difficult. Providing a dataset profile facilitates communication between publishers and consumers and also the integrated use of datasets. This paper proposes an approach that describes datasets on the Web by the generation of a semantically enriched descriptive and structural metadata profile. The enrichment occurs by means of the knowledge domain identification of the dataset at hand and a vocabulary recommendation in order to semantically reference the data. This work presents some accomplished experiments that indicate the relevance of this enrichment. Resumo. A ausência de metadados para a descrição de conjuntos de dados publicados na Web dificulta sua localização e acesso por parte de mecanismos de busca ou aplicações. Prover um perfil do conjunto de dados facilita a comunicação entre publicadores e consumidores e o uso integrado dos conjuntos d...
Ontology-Based Data Access (OBDA) is the problem of accessing one or more data sources by means o... more Ontology-Based Data Access (OBDA) is the problem of accessing one or more data sources by means of a conceptual representation expressed in terms of an ontology. We apply the principles underlying an ODBA in the light of a Peer Data Management System, using geographic databases as data sources. When dealing with geospatial data, specific problems regarding query answering and data visualization occur. To help matters, in this work, we present an approach and a tool, named easeGO, which provides access to a geographic database using an ontology as a middle layer between the user interface and the data. It also allows users to formulate queries using visual elements and spatial operators. We present the principles underlying our approach and examples illustrating how it works.
One key issue in Peer Data Management Systems (PDMSs) is the heterogeneity of the peer schemas. T... more One key issue in Peer Data Management Systems (PDMSs) is the heterogeneity of the peer schemas. To help matters, ontologies may be used as uniform conceptual representation of these schemas. In this work, we are working with geographic databases to be used in a PDMS. When dealing with geospatial data, specific problems with representation and usage occur. In this sense, we have developed an approach and a tool, named GeoMap, which builds a peer ontology from a geographic database schema. In order to provide geospatial semantics when mapping, we have defined and used a reference geospatial ontology. We present the principles underlying our approach and examples illustrating how they work by means of the tool.
Clinical decision support systems is a research area in which Machine Learning (ML) techniques ca... more Clinical decision support systems is a research area in which Machine Learning (ML) techniques can be applied. Nevertheless, specifically in assisting pneumonia decision making, the use of ML has not been so expressive. To help matters, this work aims to contribute to the evolution of the intersection of such areas by presenting a Systematic Review of the Literature. It provides results which may help to identify, interpret and evaluate how ML techniques have been applied and some research enhancements yet to be done. CCS Concepts: • Applied computing;
Revista Principia - Divulgação Científica e Tecnológica do IFPB
Nowadays, the Web may be considered an adequate ecosystem for publication and open data consumpti... more Nowadays, the Web may be considered an adequate ecosystem for publication and open data consumption . Published datasets may provide open and, additionally, linked data, which results in the use of semantic technologies such as recommended vocabularies and their connection with other datasets. Taking into account a data scope from the Academic Unit of Informatics at IFPB-Campus João Pessoa, a set of open and linked data was created and published for consumption. This dataset includes information obtained from the Lattes Platform and from some internal data regarding teachers, projects, courses and areas of expertise. Source data went through a process of extraction, transformation and load based on the use of an ontology, named “Ontology for University and Academic Institutions” (OUAI), which was developed in this work. As a result, the dataset was published in the RDF model and was made available for consumption through an endpoint. Based ondata consumption, the OpenUAI application...
Revista Principia - Divulgação Científica e Tecnológica do IFPB, 2015
&... more <p>As atuais perspectivas computacionais, vindas sobretudo da Web, têm gerado novas demandas relacionadas ao gerenciamento de dados, principalmente em termos de volume, heterogeneidade e dinamismo. Uma tendência atual para facilitar o gerenciamento de dados na Web é a utilização dos denominados Sistemas NoSQL, que se diferenciam dos sistemas que seguem o Modelo Relacional por possibilitarem a implementação de estruturas mais flexíveis. Contudo, a maioria dos bancos de dados de aplicações existentes encontra-se em estruturas relacionais, e a migração de uma base que segue o Modelo Relacional para uma NoSQL requer grande esforço dos projetistas diante das diferenças existentes. Nesse panorama, este artigo descreve os modelos citados, em termos de conceitos e estruturas, e apresenta um estudo comparativo apontando possíveis mapeamentos conceituais entre eles. Aborda também, de forma comparativa, trabalhos de conversão de dados existentes, e indica desafios e possibilidades para novas pesquisas sobre o…
Este relat?rio apresenta os resultados de um estudo realizado com dados abertos governamentais qu... more Este relat?rio apresenta os resultados de um estudo realizado com dados abertos governamentais que s?o empenhados mensalmente, visando ? identifica??o de anomalias na administra??o desses gastos. Para isso, foi utilizada a t?cnica de agrupamento com o intuito de identificar grupos baseados em algumas informa??es de cada gasto p?blico. O objetivo geral foi a identifica??o de registros fora dos grupos gerados e o estudo mais aprofundado sobre estes itens
In recent years, social networks have gained a huge popularity among internet users, serving dive... more In recent years, social networks have gained a huge popularity among internet users, serving diverse purposes and communities. Meanwhile, in data-oriented applications, the increasing amount of available data has made it hard for users to find the information they need in the way they consider relevant. To help matters, a user-centric approach may be used to enhance query answering and, particularly, provide query personalization. In this work, we address the issue of personalizing query answers in data-oriented applications considering the user context provided by social network information. To this end, we propose a context-aware plugin named CODI4In. The CODI4In extracts users' social network information regarding their "likes" and use them as context information to provide query personalization. In this paper, we present the developed approach and some experimental results we have accomplished with real users. These results show that by considering the acquired use...
In distributed data environments, ontologies have been used as a support for data management. For... more In distributed data environments, ontologies have been used as a support for data management. For instance, ontologies may be used to describe the semantics of data at different sources, helping to overcome problems of data heterogeneity and semantic interoperability. Generally, the task of accessing data by means of conceptual ontologies has been called Ontology-based Data Access (OBDA). A typical scenario for OBDA instantiation is a Peer Data Management System (PDMS) where queries submitted at a peer are answered with data residing at that peer and with data acquired from neighbor peers through the use of mappings. In this work, we apply the principles underlying an ODBA in the light of a PDMS, using geographic databases as data sources. When dealing with geospatial data, specific problems regarding query answering and data visualization may occur. To help matters, we propose an approach named easeGO, which provides access to a geographic database using an ontology. We also presen...
One of the major database research areas is Data Integration, which refers to providing users wit... more One of the major database research areas is Data Integration, which refers to providing users with a uniform view over a set of heterogeneous, distributed and autonomous data sources. Data Integration settings concern, for instance, mediator-based ...
Realizar consultas em sistemas computacionais pode ser uma tarefa desafiadora, visto que, quando ... more Realizar consultas em sistemas computacionais pode ser uma tarefa desafiadora, visto que, quando uma consulta e submetida por varios usuarios, normalmente, as mesmas respostas sao retornadas, independentemente de suas preferencias e do contexto no qual a consulta ocorreu. Para facilitar esse processo, uma abordagem centrada no usuario pode ser usada visando prover a personalizacao da consulta. Neste trabalho, esta personalizacao e realizada considerando o contexto do usuario. Para tal, foi desenvolvido uma primeira versao de um plugin chamado CODI4In que prove a persistencia das informacoes contextuais do usuario. Estas sao representadas por meio de uma ontologia e armazenadas em um banco de dados baseado em grafos.
Query answering has been addressed as a key issue in dynamic distributed environments such as Pee... more Query answering has been addressed as a key issue in dynamic distributed environments such as Peer Data Management Systems (PDMS). An important step in this process regards query routing, i.e., how to find peers (data sources) that are most likely to provide matching results according to the semantics of a submitted query. To help matters, we argue that semantic information like contextual information, combined with Information Quality (IQ) provided by IQ measures, may be employed together to enrich query routing processes. In this work, we propose an instantiation of a metamodel which combines both concepts as a means to produce semantic knowledge to be used in query routing processes. We also present an example of such instantiation.
The lack of metadata to describe datasets published on the Web makes their location and access by... more The lack of metadata to describe datasets published on the Web makes their location and access by search engines or applications more difficult. Providing a dataset profile facilitates communication between publishers and consumers and also the integrated use of datasets. This paper proposes an approach that describes datasets on the Web by the generation of a semantically enriched descriptive and structural metadata profile. The enrichment occurs by means of the knowledge domain identification of the dataset at hand and a vocabulary recommendation in order to semantically reference the data. This work presents some accomplished experiments that indicate the relevance of this enrichment. Resumo. A ausência de metadados para a descrição de conjuntos de dados publicados na Web dificulta sua localização e acesso por parte de mecanismos de busca ou aplicações. Prover um perfil do conjunto de dados facilita a comunicação entre publicadores e consumidores e o uso integrado dos conjuntos d...
Ontology-Based Data Access (OBDA) is the problem of accessing one or more data sources by means o... more Ontology-Based Data Access (OBDA) is the problem of accessing one or more data sources by means of a conceptual representation expressed in terms of an ontology. We apply the principles underlying an ODBA in the light of a Peer Data Management System, using geographic databases as data sources. When dealing with geospatial data, specific problems regarding query answering and data visualization occur. To help matters, in this work, we present an approach and a tool, named easeGO, which provides access to a geographic database using an ontology as a middle layer between the user interface and the data. It also allows users to formulate queries using visual elements and spatial operators. We present the principles underlying our approach and examples illustrating how it works.
One key issue in Peer Data Management Systems (PDMSs) is the heterogeneity of the peer schemas. T... more One key issue in Peer Data Management Systems (PDMSs) is the heterogeneity of the peer schemas. To help matters, ontologies may be used as uniform conceptual representation of these schemas. In this work, we are working with geographic databases to be used in a PDMS. When dealing with geospatial data, specific problems with representation and usage occur. In this sense, we have developed an approach and a tool, named GeoMap, which builds a peer ontology from a geographic database schema. In order to provide geospatial semantics when mapping, we have defined and used a reference geospatial ontology. We present the principles underlying our approach and examples illustrating how they work by means of the tool.
Clinical decision support systems is a research area in which Machine Learning (ML) techniques ca... more Clinical decision support systems is a research area in which Machine Learning (ML) techniques can be applied. Nevertheless, specifically in assisting pneumonia decision making, the use of ML has not been so expressive. To help matters, this work aims to contribute to the evolution of the intersection of such areas by presenting a Systematic Review of the Literature. It provides results which may help to identify, interpret and evaluate how ML techniques have been applied and some research enhancements yet to be done. CCS Concepts: • Applied computing;
Revista Principia - Divulgação Científica e Tecnológica do IFPB
Nowadays, the Web may be considered an adequate ecosystem for publication and open data consumpti... more Nowadays, the Web may be considered an adequate ecosystem for publication and open data consumption . Published datasets may provide open and, additionally, linked data, which results in the use of semantic technologies such as recommended vocabularies and their connection with other datasets. Taking into account a data scope from the Academic Unit of Informatics at IFPB-Campus João Pessoa, a set of open and linked data was created and published for consumption. This dataset includes information obtained from the Lattes Platform and from some internal data regarding teachers, projects, courses and areas of expertise. Source data went through a process of extraction, transformation and load based on the use of an ontology, named “Ontology for University and Academic Institutions” (OUAI), which was developed in this work. As a result, the dataset was published in the RDF model and was made available for consumption through an endpoint. Based ondata consumption, the OpenUAI application...
Revista Principia - Divulgação Científica e Tecnológica do IFPB, 2015
&... more <p>As atuais perspectivas computacionais, vindas sobretudo da Web, têm gerado novas demandas relacionadas ao gerenciamento de dados, principalmente em termos de volume, heterogeneidade e dinamismo. Uma tendência atual para facilitar o gerenciamento de dados na Web é a utilização dos denominados Sistemas NoSQL, que se diferenciam dos sistemas que seguem o Modelo Relacional por possibilitarem a implementação de estruturas mais flexíveis. Contudo, a maioria dos bancos de dados de aplicações existentes encontra-se em estruturas relacionais, e a migração de uma base que segue o Modelo Relacional para uma NoSQL requer grande esforço dos projetistas diante das diferenças existentes. Nesse panorama, este artigo descreve os modelos citados, em termos de conceitos e estruturas, e apresenta um estudo comparativo apontando possíveis mapeamentos conceituais entre eles. Aborda também, de forma comparativa, trabalhos de conversão de dados existentes, e indica desafios e possibilidades para novas pesquisas sobre o…
Uploads
Papers by Damires Souza