A framework for evaluating automatic indexing or classification in the context of retrieval
Tools for automatic subject assignment help deal with scale and sustainability in creating and enriching metadata, establishing more connections across and between resources and enhancing consistency. Although some software vendors and experimental ...
Classifying Twitter favorites: Like, bookmark, or Thanks?
Since its foundation in 2006, Twitter has enjoyed a meteoric rise in popularity, currently boasting over 500 million users. Its short text nature means that the service is open to a variety of different usage patterns, which have evolved rapidly in ...
Personal information concerns and provision in social network sites: Interplay between secure preservation and true presentation
Encouraging users of social network sites SNS to actively provide personal information is vital if SNS are to prosper, but privacy concerns have hindered users from giving such information. Previous research dealing with privacy concerns has studied ...
An exploratory study of the information-seeking activities of adolescents in a discussion forum
The aim of this study is to understand how teenagers use Internet forums to search for information. The activities of asking for and providing information in a forum were explored, and a set of messages extracted from a French forum targeting ...
User satisfaction with microblogging: Information dissemination versus social networking
Microblogging is growing in popularity and significance. Although many researchers have attempted to explain why and how people use this new medium, previous studies have produced relatively inconclusive results. For instance, in most of these studies, ...
SemGraph: Extracting keyphrases following a novel semantic graph-based approach
Keyphrases represent the main topics a text is about. In this article, we introduce SemGraph, an unsupervised algorithm for extracting keyphrases from a collection of texts based on a semantic relationship graph. The main novelty of this algorithm is ...
On cold start for associative tag recommendation
Tag recommendation strategies that exploit term co-occurrence patterns with tags previously assigned to the target object have consistently produced state-of-the-art results. However, such techniques work only for objects with previously assigned tags. ...
Descriptive document clustering via discriminant learning in a co-embedded space of multilevel similarities
Descriptive document clustering aims at discovering clusters of semantically interrelated documents together with meaningful labels to summarize the content of each document cluster. In this work, we propose a novel descriptive clustering framework, ...
Extending the understanding of critical success factors for implementing business intelligence systems
Extant studies suggest implementing a business intelligence BI system is a costly, resource-intensive and complex undertaking. Literature draws attention to the critical success factors CSFs for implementation of BI systems. Leveraging case studies of ...
C-sanitized: A privacy model for document redaction and sanitization
Vast amounts of information are daily exchanged and/or released. The sensitive nature of much of this information creates a serious privacy threat when documents are uncontrollably made available to untrusted third parties. In such cases, appropriate ...
The invariant distribution of references in scientific articles
The organization of scientific papers typically follows a standardized pattern, the well-known IMRaD structure introduction, methods, results, and discussion. Using the full text of 45,000 papers published in the PLoS series of journals as a case study, ...
Updating the SCImago journal and country rank classification: A new approach using Ward's clustering and alternative combination of citation measures
This study introduces a new proposal to refine the classification of the SCImago Journal and Country Rank SJR platform by using clustering techniques and an alternative combination of citation measures from an initial 18,891 SJR journal network. Thus, a ...
When are readership counts as useful as citation counts? Scopus versus Mendeley for LIS journals
In theory, articles can attract readers on the social reference sharing site Mendeley before they can attract citations, so Mendeley altmetrics could provide early indications of article impact. This article investigates the influence of time on the ...
A new approach to the QS university ranking using the composite I-distance indicator: Uncertainty and sensitivity analyses
Some major concerns of universities are to provide quality in higher education and enhance global competitiveness, thus ensuring a high global rank and an excellent performance evaluation. This article examines the Quacquarelli Symonds QSWorld ...
Explaining the unexpected and continued use of an information system with the help of evolved evolutionary mechanisms
Explaining how, why, and to what extent humans use information systems has been at the heart of the information systems IS discipline, and although successful models have emerged, mostly relying on social and cognitive psychology in their theoretical ...
Tweets as impact indicators: Examining the implications of automated "bot" accounts on Twitter
This brief communication presents preliminary findings on automated Twitter accounts distributing links to scientific articles deposited on the preprint repository arXiv. It discusses the implication of the presence of such bots from the perspective of ...
Computational authorship verification method attributes a new work to a major 2nd century African author
We discuss a real-world application of a recently proposed machine learning method for authorship verification. Authorship verification is considered an extremely difficult task in computational text classification, because it does not assume that the ...