Academia.edu no longer supports Internet Explorer.
To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to upgrade your browser.
2007
Abstract—Latent semantic analysis (LSA) is an algorithm applied to approximate the meaning of texts, thereby exposing semantic structure to computation. LSA combines the classical vector-space model− well known in computational linguistics− with a singular value decomposition (SVD), a two-mode factor analysis. Thus, bag-of-words representations of texts can be mapped into a modified vector space that is assumed to reflect semantic structure.
Behavior research methods, 2014
In this article, the R package LSAfun is presented. This package enables a variety of functions and computations based on Vector Semantic Models such as Latent Semantic Analysis (LSA) Landauer, Foltz and Laham (Discourse Processes 25:259-284, 1998), which are procedures to obtain a high-dimensional vector representation for words (and documents) from a text corpus. Such representations are thought to capture the semantic meaning of a word (or document) and allow for semantic similarity comparisons between words to be calculated as the cosine of the angle between their associated vectors. LSAfun uses pre-created LSA spaces and provides functions for (a) Similarity Computations between words, word lists, and documents; (b) Neighborhood Computations, such as obtaining a word's or document's most similar words, (c) plotting such a neighborhood, as well as similarity structures for any word lists, in a two- or three-dimensional approximation using Multidimensional Scaling, (d) Ap...
European Journal of Information Systems, 2012
International Journal of Recent Trends in Engineering, 2009
Different mathematical techniques are being developed to reduce the dimensionality of data within large datasets, for robust retrieval of required information. Latent Semantic Analysis (LSA), a modified low rank approximation form of Vector Space Model, can be used for detecting underlying semantic relationships within text corpora. LSA performs a low-rank approximation on term-document matrix, which is generated by transforming textual data into a vector representation, thereby bringing out the semantic connectedness present among the documents of the corpus. Singular Value Decomposition (SVD) is the traditional approximation method used for LSA, wherein lower dimensional components from the decomposition are truncated. On truncation, the linguistic noise present in the vector representation is removed, and the semantic connectedness is made visible. One of the pitfalls of using SVD is that the truncated matrix will have negative components, which is not natural for interpreting the textual representation. Nonnegative Matrix Factorization (NMF) addresses this issue by generating non-negative parts-based representation as the low rank approximation for performing LSA. The paper provides an in-depth overview of how both methods are being used for the purpose of Information Retrieval. Performance evaluation of the methods has been performed using standard test datasets.
Abstract—This paper presents a statistical method for analysis and processing of text using a technique called Latent Semantic Analysis. Latent semantic analysis was a technique that was devised to mimic human understanding of words and language. Hence it is a method for computer simulation of the meaning of word and passages by analysis of natural language or text. It uses a mathematical model called Singular Value Decomposition which is a technique used to factorize a matrix. The paper discusses its application in information retrieval, which is called latent semantic indexing in this context. We also present an example which demonstrates this technique.
Studia Norwidiana, 2017
International Journal of Research and Innovation in Applied Science
Educar em Revista, 1988
Physiology and Molecular Biology of Plants, 2013
The Astrophysical Journal, 2010
Chemical and Pharmaceutical Bulletin, 2001
Comptes Rendus Biologies, 2011
International Journal of High Dilution Research, 2021
Biological Psychiatry, 1998