Natural language processing

Applied Filters

People

Publications

Publication Date

Searched The ACM Guide to Computing Literature (3,835,209 records)|Limit your search to The ACM Full-Text Collection (773,459 records)

Showing 1 - 20of59 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
Free
March 2024
Discovering salient neurons in deep NLP models
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 362, Pages 17438–17477

While a lot of work has been done in understanding representations learned within deep NLP models and what knowledge they capture, work done towards analyzing individual neurons is relatively sparse. We present a technique called Linguistic Correlation ...
0
96
Metrics
Total Citations0
Total Downloads96
Last 12 Months96
Last 6 weeks25
View online with eReader
PDF
research-article
Free
March 2024
ProtoryNet - interpretable text classification via prototype trajectories
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 264, Pages 12344–12382

We propose a novel interpretable deep neural network for text classification, called ProtoryNet, based on a new concept of prototype trajectories. Motivated by the prototype theory in modern linguistics, ProtoryNet makes a prediction by finding the most ...
0
83
Metrics
Total Citations0
Total Downloads83
Last 12 Months83
Last 6 weeks23
View online with eReader
PDF
research-article
Free
March 2024
Atlas: few-shot learning with retrieval augmented language models
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 251, Pages 11912–11954

Large language models have shown impressive few-shot results on a wide range of tasks. However, when knowledge is key for such results, as is the case for tasks such as question answering and fact checking, massive parameter counts to store knowledge ...
0
398
Metrics
Total Citations0
Total Downloads398
Last 12 Months398
Last 6 weeks67
View online with eReader
PDF
research-article
Free
March 2024
PaLM: scaling language modeling with pathways
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 240, Pages 11324–11436

Large language models have been shown to achieve remarkable performance across a variety of natural language tasks using few-shot learning, which drastically reduces the number of task-specific training examples needed to adapt the model to a particular ...
0
675
Metrics
Total Citations0
Total Downloads675
Last 12 Months675
Last 6 weeks136
View online with eReader
PDF
research-article
Free
January 2022
Switch transformers: scaling to trillion parameter models with simple and efficient sparsity
The Journal of Machine Learning Research (JMLR), Volume 23, Issue 1Article No.: 120, Pages 5232–5270

In deep learning, models typically reuse the same parameters for all inputs. Mixture of Experts (MoE) models defy this and instead select different parameters for each incoming example. The result is a sparsely-activated model--with an outrageous number ...
74
2,334
Metrics
Total Citations74
Total Downloads2,334
Last 12 Months1,142
Last 6 weeks50
View online with eReader
PDF
research-article
Free
January 2022
A statistical approach for optimal topic model identification
- Craig M. Lewis,
- Francesco Grossetti
The Journal of Machine Learning Research (JMLR), Volume 23, Issue 1Article No.: 58, Pages 2553–2572

Latent Dirichlet Allocation (LDA) is a popular machine-learning technique that identifies latent structures in a corpus of documents. This paper addresses the ongoing concern that formal procedures for determining the optimal LDA configuration do not ...
0
149
Metrics
Total Citations0
Total Downloads149
Last 12 Months114
Last 6 weeks10
View online with eReader
PDF
research-article
Free
January 2021
Further results on latent discourse models and word embeddings
The Journal of Machine Learning Research (JMLR), Volume 22, Issue 1Article No.: 270, Pages 12376–12411

We discuss some properties of generative models for word embeddings. Namely, (Arora et al., 2016) proposed a latent discourse model implying the concentration of the partition function of the word vectors. This concentration phenomenon led to an ...
0
43
Metrics
Total Citations0
Total Downloads43
Last 12 Months22
Last 6 weeks5
View online with eReader
PDF
research-article
Free
January 2021
Beyond english-centric multilingual machine translation
The Journal of Machine Learning Research (JMLR), Volume 22, Issue 1Article No.: 107, Pages 4839–4886

Existing work in translation demonstrated the potential of massively multilingual machine translation by training a single model able to translate between any pair of languages. However, much of this work is English-Centric, training only on data which ...
10
1,299
Metrics
Total Citations10
Total Downloads1,299
Last 12 Months644
Last 6 weeks73
View online with eReader
PDF
research-article
Free
January 2021
LocalGAN: modeling local distributions for adversarial response generation
The Journal of Machine Learning Research (JMLR), Volume 22, Issue 1Article No.: 101, Pages 4578–4606

This paper presents a new methodology for modeling the local semantic distribution of responses to a given query in the human-conversation corpus, and on this basis, explores a specified adversarial learning mechanism for training Neural Response ...
0
78
Metrics
Total Citations0
Total Downloads78
Last 12 Months54
Last 6 weeks8
View online with eReader
PDF
research-article
Free
January 2021
Bayesian text classification and summarization via a class-specified topic model
The Journal of Machine Learning Research (JMLR), Volume 22, Issue 1Article No.: 89, Pages 3971–4018

We propose the class-specified topic model (CSTM) to deal with the tasks of text classification and class-specific text summarization. The model assumes that in addition to a set of latent topics that are shared across classes, there is a set of class-...
0
199
Metrics
Total Citations0
Total Downloads199
Last 12 Months133
Last 6 weeks15
View online with eReader
PDF
research-article
Free
January 2021
Residual energy-based models for text
The Journal of Machine Learning Research (JMLR), Volume 22, Issue 1Article No.: 40, Pages 1840–1880

Current large-scale auto-regressive language models (Radford et al., 2019; Liu et al., 2018; Graves, 2013) display impressive fluency and can generate convincing text. In this work we start by asking the question: Can the generations of these models be ...
2
178
Metrics
Total Citations2
Total Downloads178
Last 12 Months82
Last 6 weeks9
View online with eReader
PDF
research-article
Free
January 2020
Exploring the limits of transfer learning with a unified text-to-text transformer
The Journal of Machine Learning Research (JMLR), Volume 21, Issue 1Article No.: 140, Pages 5485–5551

Transfer learning, where a model is first pre-trained on a data-rich task before being fine-tuned on a downstream task, has emerged as a powerful technique in natural language processing (NLP). The effectiveness of transfer learning has given rise to a ...
684
8,702
Metrics
Total Citations684
Total Downloads8,702
Last 12 Months2,919
Last 6 weeks279
View online with eReader
PDF
article
Free
February 2013
Ranked bandits in metric spaces: learning diverse rankings over large document collections
The Journal of Machine Learning Research (JMLR), Volume 14, Issue 1Pages 399–436

Most learning to rank research has assumed that the utility of different documents is independent, which results in learned ranking functions that return redundant results. The few approaches that avoid this have rather unsatisfyingly lacked theoretical ...
34
234
Metrics
Total Citations34
Total Downloads234
Last 12 Months50
Last 6 weeks6
View online with eReader
PDF
article
Free
January 2013
MAGIC summoning: towards automatic suggesting and testing of gestures with low probability of false positives during use
- Daniel Kyu Hwa Kohlsdorf,
- Thad E. Starner
The Journal of Machine Learning Research (JMLR), Volume 14, Issue 1Pages 209–242

Gestures for interfaces should be short, pleasing, intuitive, and easily recognized by a computer. However, it is a challenge for interface designers to create gestures easily distinguishable from users' normal movements. Our tool MAGIC Summoning ...
4
247
Metrics
Total Citations4
Total Downloads247
Last 12 Months54
Last 6 weeks7
View online with eReader
PDF
article
Free
December 2012
Exploration in relational domains for model-based reinforcement learning
The Journal of Machine Learning Research (JMLR), Volume 13, Issue 1Pages 3725–3768

A fundamental problem in reinforcement learning is balancing exploration and exploitation. We address this problem in the context of model-based reinforcement learning in large stochastic relational domains by developing relational extensions of the ...
12
268
Metrics
Total Citations12
Total Downloads268
Last 12 Months51
Last 6 weeks5
View online with eReader
PDF
article
Free
December 2012
Security analysis of online centroid anomaly detection
- Marius Kloft,
- Pavel Laskov
The Journal of Machine Learning Research (JMLR), Volume 13, Issue 1Pages 3681–3724

Security issues are crucial in a number of machine learning applications, especially in scenarios dealing with human activity rather than natural phenomena (e.g., information ranking, spam detection, malware detection, etc.). In such cases, learning ...
12
412
Metrics
Total Citations12
Total Downloads412
Last 12 Months80
Last 6 weeks29
View online with eReader
PDF
article
Free
December 2012
Smoothing multivariate performance measures
The Journal of Machine Learning Research (JMLR), Volume 13, Issue 1Pages 3623–3680

Optimizing multivariate performance measure is an important task in Machine Learning. Joachims (2005) introduced a Support Vector Method whose underlying optimization problem is commonly solved by cutting plane methods (CPMs) such as SVM-Perf and BMRM. ...
13
120
Metrics
Total Citations13
Total Downloads120
Last 12 Months47
Last 6 weeks6
View online with eReader
PDF
article
Free
December 2012
SVDFeature: a toolkit for feature-based collaborative filtering
The Journal of Machine Learning Research (JMLR), Volume 13, Issue 1Pages 3619–3622

In this paper we introduce SVDFeature, a machine learning toolkit for feature-based collaborative filtering. SVDFeature is designed to efficiently solve the feature-based matrix factorization. The feature-based setting allows us to build factorization ...
83
519
Metrics
Total Citations83
Total Downloads519
Last 12 Months90
Last 6 weeks9
View online with eReader
PDF
article
Free
December 2012
Learning symbolic representations of hybrid dynamical systems
- Daniel L. Ly,
- Hod Lipson
The Journal of Machine Learning Research (JMLR), Volume 13, Issue 1Pages 3585–3618

A hybrid dynamical system is a mathematical model suitable for describing an extensive spectrum of multi-modal, time-series behaviors, ranging from bouncing balls to air traffic controllers. This paper describes multi-modal symbolic regression (MMSR): a ...
6
255
Metrics
Total Citations6
Total Downloads255
Last 12 Months63
Last 6 weeks7
View online with eReader
PDF
article
Free
December 2012
Regularized bundle methods for convex and non-convex risks
- Trinh-Minh-Tri Do,
- Thierry Artières
The Journal of Machine Learning Research (JMLR), Volume 13, Issue 1Pages 3539–3583

Machine learning is most often cast as an optimization problem. Ideally, one expects a convex objective function to rely on efficient convex optimizers with nice guarantees such as no local optima. Yet, non-convexity is very frequent in practice and it ...
10
188
Metrics
Total Citations10
Total Downloads188
Last 12 Months80
Last 6 weeks7
View online with eReader
PDF