This paper contains a preliminary corpus study of oxymorons, a figure of speech so far under-inve... more This paper contains a preliminary corpus study of oxymorons, a figure of speech so far under-investigated in NLP-oriented research. The study resulted in a list of 376 oxymorons, identified by extracting a set of antonymous pairs (under various configurations) from corpora of written Italian and by manually checking the results. A complementary method is also envisaged for discovering contextual oxymorons, which are highly relevant for the detection of humor, irony and sarcasm.
The dataset contains 376 Italian oxymorons (first version, released on June 2020) with their Engl... more The dataset contains 376 Italian oxymorons (first version, released on June 2020) with their English translation. The dataset also provides, for each oxymoron, its syntactic structure, the pair of antonyms it relies on, and the corpus/corpora from which it was extracted.
This paper contains a preliminary corpus study of oxymorons, a figure of speech so far under-inve... more This paper contains a preliminary corpus study of oxymorons, a figure of speech so far under-investigated in NLP-oriented research. The study resulted in a list of 376 oxymorons, identified by extracting a set of antonymous pairs (under various configurations) from corpora of written Italian and by manually checking the results. A complementary method is also envisaged for discovering contextual oxymorons, which are highly relevant for the detection of humor, irony and sarcasm.
The dataset contains 376 Italian oxymorons (first version, released on June 2020) with their Engl... more The dataset contains 376 Italian oxymorons (first version, released on June 2020) with their English translation. The dataset also provides, for each oxymoron, its syntactic structure, the pair of antonyms it relies on, and the corpus/corpora from which it was extracted.
Uploads
Papers by Marta La Pietra