Un duel probabiliste pour d\'epartager deux pr\'esidents (LIA @ DEFT'2005)

El-Bèze, Marc; Torres-Moreno, Juan-Manuel; Béchet, Frédéric

Computer Science > Computation and Language

arXiv:1903.07397 (cs)

[Submitted on 11 Mar 2019]

Title:Un duel probabiliste pour départager deux présidents (LIA @ DEFT'2005)

Authors:Marc El-Bèze, Juan-Manuel Torres-Moreno, Frédéric Béchet

View PDF

Abstract:We present a set of probabilistic models applied to binary classification as defined in the DEFT'05 challenge. The challenge consisted a mixture of two differents problems in Natural Language Processing : identification of author (a sequence of François Mitterrand's sentences might have been inserted into a speech of Jacques Chirac) and thematic break detection (the subjects addressed by the two authors are supposed to be different). Markov chains, Bayes models and an adaptative process have been used to identify the paternity of these sequences. A probabilistic model of the internal coherence of speeches which has been employed to identify thematic breaks. Adding this model has shown to improve the quality results. A comparison with different approaches demostrates the superiority of a strategy that combines learning, coherence and adaptation. Applied to the DEFT'05 data test the results in terms of precision (0.890), recall (0.955) and Fscore (0.925) measure are very promising.

Comments:	27 figures, 1 table (in French)
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1903.07397 [cs.CL]
	(or arXiv:1903.07397v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1903.07397
Journal reference:	RNTI (E10)776:1889-1918, 2007

Submission history

From: Juan-Manuel Torres-Moreno [view email]
[v1] Mon, 11 Mar 2019 11:02:24 UTC (226 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2019-03

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Marc El-Bèze
Juan-Manuel Torres-Moreno
Frédéric Béchet

export BibTeX citation

Computer Science > Computation and Language

Title:Un duel probabiliste pour départager deux présidents (LIA @ DEFT'2005)

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Un duel probabiliste pour départager deux présidents (LIA @ DEFT'2005)

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators