Abstract
Sentence compression is an important problem in natural language processing. In this paper, we firstly establish a new sentence compression model based on the probability model and the parse tree model. Our sentence compression model is equivalent to an integer linear program (ILP) which can both guarantee the syntax correctness of the compression and save the main meaning. We propose using a DC (Difference of convex) programming approach (DCA) for finding local optimal solution of our model. Combing DCA with a parallel-branch-and-bound framework, we can find global optimal solution. Numerical results demonstrate the good quality of our sentence compression model and the excellent performance of our proposed solution algorithm.
The research is funded by Natural Science Foundation of China (Grant No: 11601327) and by the Key Construction National “985” Program of China (Grant No: WF220426001).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
Punctuation is also deemed as word.
- 2.
\(\left[ \!\left[ m,n\right] \!\right] \) with \(m\le n\) stands for the set of integers between m and n.
- 3.
A function \(f:\mathbb {R}^n\rightarrow \mathbb {R}\) is called DC if there exist two convex functions g and h (called DC components) such that \(f=g-h\).
- 4.
The compression rate is computed by the length of compression over the length of original sentence.
References
Clarke, J., Lapata, M.: Global inference for sentence compression: an integer linear programming approach. J. Artif. Intell. Res. 31, 399–429 (2008)
Gurobi 8.1.0. http://www.gurobi.com
Jing, H.: Sentence reduction for automatic text summarization. In: Proceedings of the 6th Applied Natural Language Processing Conference, pp. 310–315 (2000)
Knight, K., Marcu, D.: Summarization beyond sentence extraction: a probalistic approach to sentence compression. Artif. Intell. 139, 91–107 (2002)
Le Thi, H.A., Pham, D.T., Le Dung, M.: Exact penalty in dc programming. Vietnam J. Math. 27(2), 169–178 (1999)
Le Thi, H.A., Pham, D.T.: A continuous approach for large-scale constrained quadratic zero-one programming. Optimization 45(3), 1–28 (2001)
Le Thi, H.A., Pham, D.T.: The dc (difference of convex functions) programming and dca revisited with dc models of real world nonconvex optimization problems. Ann. Oper. Res. 133, 23–46 (2005)
Le Thi, H.A., Nguyen, Q.T., Nguyen, H.T., et al.: Solving the earliness tardiness scheduling problem by DC programming and DCA. Math. Balk. 23, 271–288 (2009)
Le Thi, H.A., Moeini, M., Pham, D.T.: Portfolio selection under downside risk measures and cardinality constraints based on DC programming and DCA. Comput. Manag. Sci. 6(4), 459–475 (2009)
Le Thi, H.A., Minh, L.H., Pham, D.T., Bouvry, P.: Solving the perceptron problem by deterministic optimization approach based on DC programming and DCA. In: Proceeding in INDIN 2009, Cardiff. IEEE (2009)
Le Thi, H.A., Pham, D.T., Huynh, V.N.: Exact penalty and error bounds in dc programming. J. Glob. Optim. 52(3), 509–535 (2012)
MacDonald, D.: Discriminative sentence compression with soft syntactic constraints. In: Proceedings of EACL, pp. 297–304 (2006)
Niu, Y.S., Pham, D.T.: A DC programming approach for mixed-integer linear programs. In: Modelling, Computation and Optimization in Information Systems and Management Sciences, CCIS, vol. 14, pp. 244–253 (2008)
Niu, Y.S.: Programmation DC & DCA en Optimisation Combinatoire et Optimisation Polynomiale via les Techniques de SDP. Ph.D. thesis, INSA, France (2010)
Niu, Y.S., Pham, D.T.: Efficient DC programming approaches for mixed-integer quadratic convex programs. In: Proceedings of the International Conference on Industrial Engineering and Systems Management (IESM2011), pp. 222–231 (2011)
Niu, Y.S.: On difference-of-SOS and difference-of-convex-SOS decompositions for polynomials (2018). arXiv:1803.09900
Niu, Y.S.: A parallel branch and bound with DC algorithm for mixed integer optimization. In: The 23rd International Symposium in Mathematical Programming (ISMP2018), Bordeaux, France (2018)
Nguyen, H.T., Pham, D.T.: A continuous DC programming approach to the strategic supply chain design problem from qualified partner set. Eur. J. Oper. Res. 183(3), 1001–1012 (2007)
NLTK 3.2.5: The Natural Language Toolkit. http://www.nltk.org
Pham, D.T., Le Thi, H.A., Pham, V.N., Niu, Y.S.: DC programming approaches for discrete portfolio optimization under concave transaction costs. Optim. Lett. 10(2), 261–282 (2016)
Schleich, J., Le Thi, H.A., Bouvry, P.: Solving the minimum m-dominating set problem by a continuous optimization approach based on DC programming and DCA. J. Comb. Optim. 24(4), 397–412 (2012)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Niu, YS., Hu, XW., You, Y., Benammour, F.M., Zhang, H. (2020). Sentence Compression via DC Programming Approach. In: Le Thi, H., Le, H., Pham Dinh, T. (eds) Optimization of Complex Systems: Theory, Models, Algorithms and Applications. WCGO 2019. Advances in Intelligent Systems and Computing, vol 991. Springer, Cham. https://doi.org/10.1007/978-3-030-21803-4_35
Download citation
DOI: https://doi.org/10.1007/978-3-030-21803-4_35
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-21802-7
Online ISBN: 978-3-030-21803-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)