Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

A neural network for semantic labelling of structured information

Published: 01 April 2020 Publication History
  • Get Citation Alerts
  • Highlights

    A novel approach to perform semantic labelling with neural networks is presented.
    The existing proposals focus on features engineering instead of classification techniques.
    Neural networks handle well the large number of features and allow nonlinearity.
    Experimental results show consistent improvement in every tested scenario.
    Tests with different subsets of features compare their usefulness and impact.

    Abstract

    Intelligent systems rely on rich sources of information to make informed decisions. Using information from external sources requires establishing correspondences between the information and known information classes. This can be achieved with semantic labelling, which assigns known labels to structured information by classifying it according to computed features. The existing proposals have explored different sets of features, without focusing on what classification techniques are used. In this paper we present three contributions: first, insights on architectural issues that arise when using neural networks for semantic labelling; second, a novel implementation of semantic labelling that uses a state-of-the-art neural network classifier which achieves significantly better results than other four traditional classifiers; third, a comparison of the results obtained by the former network when using different subsets of features, comparing textual features to structural ones, and domain-dependent features to domain-independent ones. The experiments were carried away with datasets from three real world sources. Our results show that there is a need to develop more semantic labelling proposals with sophisticated classification techniques and large features catalogues.

    References

    [1]
    D. Ayala, I. Hernández, D. Ruiz, M. Toro, Tapon: A two-phase machine learning approach for semantic labelling, Knowledge-Based Systems 163 (2019) 931–943.
    [2]
    D. Ayala, I. Hernández, D. Ruiz, M. Toro, Tapon-mt: A versatile framework for semantic labelling, Information Systems 83 (2019) 57–68.
    [3]
    X. Chen, S. Jia, Y. Xiang, A review: Knowledge reasoning over knowledge graph, Expert Systems with Applications (2019) 112948.
    [4]
    X. Ding, Y. Zhang, T. Liu, J. Duan, Knowledge-driven event embedding for stock prediction, Proceedings of coling 2016, the 26th international conference on computational linguistics: Technical papers, 2016, pp. 2133–2142.
    [5]
    Foundation, N. S. (2018). NSF Awards API specification. https://www.research.gov/common/webapi/awardapisearch-v1.htm. Accessed: 2018-09-17.
    [6]
    Foundation, T. A. S. (2018). Apache spark. https://spark.apache.org/. Accessed: 2018-09-17.
    [7]
    M. Franco-Salvador, P. Gupta, P. Rosso, R.E. Banchs, Cross-language plagiarism detection over continuous-space-and knowledge graph-based representations of language, Knowledge-based Systems 111 (2016) 87–99.
    [8]
    H. Huang, L. Heck, H. Ji, Leveraging deep neural networks and knowledge graphs for entity disambiguation, 2015, arXiv:1504.07678.
    [9]
    Jisc (2018). Jisc repository. http://jisc.rkbexplorer.com/. Accessed: 2018-09-17.
    [10]
    S.M. Kazemi, D. Poole, Simple embedding for link prediction in knowledge graphs, Advances in neural information processing systems, 2018, pp. 4284–4295.
    [11]
    N. Kushmerick, Regression testing for wrapper maintenance, Aaai/iaai, 1999, pp. 74–79.
    [12]
    N. Kushmerick, Wrapper verification, WWW 3 (2) (2000) 79–94,.
    [13]
    C. Laudy, J.-G. Ganascia, Information fusion in a tv program recommendation system, 2008 11th international conference on information fusion, IEEE, 2008, pp. 1–8.
    [14]
    Y. LeCun, Y. Bengio, G.E. Hinton, Deep learning, Nature 521 (7553) (2015) 436–444,.
    [15]
    J. Leng, P. Jiang, A deep learning approach for relationship extraction from interaction context in social manufacturing paradigm, Knowledge-Based Systems 100 (2016) 188–199.
    [16]
    K. Lerman, S. Minton, C.A. Knoblock, Wrapper maintenance: A machine learning approach, Journal of Artificial Intelligence Research 18 (2003) 149–181,.
    [17]
    G. Limaye, S. Sarawagi, S. Chakrabarti, Annotating and searching web tables using entities, types and relationships, PVLDB 3 (1) (2010) 1338–1347.
    [18]
    R. McCann, B.K. AlShebli, Q. Le, H. Nguyen, L. Vu, A. Doan, Mapping maintenance for data integration systems, Vldb, 2005, pp. 1018–1030.
    [19]
    V. Mulwad, T. Finin, A. Joshi, Semantic message passing for generating linked data from tables, Iswc, 2013, pp. 363–378,.
    [20]
    S. Neumaier, J. Umbrich, J.X. Parreira, A. Polleres, Multi-level semantic labelling of numerical values, International semantic web conference (1), 2016, pp. 428–445.
    [21]
    M. Pham, S. Alse, C.A. Knoblock, P.A. Szekely, Semantic labeling: A domain-independent approach, International semantic web conference (1), 2016, pp. 446–462.
    [22]
    PyTorch (2018). Pytorch. https://pytorch.org/. Accessed: 2018-09-17.
    [23]
    S.K. Ramnandan, A. Mittal, C.A. Knoblock, P.A. Szekely, Assigning semantic labels to data sources, Eswc, 2015, pp. 403–417,.
    [24]
    F. Ren, M.G. Sohrab, Class-indexing-based term weighting for automatic text classification, Information Sciences 236 (2013) 109–125.
    [25]
    D. Ritze, O. Lehmberg, C. Bizer, Matching html tables to dbpedia, Proceedings of the 5th international conference on web intelligence, mining and semantics, ACM, 2015, p. 10.
    [26]
    L.N. Smith, Cyclical learning rates for training neural networks, 2017 IEEE winter conference on applications of computer vision, WACV 2017, santa rosa, ca, usa, march 24–31, 2017, 2017, pp. 464–472,.
    [27]
    A. Tagarelli, Exploring dictionary-based semantic relatedness in labeled tree data, Information Sciences 220 (2013) 244–268.
    [28]
    University, N. (2018). Newcastle university repository. http://newcastle.rkbexplorer.com/. Accessed: 2018-09-17.
    [29]
    P. Venetis, A.Y. Halevy, J. Madhavan, M. Pasca, W. Shen, F. Wu, …., C. Wu, Recovering semantics of tables on the web, PVLDB 4 (9) (2011) 528–538.
    [30]
    I.H. Witten, E. Frank, M.A. Hall, C.J. Pal, Data mining: Practical machine learning tools and techniques, Morgan Kaufmann, 2016.
    [31]
    Q. Wu, P. Wang, C. Shen, A. Dick, A. van den Hengel, Ask me anything: Free-form visual question answering based on knowledge from external sources, Proceedings of the ieee conference on computer vision and pattern recognition, 2016, pp. 4622–4630.
    [32]
    Z. Zhang, Effective and efficient semantic table interpretation using tableminer+, Semantic Web 8 (6) (2017) 921–957.

    Index Terms

    1. A neural network for semantic labelling of structured information
              Index terms have been assigned to the content through auto-classification.

              Recommendations

              Comments

              Information & Contributors

              Information

              Published In

              cover image Expert Systems with Applications: An International Journal
              Expert Systems with Applications: An International Journal  Volume 143, Issue C
              Apr 2020
              425 pages

              Publisher

              Pergamon Press, Inc.

              United States

              Publication History

              Published: 01 April 2020

              Author Tags

              1. Semantic labelling
              2. Information integration
              3. Neural networks

              Qualifiers

              • Research-article

              Contributors

              Other Metrics

              Bibliometrics & Citations

              Bibliometrics

              Article Metrics

              • 0
                Total Citations
              • 0
                Total Downloads
              • Downloads (Last 12 months)0
              • Downloads (Last 6 weeks)0

              Other Metrics

              Citations

              View Options

              View options

              Get Access

              Login options

              Media

              Figures

              Other

              Tables

              Share

              Share

              Share this Publication link

              Share on social media