Abstract
In view of the limitation of the forward dictionaries to attend to the needs specific to the language producers, an alternate resource in the form of ‘Reverse Dictionary’ needs to be built. Reverse Dictionary aims to lexicalize the concept in the user’s mind by taking as input a natural language description of the concept and returning word/s that are in semantic correspondence to the description. A critical survey of the existing Reverse Dictionary works is presented in this paper. It is concluded that this problem has been addressed through five categories of approaches, namely, Information Retrieval-based approach, Graph-based approach, Mental dictionary-based approach, Vector Space Model-based approach, and Neural Language Model-based approach. We identify and highlight that the works reported so far do not account for human perceptions in the user input. However, as a NL is a system of perceptions and a Reverse Dictionary deals with natural language input, dealing with perception based information in the user input is important to capture his/her intent. To address the identified research gap, we have considered the concept of Precisiated Natural Language (PNL) based on Zadeh’s paradigm of Computational Theory of Perceptions. We have proposed to incorporate it into the traditional Information Retrieval (IR) architecture in building a Reverse Dictionary. To gain insights for the same, we have reported an experimental analysis of IR system based Wordster Reverse Dictionary.
Similar content being viewed by others
Notes
Reddit: What’s the Word: https://www.reddit.com/r/whatstheword/, Words For That: http://www.wordsforthat.com/.
References
Sierra G. In: Proceedings of the Ninth Euralex International Congress. 2000, p. 223–35.
Calvo H, Méndez O, Moreno-Armendáriz MA. Integrated concept blending with vector space models. Comput Speech Lange. 2016;40:79–96.
Hill F, Cho K, Korhonen A, Bengio Y. Learning to understand phrases by embedding the dictionary. TACL. 2016;4:17–30.
Miller GA. Wordnet: a lexical database for english. Commun ACM. 1995;38(11):39–41.
Schwartz BL, Metcalfe J. Tip-of-the-tongue (tot) states: retrieval, behavior, and experience. Mem Cogn. 2011;39(5):737–49.
Brown R, McNeill D. The “tip of the tongue’’ phenomenon. J Verbal Learn Verbal Behav. 1966;5(4):325–37.
Rohrer JD, Knight WD, Warren JE, Fox NC, Rossor MN, Warren JD. Word-finding difficulty: a clinical analysis of the progressive aphasias. Brain. 2008;131(1):8–38.
Tripathi P, Deshmukh M. Building a database driven reverse medical dictionary. Int J Comput Sci Trends Technol. 2017;5(2):341–5.
Tripathi P, Deshmukh M. Building a scalable database driven reverse medical dictionary using nlp techniques. Adv Comput Sci Technol. 2017;10(5):1221–31.
Jincy AK, Sindhu L. A survey of building a reverse dictionary. Int J Comput Sci Inf Technol. 2014;5(6):7727–8.
Gaya D. A study of building an reverse dictionary. Int J Sci Res Publ. 2015;5(7):5–7.
Tiwari A, Jadhav R. Survey on building a database driven reverse dictionary. Int J Mod Trends Eng Res. 2014;1(6):154–60.
Siddique B, Beg MMS. In: International conference on next generation computing technologies. Springer; 2018. p. 128–39.
Crawford HV, Crawford J. Reverse electronic dictionary using synonyms to expand search capabilities. US Patent 5,649,221. 1997.
Thorat S, Choudhari V. In: Proceedings of the 26th international conference on computational linguistics (COLING 2016). 2016. p. 2797–806.
Shaw R, Datta A, VanderMeer D, Dutta K. Building a scalable database-driven reverse dictionary. IEEE Trans Knowl Data Eng. 2013;25(3):528–40.
Dutoit D, Nugues P. In: Proceedings of the 15th European conference on artificial intelligence. 2002. p. 450–54.
Bilac S, Watanabe W, Hashimoto T, Tokunaga T, Tanaka H. In: Proceedings of the tenth annual meeting of the association for NLP (NLP2004). 2004. p. 556–9.
El-Kahlout ID, Oflazer K. In: Proceedings of the global Wordnet conference (GWC2004). 2004. p. 118–23.
Gurram V, Rathod S, Lushte A, Vaidya P, Alone V, Pawar M. Reverse dictionary. Imp J Interdiscip Res. 2016;2(4):531–3.
Gurram V, Rathod S, Lushte A, Vaidya P, Alone V, Pawar M. Reverse dictionary. Int J Eng Sci Comput. 2016;6(3):2727–30.
Shete PS, Patil G. In: 2018 fourth international conference on computing communication control and automation (ICCUBEA). IEEE; 2018. p. 1–9.
Rajan S, Soumya KR. In: 2016 IEEE international conference on engineering and technology (ICETECH). IEEE; 2016. p. 448–51.
Arivazhagan P, Thairiyalakshmi S. Intelligent database driven reverse dictionary. Int J Innov Res Comput Commun Eng. 2014;2(5):4027–33.
Thanigaivel R, Kumar S. Intelligent database driven reverse dictionary. IOSR J Comput Eng. 2014;16(2):40–4.
Praveena RC, Manjula A. Improving the reverse dictionary using scalable dataset. Int J Eng Comput Sci. 2016;5(11):18827–31.
Racha N, Balghare V, Khan S, Bhardwaj S. A proposed system for a scalable database driven reverse dictionary. Int J Adv Res Comput Sci Manage Stud. 2015;3(2):355–9.
K PH, Muthamilselvan T. Building a ranking based reverse dictionary. Int J Innov Res Comput Commun Eng. 2014;2(4):3831–5.
Mangalakshmi P, Marie A, Jayanthi M. International journal of advanced research in computer and communication engineering. Int J Innov Res Comput Commun Eng. 2014;3(1):5141–3.
Nanaware P, Ambekar RK. Building and improving scalable database driven reverse dictionary. Int J Adv Res Comput Sci Softw Eng. 2014;4(7):1014–7.
Reyes-Magana J, Bel-Enguix G, Gómez-Adorno H, Sierra G. A lexical search model based on word association norms. J Intell Fuzzy Syst. 2019;36(5):4587–97.
Reyes-Magaña J, Bel-Enguix G, Sierra G, Gómez-Adorno H. In: Proceedings of electronic lexicography in the 21st century conference. 2019. p. 865–80.
Dutoit D. In: Proceedings of the 14th conference on computational linguistics. Association for Computational Linguistics. 1992. p. 982–7.
Kiss GR, Armstrong C, Milroy R, Piper J. An associative thesaurus of english and its computer analysis. The computer and literary studies. 1973. p. 153–65.
Nelson DL, McEvoy CL, Schreiber TA. The University of South Florida free association, rhyme, and word fragment norms. Behav Res Methods Instrum Comput. 2004;36(3):402–7.
Freeman LC. A set of measures of centrality based on betweenness. Sociometry. 1977;35–41.
Page L, Brin S, Motwani R, Winograd T. The PageRank Citation Ranking: Bringing Order to the Web. Stanford InfoLab: Tech. rep; 1999.
Collins AM, Loftus EF. A spreading-activation theory of semantic processing. Psychol Rev. 1975;82(6):407–28.
Zock M, Bilac S. In: Proceedings of the workshop on enhancing and using electronic dictionaries (Association for Computational Linguistics). 2004. p. 29–35.
Zock M, Schwab D. In: Proceedings of the workshop on cognitive aspects of the Lexicon (Association for Computational Linguistics). 2008. p. 9–17.
Turney PD, Pantel P. From frequency to meaning: vector space models of semantics. J Artif Intell Res. 2010;37:141–88.
Méndez O, Calvo H, Moreno-Armendáriz MA. In: Mexican international conference on artificial intelligence. Springer. 2013. p. 275–85.
Socher R, Manning CD, Ng AY. In: Proceedings of the NIPS-2010 deep learning and unsupervised feature learning workshop. 2010. p. 1–9.
Cho K, van Merrienboer B, Gulcehre C, Bahdanauu D, Bougares F, Schwenk H, Bengio Y. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP) (Association for Computational Linguistics). 2014. p. 1724–34.
Mikolov T, Sutskever I, Chen K, Corrado G, Dean J. Distributed representations of words and phrases and their compositionality. In: Advances in neural information processing systems. 2013. p. 3111–9.
Yan H, Li X, Qiu X. BERT for monolingual and cross-lingual reverse dictionary. 2020. arXiv preprint arXiv:2009.14790
Morinaga Y, Yamaguchi K. Improvement of neural reverse dictionary by using cascade forward neural network. J Inf Process. 2020;28:715–23.
Morinaga Y, Yamaguchi K. In: International conference on information and software technologies. Springer. 2018. p. 533–45.
Qi F, Zhang L, Yang Y, Liu Z, Sun M. In: Proceedings of the 2020 conference on empirical methods in natural language processing: system demonstrations. 2020. p. 175–81.
Zheng L, Qi F, Liu Z, Wang Y, Liu Q, Sun M. In: Proceedings of the AAAI conference on artificial intelligence, vol. 34. 2020. p. 312–9.
Pilehvar MT. In: Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, vol. 1 (Long and Short Papers). 2019. p. 2151–6.
Agrawal A, Shanly KA, Vaishnaw K, Singh M. In: 8th ACM IKDD CODS and 26th COMAD. 2021. p. 420.
Malekzadeh A, Gheibi A, Mohades A. PREDICT: PERSIAN REVERSE DICTIONARY. 2021. arXiv preprint arXiv:2105.00309.
Hedderich MA, Yates A, Klakow D, De Melo G. Using multi-sense vector embeddings for reverse dictionaries. 2019. arXiv preprint arXiv:1904.01451.
Mane D, Sunil B, Patil H, Madaswar K, Sadavarte P. WordAlchemy: a transformer-based reverse dictionary. 2022. arXiv preprint arXiv:2204.10181
Chen G, Su J. In: 2021 IEEE 33rd international conference on tools with artificial intelligence (ICTAI). IEEE; 2021. p. 1113–20.
Gadetsky A, Yakubovskiy I, Vetrov D. Conditional generators of words definitions. 2018. arXiv preprint arXiv:1806.10090
Maillard J, Clark S, Yogatama D. Jointly learning sentence embeddings and syntax with unsupervised Tree-LSTMs. Nat Lang Eng. 2019;25(4):433–49.
Bastos A. Learning sentence embeddings using recursive networks. 2018. arXiv preprint arXiv:1805.08353.
Kartsaklis D, Pilehvar MT, Collier N. Mapping text to knowledge graph entities using multi-sense LSTMs. 2018. arXiv preprint arXiv:1808.07724.
Urra F. Measuring enrichment of word embeddings with subword and dictionary information. 2019.
Onelook.com. OneLook Reverse Dictionary. 2009. http://www.onelook.com/
Zadeh LA. From computing with numbers to computing with words: from manipulation of measurements to manipulation of perceptions. Int J Appl Math Comput Sci. 2002;12(3):307–24.
Wang TC, Chang TH. Forecasting the probability of successful knowledge management by consistent fuzzy preference relations. Expert Syst Appl. 2007;32(3):801–13.
Herrera F, Herrera-viedma E, Luis M. A fusion approach for managing multi-granularity linguistic term sets in decision making. Fuzzy Sets Syst. 2000;114(1):43–58.
Zadrozny S, Kacprzyk J. Computing with words for text processing: an approach to the text categorization. Inf Sci. 2006;176(4):415–37.
Luis M, Barranco MJ, Luis GP, Espinilla M. A knowledge based recommender system with multigranular linguistic information. Int J Comput Intell Syst. 2008;1(3):225–36.
Larsen HL, Berzal F, Martin-bautista MJ, Vila Ma. In: IFSA world congress and 20th NAFIPS international conference, 2001. Joint 9th. IEEE; 2001, p. 3088–92.
Kostek B. “Computing with words’’ concept applied musical information retrieval. Electron Notes Theor Comput Sci. 2003;82(4):141–52.
Herrera-viedma E, Lopez-Herrera AG, Luque M, Porcel C. A fuzzy linguistic IRS model based on a 2-tuple fuzzy linguistic approach. Int J Uncertain Fuzziness Knowl-Based Syst. 2007;15(2):225–50.
Herrera-viedma E, López-herrera AG. A model of an information retrieval system with unbalanced fuzzy linguistic information. Int J Intell Syst. 2007;22(11):1197–214.
Zadeh LA. Precisiated natural language (pnl). AI Mag. 2004;25(3):74–74.
Zadeh LA. Precisiated natural language (PNL). AI Mag. 2004;25(3):74–92.
Zadeh LA. In: Capturing Intelligence, vol. 1. Elsevier; 2006. p. 163–210.
Siddique B, Beg MS. In: 2021 10th international conference on internet of everything, microwave engineering, communication and networks (IEMECON). IEEE; 2021. p. 1–5.
theidioms.com, The Idioms. https://www.theidioms.com/
Howe D. Free Online Dictionary of Computing. http://foldoc.org/
Zhibiao W, Palmer M. In: Proceedings of the 32nd annual meeting on Association for Computational Linguistics. 1994, p. 133–8.
Leacock C, Chodorow M. Combining local context and wordnet sense similarity for word sense identification. wordnet, an electronic lexical database. The MIT Press; 1998.
Lin D. In: Proceedings of the fifteenth international conference on machine learning (ICML 1998). Madison, Wisconsin, USA, July 24-27, 1998 (1998), p. 296–304.
Resnik P. In: Proceedings of the fourteenth international joint conference on artificial intelligence, IJCAI 95, Montréal Québec, Canada, August 20-25 1995, 2 Volumes (1995), p. 448–53.
Hirst G, St-onge D. Lexical chains as representations of context for the detection and correction of malapropisms. In: Fellbaum, C, editor. WordNet: an electronic lexical database (April); 1995. p. 305–32.
Banerjee S, Pedersen T. In: Proceedings of the third international conference on intelligent text processing and computational linguistics. 2002, p. 136–45.
Rajan S, Soumya KR. Evaluation of scalable database driven reverse dictionary. Proc Technol. 2016;25:272–9.
Siddique B, Beg MS. Adjective phrases in PNL and its application to reverse dictionary. IEEE Access. 2022;10:28385–96.
Funding
The study conducted is non-funded.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that there is no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Siddique, B., Sufyan Beg, M.M. Reverse Dictionary Formation: State of the Art and Future Directions. SN COMPUT. SCI. 4, 168 (2023). https://doi.org/10.1007/s42979-022-01495-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s42979-022-01495-1