Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.3115/1118637.1118644dlproceedingsArticle/Chapter ViewAbstractPublication PagessemiticConference Proceedingsconference-collections
Article
Free access

QARAB: a question answering system to support the Arabic language

Published: 11 July 2002 Publication History

Abstract

We describe the design and implementation of a question answering (QA) system called QARAB. It is a system that takes natural language questions expressed in the Arabic language and attempts to provide short answers. The system's primary source of knowledge is a collection of Arabic newspaper text extracted from Al-Raya, a newspaper published in Qatar. During the last few years the information retrieval community has attacked this problem for English using standard IR techniques with only mediocre success. We are tackling this problem for Arabic using traditional Information Retrieval (IR) techniques coupled with a sophisticated Natural Language Processing (NLP) approach. To identify the answer, we adopt a keyword matching strategy along with matching simple structures extracted from both the question and the candidate documents selected by the IR system. To achieve this goal, we use an existing tagger to identify proper names and other crucial lexical items and build lexical entries for them on the fly. We also carry out an analysis of Arabic question forms and attempt a better understanding of what kinds of answers users find satisfactory. The paucity of studies of real users has limited results in earlier research.

References

[1]
Abuleil, S., and Evens, M., 1998. "Discovering Lexical Information by Tagging Arabic Newspaper Text", Workshop on Semantic Language Processing. COLING-ACL '98, University of Montreal, Montreal, PQ, Canada, Aug. 16 1998, pp. 1--7.
[2]
Al-Daimi, K., and Abdel-Amir, M. 1994. "The Syntactic Analysis of Arabic by Machine". Computers and Humanities, Vol. 28, No. 1, pp. 29--37.
[3]
Allan, J., Callan, J., Feng, F-F., and Malin D. 1999. "INQUERY and TREC-8". Proceedings of the 8th Text REtrieval Conference (TREC-8), NIST Special Publications 500--246, pp. 637--645.
[4]
Ask Jeeves. 1996. www.ask.com Site last visited in March 2001.
[5]
Breck, E., Burger, J., Ferro, L., House, D., Light, M., and Mani, I. 1999. "A Sys Called Qanda". Proceedings of the 8th Text REtrieval Conference, NIST Special Publications, pp. 499--507.
[6]
Budzik, J. and Hammond, K. 1999. "Q&A: A System for the Capture, Organization and Reuse of Expertise". Proceedings of the Sixty-second Annual Meeting of the American Society for Information Science. Information Today, Inc., Medford, NJ. Available on the Web at http://dent.infolab.nwu.edu/infolab/downloads/papers/paper10061.pdf. Site last visited in August 2001.
[7]
Burke, R., Hammond, K., Kulyukin, V., Lytinen, S., Tomuro, N., and Schoenberg, S. 1997. "Question Answering from Frequently-Asked Question Files: Experiences with the FAQ Finder System". AI Magazine, Vol. 18, No.2, pp. 57--66.
[8]
Cardie, C., Ng, V., Pierce, D., and Buckley, C. 2000. "Examining the Role of Statistical and Linguistic Knowledge Sources in a General-Knowledge Question-Answering System". Proceedings of the Sixth Applied Natural Language Processing Conference, pp. 180--187.
[9]
Cormack, G., Clarke, C., and Kisman, D. 1999. "Fast Automatic Passage Ranking (MultiText Experiments for TREC-8)". Proceedings of the 8th Text REtrieval Conference (TREC-8), NIST Special Publications 500--246, pp. 735--743.
[10]
Ferret, O., Grau, B., Illouz, G., Jacquemin, C., and Masson, N. 1999. "QALC - the Question-Answering Program of the Language and Cognition Group at LIMSI-CNRS". Proceedings of the 8th Text REtrieval Conference, NIST Special Publications, pp. 465--475.
[11]
Harabagiu, S., Pasca, M., and Maiorano, S. 2000. "Experiments with Open-Domain Textual Question Answering". Proceedings of 18th International Conference on Computational Linguistics (COLING-2000), Saarbrucken, Germany, pp. 292--298
[12]
Hull, D. 1999. "Xerox TREC-8 Question Answering Track Report". Proceedings of the 8thText REtrieval Conference (TREC-8), NIST Special Publications 500--246, pp. 743--751.
[13]
Humphreys, K., Gaizauskas, R., Hepple, M., and Sanderson, M. 1999. "University of Sheffield TREC-8 Q & A System". Proceedings of the 8th Text REtrieval Conference (TREC-8), NIST Special Publications 500--246, pp. 707--717.
[14]
Jacobs, P., and Rau, L. 1990. "SCISOR: Extracting Information from On-line News". Communications of the ACM, Vol. 33, No.11, pp. 88--97.
[15]
Katz, B. 1997. "From Sentence Processing to Information Access on the World Wide Web". Proceedings of the American Association for Artificial Intelligence Conference, Spring Symposium, NLP for WWW, pp. 77--86.
[16]
Khoja, S. 1999. "Stemming Arabic Text". Available on the Web at: http://www.comp.lancs.ac.uk/computing/users/khoja/stemmer.ps. Site last visited in March 2001.
[17]
Kupiec, J. 1993. "MURAX: A Robust Linguistic Approach for Question Answering Using an On-line Encyclopedia". Proceedings of the 16th Annual Int. ACM SIGIR Conference, pp. 181--190.
[18]
Lehnert, W. 1978. The Process of Question Answering. Lawrence Erlbaum Associates, Hillsdale, NJ.
[19]
Lin, C-J, and Chen, H-H. 1999. "Description of Preliminary Results to TREC-8 QA Task". Proceedings of the 8th Text REtrieval Conference(TERC-8), NIST Special Publications 500--246, pp. 507--513.
[20]
Litkowski, K. 1999. "Question-Answering Using Semantic Relation Triples". Proceedings of the 8th Text REtrieval Conference (TREC-8), NIST Special Publications 500--248, pp. 349--357
[21]
Lundquist, C., Grossman, D., and Frieder, O. 1999. "Improving Relevance Feedback in the Vector Space Model". Proceedings of 6th ACM Annual Conference on Information and Knowledge Management (CIKM), pp. 16--23.
[22]
Moldovan, D., Harabagiu, S., Pasca, M., Mihalcea, R., Girju, R., Goodrum, R., and Rus, V. 2000. "The Structure and Performance of an Open-Domain Question-Answering System". Proceedings of the 38th Annual Meeting of the Association for Computational Linguistics, pp. 563--570.
[23]
Oard, D., Wang, J., Lin, D., and Soboroff, I. 1999. "TREC-8 Experiments at Maryland: CLIR, QA and Routing". Proceedings of the 8th Text REtrieval Conference (TERC-8), NIST Special Publications 500--246, pp. 623--637.
[24]
Ogden, B., Cowie, J., Ludovik, E. Molina-Salgado, H., Nirenburg, S., Sharples, N., and Sheremtyeva, S. 1999. "CRL's TREC-8 Systems Cross-Lingual IR, and Q&A". Proceedings of the 8th Text REtrieval Conference (TERC-8), NIST Special Publications 500--246, pp. 513--523.
[25]
Salton, G. 1971. The SMART Retrieval System Experiments in Automatic Document Processing. Prentice Hall Inc., Englewood Cliffs, NJ.
[26]
Schank, R., and Abelson, R. 1977. Scripts, Plans, Goals, and Understanding. Lawrence Erlbaum Associates, Hillsdale, NJ.
[27]
Shin, D-H, Kim, Y-H, Kim, S., Eom, J-H, Shin, H-J, and Zhang B-T. 1999. "SCAI TREC-8 Experiments". Proceedings of the 8th Text REtrieval Conference (TREC-8), NIST Special Publications 500--246, pp. 583--591.
[28]
Singhal, A., Abney, S., Bacchiani, M., Collins, M., Hindle, D., and Pereira, F. 1999. "AT&T at TREC-8". Proceedings of the 8th Text REtrieval Conference, NIST Special Publications, pp. 317--331.
[29]
Srihari, R., and Li, W. 1999. "Information Extraction Supported Question Answering". Proceedings of the 8th Text REtrieval Conference (TREC-8), NIST Special Publications 500--246, pp. 185--197.
[30]
Takaki, T. 1999. "NTT DATA: Overview of System Approach at TREC-8 ad-hoc and Question Answering". Proceedings of the 8th Text REtrieval Conference (TREC-8), NIST Special Publications 500--246, pp. 523--531.
[31]
TREC-8. 1999. NIST Special Publication 500--246: The Eighth Text REtrieval Conference. Available on the Web at: http://trec.nist.gov/pubs/trec8/t8_proceedings.html. Site last visited in August 2001.
[32]
TREC-9. 2000. NIST Special Publication: The Ninth Text REtrieval Conference. Available on the Web at: http://trec.nist.gov/pubs/trec9/t9_proceedings.html. Site last visited in August 2001.
[33]
Vicedo, J., and Ferrández, A. 2000. "Importance of Pronominal Anaphora Resolution in Question- Answering System". Proceedings of the 38th Annual Meeting of the Association for Computational Linguistics, pp. 555--562.
[34]
Voorhees, E., and Tice, D. 1999. "The TREC-8 Question Answering Track Evaluation". Proceedings of the 8th Text REtrieval Conference (TREC-8), NIST Special Publication 500--246, pp. 83--106.
[35]
Voorhees, E., and Tice, D. 2000. "Building a Question Answering Test Collection". Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Athens, Greece, pp. 200--207.
[36]
Winograd, T. 1972. Understanding Natural Language. Academic Press, New York, NY.
[37]
Woods, W., Kaplan, R., and Webber, B. 1972. "The Lunar Sciences Natural Language Information System: Final Report". Bolt Beranek and Newman Inc. (BBN), Report No. 2378.

Cited By

View all
  • (2023)So2al-wa-Gwab: A New Arabic Question-Answering Dataset Trained on Answer Extraction ModelsACM Transactions on Asian and Low-Resource Language Information Processing10.1145/360555022:8(1-21)Online publication date: 24-Aug-2023
  • (2022)Arabic Medical Community Question Answering Using ON-LSTM and CNNProceedings of the 2022 14th International Conference on Machine Learning and Computing10.1145/3529836.3529913(298-307)Online publication date: 18-Feb-2022
  • (2018)A proposed system to increase work readiness of fresh IT graduates at interviewsProceedings of the 30th Australian Conference on Computer-Human Interaction10.1145/3292147.3292182(443-447)Online publication date: 4-Dec-2018
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
SEMITIC '02: Proceedings of the ACL-02 workshop on Computational approaches to semitic languages
July 2002
85 pages

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 11 July 2002

Qualifiers

  • Article

Acceptance Rates

Overall Acceptance Rate 12 of 21 submissions, 57%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)75
  • Downloads (Last 6 weeks)17
Reflects downloads up to 13 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)So2al-wa-Gwab: A New Arabic Question-Answering Dataset Trained on Answer Extraction ModelsACM Transactions on Asian and Low-Resource Language Information Processing10.1145/360555022:8(1-21)Online publication date: 24-Aug-2023
  • (2022)Arabic Medical Community Question Answering Using ON-LSTM and CNNProceedings of the 2022 14th International Conference on Machine Learning and Computing10.1145/3529836.3529913(298-307)Online publication date: 18-Feb-2022
  • (2018)A proposed system to increase work readiness of fresh IT graduates at interviewsProceedings of the 30th Australian Conference on Computer-Human Interaction10.1145/3292147.3292182(443-447)Online publication date: 4-Dec-2018
  • (2017)Question answering systemsInternational Journal of Artificial Intelligence and Soft Computing10.1504/IJAISC.2017.0842166:1(24-42)Online publication date: 1-Jan-2017
  • (2016)Text StemmingACM Computing Surveys10.1145/297560849:3(1-46)Online publication date: 16-Sep-2016
  • (2016)Answering Arabic Why-QuestionsACM Transactions on Information Systems10.1145/295004935:1(1-19)Online publication date: 7-Sep-2016
  • (2014)Integrated Question Classification based on Rules and Pattern MatchingProceedings of the 2014 International Conference on Information and Communication Technology for Competitive Strategies10.1145/2677855.2677894(1-7)Online publication date: 14-Nov-2014
  • (2013)Ontology-Based Question Analysis MethodProceedings of the 10th International Conference on Flexible Query Answering Systems - Volume 813210.1007/978-3-642-40769-7_9(100-111)Online publication date: 18-Sep-2013
  • (2012)Arabic rhetorical relations extraction for answering "why" and "how to" questionsProceedings of the 17th international conference on Applications of Natural Language Processing and Information Systems10.1007/978-3-642-31178-9_52(385-390)Online publication date: 26-Jun-2012
  • (2011)On the improvement of passage retrieval in Arabic question/answering (Q/A) systemsProceedings of the 16th international conference on Natural language processing and information systems10.5555/2026011.2026065(336-341)Online publication date: 28-Jun-2011
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media