Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

PREDOSE

Published: 01 December 2013 Publication History

Abstract

Graphical abstractDisplay Omitted We present a semantic web platform for drug abuse research using social media.Social media texts could be an important resource in identifying new epidemiological trends.Extraction of appropriate semantic information may be beneficial to epidemiological research. ObjectivesThe role of social media in biomedical knowledge mining, including clinical, medical and healthcare informatics, prescription drug abuse epidemiology and drug pharmacology, has become increasingly significant in recent years. Social media offers opportunities for people to share opinions and experiences freely in online communities, which may contribute information beyond the knowledge of domain professionals. This paper describes the development of a novel semantic web platform called PREDOSE (PREscription Drug abuse Online Surveillance and Epidemiology), which is designed to facilitate the epidemiologic study of prescription (and related) drug abuse practices using social media. PREDOSE uses web forum posts and domain knowledge, modeled in a manually created Drug Abuse Ontology (DAO - pronounced dow), to facilitate the extraction of semantic information from User Generated Content (UGC), through combination of lexical, pattern-based and semantics-based techniques. In a previous study, PREDOSE was used to obtain the datasets from which new knowledge in drug abuse research was derived. Here, we report on various platform enhancements, including an updated DAO, new components for relationship and triple extraction, and tools for content analysis, trend detection and emerging patterns exploration, which enhance the capabilities of the PREDOSE platform. Given these enhancements, PREDOSE is now more equipped to impact drug abuse research by alleviating traditional labor-intensive content analysis tasks. MethodsUsing custom web crawlers that scrape UGC from publicly available web forums, PREDOSE first automates the collection of web-based social media content for subsequent semantic annotation. The annotation scheme is modeled in the DAO, and includes domain specific knowledge such as prescription (and related) drugs, methods of preparation, side effects, and routes of administration. The DAO is also used to help recognize three types of data, namely: (1) entities, (2) relationships and (3) triples. PREDOSE then uses a combination of lexical and semantic-based techniques to extract entities and relationships from the scraped content, and a top-down approach for triple extraction that uses patterns expressed in the DAO. In addition, PREDOSE uses publicly available lexicons to identify initial sentiment expressions in text, and then a probabilistic optimization algorithm (from related research) to extract the final sentiment expressions. Together, these techniques enable the capture of fine-grained semantic information, which facilitate search, trend analysis and overall content analysis using social media on prescription drug abuse. Moreover, extracted data are also made available to domain experts for the creation of training and test sets for use in evaluation and refinements in information extraction techniques. ResultsA recent evaluation of the information extraction techniques applied in the PREDOSE platform indicates 85% precision and 72% recall in entity identification, on a manually created gold standard dataset. In another study, PREDOSE achieved 36% precision in relationship identification and 33% precision in triple extraction, through manual evaluation by domain experts. Given the complexity of the relationship and triple extraction tasks and the abstruse nature of social media texts, we interpret these as favorable initial results. Extracted semantic information is currently in use in an online discovery support system, by prescription drug abuse researchers at the Center for Interventions, Treatment and Addictions Research (CITAR) at Wright State University. ConclusionA comprehensive platform for entity, relationship, triple and sentiment extraction from such abstruse texts has never been developed for drug abuse research. PREDOSE has already demonstrated the importance of mining social media by providing data from which new findings in drug abuse research were uncovered. Given the recent platform enhancements, including the refined DAO, components for relationship and triple extraction, and tools for content, trend and emerging pattern analysis, it is expected that PREDOSE will play a significant role in advancing drug abuse epidemiology in future.

References

[1]
Alpers, G., Winzelberg, A., Classen, C., Roberts, H., Dev, P. and Koopman, C., Evaluation of computerized text analysis in an internet breast cancer support group. Comput Hum Behav. v21. 361-376.
[2]
Bach N, Badaskar S. A review of relation extraction. <http://www.cs.cmu.edu/~nbach/papers/A-survey-on-Relation-Extraction.pwd>.
[3]
Baggott, M., Erowid, E., Erowid, F., Galloway, G. and Mendelson, J., Use pat- terns and self-reported effects of salvia divinorum: an internet-based survey. Drug Alcohol Depend. v83 i111. 250-256.
[4]
Bantum, E. and Owen, J., Evaluating the validity of computerized content analysis programs for identification of emotional expression in cancer narratives. Psychol Assess. v21. 79-88.
[5]
Boyer, E. and Wines, J., Impact of internet pharmacy regulation on opioid analgesic availability. J Stud Alcohol Drugs. v69. 703-708.
[6]
Boyer, E., Babu, K. and Macalino, G., Self-treatment of opioid withdrawal with a dietary supplement, kratom. Am J Addict. v16. 352-356.
[7]
Boyer, E., Lapen, P., Macalino, G. and Hibberd, P., Disseminaion of psychoactive substance information by innovative drug users. Cyberpsychol Behav. v10. 1-6.
[8]
The internet and psychoactive substance use among innovative drug users. Pediatrics. v115. 302-305.
[9]
Boyer, E., Shannon, M. and Hibberd, P., Web sites with misinformation about illicit drugs. N Engl J Med. v345. 469-471.
[10]
Brent E. Artificial intelligence and the Internet. Sage; in the sage handbook of online research methods; 2008.
[11]
Butler, S., Budman, S., Licari, A., Cassidy, T., Lioy, K. and Dickenson, J., National Addictions Vigilance Intervention and Prevention Program (NAVIPPRO): a real-time, product-specific, public health surveillance system for monitoring prescription drug abuse. Pharmacoepidemiol Drug Saf. 1142-1145.
[12]
Butler, S., Venuti, S., Benoit, C., Beaulaurier, R., Houle, B. and Katz, N., Internet surveillance: content analysis and monitoring of product-specific internet prescription opioid abuse-related postings. Clin J Pain. v23. 619-628.
[13]
Cameron D, Bhagwan V, Sheth AP. Towards comprehensive longitudinal healthcare data capture. In: 1st international workshop on the role of semantic web in literature-based discovery (SWLBD). Philadelphia, PA, USA: IEEE; 2012. p. 241-7.
[14]
Cameron D, Mendes PN, Sheth AP, Chan V. Semantics-empowered Text exploration for knowledge discovery. In: ACM southeast regional conference; 2010. p. 14.
[15]
Canfield, M., Keller, C., Frydrych, L., Ashrafioun, L., Purdy, C. and Blondell, R., Prescription opioid use among patients seeking treatment for opioid dependence. J Addict Med. v4. 108-113.
[16]
Extracting diverse sentiment expressions with target-dependent polarity from twitter. 2012. ICWSM.
[17]
Chiticariu L, Krishnamurthy R, Li Y, Raghavan S, Reiss F, Vaithyanathan S. SystemT: an algebraic approach to declarative information extraction. In: 48th Annual meeting of the association for computation linguistics. Stroudsburg, PA, USA; 2010. p. 128-37.
[18]
Cicero, T., Adams, E., Geller, A., Inciardi, J., Munoz, A. and Schnoll, S.H., A postmarketing surveillance program to monitor Ultram (tramadol hydrochloride) abuse in the United States. Drug Alcohol Depend. v57. 7-22.
[19]
Compton, W. and Volkow, N., Major increases in opioid analgesic abuse in the United States: concerns and strategies. Drug Alcohol Depend. v81 i2. 103-107.
[20]
Cone, E., Ephemeral profiles of prescription drug and formulation tampering: evolving pseudoscience on the internet. Drug Alcohol Depend. v83 i1. S31-S39.
[21]
Daniulaityte, R., Carlson, R., Falck, R., Cameron, D., Perera, S. and Chen, L., "i just wanted to tell you that loperamide will work": a web based study of extra-medical use of loperamide. Drug Alcohol Depend. v130 i1-3. 241-244.
[22]
Daniulaityte, R., Carlson, R., Falck, R., Cameron, D., Perera, S. and Chen, L., A web-based study of self-treatment of opioid withdrawal symptoms wih loperamide. 2012. College on Problems of Drug Dependence.
[23]
Ericsson, C. and Johnson, P., Safety and efficacy of loperamide. Am J Med. v88. 10S-14S.
[24]
Schifano, F., Ricciardi, A., Corazza, O., Deluca, P., Davey, Z. and Rafanelli, C., "Psychonaut web mapping": new drugs of abuse on the web: the role of the Psychonaut web mapping project. Riv Psichiatr. v45. 88-93.
[25]
Falck, R., Carlson, R., Wang, J. and Siegal, H., Sources of information about MDMA (3,4-methylenedioxymethamphetamine): perceived accuracy, importance. Alcohol Depend. v74. 45-54.
[26]
Kavuluru R, Thomas C, Sheth AP, Chan V, Wang W, Smith A. An up-to-date knowledge-based literature search and exploration framework for focused bioscience domains. In: International health informatics symposium; 2012. p. 275-84.
[27]
Krishnamurthy, R., Li, Y., Raghavan, S., Reiss, F., Vaithyanathan, S. and Zhu, H., SystemT: a system for declarative information extraction. SIGMOD Rec. v37 i4. 7-13.
[28]
Lange, J., Daniel, J., Homer, K., Reed, M. and Clapp, J., Salvia divinorum effects and use among YouTube users. Drug Alcohol Depend. v83 i108. 138-140.
[29]
Lankenau, S., Teti, M., Silva, K., Bloom, J., Harocopos, A. and Treese, M., Initiation into prescription opioid misuse amongst young injection drug users. Int J Drug Policy. v23. 37-44.
[30]
McCabe, S., Cranford, J. and West, B., Trends in prescription drug abuse and dependence, co-occurrence with other substance use disorders, and treatment utilization: results from two national surveys. Addict Behav. v33. 1297-1305.
[31]
Mendes PN, Jakob M, Garcia-Silva A, Bizer C. DBpediaSpotlight: shedding light on the web of documents. In: 7th International conference on semantic systems. I - semantics. New York: ACM; 2011. p. 1-8.
[32]
Miles, M. and Huberman, A., Qualitative data analysis: an expanded sourcebook. 1994. Sage Publications, Thousand Oaks.
[33]
Miller, P. and Sonderlund, A., Using the internet to research hidden populations. Addiction. v105. 1557-1567.
[34]
Nadeau, D. and Sekine, S., A survey of named entity recognition and classification. Lingvisticae Invest. v30 i1. 3-26.
[35]
Griffiths, P., Vingoe, L., Hunt, N., Mounteney, J. and Hartnoll, R., Drug information systems, early warning, and new drug trends: can drug monitoring systems become more sensitive to emerging trends in drug consumption?. Subst Use Misuse. v35. 811-844.
[36]
Paulozzi, L. and Xi, Y., Recent changes in drug poisining mortality in the United States by urban-rural status and by drug type. Pharmacoepidemiol Drug Saf. v17 i10. 997-1005.
[37]
Paulozzi, L., Budnitz, D. and Xi, Y., Increasing deaths from opioid analgesics in the United States. Pharmacoepidemiol Drug Saf. v15 i9. 618-627.
[38]
Peavy, K., Banta-Green, C., Kingston, S., Hanrahan, M., Merill, J. and Coffin, P., "hooked on" prescription-type opiates prior to using heroin: results from a survey of syringe exchange clients. J Psychoactive Drugs. v44. 259-265.
[39]
Ramakrishnan, C., Extracting, representing and mining semantic metadata from text; facilitating knowledge discovery in biomedicine. 2008. Wright State University, Dayton (OH).
[40]
Ramakrishnan C, Kochut K, Sheth AP. A framework for schema-driven relationship discovery from unstructured text. In: International semantic web conference. Athens, GA, USA; 2006. p. 583-96.
[41]
Ramakrishnan C, Mendes PN, daGama RA. Joint extraction of compound entities and relationships for biomedical literature. Web Intell; 2008. p. 398-401.
[42]
Unsupervised discovery of compound entities for relationship extraction. 2008. EKAW.
[43]
Rindflesch, T.F., The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text. J Biomed Inform. 462-477.
[44]
SAMHSA. Results from the 2010 national survey on drug use and health: detailed tables; 2011.
[45]
Schifano, F. and Deluca, P., Psychonaut 2002 research group: searching the internet for drug-related web sites: analysis of online available information on ecstasy (mdma). Am J Addict. v16. 479-483.
[46]
Siegal, H., Carlson, R., Kenne, D. and Swora, M., Probable relationship between opioid abuse and heroin use. Am Fam Physician. v67. 942-945.
[47]
Sloboda, Z., Epidemiology of drug abuse. 2005. Springer.
[48]
Strauss, A. and Corbin, J., Basics of qualitative research: grounded theory procedures and techniques. 1990. Sage Publications.
[49]
Substance Abuse and Mental Health Services Administration. Results from the 2008 National Survey on Drug Use and Health: National findings. Office of Applied Studies.
[50]
Suchanek F, Ifrim GW. Combining linguistic and statistical analysis to extract relations from web documents. KDD; 2006. p. 712-7.
[51]
Thomas, C., Knowledge acquisition in a system. 2012. Wright State University, Dayton (OH).
[52]
UIMA; 2010. Apache: <http://uima.apache.org>.
[53]
Zacny, J., Bigelow, G.C., Foley, K., Iguchi, M. and Sannerud, C., College on problems of drug dependence task force on prescription opioid non-medical use and abuse: position statement. Drug Alcohol Depend. v69. 215-232.
[54]
Ziano J. Semantic app helps researchers understand prescription drug abuse; 2012. From semanticweb.com: <http://tinyurl.com/9oeop2j>.

Cited By

View all
  • (2022)Education, Personal Experiences, and AdvocacyProceedings of the ACM on Human-Computer Interaction10.1145/35556246:CSCW2(1-28)Online publication date: 11-Nov-2022
  • (2022)Machine learning for suicidal ideation identificationComputers in Human Behavior10.1016/j.chb.2021.107095128:COnline publication date: 1-Mar-2022
  • (2021)COVID-19 and Mental Health/Substance Use Disorders on Reddit: A Longitudinal StudyPattern Recognition. ICPR International Workshops and Challenges10.1007/978-3-030-68790-8_2(20-27)Online publication date: 10-Jan-2021
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

Publisher

Elsevier Science

San Diego, CA, United States

Publication History

Published: 01 December 2013

Author Tags

  1. Drug Abuse Ontology
  2. Entity identification
  3. Opiod abuse
  4. Prescription drug abuse
  5. Relationship extraction
  6. Semantic web
  7. Sentiment extraction
  8. Social media
  9. Triple extraction

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 03 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2022)Education, Personal Experiences, and AdvocacyProceedings of the ACM on Human-Computer Interaction10.1145/35556246:CSCW2(1-28)Online publication date: 11-Nov-2022
  • (2022)Machine learning for suicidal ideation identificationComputers in Human Behavior10.1016/j.chb.2021.107095128:COnline publication date: 1-Mar-2022
  • (2021)COVID-19 and Mental Health/Substance Use Disorders on Reddit: A Longitudinal StudyPattern Recognition. ICPR International Workshops and Challenges10.1007/978-3-030-68790-8_2(20-27)Online publication date: 10-Jan-2021
  • (2020)eDarkFind: Unsupervised Multi-view Learning for Sybil Account DetectionProceedings of The Web Conference 202010.1145/3366423.3380263(1955-1965)Online publication date: 20-Apr-2020
  • (2019)Knowledge-aware Assessment of Severity of Suicide Risk for Early InterventionThe World Wide Web Conference10.1145/3308558.3313698(514-525)Online publication date: 13-May-2019
  • (2019)Global trends, local harmsComputational & Mathematical Organization Theory10.1007/s10588-018-09283-025:1(48-59)Online publication date: 1-Mar-2019
  • (2019)Detection and Analysis of Drug Non-compliance in Internet Fora Using Information Retrieval ApproachesComputational Linguistics and Intelligent Text Processing10.1007/978-3-031-24337-0_11(143-154)Online publication date: 7-Apr-2019
  • (2018)"Let Me Tell You About Your Mental Health!"Proceedings of the 27th ACM International Conference on Information and Knowledge Management10.1145/3269206.3271732(753-762)Online publication date: 17-Oct-2018
  • (2018)A Rule-based Approach to Determining Pregnancy Timeframe from Contextual Social Media PostingsProceedings of the 2018 International Conference on Digital Health10.1145/3194658.3194679(16-20)Online publication date: 23-Apr-2018
  • (2017)Social Media for Opioid Addiction EpidemiologyProceedings of the 2017 ACM on Conference on Information and Knowledge Management10.1145/3132847.3132857(1259-1267)Online publication date: 6-Nov-2017
  • Show More Cited By

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media