MASSW: A New Dataset and Benchmark Tasks for AI-Assisted Scientific Workflows

Zhang, Xingjian; Xie, Yutong; Huang, Jin; Ma, Jinge; Pan, Zhaoying; Liu, Qijia; Xiong, Ziyang; Ergen, Tolga; Shim, Dongsub; Lee, Honglak; Mei, Qiaozhu

Computer Science > Computation and Language

arXiv:2406.06357 (cs)

[Submitted on 10 Jun 2024]

Title:MASSW: A New Dataset and Benchmark Tasks for AI-Assisted Scientific Workflows

Authors:Xingjian Zhang, Yutong Xie, Jin Huang, Jinge Ma, Zhaoying Pan, Qijia Liu, Ziyang Xiong, Tolga Ergen, Dongsub Shim, Honglak Lee, Qiaozhu Mei

View PDF HTML (experimental)

Abstract:Scientific innovation relies on detailed workflows, which include critical steps such as analyzing literature, generating ideas, validating these ideas, interpreting results, and inspiring follow-up research. However, scientific publications that document these workflows are extensive and unstructured. This makes it difficult for both human researchers and AI systems to effectively navigate and explore the space of scientific innovation. To address this issue, we introduce MASSW, a comprehensive text dataset on Multi-Aspect Summarization of Scientific Workflows. MASSW includes more than 152,000 peer-reviewed publications from 17 leading computer science conferences spanning the past 50 years. Using Large Language Models (LLMs), we automatically extract five core aspects from these publications -- context, key idea, method, outcome, and projected impact -- which correspond to five key steps in the research workflow. These structured summaries facilitate a variety of downstream tasks and analyses. The quality of the LLM-extracted summaries is validated by comparing them with human annotations. We demonstrate the utility of MASSW through multiple novel machine-learning tasks that can be benchmarked using this new dataset, which make various types of predictions and recommendations along the scientific workflow. MASSW holds significant potential for researchers to create and benchmark new AI methods for optimizing scientific workflows and fostering scientific innovation in the field. Our dataset is openly available at \url{this https URL}.

Comments:	arXiv admin note: text overlap with arXiv:1706.03762 by other authors
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2406.06357 [cs.CL]
	(or arXiv:2406.06357v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2406.06357

Submission history

From: Xingjian Zhang [view email]
[v1] Mon, 10 Jun 2024 15:19:09 UTC (3,152 KB)

Computer Science > Computation and Language

Title:MASSW: A New Dataset and Benchmark Tasks for AI-Assisted Scientific Workflows

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:MASSW: A New Dataset and Benchmark Tasks for AI-Assisted Scientific Workflows

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators