Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Searching workflows with hierarchical views

Published: 01 September 2010 Publication History

Abstract

Workflows are prevalent in diverse applications, which can be scientific experiments, business processes, web services, or recipes. With the dramatically growing number of workflows, there is an increasing need for people to search a workflow repository using keywords and to retrieve the relevant ones. A workflow hierarchy is a three dimensional object containing multiple abstraction views of different granularity on the same workflow. This unique structure poses a new set of challenges compared to keyword search on tree or graph structures typically found in relational or XML data.
In this paper, we define an informative, self-contained and concise search result on workflows to be a projection of a workflow hierarchy on a two dimensional viewing plane inferred from user queries. We then design and develop an efficient keyword search engine for workflows. Experimental evaluation demonstrates the effectiveness of our approach.

References

[1]
GEON. http://www.geongrid.org.
[2]
Kepler. http://kepler-project.org/.
[3]
MOML. http://ptolemy.eecs.berkeley.edu/papers/05/ptIIdesign1-intro/ptIIdesign1-intro.pdf.
[4]
myExperiment. http://www.myexperiment.org/.
[5]
Seek. http://seek.ecoinformatics.org.
[6]
Taverna Project. http://taverna.sourceforge.net/.
[7]
Triana. http://www.trianacode.org/collaborations/index.html.
[8]
WordNet: A Lexical Database for English. http://wordnet.princeton.edu/.
[9]
I. Altintas, C. Berkley, E. Jaeger, M. Jones, B. Ludäscher, and S. Mock. Kepler: An Extensible System for Design and Execution of Scientific Workflows. In SSDBM, 2004.
[10]
Z. Bao, S. C. Boulakia, S. B. Davidson, A. Eyal, and S. Khanna. Differencing Provenance in Scientific Workflows. In ICDE, 2009.
[11]
Z. Bao, T. W. Ling, B. Chen, and J. Lu. Effective XML Keyword Search with Relevance Oriented Ranking. In ICDE, 2009.
[12]
C. Beeri, A. Eyal, S. Kamenkovich, and T. Milo. Querying Business Processes. In VLDB, 2006.
[13]
O. Biton, S. C. Boulakia, S. B. Davidson, and C. S. Hara. Querying and Managing Provenance through User Views in Scientific Workflows. In ICDE, 2008.
[14]
O. Biton, S. Cohen-Boulakia, and S. B. Davidson. Zoom*UserViews: Querying Relevant Provenance in Workflow Systems. In VLDB, 2007.
[15]
A. Chebotko, S. Chang, S. Lu, F. Fotouhi, and P. Yang. Scientific Workflow Provenance Querying with Security Views. In WAIM, 2008.
[16]
I.-M. A. Chen and V. M. Markowitz. Modeling Scientific Experiments with an Object Data Model. In ICDE, 1995.
[17]
S. Cohen, S. C. Boulakia, and S. B. Davidson. Towards a Model of Provenance and User Views in Scientific Workflows. In DILS, 2006.
[18]
E. Deelman, S. Callaghan, E. Field, H. Francoeur, R. Graves, N. Gupta, V. Gupta, T. H. Jordan, C. Kesselman, P. Maechling, J. Mehringer, G. Mehta, D. Okaya, K. Vahi, and L. Zhao. Managing Large-Scale Workflow Execution from Resource Provisioning to Provenance tracking: The CyberShake Example. In e-Science, 2006.
[19]
K. Golenberg, B. Kimelfeld, and Y. Sagiv. Keyword Proximity Search in Complex Data Graphs. In SIGMOD, 2008.
[20]
H. He, H. Wang, J. Yang, and P. Yu. BLINKS: Ranked Keyword Searches on Graphs. In SIGMOD, 2007.
[21]
T. Heinis and G. Alonso. Efficient Lineage Tracking for Scientific Workflows. In SIGMOD, 2008.
[22]
A. K. Joshi and Y. Schabes. Tree-Adjoining Grammars and Lexicalized Grammars. In Tree Automata and Languages, pages 409--432. 1992.
[23]
K. Lee, N. W. Paton, R. Sakellariou, and A. A. A. Fernandes. Utility Driven Adaptive Workflow Execution. In CCGRID, 2009.
[24]
G. Li, V. Muthusamy, H.-A. Jacobsen, and S. Mankovski. Decentralized Execution of Event-Driven Scientific Workflows. In SCW, 2006.
[25]
D. T. Liu and M. J. Franklin. The Design of GridDB: A Data-Centric Overlay for the Scientific Grid. In VLDB, 2004.
[26]
Z. Liu and Y. Chen. Identifying Meaningful Return Information for XML Keyword Search. In SIGMOD, 2007.
[27]
Z. Liu and Y. Chen. Reasoning and Identifying Relevant Matches for XML Keyword Search. In VLDB, 2008.
[28]
Z. Liu and Y. Chen. Return Specification Inference and Result Clustering for Keyword Search on XML. ACM Trans. Database Syst., 35(2), 2010.
[29]
Z. Liu, Q. Shao, and Y. Chen. WISE: Searching Workflow Hierarchies. Technical report, Arizona State University, 2010.
[30]
Y. Luo, X. Lin, W. Wang, and X. Zhou. SPARK: Top-k Keyword Query in Relational Databases. In SIGMOD, 2007.
[31]
C. B. Medeiros, J. de Jesús Pérez Alcázar, L. A. Digiampietri, G. Z. P. Jr., A. Santanchè, R. da Silva Torres, E. R. M. Madeira, and E. Bacarin. WOODSS and the Web: Annotating and Reusing Scientific Workflows. SIGMOD Record, 34(3):18--23, 2005.
[32]
M. A. Nieto-Santisteban, J. Gray, A. S. Szalay, J. Annis, A. R. Thakar, and W. O'Mullane. When Database Systems Meet the Grid. In CIDR, 2005.
[33]
T. M. Oinn, M. Addis, J. Ferris, D. Marvin, M. Senger, R. M. Greenwood, T. Carver, K. Glover, M. R. Pocock, A. Wipat, and P. Li. Taverna: A Tool for the Composition and Enactment of Bioinformatics Workflows. Bioinformatics, 20(17):3045--3054, 2004.
[34]
C. E. Scheidegger, H. T. Vo, D. Koop, J. Freire, and C. T. Silva. Querying and Re-Using Workflows with VisTrails. In SIGMOD, 2008.
[35]
S. Shankar, A. Kini, D. J. DeWitt, and J. F. Naughton. Integrating Databases and Workflow Systems. SIGMOD Record, 34(3):5--11, 2005.
[36]
Q. Shao, P. Sun, and Y. Chen. WISE: A Workflow Information Search Engine. In ICDE, 2009.
[37]
P. Sun, Z. Liu, S. B. Davidson, and Y. Chen. Detecting and Resolving Unsound Workflow Views for Correct Provenance Analysis. In SIGMOD, 2009.
[38]
M. Vrhovnik, H. Schwarz, S. Radeschütz, and B. Mitschang. An Overview of SQL Support in Workflow Products. In ICDE, 2008.
[39]
D. L. Wang, C. S. Zender, and S. F. Jenks. Clustered Workflow Execution of Retargeted Data Analysis Scripts. In CCGRID, 2008.
[40]
Y. Xu and Y. Papakonstantinou. Efficient Keyword Search for Smallest LCAs in XML Databases. In SIGMOD, 2005.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Proceedings of the VLDB Endowment
Proceedings of the VLDB Endowment  Volume 3, Issue 1-2
September 2010
1658 pages
ISSN:2150-8097
  • Editors:
  • Elisa Bertino,
  • Paolo Atzeni,
  • Kian Lee Tan,
  • Yi Chen,
  • Y. C. Tay
Issue’s Table of Contents

Publisher

VLDB Endowment

Publication History

Published: 01 September 2010
Published in PVLDB Volume 3, Issue 1-2

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)7
  • Downloads (Last 6 weeks)1
Reflects downloads up to 30 Aug 2024

Other Metrics

Citations

Cited By

View all
  • (2013)Search and result presentation in scientific workflow repositoriesProceedings of the 25th International Conference on Scientific and Statistical Database Management10.1145/2484838.2484847(1-12)Online publication date: 29-Jul-2013
  • (2012)Exploiting and Maintaining Materialized Views for XML Keyword QueriesACM Transactions on Internet Technology10.1145/2390209.239021212:2(1-27)Online publication date: 1-Dec-2012
  • (2011)Search, adapt, and reuseACM SIGMOD Record10.1145/2034863.203486540:2(6-16)Online publication date: 15-Sep-2011
  • (2011)On provenance and privacyProceedings of the 14th International Conference on Database Theory10.1145/1938551.1938554(3-10)Online publication date: 21-Mar-2011
  • (2011)Generating sound workflow views for correct provenance analysisACM Transactions on Database Systems10.1145/1929934.192994036:1(1-35)Online publication date: 18-Mar-2011

View Options

Get Access

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media