Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Data is dead... without what-if models

Published: 01 August 2011 Publication History

Abstract

Current database technology has raised the art of scalable descriptive analytics to a very high level. Unfortunately, what enterprises really need is prescriptive analytics to identify optimal business, policy, investment, and engineering decisions in the face of uncertainty. Such analytics, in turn, rest on deep predictive analytics that go beyond mere statistical forecasting and are imbued with an understanding of the fundamental mechanisms that govern a system's behavior, allowing what-if analyses. The database community needs to put what-if models and data on equal footing, developing systems that use both data and models to make sense of rich, real-world complexity and to support real-world decision-making. This model-and-data orientation requires significant extensions of many database technologies, such as data integration, query optimization and processing, and collaborative analytics. In this paper, we argue that data without what-if modeling may be the database community's past, but data with what-if modeling must be its future.

References

[1]
Auchincloss, A. H., Riolo, R. L., Brown, D. G., Cook, J. & Diez Roux, A. V., "An Agent-Based Model of Income Inequalities in Diet in the Context of Residential Segregation," Amer. J. Preventive Medicine, 40(3), 303--311, 2011.
[2]
Cefkin, M., Glissmann, S., Haas, P. Jalali, L., Maglio, P. P., Selinger, P., Tan, W. C., "Splash: A Progress Report on Building a Platform for a 360 Degree View of Health" in 5th INFORMS Workshop on Data Mining and Health Informatics, Austin, TX, 2010. Available at https://informs.emeetingsonline.com/emeetings/formbuilder/clustersessiondtl.asp?csnno=14057&mmnno=201&ppnno=49296
[3]
Chan, W.K.V., Son, Y.-J., and Macal, C.M., "Agent-based simulation tutorial: Simulation of emergent behavior and differences between agent-based simulation and discrete-event simulation," Proc. Winter Simulation Conference, pp. pp. 135--150, 2010.
[4]
Godfray, H. C. J., Pretty, J., Thomas, S. M., Warham E. J. & Beddington, J. R., 2011, "Linking Policy on Climate and Food," Science, 331(6020), 1013--1014, 2011.
[5]
Haas, L. M., Hernández, M.A., Ho, H., Popa, L., Roth, M., "Clio Grows Up: From Research Prototype to Industrial Tool," Proc. ACM SIGMOD, pp. 805--810, 2005.
[6]
Huang, T. T., Drewnowski, A., Kumanyika, S. K., & Glass, T. A., "A Systems-Oriented Multilevel Framework for Addressing Obesity in the 21st Century," Prev. Chronic Disease, 6(3), 2009.
[7]
Jain S., and McLean C. R., "Integrated simulation and gaming architecture for incident management training," Proc. Winter Simulation Conference, pp. 904--913, 2005.
[8]
Jampani, R., Perez, L., Wu, M., Xu, F., Jermaine, C., and Haas, P.J., "MCDB: A Monte Carlo approach to managing uncertain data," Proc. ACM SIGMOD Intl. Conf. Management of Data, pp. 687--700, 2008.
[9]
Kuhl F., Weatherly R., and Dahmann J., Creating Computer Simulation Systems: An Introduction to the High Level Architecture, Prentice Hall, New Jersey, 1999.
[10]
Law, A.M., Simulation Modeling and Analysis, 4th Edition, McGraw-Hill, 2007.
[11]
Levy, D. T., Mabry, P. L., Wang, Y. C., Gortmaker, S., Huang, T. T.-K., Marsh, T., Moodie, M., & Swinburn, B., 2010, "Simulation models of obesity: a review of the literature and implications for research and policy," Obesity Rev., 12(5), 378--394, 2011.
[12]
Navarro-Barrientos J. E., Rivera D. E., Collins L. M., "A dynamical systems model for understanding behavioral interventions for weight loss," Proc. Int. Conf. Social Computing, Behavioral Modeling, and Prediction (SBP), Springer Lecture Notes in Computer Science 6007, pp. 170--279, 2010.
[13]
Planung Transport Verkehr AG, VISUM. http://www.ptvag.com/software/transportation-planning-traffic-engineering/software-system-solutions/visum/.
[14]
Robinson, A., Levis, J., and Bennett, G., 2010, "INFORMS to officially join analytics movement." ORMS Today, 37(5), October, 2010.
[15]
Sterman, J.D., Business Dynamics: Systems Thinking and Modeling for a Complex World, McGraw-Hill/Irwin, Boston, 2000.
[16]
Viégas, F., B. Wattenberg, M., van Ham, F., Kriss, J., and McKeon, M., "ManyEyes: A site for visualization at Internet scale," IEEE Trans. Visualization and Computer Graphics, 13(6), 1121--1128, 2007.
[17]
Wang, G., Vaz Salles, M.A., Sowell, B., Wang, X., Cao, T., Demers, A.J., Gehrke, J., and White, W.M., 2010, "Behavioral Simulations in MapReduce," PVLDB 3(1), 952--963, 2010.
[18]
Xu, F., Ercegovac, V., Haas, P.J., and Shekita, E., "E = MC3: Managing uncertain enterprise data in a cluster-computing environment," Proc. ACM SIGMOD Intl. Conf. Management of Data, pp. 441--454, 2009.

Cited By

View all
  • (2022)A Machine Learning–Enabled Partially Observable Markov Decision Process Framework for Early Sepsis PredictionINFORMS Journal on Computing10.1287/ijoc.2022.117634:4(2039-2057)Online publication date: 1-Jul-2022
  • (2022)Prescriptive analytics: a survey of emerging trends and technologiesThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-019-00539-y28:4(575-595)Online publication date: 10-Mar-2022
  • (2018)A Visual Interaction Framework for Dimensionality Reduction Based Data ExplorationProceedings of the 2018 CHI Conference on Human Factors in Computing Systems10.1145/3173574.3174209(1-13)Online publication date: 21-Apr-2018
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Proceedings of the VLDB Endowment
Proceedings of the VLDB Endowment  Volume 4, Issue 12
August 2011
303 pages

Publisher

VLDB Endowment

Publication History

Published: 01 August 2011
Published in PVLDB Volume 4, Issue 12

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)16
  • Downloads (Last 6 weeks)1
Reflects downloads up to 13 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2022)A Machine Learning–Enabled Partially Observable Markov Decision Process Framework for Early Sepsis PredictionINFORMS Journal on Computing10.1287/ijoc.2022.117634:4(2039-2057)Online publication date: 1-Jul-2022
  • (2022)Prescriptive analytics: a survey of emerging trends and technologiesThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-019-00539-y28:4(575-595)Online publication date: 10-Mar-2022
  • (2018)A Visual Interaction Framework for Dimensionality Reduction Based Data ExplorationProceedings of the 2018 CHI Conference on Human Factors in Computing Systems10.1145/3173574.3174209(1-13)Online publication date: 21-Apr-2018
  • (2015)Towards An Info-Symbiotic Decision Support System for Disaster Risk ManagementProceedings of the 19th International Symposium on Distributed Simulation and Real Time Applications10.1109/DS-RT.2015.26(85-91)Online publication date: 14-Oct-2015
  • (2014)γ-DBProceedings of the VLDB Endowment10.14778/2732967.27329717:11(959-962)Online publication date: 1-Jul-2014
  • (2014)Towards building wind tunnels for data center designProceedings of the VLDB Endowment10.14778/2732939.27329507:9(781-784)Online publication date: 1-May-2014
  • (2014)Model-data EcosystemsProceedings of the 33rd ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems10.1145/2594538.2594562(76-87)Online publication date: 18-Jun-2014

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media