Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Research Directions for Principles of Data Management (Abridged)

Published: 11 May 2017 Publication History
First page of PDF

References

[1]
D. Abadi et al. The Beckman report on database research. Commun. ACM, 59(2):92--99, 2016.
[2]
S. Abiteboul, B. André, and D. Kaplan. Managing your digital life. Commun. ACM, 58(5):32--35, 2015.
[3]
S. Abiteboul, P. Bourhis, and V. Vianu. Comparing workflow specification languages: A matter of views. ACM Trans. Database Syst., 37(2):10, 2012.
[4]
S. Abiteboul et al. Research directions for Principles of Data Management (Dagstuhl perspectives workshop 16151). https://arxiv.org/abs/1701.09007.
[5]
S. Abiteboul et al., editor. Data, Responsibly, volume 16291 of Dagstuhl Seminar Proceedings. Schloss Dagstuhl -- LZI, 2016, forthcoming.
[6]
F. N. Afrati and J. D. Ullman. Optimizing multiway joins in a map-reduce environment. IEEE Trans. Knowl. Data Eng., 23(9):1282--1298, 2011.
[7]
A. Agarwal et al. A reliable effective terascale linear learning system. Journal of Machine Learning Research, 15:1111--1133, 2014.
[8]
R. Agrawal et al. Diversifying search results. In WSDM, pages 5--14. ACM, 2009.
[9]
M. Akdere et al. The case for predictive database systems: Opportunities and challenges. In Conference on Innovative Data Systems Research (CIDR), pages 167--174. www.cidrdb.org, 2011.
[10]
A. Amarilli, P. Bourhis, and P. Senellart. Provenance circuits for trees and treelike instances. In ICALP, pages 56--68. Springer, 2015.
[11]
J. Angwin et al. Machine bias. ProPublica, May 2016.
[12]
M. Aref et al. Design and implementation of the LogicBlox system. In SIGMOD, pages 1371--1382. ACM, 2015.
[13]
M. Arenas, G. Gottlob, and A. Pieris. Expressive languages for querying the semantic web. In PODS, pages 14--26. ACM, 2014.
[14]
M. Arenas et al. Foundations of Data Exchange. Cambridge University Press, 2014.
[15]
M. Arenas et al. Faceted search over RDF-based knowledge graphs. J. Web Sem., 37:55--74, 2016.
[16]
A. Artale et al. A cookbook for temporal conceptual data modelling with description logics. ACM Trans. on Computational Logic, 15(3):25:1--25:50, 2014.
[17]
A. Atserias, M. Grohe, and D. Marx. Size bounds and query plans for relational joins. SIAM J. Comput., 42(4):1737--1767, 2013.
[18]
J.-F. Baget et al. On rules with existential variables: Walking the decidability line. Artificial Intelligence, 175(9--10):1620--1654, 2011.
[19]
S. Barocas and A. D. Selbst. Big data's disparate impact. California Law Review, 104, 2016.
[20]
P. Beame, P. Koutris, and D. Suciu. Communication steps for parallel query processing. In PODS, pages 273--284. ACM, 2013.
[21]
M. Benedikt, W. Fan, and F. Geerts. XPath satisfiability in the presence of DTDs. J. ACM, 55(2), 2008.
[22]
L. Bertossi. Database Repairing and Consistent Query Answering. Morgan&Claypool Publishers, 2011.
[23]
G. J. Bex et al. Inference of concise regular expressions and DTDs. ACM Trans. Database Syst., 35(2), 2010.
[24]
K. Bhattacharya et al. Towards formal analysis of artifact-centric business process models. In BPM, pages 288--304. Springer, 2007.
[25]
M. Bienvenu et al. Ontology-based data access: A study through Disjunctive Datalog, CSP, and MMSNP. ACM Trans. Database Syst., 39(4):33:1--33:44, 2014.
[26]
M. J. Cafarella, D. Suciu, and O. Etzioni. Navigating extracted data with schema discovery. In WebDB, 2007.
[27]
D. Calvanese, G. De Giacomo, and M. Lenzerini. Conjunctive query containment and answering under description logics constraints. ACM Trans. on Computational Logic, 9(3):22.1--22.31, 2008.
[28]
D. Calvanese, G. De Giacomo, and M. Montali. Foundations of data-aware process analysis: a database theory perspective. In PODS, pages 1--12. ACM, 2013.
[29]
D. Calvanese et al. Tractable reasoning and efficient query answering in description logics: The DL-Lite family. J. Autom. Reasoning, 39(3):385--429, 2007.
[30]
S. Cebiric, F. Goasdoué, and I. Manolescu. Query-oriented summarization of RDF graphs. Proc. VLDB Endowment, 8(12):2012--2015, 2015.
[31]
M. Cissé, N. Usunier, T. Artieres, and P. Gallinari. Robust Bloom filters for large multilabel classification tasks. In NIPS, 2013.
[32]
E. F. Codd. Understanding relations (installment #7). FDT - Bulletin of ACM SIGMOD, 7(3):23--28, 1975.
[33]
W. Czerwinski et al. The (almost) complete guide to tree pattern containment. In PODS, pages 117--130. ACM, 2015.
[34]
C. J. Date. Database in Depth -- Relational Theory for Practitioners. O'Reilly, 2005.
[35]
A. Datta, M. C. Tschantz, and A. Datta. Automated experiments on ad privacy settings. PoPETs, 2015(1):92--112, 2015.
[36]
S. B. Davidson and J. Freire. Provenance and scientific workflows: Challenges and opportunities. In SIGMOD, pages 1345--1350. ACM, 2008.
[37]
U. Dayal et al. Data integration flows for business intelligence. In EDBT, pages 1--11. ACM, 2009.
[38]
K. Dembczynski, W. Cheng, and E. Hüllermeier. Bayes optimal multilabel classification via probabilistic classifier chains. In ICML, pages 279--286. Omnipress, 2010.
[39]
D. Deutch and T. Milo. A quest for beauty and wealth (or, business processes for database researchers). In PODS, pages 1--12. ACM, 2011.
[40]
A. Deutsch, R. Hull, and V. Vianu. Automatic verification of database-centric systems. SIGMOD Record, 43(3):5--17, 2014.
[41]
A. Deutsch et al. Automatic verification of data-centric business processes. In ICDT. ACM, 2009.
[42]
M. Drosou and E. Pitoura. DisC diversity: result diversification based on dissimilarity and coverage. Proc. VLDB Endowment, 6(1):13--24, 2012.
[43]
C. Dwork et al. Fairness through awareness. In ITCS, pages 214--226. ACM, 2012.
[44]
T. Eiter, T. Lukasiewicz, and L. Predoiu. Generalized consistent query answering under existential rules. In KR, pages 359--368. AAAI Press, 2016.
[45]
J. Feldman et al. On distributing symmetric streaming computations. In SODA, pages 710--719. SIAM, 2008.
[46]
G. Gottlob, C. Koch, and R. Pichler. Efficient algorithms for processing XPath queries. ACM Trans. Database Syst., 30(2):444--491, 2005.
[47]
G. Gottlob and P. Senellart. Schema mapping discovery from data instances. J. ACM, 57(2), 2010.
[48]
P. J. Haas and J. M. Hellerstein. Ripple joins for online aggregation. In SIGMOD, pages 287--298. ACM, 1999.
[49]
J. M. Hellerstein, P. J. Haas, and H. J. Wang. Online aggregation. In SIGMOD, pages 171--182. ACM, 1997.
[50]
M. Hepp. The web of data for e-commerce: Schema.org and GoodRelations for researchers and practitioners. In ICWE, pages 723--727. Springer, 2015.
[51]
X. Hu and K. Yi. Towards a worst-case i/o-optimal algorithm for acyclic joins. In PODS. ACM, 2016.
[52]
R. Hull and J. Su. NSF Workshop on Data-Centric Workflows, May, 2009. http://dcw2009.cs.ucsb.edu/report.pdf
[53]
T. Imielinski and W. Lipski. Incomplete information in relational databases. J. ACM, 31(4):761--791, 1984.
[54]
K. Jasinska et al. Extreme F-measure maximization using sparse probability estimates. In ICML. JMLR.org, 2016.
[55]
A. K. Jha and D. Suciu. Probabilistic databases with MarkoViews. Proc. VLDB Endowment, 5(11):1160--1171, 2012.
[56]
M. Kaminski and E. V. Kostylev. Beyond well-designed SPARQL. In ICDT, pages 5:1--5:18. Schloss Dagstuhl -- LZI, 2016.
[57]
S. Kandel et al. Enterprise data analysis and visualization: An interview study. IEEE Trans. Vis. Comput. Graph., 18(12):2917--2926, 2012.
[58]
P. Koutris, P. Beame, and D. Suciu. Worst-case optimal algorithms for parallel query processing. In ICDT, pages 8:1--8:18. Schloss Dagstuhl -- LZI, 2016.
[59]
M. Lenzerini. Data integration: a theoretical perspective. In PODS, pages 233--246. ACM, 2002.
[60]
J. Lerman. Big data and its exclusions. Stanford Law Review Online, 66, 2013.
[61]
F. Li, B. Wu, K. Yi, and Z. Zhao. Wander join: Online aggregation via random walks. In International Conference on Management of Data (SIGMOD), pages 615--629. ACM, 2016.
[62]
L. Libkin. Incomplete information: what went wrong and how to fix it. In PODS, pages 1--13. ACM, 2014.
[63]
L. Libkin. SQL's three-valued logic and certain answers. ACM Trans. Database Syst., 41(1):1, 2016.
[64]
R. Liu et al. Business artifact-centric modeling for real-time performance monitoring. In BPM, pages 265--280, 2011.
[65]
M. Marin, R. Hull, and R. Vaculín. Data-centric BPM and the emerging Case Management standard: A short survey. In BPM Workshops, pages 24--30, 2012.
[66]
V. Z. Moffitt et al. Collaborative access control in WebdamLog. In SIGMOD, pages 197--211. ACM, 2015.
[67]
G. D. F. Morales and A. Bifet. SAMOA: Scalable advanced massive online analysis. Journal of Machine Learning Research, 16:149--153, 2015.
[68]
C. Muñoz, M. Smith, and D. Patil. Big data: A report on algorithmic systems, opportunity, and civil rights. Executive Office of the President, The White House, May 2016.
[69]
N. Ngo, M. Ortiz, and M. Simkus. Closed predicates in description logics: Results on combined complexity. In KR, pages 237--246. AAAI Press, 2016.
[70]
H. Q. Ngo et al. Worst-case optimal join algorithms: {extended abstract}. In PODS, pages 37--48. ACM, 2012.
[71]
A. Nigam and N. Caswell. Business Artifacts: An Approach to Operational Specification. IBM Systems Journal, 42(3), 2003.
[72]
Y. Prabhu and M. Varma. FastXML: a fast, accurate and stable tree-classifier for extreme multi-label learning. In KDD, pages 263--272. ACM, 2014.
[73]
M. Riondato et al. The vc-dimension of SQL queries and selectivity estimation through sampling. In ECML/PKDD, pages 661--676. Springer, 2011.
[74]
R. Salakhutdinov and G. E. Hinton. Semantic hashing. Int. Journal of Approximate Reasoning, 50(7):969--978, 2009.
[75]
M. Schleich, D. Olteanu, and R. Ciucanu. Learning linear regression models over factorized joins. In SIGMOD, pages 3--18. ACM, 2016.
[76]
P. G. Selinger et al. Access path selection in a relational database management system. In SIGMOD, pages 23--34. ACM, 1979.
[77]
J. Shin et al. Incremental knowledge base construction using DeepDive. Proc. VLDB Endowment, 8(11):1310--1321, 2015.
[78]
S. Staworko, J. Chomicki, and J. Marcinkowski. Prioritized repairing and consistent query answering in relational databases. Ann. Math. Artif. Intell., 64(2-3):209--246, 2012.
[79]
J. Stoyanovich, S. Abiteboul, and G. Miklau. Data responsibly: Fairness, neutrality and transparency in data analysis. In EDBT, pages 718--719. OpenProceedings.org, 2016.
[80]
D. Suciu and V. Tannen. A query language for NC. In PODS, pages 167--178. ACM, 1994.
[81]
D. Suciu et al. Probabilistic Databases. Morgan&Claypool Publishers, 2011.
[82]
D. Suciu et al. Probabilistic Databases. Synthesis Lectures on Data Management. Morgan & Claypool Publishers, 2011.
[83]
Y. Sun, J. Su, and J. Yang. Universal artifacts. ACM Trans. on Management Information Systems, 7(1), 2016.
[84]
L. Sweeney. Discrimination in online ad delivery. Commun. ACM, 56(5):44--54, 2013.
[85]
B. ten Cate, V. Dalmau, and P. G. Kolaitis. Learning schema mappings. ACM Trans. Database Syst., 38(4):28, 2013.
[86]
L. Valiant. A theory of the learnable. Commun. ACM, 17(11):1134--1142, 1984.
[87]
T. L. Veldhuizen. Triejoin: A simple, worst-case optimal join algorithm. In ICDT, pages 96--106. OpenProceedings.org, 2014.
[88]
K. Weinberger et al. Feature hashing for large scale multitask learning. In ICML, pages 1113--1120. ACM, 2009.
[89]
M. Yannakakis. Algorithms for acyclic database schemes. In VLDB, pages 82--94. IEEE, 1981.

Cited By

View all
  • (2024)Responsible composition and optimization of integration processes under correctness preserving guaranteesInformation Systems10.1016/j.is.2024.102400(102400)Online publication date: Apr-2024
  • (2024)Scientific Workflows Management with Blockchain: A SurveyBlockchain and Smart-Contract Technologies for Innovative Applications10.1007/978-3-031-50028-2_5(131-163)Online publication date: 1-Mar-2024
  • (2023)Evaluation of Economic Development Data Management System Based on Particle Swarm Optimization Algorithm2023 International Conference on Data Science and Network Security (ICDSNS)10.1109/ICDSNS58469.2023.10245900(1-5)Online publication date: 28-Jul-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM SIGMOD Record
ACM SIGMOD Record  Volume 45, Issue 4
December 2016
48 pages
ISSN:0163-5808
DOI:10.1145/3092931
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 May 2017
Published in SIGMOD Volume 45, Issue 4

Check for updates

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)28
  • Downloads (Last 6 weeks)5
Reflects downloads up to 03 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Responsible composition and optimization of integration processes under correctness preserving guaranteesInformation Systems10.1016/j.is.2024.102400(102400)Online publication date: Apr-2024
  • (2024)Scientific Workflows Management with Blockchain: A SurveyBlockchain and Smart-Contract Technologies for Innovative Applications10.1007/978-3-031-50028-2_5(131-163)Online publication date: 1-Mar-2024
  • (2023)Evaluation of Economic Development Data Management System Based on Particle Swarm Optimization Algorithm2023 International Conference on Data Science and Network Security (ICDSNS)10.1109/ICDSNS58469.2023.10245900(1-5)Online publication date: 28-Jul-2023
  • (2023)Polynomial combined first-order rewritings for linear and guarded existential rulesArtificial Intelligence10.1016/j.artint.2023.103936321(103936)Online publication date: Aug-2023
  • (2022)Non-Uniformly Terminating Chase: Size and ComplexityProceedings of the 41st ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems10.1145/3517804.3524146(369-378)Online publication date: 12-Jun-2022
  • (2022)Sampling a Near Neighbor in High Dimensions — Who is the Fairest of Them All?ACM Transactions on Database Systems10.1145/350286747:1(1-40)Online publication date: 6-Apr-2022
  • (2022)Fairness & friends in the data science eraAI & Society10.1007/s00146-022-01472-538:2(721-731)Online publication date: 9-Jun-2022
  • (2021)A survey on semantic schema discoveryThe VLDB Journal10.1007/s00778-021-00717-x31:4(675-710)Online publication date: 27-Nov-2021
  • (2021)Data Management Strategy Based on Edge Computing2021 International Conference on Big Data Analytics for Cyber-Physical System in Smart City10.1007/978-981-16-7466-2_85(761-770)Online publication date: 10-Dec-2021
  • (2020)Fair Near Neighbor Search: Independent Range Sampling in High DimensionsProceedings of the 39th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems10.1145/3375395.3387648(191-204)Online publication date: 14-Jun-2020
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media