Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Data modeling in the NoSQL world

Published: 01 January 2020 Publication History

Abstract

NoSQL systems have gained their popularity for many reasons, including the flexibility they provide in organizing data, as they relax the rigidity provided by the relational model and by the other structured models. This flexibility and the heterogeneity that has emerged in the area have led to a little use of traditional modeling techniques, as opposed to what has happened with databases for decades.
In this paper, we argue how traditional notions related to data modeling can be useful in this context as well. Specifically, we propose NoAM (NoSQL Abstract Model), a novel abstract data model for NoSQL databases, which exploits the commonalities of various NoSQL systems. We also propose a database design methodology for NoSQL systems based on NoAM, with initial activities that are independent of the specific target system. NoAM is used to specify a system-independent representation of the application data and, then, this intermediate representation can be implemented in target NoSQL databases, taking into account their specific features. Overall, the methodology aims at supporting scalability, performance, and consistency, as needed by next-generation web applications.

Abstract

Highlights

NoSQL systems are characterized by high flexibility but also high heterogeneity.
We argue that data modeling notions can be useful in the context on NoSQL databases.
We propose NoAM, an abstract data model for NoSQL systems.
We propose a database design methodology for NoSQL systems, based on NoAM.

References

[1]
F. Bugiotti, L. Cabibbo, P. Atzeni, R. Torlone, Database design for NoSQL systems, in: Conceptual Modeling—33rd International Conference, ER 2014, Atlanta, GA, USA, October 27–29, 2014. Proceedings, 2014, pp. 223–231.
[2]
R. Cattell, Scalable SQL and NoSQL data stores, SIGMOD Record 39 (4) (2010) 12–27.
[3]
P.J. Sadalage, M.J. Fowler, NoSQL Distilled, Addison-Wesley, Upper Saddle River, NJ, USA, 2012.
[4]
M. Stonebraker, Stonebraker on NoSQL and enterprises, Commun. ACM 54 (8) (2011) 10–11.
[5]
I. Katsov, NoSQL Data Modeling Techniques, Highly Scalable Blog, 〈https://highlyscalable.wordpress.com/2012/03/01/nosql-data-modeling-techniques/〉, 2012 (accessed February 2016).
[6]
A. Khurana, Introduction to HBase Schema Design, ;login. Usenix Mag. 37 (5) (2012) 29–36.
[7]
M. Hamrah, Data Modeling at Scale: MongoDB + Mongoid, Callbacks, and Denormalizing Data for Efficiency, 〈http://blog.michaelhamrah.com/2011/08/data-modeling-at-scale-mongodb-mongoid-callbacks-and-denormalizing-data-for-efficiency/〉, 2011 (accessed February 2016).
[8]
A. Chebotko, A. Kashlev, S. Lu, A big data modeling methodology for Apache Cassandra, in: IEEE International Congress on Big Data, 2015, pp. 238–245.
[9]
P. Atzeni, C.S. Jensen, G. Orsi, S. Ram, L. Tanca, R. Torlone, The relational model is dead, SQL is dead, and I don't feel so good myself, SIGMOD Record 42 (2) (2013) 64–68.
[10]
A. Badia, D. Lemire, A call to arms, SIGMOD Record 40 (3) (2011) 61–69.
[11]
C. Mohan, History repeats itself: sensible and NonsenSQL aspects of the NoSQL hoopla, in: EDBT, 2013, pp. 11–16.
[12]
C. Batini, S. Ceri, S.B. Navathe, Conceptual Database Design: An Entity-Relationship Approach, Benjamin/Cummings, Redwood City, CA, USA, 1992.
[13]
F. Bancilhon, Object-oriented database systems, in: Proceedings of the Seventh ACMSIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, March 21–23, 1988, Austin, TX, USA, 1988, pp. 152–162.
[14]
P. Atzeni, P. Merialdo, G. Mecca, Data-intensive web sites, World Wide Web 4 (1–2) (2001) 21–47.
[15]
S. Ceri, P. Fraternali, A. Bongio, M. Brambilla, S. Comai, M. Matera, Designing Data-Intensive Web Applications, Morgan Kaufmann, San Francisco, CA, USA, 2003.
[16]
G. Mecca, A.O. Mendelzon, P. Merialdo, Efficient queries over web views, IEEE Trans. Knowl. Data Eng. 14 (6) (2002) 1280–1298.
[17]
S. Abiteboul, P. Buneman, D. Suciu, Data on the Web: From Relations to Semistructured Data and XML, Morgan Kaufmann, San Francisco, CA, USA, 1999.
[18]
M. Stonebraker, U. Çetintemel, “one size fits all”: an idea whose time has come and gone (abstract), in: Proceedings of the 21st International Conference on Data Engineering, ICDE 2005, 5–8 April 2005, Tokyo, Japan, 2005, pp. 2–11.
[19]
E. Evans, Domain-Driven Design, Addison-Wesley, Boston, MA, USA, 2003.
[20]
P. Helland, Life beyond distributed transactions: an Apostate's opinion, in: CIDR 2007, 2007, pp. 132–141.
[21]
S. Abiteboul, R. Hull, V. Vianu, Foundations of Databases, Addison-Wesley, Reading, MA, USA, 1995.
[22]
F. Chang, et al., Bigtable: a distributed storage system for structured data, ACM Trans. Comput. Syst. 26 (2) (2008).
[23]
Oracle, Oracle NoSQL Database, 〈http://www.oracle.com/us/products/database/nosql/〉 (accessed February 2016).
[24]
J. Shute, et al., F1: a distributed SQL database that scales, PVLDB 6 (11) (2013) 1068–1079.
[25]
MongoDB Inc., MongoDB, 〈http://www.mongodb.org〉 (accessed February 2016).
[26]
Amazon Web Services, DynamoDB, 〈http://aws.amazon.com/it/dynamodb (accessed February 2016).
[27]
D. Pritchett, ASE, ACM Queue 6 (3) (2008) 48–55.
[28]
V. Vernon, Implementing Domain-Driven Design, Addison-Wesley, Upper Saddle River, NJ, USA, 2013.
[29]
K. Chodorow, MongoDB: The Definitive Guide, (Eds.), O'Reilly Media, Sebastopol, CA, USA, 2013.
[30]
D. Pasqualin, G. Souza, E.L. Buratti, E.C. de Almeida, M.D. Del Fabro, D. Weingaertner, A case study of the aggregation query model in read-mostly NoSQL document stores, in: 20th International Database Engineering&Applications Symposium (IDEAS '16), IDEAS '16, ACM, New York, NY, USA, 2016, pp. 224–229.
[31]
H. V. Olivera, M. Holanda, V. Guimarâes, F. Hondo, W. Boaventura, Data modeling for NoSQL document-oriented databases, in: 2nd Annual International Symposium on Information Management and Big Data (SIMBig 2015), vol. 1478 of CEUR Workshop Proceedings, 2015, pp. 129–135.
[32]
C. de Lima, R. dos Santos Mello, A workload-driven logical design approach for NoSQL document databases, in: 17th International Conference on Information Integration and Web-based Applications&Services (iiWAS '15), iiWAS '15, ACM, New York, NY, USA, 2015, pp. 73:1–73:10.
[33]
M. Chevalier, M.E. Malki, A. Kopliku, O. Teste, R. Tournier, Implementation of multidimensional databases with document-oriented NoSQL, in: 17th International Conference on Big Data Analytics and Knowledge Discovery, (DaWaK 2015), Cham, Switzerland, Lecture Notes in Computer Science, vol. 9263, Springer, 2015, pp. 379–390.
[34]
M. Chevalier, M.E. Malki, A. Kopliku, O. Teste, R. Tournier, Implementation of multidimensional databases in column-oriented NoSQL systems, in: 19th East European Conference on Advances in Databases and Information Systems (ADBIS 2015), 2015, pp. 79–91.
[35]
T. Olier, Database design using key-value tables, 〈http://www.devshed.com/c/a/mysql/database-design-using-key-value-tables/〉, 2006 (accessed February 2016).
[36]
M.J. Mior, K. Salem, A. Aboulnaga, R. Liu, Nose: schema design for NoSQL applications, in: 32nd IEEE International Conference on Data Engineering, ICDE 2016, Helsinki, Finland, May 16–20, 2016, pp. 181–192.
[37]
D.S. Ruiz, S.F. Morales, J.G. Molina, Inferring versioned schemas from NoSQL databases and its applications, in: 34th International Conference on Conceptual Modeling (ER 2015), 2015, pp. 467–480.
[38]
J. Baker, et al., Megastore: Providing scalable, highly available storage for interactive services, in: CIDR 2011, 2011, pp. 223–234.
[39]
T.J. Teorey, J.P. Fry, The logical record access approach to database design, ACM Comput. Surv. 12 (2) (1980) 179–211.
[40]
D. Florescu, D. Kossmann, Storing and querying XML data using an RDMBS, IEEE Data Eng. Bull. 22 (3) (1999) 27–34.
[41]
P. Atzeni, F. Bugiotti, L. Rossi, Uniform access to NoSQL systems, Inf. Syst. 43 (2014) 117–133.
[42]
S. Jain, D. Moritz, D. Halperin, B. Howe, E. Lazowska, SQLShare: results from a multi-year SQL-as-a-service experiment, in: Proceedings of the 2016 International Conference on Management of Data, SIGMOD Conference 2016, San Francisco, CA, USA, 26 June–1 July 2016, pp. 281–293.
[43]
I. Alagiannis, R. Borovica-Gajic, M. Branco, S. Idreos, A. Ailamaki, NoDB, Commun. ACM 58 (12) (2015) 112–121.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Computer Standards & Interfaces
Computer Standards & Interfaces  Volume 67, Issue C
Jan 2020
41 pages

Publisher

Elsevier Science Publishers B. V.

Netherlands

Publication History

Published: 01 January 2020

Author Tags

  1. Data models
  2. Database design
  3. NoSQL systems

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 30 Aug 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Self-tuning Database Systems: A Systematic Literature Review of Automatic Database Schema Design and TuningACM Computing Surveys10.1145/366532356:11(1-37)Online publication date: 17-May-2024
  • (2024)TyphonMLScience of Computer Programming10.1016/j.scico.2023.103044232:COnline publication date: 1-Jan-2024
  • (2024)Multi-model query languages: taming the variety of big dataDistributed and Parallel Databases10.1007/s10619-023-07433-142:1(31-71)Online publication date: 1-Mar-2024
  • (2024)An ontology-based approach to designing a NoSQL database for semi-structured and unstructured health dataCluster Computing10.1007/s10586-023-03995-y27:1(959-976)Online publication date: 1-Feb-2024
  • (2023)A text mining-based approach for understanding Chinese railway incidents caused by electromagnetic interferenceEngineering Applications of Artificial Intelligence10.1016/j.engappai.2022.105598117:PAOnline publication date: 1-Jan-2023
  • (2023)SkiQLData & Knowledge Engineering10.1016/j.datak.2023.102234148:COnline publication date: 1-Nov-2023
  • (2023)Multidimensional Models Supported by Document-Oriented DatabasesHybrid Artificial Intelligent Systems10.1007/978-3-031-40725-3_14(156-167)Online publication date: 5-Sep-2023
  • (2022)Metamodels to support database migration between heterogeneous data storesProceedings of the 25th International Conference on Model Driven Engineering Languages and Systems: Companion Proceedings10.1145/3550356.3561574(546-551)Online publication date: 23-Oct-2022
  • (2022)Unifying categorical representation of multi-model dataProceedings of the 37th ACM/SIGAPP Symposium on Applied Computing10.1145/3477314.3507690(365-371)Online publication date: 25-Apr-2022
  • (2022)MDICAComputer Standards & Interfaces10.1016/j.csi.2022.10364283:COnline publication date: 19-Sep-2022
  • Show More Cited By

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media