Skip Abstract Section
Abstract
Written by the father of the data warehouse concept, this edition provides a comprehensive introduction to building data marts, operational data stores, Corporate Information Factory, exploration warehouses, and Web-enabled warehouses.
Cited By
- Masmoudi M, Ben Abdallah Ben Lamine S, Karray M, Archimede B and Baazaoui Zghal H (2024). Semantic Data Integration and Querying: A Survey and Challenges, ACM Computing Surveys, 56:8, (1-35), Online publication date: 31-Aug-2024.
- Herschel M, Gienger A, Lauer A, Stein C, Skoury L, Lässig N, Ellwein C, Verl A, Wortmann T and Sauer C Putting Co-Design-Supporting Data Lakes to the Test: An Evaluation on AEC Case Studies Big Data Analytics and Knowledge Discovery, (253-268)
- Siciliani L, Taccardi V, Basile P, Di Ciano M and Lops P (2023). AI-based decision support system for public procurement, Information Systems, 119:C, Online publication date: 1-Oct-2023.
- Whairit T, Phadermrod B and Attasena V (2023). JINDEX, Journal of King Saud University - Computer and Information Sciences, 35:8, Online publication date: 1-Sep-2023.
- Ferro M, Silva E and Fidalgo R (2023). AStar, Data & Knowledge Engineering, 145:C, Online publication date: 1-May-2023.
- Ngo T, Sarramia D, Kang M and Pinet F (2023). A New Approach Based on ELK Stack for the Analysis and Visualisation of Geo-referenced Sensor Data, SN Computer Science, 4:3, Online publication date: 23-Mar-2023.
- Gillet A, Leclercq É and Cullot N Lambda+, the Renewal of the Lambda Architecture: Category Theory to the Rescue Advanced Information Systems Engineering, (381-396)
- Menolli A, Coelho R, Silva G and Barbosa E An Agile Data Warehouse Virtualization Framework for ROLAP Server Proceedings of the XVII Brazilian Symposium on Information Systems, (1-8)
- Kumar T and Kumar A (2021). Materialized View Selection Using Swap Operator Based Particle Swarm Optimization, International Journal of Distributed Artificial Intelligence, 13:1, (58-73), Online publication date: 1-Jan-2021.
- Hamdi W and Faiz S Distributing Data in Real Time Spatial Data Warehouse Algorithms and Architectures for Parallel Processing, (3-13)
- Ngo V and Kechadi M Crop Knowledge Discovery Based on Agricultural Big Data Integration Proceedings of the 4th International Conference on Machine Learning and Soft Computing, (46-50)
- Yoo J Crime data warehousing and crime pattern discovery Proceedings of the Second International Conference on Data Science, E-Learning and Information Systems, (1-6)
- Giebler C, Gröger C, Hoos E, Schwarz H and Mitschang B Modeling Data Lakes with Data Vault: Practical Experiences, Assessment, and Lessons Learned Conceptual Modeling, (63-77)
- Azgomi H and Sohrabi M (2019). A novel coral reefs optimization algorithm for materialized view selection in data warehouse environments, Applied Intelligence, 49:11, (3965-3989), Online publication date: 1-Nov-2019.
- Frazzetto D, Nielsen T, Pedersen T and Šikšnys L (2019). Prescriptive analytics: a survey of emerging trends and technologies, The VLDB Journal — The International Journal on Very Large Data Bases, 28:4, (575-595), Online publication date: 1-Aug-2019.
- Kel’manov A and Khandeev V Fast and Exact Algorithms for Some NP-Hard 2-Clustering Problems in the One-Dimensional Case Analysis of Images, Social Networks and Texts, (377-387)
- Ngo V, Le-Khac N and Kechadi M Designing and Implementing Data Warehouse for Agricultural Big Data Big Data – BigData 2019, (1-17)
- Prakash D and Prakash N (2019). A multifactor approach for elicitation of Information requirements of data warehouses, Requirements Engineering, 24:1, (103-117), Online publication date: 1-Mar-2019.
- Letrache K, Beggar O and Ramdani M (2019). OLAP cube partitioning based on association rules method, Applied Intelligence, 49:2, (420-434), Online publication date: 1-Feb-2019.
- (2019). Empirical investigation of dimension hierarchy sharing-based metrics for multidimensional schema understandability, International Journal of Intelligent Engineering Informatics, 7:2-3, (141-163), Online publication date: 1-Jan-2019.
- Çığşar B, Ünal D and Peña A (2019). Comparison of Data Mining Classification Algorithms Determining the Default Risk, Scientific Programming, 2019, Online publication date: 1-Jan-2019.
- Rufino R, Moreira D and de Freitas Neto F Dengue 360 Proceedings of the Euro American Conference on Telematics and Information Systems, (1-8)
- Letrache K, El Beggar O and Ramdani M Green Data warehouse Design and Exploitation Proceedings of the 12th International Conference on Intelligent Systems: Theories and Applications, (1-6)
- Prakash D Direct Conversion of Early Information to Multi-dimensional Model Database and Expert Systems Applications, (119-126)
- Cuzzocrea A, Moussa R and Vercelli G An Innovative Lambda-Architecture-Based Data Warehouse Maintenance Framework for Effective and Efficient Near-Real-Time OLAP over Big Data Big Data – BigData 2018, (149-165)
- Sinha H (2018). Enhancement of TOPSIS for Evaluating the Web-Sources to Select as External Source for Web-Warehousing, International Journal of Rough Sets and Data Analysis, 5:1, (117-130), Online publication date: 1-Jan-2018.
- Bouadi T, Cordier M, Moreau P, Quiniou R, Salmon-Monviola J and Gascuel-Odoux C (2017). A data warehouse to explore multidimensional simulated data from a spatially distributed agro-hydrological model to improve catchment nitrogen management, Environmental Modelling & Software, 97:C, (229-242), Online publication date: 1-Nov-2017.
- Azabou M, Khrouf K, Feki J, Soulé-Dupuy C and Vallès N (2017). Yet Another Multidimensional Model for XML Documents, International Journal of Strategic Information Technology and Applications, 8:3, (73-90), Online publication date: 1-Jul-2017.
- Bimonte S, Sautot L, Journaux L and Faivre B (2017). Multidimensional Model Design using Data Mining, International Journal of Data Warehousing and Mining, 13:1, (1-35), Online publication date: 1-Jan-2017.
- Sinha H (2017). Enhancement of "Technique for Order Preference by Similarity to Ideal Solution" Approach for Evaluating the Web Sources to Select as External Source for Web Warehousing, International Journal of Natural Computing Research, 6:1, (1-16), Online publication date: 1-Jan-2017.
- Arun B and Kumar T (2017). Materialized View Selection using Artificial Bee Colony Optimization, International Journal of Intelligent Information Technologies, 13:1, (26-49), Online publication date: 1-Jan-2017.
- Toddenroth D, Sivagnanasundaram J, Prokosch H and Ganslandt T (2016). Concept and implementation of a study dashboard module for a continuous monitoring of trial recruitment and documentation, Journal of Biomedical Informatics, 64:C, (222-231), Online publication date: 1-Dec-2016.
- Haarbrandt B, Tute E and Marschollek M (2016). Automated population of an i2b2 clinical data warehouse from an openEHR-based data repository, Journal of Biomedical Informatics, 63:C, (277-294), Online publication date: 1-Oct-2016.
- (2016). A multidimensional data model design for building energy management, Advanced Engineering Informatics, 30:4, (619-632), Online publication date: 1-Oct-2016.
- Kang J, Yu Q, Holden E and Oh T Security Requirements Embedded in MS Programs in Information Sciences and Technologies Proceedings of the 17th Annual Conference on Information Technology Education, (77-82)
- Anurag , Arora D and Kumar U Protecting Sensitive Warehouse Data through UML based Modeling Proceedings of the International Conference on Informatics and Analytics, (1-6)
- Love M, Boisvert C, Uruchurtu E and Ibbotson I Nifty with Data Proceedings of the 2016 ACM Conference on Innovation and Technology in Computer Science Education, (344-349)
- Goede R Listening to the affected Proceedings of the Computer Science Education Research Conference 2016, (12-21)
- Haupt R, Scholtz B and Calitz A Using Business Intelligence to Support Strategic Sustainability Information Management Proceedings of the 2015 Annual Research Conference on South African Institute of Computer Scientists and Information Technologists, (1-11)
- Rahman N and Rutz D (2015). Building Data Warehouses Using Automation, International Journal of Intelligent Information Technologies, 11:2, (1-22), Online publication date: 1-Apr-2015.
- Yao Q, Tian Y, Li P, Tian L, Qian Y and Li J (2015). Design and Development of a Medical Big Data Processing System Based on Hadoop, Journal of Medical Systems, 39:3, (1-11), Online publication date: 1-Mar-2015.
- Chalamalla A, Ilyas I, Ouzzani M and Papotti P Descriptive and prescriptive data cleaning Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, (445-456)
- Viswanathan G and Schneider M (2014). Querying Cardinal Directions between Complex Objects in Data Warehouses, Fundamenta Informaticae, 132:2, (177-202), Online publication date: 1-Apr-2014.
- Truong T, Amblard F, Gaudou B, Sibertin-Blanc C, Truong V, Drogoul A, Huynh H and Le M An implementation of framework of business intelligence for agent-based simulation Proceedings of the 4th Symposium on Information and Communication Technology, (35-44)
- Goller M and Berger S Slowly changing measures Proceedings of the sixteenth international workshop on Data warehousing and OLAP, (47-54)
- Appelgren Lara G, Delgado M and Marín N Fuzzy Multidimensional Modelling for Flexible Querying of Learning Object Repositories Proceedings of the 10th International Conference on Flexible Query Answering Systems - Volume 8132, (112-123)
- Ariyan S and Bertossi L A multidimensional data model with subcategories for flexibly capturing summarizability Proceedings of the 25th International Conference on Scientific and Statistical Database Management, (1-12)
- Maurino A, Venturini C and Viscusi G Coopetitive data warehouse Proceedings of the 25th international conference on Advanced Information Systems Engineering, (482-497)
- Baffoe S, Baarah A and Peyton L Inferring state for real-time monitoring of care processes Proceedings of the 5th International Workshop on Software Engineering in Health Care, (57-63)
- Brighen A, Bellatreche L, Slimani H and Faget Z An Economical Query Cost Model in the Cloud Proceedings of the 18th International Conference on Database Systems for Advanced Applications - Volume 7827, (16-30)
- Mrunalini M, Kumar T and Kanth K (2013). Dynamic process model for identifying modified data using mobile agents in real time ETL processes, ACM SIGSOFT Software Engineering Notes, 38:1, (43-46), Online publication date: 23-Jan-2013.
- Mrunalini M, Kumar T and Kanth K (2013). Dynamic process model for identifying modified data using mobile agents in real time ETL processes, ACM SIGSOFT Software Engineering Notes, 37:6, (1-9), Online publication date: 27-Nov-2012.
- Ayhan S, Pesce J, Comitz P, Gerberick G and Bliesner S Predictive analytics with surveillance big data Proceedings of the 1st ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data, (81-90)
- Maté A, Trujillo J, de Gregorio E and Song I Improving the maintainability of data warehouse designs Proceedings of the fifteenth international workshop on Data warehousing and OLAP, (25-32)
- Amanzougarene F, Chachoua M and Zeitouni K Qualitative representation of building sites annoyance Proceedings of the 2012 ACM workshop on City data management workshop, (13-20)
- Zhang B, Xia X, Huang X, Wang M and Le J Query optimization with value path materialization in column-stored DWMS Proceedings of the 3rd International Conference on Computing for Geospatial Research and Applications, (1-6)
- Martínez A, Galvis-Lista E and Florez L Modeling techniques for extraction transformation and load processes Proceedings of the 6th Euro American Conference on Telematics and Information Systems, (41-47)
- Silva Souza V, Mazón J, Garrigós I, Trujillo J and Mylopoulos J Monitoring strategic goals in data warehouses with awareness requirements Proceedings of the 27th Annual ACM Symposium on Applied Computing, (1075-1082)
- Schütz C, Schrefl M, Neumayr B and Sierninger D Incremental integration of data warehouses Proceedings of the ACM 14th international workshop on Data Warehousing and OLAP, (25-30)
- Saga R, Takamizawa S, Kitami K, Tsuji H and Matsumoto K Comparison analysis for text data by using FACT-graph Proceedings of the 1st international conference on Human interface and the management of information: interacting with information - Volume Part II, (75-83)
- Maté A and Trujillo J A trace metamodel proposal based on the model driven architecture framework for the traceability of user requirements in data warehouses Proceedings of the 23rd international conference on Advanced information systems engineering, (123-137)
- Moya L, Kudama S, Cabo M and Llavori R Integrating web feed opinions into a corporate data warehouse Proceedings of the 2nd International Workshop on Business intelligencE and the WEB, (20-27)
- Trujillo J, Pardillo J and Mazón J (2011). An MDA Approach and QVT Transformations for the Integrated Development of Goal-Oriented Data Warehouses and Data Marts, Journal of Database Management, 22:1, (43-68), Online publication date: 1-Jan-2011.
- Viswanathan G and Schneider M The objects interaction Graticule for cardinal direction querying in moving objects data warehouses Proceedings of the 14th east European conference on Advances in databases and information systems, (520-532)
- Wu D and Håkansson A Applying a knowledge based system for metadata integration for data warehouses Proceedings of the 14th international conference on Knowledge-based and intelligent information and engineering systems: Part IV, (60-69)
- Choenni S and Leertouwer E Public safety mashups to support policy makers Proceedings of the First international conference on Electronic government and the information systems perspective, (234-248)
- Marques E, Miani R, De Almeida Gago E and De Souza Mendes L Development of a business intelligence environment for e-gov using open source technologies Proceedings of the 12th international conference on Data warehousing and knowledge discovery, (203-214)
- Schneider S and Frosch-Wilke D Analysis patterns in dimensional data modeling Proceedings of the Second international conference on Data Engineering and Management, (109-116)
- Kalidien S, Choenni S and Meijer R Crime statistics online Proceedings of the 11th Annual International Digital Government Research Conference on Public Administration Online: Challenges and Opportunities, (131-137)
- Plantevit M, Laurent A, Laurent D, Teisseire M and Choong Y (2010). Mining multidimensional and multilevel sequential patterns, ACM Transactions on Knowledge Discovery from Data, 4:1, (1-37), Online publication date: 1-Jan-2010.
- Pitarch Y, Laurent A and Poncelet P A conceptual model for handling personalized hierarchies in multidimensional databases Proceedings of the International Conference on Management of Emergent Digital EcoSystems, (107-111)
- Zhang J, Wen Q and Zhang H The research in improving the quality of DW data Proceedings of the 5th International Conference on Wireless communications, networking and mobile computing, (5404-5407)
- Fasel D A fuzzy data warehouse approach for the customer performance measurement for a hearing instrument manufacturing company Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 7, (285-289)
- Cohen J, Dolan B, Dunlap M, Hellerstein J and Welton C (2009). MAD skills, Proceedings of the VLDB Endowment, 2:2, (1481-1492), Online publication date: 1-Aug-2009.
- Saga R, Tsuji H and Tabata K Loopo Proceedings of the Symposium on Human Interface 2009 on Human Interface and the Management of Information. Information and Interaction. Part II: Held as part of HCI International 2009, (192-200)
- Zaker M, Phon-Amnuaisuk S and Haw S Optimizing the data warehouse design by hierarchical denormalizing Proceedings of the 8th conference on Applied computer scince, (131-138)
- Zaker M, Phon-Amnuaisuk S and Haw S Investigating design choices between Bitmap index and B-tree index for a large data warehouse system Proceedings of the 8th conference on Applied computer scince, (123-130)
- Liu Y, Hsu P, Sheen G, Ku S and Chang K (2008). Simultaneous determination of view selection and update policy with stochastic query and response time constraints, Information Sciences: an International Journal, 178:18, (3491-3509), Online publication date: 20-Sep-2008.
- Salguero A and Araque F Information system architecture for customizing touristic trips Proceedings of the 2nd conference on European computing conference, (349-354)
- Pardillo J, Mazón J and Trujillo J Model-Driven Metadata for OLAP Cubes from the Conceptual Modelling of Data Warehouses Proceedings of the 10th international conference on Data Warehousing and Knowledge Discovery, (13-22)
- Salguero A, Araque F and Delgado C Information system architecture for customizing touristic trips Proceedings of the 8th conference on Applied informatics and communications, (344-349)
- Zhengcai L, Zhu J, Ben-Lin X, Zhong-Hai Y and Huaiyu W (2008). Index transforms of a symmetrical matrix, Computers & Geosciences, 34:4, (301-309), Online publication date: 1-Apr-2008.
- Malinowski E and Zimányi E (2008). A conceptual model for temporal data warehouses and its transformation to the ER and the object-relational models, Data & Knowledge Engineering, 64:1, (101-133), Online publication date: 1-Jan-2008.
- Chen Y and Hsu P (2007). A grain preservation translation algorithm, Information Sciences: an International Journal, 177:18, (3679-3695), Online publication date: 1-Sep-2007.
- Tilg B, Chimiak-Opoka J, Lenz C and Breu R Towards operationalizing strategic alignment of IT by usage of software engineering methods Proceedings of the 10th international conference on Business information systems, (610-625)
- Velásquez J and Palade V (2007). A Knowledge Base for the maintenance of knowledge extracted from web data, Knowledge-Based Systems, 20:3, (238-248), Online publication date: 1-Apr-2007.
- Lee K, Son J and Kim M (2007). Reducing the cost of accessing relations in incremental view maintenance, Decision Support Systems, 43:2, (512-526), Online publication date: 1-Mar-2007.
- Araque F, Salguero A, Carrasco R and Delgado C Fuzzy integration of web data sources for data warehousing Proceedings of the 11th international conference on Computer aided systems theory, (1208-1215)
- Kumar N, Gangopadhyay A and Karabatis G (2007). Supporting mobile decision making with association rules and multi-layered caching, Decision Support Systems, 43:1, (16-30), Online publication date: 1-Feb-2007.
- Sahama T and Croll P A data warehouse architecture for clinical data warehousing Proceedings of the fifth Australasian symposium on ACSW frontiers - Volume 68, (227-232)
- Klimavicius M Data warehouse development with EPC Proceedings of the 5th WSEAS international conference on Data networks, communications and computers, (1-6)
- Malinowski E and Zimányi E A conceptual solution for representing time in data warehouse dimensions Proceedings of the 3rd Asia-Pacific conference on Conceptual modelling - Volume 53, (45-54)
- Borges V, Nogueira B and Barbosa E A multidimensional data model for the analysis of learning management systems under different perspectives 2016 IEEE Frontiers in Education Conference (FIE), (1-8)
Index Terms
- Building the Data Warehouse
Recommendations
Alliance Rules for Data Warehouse Cleansing
ICSPS '09: Proceedings of the 2009 International Conference on Signal Processing SystemsData Cleansing is an activity performed on the data sets of data warehouse to enhance and maintain the quality and consistency of the data. This paper addresses the problems related with dirty data, entrance of dirty data and detection of dirty data in ...