In this thesis we examine the problem of modelling data base contents and data placement on devices. This modelling is necessary in analytic data base performance evaluation studies in order to estimate the number of records of a file that have to be retrieved in response to the user(s) requests, as well as the number of blocks of the file containing these records. The cpu, io, and telecommunication costs of the system are directly or indirectly expressed in terms of these quantities.We first show that certain assumptions used for modelling data base contents, data placement on devices and user requests often are not satisfied in actual data base environments, and that they may lead to errors in model predictions. We examine formally implications of non-uniformity and dependencies of attribute values in data base design and data base performance evaluation. Thereafter, we provide more detailed modelling techniques based on multivariate statistical model, and we demonstrate their use in improving data base performance.
Cited By
- Oommen B and Rueda L (2019). A formal analysis of why heuristic functions work, Artificial Intelligence, 164:1-2, (1-22), Online publication date: 1-May-2005.
- Oommen B and Thiyagarajah M Query Result Size Estimation Using a Novel Histogram-like Technique Proceedings of the 1999 International Symposium on Database Engineering & Applications
- Chu P (2019). A Contingency Approach to Estimating Record Selectivities, IEEE Transactions on Software Engineering, 17:6, (544-552), Online publication date: 1-Jun-1991.
- Muthuswamy B and Kerschberg L A detailed statistical model for relational query optimization Proceedings of the 1985 ACM annual conference on The range of computing : mid-80's perspective: mid-80's perspective, (439-448)
- Kriegel H Performance comparison of index structures for multi-key retrieval Proceedings of the 1984 ACM SIGMOD international conference on Management of data, (186-196)
- Piatetsky-Shapiro G and Connell C Accurate estimation of the number of tuples satisfying a condition Proceedings of the 1984 ACM SIGMOD international conference on Management of data, (256-276)
- Kriegel H (1984). Performance comparison of index structures for multi-key retrieval, ACM SIGMOD Record, 14:2, (186-196), Online publication date: 1-Jun-1984.
- Piatetsky-Shapiro G and Connell C (2019). Accurate estimation of the number of tuples satisfying a condition, ACM SIGMOD Record, 14:2, (256-276), Online publication date: 1-Jun-1984.
- Christodoulakis S (1984). Implications of certain assumptions in database performance evauation, ACM Transactions on Database Systems (TODS), 9:2, (163-186), Online publication date: 3-Jun-1984.
- Christodoulakis S Estimating block transfers and join sizes Proceedings of the 1983 ACM SIGMOD international conference on Management of data, (40-54)
- Rowe N Top-down statistical estimation on a database Proceedings of the 1983 ACM SIGMOD international conference on Management of data, (135-145)
- Christodoulakis S (2019). Estimating block transfers and join sizes, ACM SIGMOD Record, 13:4, (40-54), Online publication date: 1-May-1983.
- Rowe N (1983). Top-down statistical estimation on a database, ACM SIGMOD Record, 13:4, (135-145), Online publication date: 1-May-1983.
- Tsichritzis D and Christodoulakis S Message files Proceedings of the SIGOA conference on Office information systems, (110-112)
- Tsichritzis D and Christodoulakis S (2019). Message files, ACM SIGOA Newsletter, 3:1-2, (110-112), Online publication date: 1-Jun-1982.
Recommendations
Auditing large scale data bases
VLDB '77: Proceedings of the third international conference on Very large data bases - Volume 31. Data bases and data base auditing are briefly defined. 2. The roles of external auditors, internal auditors, and corporate managers in data base auditing are explained. 3. The general approach taken by the auditor in his audit of a data base is ...
Estimating Pre-Placement FPGA Interconnection Requirements
VLSID '04: Proceedings of the 17th International Conference on VLSI DesignWith increasing device and design sizes, InterconnectPlanning is fast becoming an important design issue for largeFPGA based designs. The fundamental requirement for interconnectplanning is the ability to estimate the routing requirementsof a given ...