About: Online aggregation

An Entity of Type: topical concept, from Named Graph: http://dbpedia.org, within Data Space: dbpedia.org

Online aggregation is a technique for improving the interactive behavior of database systems processing expensive analytical queries. Almost all database operations are performed in batch mode, i.e. the user issues a query and waits till the database has finished processing the entire query. On the contrary, using online aggregation, the user gets estimates of an aggregate query in an online fashion as soon as the query is issued. For example, if the final answer is 1000, after k seconds, the user gets the estimates in form of a confidence interval like [990, 1020] with 95% probability. This confidence keeps on shrinking as the system gets more and more samples.

Property	Value
dbo:abstract	Online aggregation is a technique for improving the interactive behavior of database systems processing expensive analytical queries. Almost all database operations are performed in batch mode, i.e. the user issues a query and waits till the database has finished processing the entire query. On the contrary, using online aggregation, the user gets estimates of an aggregate query in an online fashion as soon as the query is issued. For example, if the final answer is 1000, after k seconds, the user gets the estimates in form of a confidence interval like [990, 1020] with 95% probability. This confidence keeps on shrinking as the system gets more and more samples. Online aggregation was proposed in 1997 by Hellerstein, Haas and Wang for group-by aggregation queries over a single table. Later, the authors showed how to evaluate joins in an online fashion. In 2007, Jermaine et al. designed and implemented a prototype database system called Database-Online (or DBO) that computes group-by aggregate query over multiple tables in an online and more importantly in a scalable fashion. All the approaches for online aggregation use random sampling, which is non-trivial in a distributed environment due to inspection paradox of renewal reward theory. In 2011, Pansare et al. proposed a Bayesian model to deal with the inspection paradox and implemented online aggregation for a MapReduce-like environment. (en)
dbo:wikiPageID	33262148 (xsd:integer)
dbo:wikiPageLength	2876 (xsd:nonNegativeInteger)
dbo:wikiPageRevisionID	1046593468 (xsd:integer)
dbo:wikiPageWikiLink	dbr:Bayesian_probability dbc:Database_theory dbr:MapReduce dbr:Aggregate_function dbr:Database dbr:Renewal_theory dbr:Sampling_(statistics) dbr:Database_systems
dbp:wikiPageUsesTemplate	dbt:Reflist dbt:Database-stub
dcterms:subject	dbc:Database_theory
gold:hypernym	dbr:Technique
rdf:type	dbo:TopicalConcept
rdfs:comment	Online aggregation is a technique for improving the interactive behavior of database systems processing expensive analytical queries. Almost all database operations are performed in batch mode, i.e. the user issues a query and waits till the database has finished processing the entire query. On the contrary, using online aggregation, the user gets estimates of an aggregate query in an online fashion as soon as the query is issued. For example, if the final answer is 1000, after k seconds, the user gets the estimates in form of a confidence interval like [990, 1020] with 95% probability. This confidence keeps on shrinking as the system gets more and more samples. (en)
rdfs:label	Online aggregation (en)
owl:sameAs	freebase:Online aggregation wikidata:Online aggregation https://global.dbpedia.org/id/4sm89
prov:wasDerivedFrom	wikipedia-en:Online_aggregation?oldid=1046593468&ns=0
foaf:isPrimaryTopicOf	wikipedia-en:Online_aggregation
is dbo:wikiPageRedirects of	dbr:Online_Aggregation
is dbo:wikiPageWikiLink of	dbr:Joseph_M._Hellerstein dbr:Online_Aggregation
is foaf:primaryTopic of	wikipedia-en:Online_aggregation

This content was extracted from Wikipedia and is licensed under the Creative Commons Attribution-ShareAlike 3.0 Unported License