Software notations and tools

Applied Filters

People

Publications

Publication Date

Searched The ACM Guide to Computing Literature (3,846,486 records)|Limit your search to The ACM Full-Text Collection (775,921 records)

Showing 1 - 20of35 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
December 2024
A graph pattern mining framework for large graphs on GPU: A Graph Pattern Mining...
The VLDB Journal — The International Journal on Very Large Data Bases (VLDB), Volume 34, Issue 1https://doi.org/10.1007/s00778-024-00883-8
Abstract
Graph pattern mining (GPM) is an important problem in graph processing. There are many parallel frameworks for GPM, many of which suffer from low performance. GPU is a powerful option for accelerating graph processing, but parallel GPM algorithms ...
0
Metrics
Total Citations0
research-article
December 2024
A powerful reducing framework for accelerating set intersections over graphs
The VLDB Journal — The International Journal on Very Large Data Bases (VLDB), Volume 34, Issue 1https://doi.org/10.1007/s00778-024-00881-w
Abstract
Given two sets of vertices $S_{a}$ and $S_{b}$ of a graph, computing their common vertices, namely set intersection, is one primitive operation in many graph algorithms such as triangle counting, maximal clique enumeration, and subgraph matching. Therefore, ...
0
Metrics
Total Citations0
research-article
June 2024
Performant almost-latch-free data structures using epoch protection in more depth
The VLDB Journal — The International Journal on Very Large Data Bases (VLDB), Volume 33, Issue 6Pages 1793–1812https://doi.org/10.1007/s00778-024-00859-8
Abstract
Multi-core scalability presents a major implementation challenge for data system designers today. Traditional methods such as latching no longer scale in today’s highly parallel architectures. While the designer can make use of techniques such as ...
0
Metrics
Total Citations0
research-article
June 2024
Parallelization of butterfly counting on hierarchical memory
The VLDB Journal — The International Journal on Very Large Data Bases (VLDB), Volume 33, Issue 5Pages 1453–1484https://doi.org/10.1007/s00778-024-00856-x
Abstract
Butterfly (a cyclic graph motif) counting is a fundamental task with many applications in graph analysis, which aims at computing the number of butterflies in a large graph. With the rapid growth of graph data, it is more and more challenging to ... $\frac{^{}}{}$ $\frac{^{}}{} \frac{}{\sqrt{}}$ $^{}$
0
Metrics
Total Citations0
research-article
December 2023
RCBench: an RDMA-enabled transaction framework for analyzing concurrency control algorithms
The VLDB Journal — The International Journal on Very Large Data Bases (VLDB), Volume 33, Issue 2Pages 543–567https://doi.org/10.1007/s00778-023-00821-0
Abstract
Distributed transaction processing over the TCP/IP network suffers from the weak transaction scalability problem, i.e., its performance drops significantly when the number of involved data nodes per transaction increases. Although quite a few of ...
1
Metrics
Total Citations1
research-article
September 2023
A survey on transactional stream processing
The VLDB Journal — The International Journal on Very Large Data Bases (VLDB), Volume 33, Issue 2Pages 451–479https://doi.org/10.1007/s00778-023-00814-z
Abstract
Transactional stream processing (TSP) strives to create a cohesive model that merges the advantages of both transactional and stream-oriented guarantees. Over the past decade, numerous endeavors have contributed to the evolution of TSP solutions, ...
0
Metrics
Total Citations0
correction
June 2022
Correction to: BugDoc Iterative debugging and explanation of pipeline executions
The VLDB Journal — The International Journal on Very Large Data Bases (VLDB), Volume 32, Issue 2Page 473https://doi.org/10.1007/s00778-022-00751-3
0
Metrics
Total Citations0
research-article
January 2022
Data distribution debugging in machine learning pipelines
The VLDB Journal — The International Journal on Very Large Data Bases (VLDB), Volume 31, Issue 5Pages 1103–1126https://doi.org/10.1007/s00778-021-00726-w
Abstract
Machine learning (ML) is increasingly used to automate impactful decisions, and the risks arising from this widespread use are garnering attention from policy makers, scientists, and the media. ML applications are often brittle with respect to ...
5
Metrics
Total Citations5
research-article
Public Access
November 2021
Parallel mining of large maximal quasi-cliques
The VLDB Journal — The International Journal on Very Large Data Bases (VLDB), Volume 31, Issue 4Pages 649–674https://doi.org/10.1007/s00778-021-00712-2
Abstract
Given a user-specified minimum degree threshold $γ$ , a $γ$ -quasi-clique is a subgraph where each vertex connects to at least $γ$ fraction of the other vertices. Quasi-clique is a natural definition for dense structures, so finding large and hence ...
4
Metrics
Total Citations4
research-article
August 2021
RDFFrames: knowledge graph access for machine learning tools
The VLDB Journal — The International Journal on Very Large Data Bases (VLDB), Volume 31, Issue 2Pages 321–346https://doi.org/10.1007/s00778-021-00690-5
Abstract
Knowledge graphs represented as RDF datasets are integral to many machine learning applications. RDF is supported by a rich ecosystem of data management systems and tools, most notably RDF database systems that provide a SPARQL query interface. ...
2
Metrics
Total Citations2
research-article
Public Access
August 2021
PrefixFPM: a parallel framework for general-purpose mining of frequent and closed patterns
The VLDB Journal — The International Journal on Very Large Data Bases (VLDB), Volume 31, Issue 2Pages 253–286https://doi.org/10.1007/s00778-021-00687-0
Abstract
A frequent pattern is a substructure that appears in a database with frequency (aka. support) no less than a user-specified threshold, while a closed pattern is one that has no super-pattern that has the same support. Here, a substructure can ...
5
Metrics
Total Citations5
research-article
Public Access
August 2021
G-thinker: a general distributed framework for finding qualified subgraphs in a big graph with load balancing
The VLDB Journal — The International Journal on Very Large Data Bases (VLDB), Volume 31, Issue 2Pages 287–320https://doi.org/10.1007/s00778-021-00688-z
Abstract
Finding from a big graph those subgraphs that satisfy certain conditions is useful in many applications such as community detection and subgraph matching. These problems have a high time complexity, but existing systems that attempt to scale them ...
3
Metrics
Total Citations3
research-article
June 2021
Tidy Tuples and Flying Start: fast compilation and fast execution of relational queries in Umbra
The VLDB Journal — The International Journal on Very Large Data Bases (VLDB), Volume 30, Issue 5Pages 883–905https://doi.org/10.1007/s00778-020-00643-4
Abstract
Although compiling queries to efficient machine code has become a common approach for query execution, a number of newly created database system projects still refrain from using compilation. It is sometimes claimed that the intricacies of code ...
19
Metrics
Total Citations19
research-article
May 2021
Formal semantics and high performance in declarative machine learning using Datalog
The VLDB Journal — The International Journal on Very Large Data Bases (VLDB), Volume 30, Issue 5Pages 859–881https://doi.org/10.1007/s00778-021-00665-6
Abstract
With an escalating arms race to adopt machine learning (ML) in diverse application domains, there is an urgent need to support declarative machine learning over distributed data platforms. Toward this goal, a new framework is needed where users ...
3
Metrics
Total Citations3
research-article
Public Access
December 2019
$O R P H E U S$ DB: bolt-on versioning for relational databases (extended version)
The VLDB Journal — The International Journal on Very Large Data Bases (VLDB), Volume 29, Issue 1Pages 509–538https://doi.org/10.1007/s00778-019-00594-5
Abstract
Data science teams often collaboratively analyze datasets, generating dataset versions at each stage of iterative exploration and analysis. There is a pressing need for a system that can support dataset versioning, enabling such teams to ... $^{}$
3
Metrics
Total Citations3
research-article
March 2022
Interleaving with coroutines: a systematic and practical approach to hide memory latency in index joins
The VLDB Journal — The International Journal on Very Large Data Bases (VLDB), Volume 28, Issue 4Pages 451–471https://doi.org/10.1007/s00778-018-0533-6
Abstract
Index joins present a case of pointer-chasing code that causes data cache misses. In principle, we can hide these cache misses by overlapping them with computation: The lookups involved in an index join are parallel tasks whose execution can be ...
3
Metrics
Total Citations3
article
Free
December 2018
Generating custom code for efficient query execution on heterogeneous processors
The VLDB Journal — The International Journal on Very Large Data Bases (VLDB), Volume 27, Issue 6Pages 797–822https://doi.org/10.1007/s00778-018-0512-y

Processor manufacturers build increasingly specialized processors to mitigate the effects of the power wall in order to deliver improved performance. Currently, database engines have to be manually optimized for each processor which is a costly and ...
26
178
Metrics
Total Citations26
Total Downloads178
Last 12 Months35
Last 6 weeks8
View online with eReader
PDF
article
Free
October 2012
SCOPE: parallel databases meet MapReduce
The VLDB Journal — The International Journal on Very Large Data Bases (VLDB), Volume 21, Issue 5Pages 611–636https://doi.org/10.1007/s00778-012-0280-z

Companies providing cloud-scale data services have increasing needs to store and analyze massive data sets, such as search logs, click streams, and web graph data. For cost and performance reasons, processing is typically done on large clusters of tens ...
83
889
Metrics
Total Citations83
Total Downloads889
Last 12 Months104
Last 6 weeks30
View online with eReader
PDF
article
Free
October 2012
On the optimization of schedules for MapReduce workloads in the presence of shared scans
The VLDB Journal — The International Journal on Very Large Data Bases (VLDB), Volume 21, Issue 5Pages 589–609https://doi.org/10.1007/s00778-012-0279-5

We consider MapReduce clusters designed to support multiple concurrent jobs, concentrating on environments in which the number of distinct datasets is modest relative to the number of jobs. In such scenarios, many individual datasets are likely to be ...
6
291
Metrics
Total Citations6
Total Downloads291
Last 12 Months46
Last 6 weeks4
View online with eReader
PDF
article
Free
April 2010
A framework for testing DBMS features
The VLDB Journal — The International Journal on Very Large Data Bases (VLDB), Volume 19, Issue 2Pages 203–230https://doi.org/10.1007/s00778-009-0157-y

Testing a specific feature of a DBMS requires controlling the inputs and outputs of the operators in the query execution plan. However, that is practically difficult to achieve because the inputs/outputs of a query depend on the content of the test ...
14
544
Metrics
Total Citations14
Total Downloads544
Last 12 Months21
Last 6 weeks2
View online with eReader
PDF

Applied Filters

People

Names

Institutions

Authors

Reviewers

Publications

All Publications

Content Type

Media Formats

Publisher

Publication Date

Results

A graph pattern mining framework for large graphs on GPU: A Graph Pattern Mining...

A powerful reducing framework for accelerating set intersections over graphs

Performant almost-latch-free data structures using epoch protection in more depth

Parallelization of butterfly counting on hierarchical memory

RCBench: an RDMA-enabled transaction framework for analyzing concurrency control algorithms

A survey on transactional stream processing

Correction to: BugDoc Iterative debugging and explanation of pipeline executions

Data distribution debugging in machine learning pipelines

Parallel mining of large maximal quasi-cliques

RDFFrames: knowledge graph access for machine learning tools

PrefixFPM: a parallel framework for general-purpose mining of frequent and closed patterns

G-thinker: a general distributed framework for finding qualified subgraphs in a big graph with load balancing

Tidy Tuples and Flying Start: fast compilation and fast execution of relational queries in Umbra

Formal semantics and high performance in declarative machine learning using Datalog

$O R P H E U S$ DB: bolt-on versioning for relational databases (extended version)

Interleaving with coroutines: a systematic and practical approach to hide memory latency in index joins

Generating custom code for efficient query execution on heterogeneous processors

SCOPE: parallel databases meet MapReduce

On the optimization of schedules for MapReduce workloads in the presence of shared scans

A framework for testing DBMS features

Applied Filters

People

Names

Institutions

Authors

Reviewers

Publications

All Publications

Content Type

Media Formats

Publisher

Publication Date

Save to Binder

ORPHEUSDB: bolt-on versioning for relational databases (extended version)

$O R P H E U S$ DB: bolt-on versioning for relational databases (extended version)