Skip to content Skip to footer

Purdue University researchers introduce GTX: A Transactional Graph Data System designed for handling HTAP Workloads.

Researchers from Purdue University have unveiled a new tool, GTX, to address the challenges faced by transactional graph systems in handling large-scale graphs. GTX is designed to be highly efficient in managing dynamic graphs that feature high update arrival rates, temporal localities, and hotspots. Such capabilities are vital for applications including fraud detection, recommendation systems, and graph neural network training.

Many current transactional graph systems use coarse-grained concurrency control mechanisms, resulting in potential performance degradation when handling concurrent workloads with frequent updates. GTX, on the other hand, is designed as a latch-free, write-optimized transactional graph data system using atomic operations. It employs delta-based multiversion storage and implements a hybrid transaction commit protocol to optimize efficiency.

The significant feature of GTX is its capacity to adapt to temporal localities and hotspots during graph updates. It does so while ensuring high throughput read-write transactions and good competition in the realm of graph analytics performance. The architecture of GTX revolves around a latch-free adjacency list-based graph store and a transaction manager providing concurrency control protocol. The system uses a multi-version delta store, in which each delta captures either vertex or edge operations, enabling efficient access and updates.

To optimize concurrency, GTX controls transactions and analytics at the delta-chain level and employs a hybrid group commit protocol. This improves overall throughput of the system. Additionally, GTX uses a delta-chain index for efficient edge lookups and adapts concurrency control based on workload history. These features enhance the tool’s ability to handle real-world power-law graphs with temporal localities and hotspots, maintaining millions of transactions per second throughput and competitive graph analytics performance.

The design of GTX has been prototyped as a graph library and evaluated through experiments using real-world and synthetic datasets. The results bolster the capacity of GTX to outperform existing systems in transaction throughput and robustness across various workloads. The ability of GTX to adapt to various factors, while preserving competitive graph analytics performance, solidifies its promise for applications that require efficient graph management and analysis.

In summary, Purdue University researchers have introduced GTX, a transactional graph data system that performs exceptionally well in managing dynamic graphs featuring high update arrival rates, temporal localities, and hotspots. Outperforming its contemporary systems in transaction throughput and robustness, GTX suggests great potential for applications involving efficient graph management and analysis.

Leave a comment

0.0/5