[1993] Proceedings of the Second International Conference on Parallel and Distributed Information Systems
DOI: 10.1109/pdis.1993.253077
|View full text |Cite
|
Sign up to set email alerts
|

Declustering using fractals

Abstract: We propose a method to achieve declustering for cartesian product les on M units. The focus is on range queries, as opposed to partial match queries that older declustering methods have examined. Our method uses a distance-preserving mapping, namely, the Hilbert curve, to impose a linear ordering on the multidimensional points (buckets); then, it traverses the buckets according to this ordering, assigning buckets to disks in a round-robin fashion. Thanks to the good distance-preserving properties of the Hilber… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

1
66
0

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 101 publications
(67 citation statements)
references
References 23 publications
1
66
0
Order By: Relevance
“…In this work, the assignment of input chunks to the disks was done using a Hilbert curve based declustering algorithm [15]. Hilbert curve algorithms have been shown to achieve good I/O parallelism for multi-dimensional datasets.…”
Section: Datasetsmentioning
confidence: 99%
“…In this work, the assignment of input chunks to the disks was done using a Hilbert curve based declustering algorithm [15]. Hilbert curve algorithms have been shown to achieve good I/O parallelism for multi-dimensional datasets.…”
Section: Datasetsmentioning
confidence: 99%
“…Several methods have been proposed for declustering data, including Disk Modulo [12], Field-wise Exclusive OR [29], Hilbert [13], Near-Optimal Declustering [5], General Multidimensional Data Allocation [27], cyclic allocation schemes [36], [37], Golden Ratio Sequences [7], Hierarchical Declustering [6], and Discrepancy Declustering [9]. Using declustering and replication, approaches including Complete Coloring [20] have optimal performance and Square Root Colors Disk Modulo [20] has one more than optimal.…”
Section: Introductionmentioning
confidence: 99%
“…Given the established bounds on the extra cost and the impossibility result, a large number of declustering techniques have been proposed to achieve performance close to the bounds either on the average case [5], [12], [13], [14], [16], [22], [24], [25], [29], [31], [36], [37] or, in the worst case, [3], [6], [7], [9], [41]. Although initial approaches in the literature were originally for relational databases or Cartesian product files, recent techniques focus more on spatial data declustering.…”
Section: Introductionmentioning
confidence: 99%
“…Efficient access to data also depends on how well the data has been distributed across multiple storage nodes. The goal of declustering [9,14] is to distribute the data across as many storage units as possible so that data elements that satisfy a query can be retrieved from many sources in parallel. Caching is yet another optimization that targets multiple query workloads [1,10,19,21].…”
Section: Introductionmentioning
confidence: 99%