Suppose a directed graph has its arcs stored in secondary memory, and we wish to compute its transitive closure, also storing the result in secondary memory. We assume that an amount of main memory capable of holdings “values” is available, and thats lies betweenn, the number of nodes of the graph, ande, the number of arcs. The cost measure we use for algorithms is theI/O complexity of Kung and Hong, where we count 1 every time a value is moved into main memory from secondary memory, or vice versa.In the dense case, wheree is close ton2, we show that I/O equal toO(n3/√s) is sufficient to compute the transitive closure of ann-node graph, using main memory of sizes. Moreover, it is necessary for any algorithm that is “standard,” in a sense to be defined precisely in the paper. Roughly, “standard” means that paths are constructed only by concatenating arcs and previously discovered paths. For the sparse case, we show that I/O equal toO(n2√e/s) is sufficient, although the algorithm we propose meets our definition of “standard” only if the underlying graph is acyclic. We also show thatΩ(n2√e/s) is necessary for any standard algorithm in the sparse case. That settles the I/O complexity of the sparse/acyclic case, for standard algorithms. It is unknown whether this complexity can be achieved in the sparse, cyclic case, by a standard algorithm, and it is unknown whether the bound can be beaten by nonstandard algorithms.We then consider a special kind of standard algorithm, in which paths are constructed only by concatenating arcs and old paths, never by concatenating two old paths. This restriction seems essential if we are to take advantage of sparseness. Unfortunately, we show that almost another factor ofn I/O is necessary. That is, there is an algorithm in this class using I/OO(n3√e/s) for arbitrary sparse graphs, including cyclic ones. Moreover, every algorithm in the restricted class must useΩ(n3√e/s/log3n) I/O, on some cyclic graphs.
[1]
J. Hopcroft,et al.
Efficient algorithms for graph manipulation
,
1971
.
[2]
Alfred V. Aho,et al.
The Design and Analysis of Computer Algorithms
,
1974
.
[3]
Jeffrey D. Ullman.
The new technologies
,
1989
.
[4]
Don Coppersmith,et al.
Matrix multiplication via arithmetic progressions
,
1987,
STOC.
[5]
H. T. Kung,et al.
I/O complexity: The red-blue pebble game
,
1981,
STOC '81.
[6]
Jeffrey D. Ullman,et al.
Principles of Database and Knowledge-Base Systems, Volume II
,
1988,
Principles of computer science series.
[7]
J. Hopcroft,et al.
Algorithm 447: efficient algorithms for graph manipulation
,
1973,
CACM.
[8]
Raghu Ramakrishnan,et al.
Efficient Transitive Closure Algorithms
,
1988,
VLDB.
[9]
L. Lovász.
Combinatorial problems and exercises
,
1979
.
[10]
Henry S. Warren,et al.
A modification of Warshall's algorithm for the transitive closure of binary relations
,
1975,
Commun. ACM.
[11]
V. Benes,et al.
Mathematical Theory of Connecting Networks and Telephone Traffic.
,
1966
.
[12]
H. V. Jagadish,et al.
Direct Algorithms for Computing the Transitive Closure of Database Relations
,
1987,
VLDB.
[13]
Jeffrey D. Ullman.
Classical database systems
,
1988
.
[14]
Stephen Warshall,et al.
A Theorem on Boolean Matrices
,
1962,
JACM.
[15]
Edward G. Coffman,et al.
Organizing matrices and matrix operations for paged memory systems
,
1969,
Commun. ACM.