Extend code that deduces implied equality clauses to detect whether a

clause being added to a particular restriction-clause list is redundant with those already in the list. This avoids useless work at runtime, and (perhaps more importantly) keeps the selectivity estimation routines from generating too-small estimates of numbers of output rows. Also some minor improvements in OPTIMIZER_DEBUG displays.
author: Tom Lane <tgl@sss.pgh.pa.us> 2001-10-18 16:11:42 +0000
committer: Tom Lane <tgl@sss.pgh.pa.us> 2001-10-18 16:11:42 +0000
commit: 6254465d0621f724cdc9a1e99da24fa8a29f579d (patch)
tree: 84bd08321ce84de9daf6ab5264c889e5b5a92e4e /src/backend/optimizer/README
parent: 50450049581566ed47016cd89ba03b90be7ea1d0 (diff)
download: postgresql-6254465d0621f724cdc9a1e99da24fa8a29f579d.tar.gz
1 files changed, 17 insertions, 0 deletions
diff --git a/src/backend/optimizer/README b/src/backend/optimizer/README
index 520ab1b242..472efbcd9c 100644
--- a/src/backend/optimizer/README
+++ b/src/backend/optimizer/README
@@ -461,6 +461,23 @@ code than it would otherwise have, and reduces the number of tuples
 processed in join stages, so it's a win to make these deductions even
 if we weren't forced to.
 
+When we generate implied equality constraints, we may find ourselves
+adding redundant clauses to specific relations.  For example, consider
+	SELECT * FROM t1, t2, t3 WHERE t1.a = t2.b AND t2.b = t3.c;
+We will generate the implied clause t1.a = t3.c and add it to the tree.
+This is good since it allows us to consider joining t1 and t3 directly,
+which we otherwise wouldn't do.  But when we reach the stage of joining
+all three relations, we will have redundant join clauses --- eg, if we
+join t1 and t2 first, then the path that joins (t1 t2) to t3 will have
+both t2.b = t3.c and t1.a = t3.c as restriction clauses.  This is bad;
+not only is evaluation of the extra clause useless work at runtime,
+but the selectivity estimator routines will underestimate the number
+of tuples produced since they won't know that the two clauses are
+perfectly redundant.  We fix this by detecting and removing redundant
+clauses as the restriction clause list is built for each join.  (We
+can't do it sooner, since which clauses are redundant will vary depending
+on the join order.)
+
 Yet another implication of all this is that mergejoinable operators
 must form closed equivalence sets.  For example, if "int2 = int4"
 and "int4 = int8" are both marked mergejoinable, then there had better
author	Tom Lane <tgl@sss.pgh.pa.us>	2001-10-18 16:11:42 +0000
committer	Tom Lane <tgl@sss.pgh.pa.us>	2001-10-18 16:11:42 +0000
commit	6254465d0621f724cdc9a1e99da24fa8a29f579d (patch)
tree	84bd08321ce84de9daf6ab5264c889e5b5a92e4e /src/backend/optimizer/README
parent	50450049581566ed47016cd89ba03b90be7ea1d0 (diff)
download	postgresql-6254465d0621f724cdc9a1e99da24fa8a29f579d.tar.gz