migration guide

author: Mike Bayer <mike_mp@zzzcomputing.com> 2013-10-13 17:16:36 -0400
committer: Mike Bayer <mike_mp@zzzcomputing.com> 2013-10-13 17:16:36 -0400
commit: 5e894798b6492ac0a8209b6dfa08554ec1dc3a8f (patch)
tree: 5c6b0b9037728ac0c19efc47db09c911ce917030
parent: ee0302161a85e8c7e1066642c2c99ccb3170c7a1 (diff)
download: sqlalchemy-5e894798b6492ac0a8209b6dfa08554ec1dc3a8f.tar.gz
2 files changed, 45 insertions, 0 deletions
diff --git a/doc/build/changelog/changelog_08.rst b/doc/build/changelog/changelog_08.rst
index b9fb941a8..38da8b916 100644
--- a/doc/build/changelog/changelog_08.rst
+++ b/doc/build/changelog/changelog_08.rst
@@ -29,6 +29,10 @@
         None in 0.9 (e.g. automatic by default).   Thanks to Alexander Koval
         for help with this.
 
+        .. seealso::
+
+            :ref:`change_2836`
+
     .. change::
         :tags: bug, mysql
         :tickets: 2515
diff --git a/doc/build/changelog/migration_09.rst b/doc/build/changelog/migration_09.rst
index da01dc5c8..969e2b5da 100644
--- a/doc/build/changelog/migration_09.rst
+++ b/doc/build/changelog/migration_09.rst
@@ -802,6 +802,47 @@ and SQL expression values inline with the INSERT or UPDATE.  The feature takes
 place automatically when the target backend and :class:`.Table` supports
 "implicit returning".
 
+.. _change_2836:
+
+Subquery Eager Loading will apply DISTINCT to the innermost SELECT for some queries
+------------------------------------------------------------------------------------
+
+In an effort to reduce the number of duplicate rows that can be generated
+by subquery eager loading when a many-to-one relationship is involved, a
+DISTINCT keyword will be applied to the innermost SELECT when the join is
+targeting columns that do not comprise the primary key, as in when loading
+along a many to one.
+
+That is, when subquery loading on a many-to-one from A->B::
+
+    SELECT b.id AS b_id, b.name AS b_name, anon_1.b_id AS a_b_id
+    FROM (SELECT DISTINCT a_b_id FROM a) AS anon_1
+    JOIN b ON b.id = anon_1.a_b_id
+
+Since ``a.b_id`` is a non-distinct foreign key, DISTINCT is applied so that
+redundant ``a.b_id`` are eliminated.  The behavior can be turned on or off
+unconditionally for a particular :func:`.relationship` using the flag
+``distinct_target_key``, setting the value to ``True`` for unconditionally
+on, ``False`` for unconditionally off, and ``None`` for the feature to take
+effect when the target SELECT is against columns that do not comprise a full
+primary key.  In 0.9, ``None`` is the default.
+
+The option is also backported to 0.8 where the ``distinct_target_key``
+option defaults to ``False``.
+
+While the feature here is designed to help performance by eliminating
+duplicate rows, the ``DISTINCT`` keyword in SQL itself can have a negative
+performance impact.  If columns in the SELECT are not indexed, ``DISTINCT``
+will likely perform an ``ORDER BY`` on the rowset which can be expensive.
+By keeping the feature limited just to foreign keys which are hopefully
+indexed in any case, it's expected that the new defaults are reasonable.
+
+The feature also does not eliminate every possible dupe-row scenario; if
+a many-to-one is present elsewhere in the chain of joins, dupe rows may still
+be present.
+
+:ticket:`2836`
+
 
 .. _migration_1068:
author	Mike Bayer <mike_mp@zzzcomputing.com>	2013-10-13 17:16:36 -0400
committer	Mike Bayer <mike_mp@zzzcomputing.com>	2013-10-13 17:16:36 -0400
commit	5e894798b6492ac0a8209b6dfa08554ec1dc3a8f (patch)
tree	5c6b0b9037728ac0c19efc47db09c911ce917030
parent	ee0302161a85e8c7e1066642c2c99ccb3170c7a1 (diff)
download	sqlalchemy-5e894798b6492ac0a8209b6dfa08554ec1dc3a8f.tar.gz