FIX: Cardinality Estimates Are Too Low When Value Is Outside Histogram (319173)



The information in this article applies to:

  • Microsoft SQL Server 2000 (all editions)

This article was previously published under Q319173
BUG #: 356671 (SHILOH_BUGS)

SYMPTOMS

The cardinality estimate for a predicate that uses a literal or a parameter value may be too low (typically 1.0) under the following conditions:
  • There are more rows in the table than there were when the statistics were last updated.
  • The value from the predicate is less than the first value or greater than the last value stored in the existing histogram.
  • The pattern of inserts is such that for this index or statistics collection, the values that correspond to newly added rows always tend to go at either the beginning or the end of the existing histogram steps as opposed to being evenly distributed through the existing range of data values. Some examples of this are an index on a column with the IDENTITY property, or on a datetime column that stores the time the row was inserted.
You may observe some of the following behaviors as a result of this problem:
  • This table might be accessed early in the join order because the cardinality estimates are low, where a later order may be more appropriate.
  • In scenarios where there is a multi-column index and the filter predicate is the leading column of this index, a seek that uses that index may seek by using only the first column when it can use additional columns in the seek predicate.

RESOLUTION

To resolve this problem, obtain the latest service pack for Microsoft SQL Server 2000. For additional information, click the following article number to view the article in the Microsoft Knowledge Base:

290211 INF: How To Obtain the Latest SQL Server 2000 Service Pack

NOTE: The following hotfix was created before the release of Microsoft SQL Server 2000 Service Pack 3.

The English version of this fix should have the following file attributes or later:
   Version      File name
   ---------------------------

   8.00.0592    Sqlservr.exe
				
NOTE: Because of file dependencies, the most recent hotfix or feature that contains the files may also contain additional files.


Note that this fix is enabled only when you use trace flag 9205. For more information about the use of trace flags, refer to SQL Server 2000 Books Online. The meaning of trace flag 9205 may change in the next major product version, or its use may be unsupported. Only use trace flag 9205 on systems that experience this problem. You need to review the use of the trace flag prior to any major version upgrade.

WORKAROUND

To work around this behavior, schedule a job that runs update statistics against the affected index so that the statistics remain up to date.

STATUS

Microsoft has confirmed that this is a problem in the Microsoft products that are listed at the beginning of this article.
This problem was first corrected in Microsoft SQL Server 2000 Service Pack 3.

MORE INFORMATION

SQL Server statistics include:
  • A count of the number of rows that were in the table when a set of statistics were created.

  • Density information.

  • A histogram that represents the distribution of the values in the table.
When the optimizer uses a histogram, it compares the number of rows currently in the table with the number of rows that existed when the statistics were created. If the numbers differ, it tries to update what the histogram might look like given the current number of rows in the table. The changes to the histogram include the creation of two new histogram steps at the low and high ends of the histogram, which reflect the new rows that have been added to the table.

The exact distribution of values after a series of updates is not known, so SQL Server guesses as to how many rows would have been inserted in the middle of the existing data distribution versus at the beginning or end of the histogram steps. If the nature of the data is such that new rows for this index typically appear at or before or after the existing values, the cardinality estimate will be too low, typically 1.0.

With this fix, the method for estimating how many rows fall outside the histogram changes so that the estimates are higher. Because the cardinality estimate affects the estimate cost of the query, this also influences the types of query plans that the optimizer considers and what order tables may be accessed.

Modification Type:MinorLast Reviewed:9/27/2005
Keywords:kbHotfixServer kbQFE kbSQLServ2000sp3fix kbbug kbfix kbSQLServ2000preSP3fix KB319173