Search: English Language Meta Tag Does Not Get Crawled Correctly (258231)



The information in this article applies to:

  • Microsoft Site Server 3.0

This article was previously published under Q258231

SYMPTOMS

When you specify languages through the language meta tag, English-language documents are not crawled correctly. When you perform a search against documents that are language tagged, English-language documents do not appear, even though all other languages work as expected.

CAUSE

The HTML filter maps content-language strings into locale identifiers (LCIDs). Previous to Site Server 3.0 service Pack 3 (SP3), both EN and EN-US were mapped to 0x409, which was incorrect.

In Site Server 3.0 SP3, the mappings have been corrected. EN now maps to 0x9 and EN-US maps to 0x409. EN-GB has always correctly mapped to 0x809.

RESOLUTION

To resolve this issue, modify your query scripts so that they search for 0x9 (EN), 0x409 (EN-US), and 0x809 (EN-GB) when searching for EN. The following example demonstrate the change that is needed to the Search.asp page in the Search language detection sample:

Original line:

Q.Query = Q.Query & " & @DetectedLanguage ^a " & LocaleID
					
New line:
Q.Query = Q.Query & " & (@DetectedLanguage = ^s 9 OR @DetectedLanguage = ^s 409 OR @DetectedLanguage = ^s 2057 )"
					


Modification Type:MajorLast Reviewed:3/29/2000
Keywords:kbprb KB258231