Licensed third-party word breakers are not installed by default in SQL Server 2005 Community Technology Preview September 2005 (908441)



The information in this article applies to:

  • Microsoft SQL Server 2005 Developer Edition Community Technology Preview
  • Microsoft SQL Server 2005 Developer Edition for Itanium-based Systems Community Technology Preview
  • Microsoft SQL Server 2005 Developer x64 Edition Community Technology Preview
  • Microsoft SQL Server 2005 Enterprise Edition Community Technology Preview
  • Microsoft SQL Server 2005 Express Edition Community Technology Preview
  • Microsoft SQL Server 2005 Standard Edition Community Technology Preview
  • Microsoft SQL Server 2005 Standard Edition for Itanium-based Systems Community Technology Preview
  • Microsoft SQL Server 2005 Standard x64 Edition Community Technology Preview
  • Microsoft SQL Server 2005 Workgroup Edition Community Technology Preview

Beta Information

This article discusses a Beta release of a Microsoft product. The information in this article is provided as-is and is subject to change without notice.

No formal product support is available from Microsoft for this Beta product. For information about how to obtain support for a Beta release, see the documentation that is included with the Beta product files, or check the Web location where you downloaded the release.

SYMPTOMS

Microsoft SQL Server 2005 Community Technology Preview (CTP) September 2005 includes licensed third-party word breakers for the following languages:
  • Danish
  • Polish
  • Portuguese-Brazilian
  • Portuguese-Iberian
  • Russian
  • Turkish
These word breakers are available. However, they are not installed by default in SQL Server 2005 CTP September 2005. Therefore, you must manually register these word breakers.

Note We recommend that you set the Microsoft Full-Text Engine for SQL Server (MSFTESQL) service to run under a low-privileged account.

RESOLUTION

Register the word breakers

To register a word breaker, you must use the following method:
  • Add the following information to the <InstanceRoot>\MSSearch\CLSID registry subkey:
    • The Microsoft Component Object Model (COM) Class ID or Class IDs for the word breaker
    • The stemmer interfaces for the language that you are using
    Note <InstanceRoot> represents the instance root.
  • Add a subkey to the <InstanceRoot>\MSSearch\Language registry subkey for the language.
  • Add configuration values that specify the location of the lexicon, noise word, and thesaurus files for the language.

Obtain instance IDs for multiple instances of SQL Server

The registry paths that are listed in this article are for the first instance of SQL Server 2005. This instance has instance ID MSSQL.1. If multiple instances of SQL Server exist, you must modify the registry paths by substituting the instance ID for that instance instead of MSSQL.1. To obtain the instance ID for an instance, follow these steps:
  1. Click Start, click Run, type regedit, and then click OK.
  2. In Registry Editor, locate the following registry key for the first instance of SQL Server:

    HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Microsoft SQL Server\Instance Names\SQL

  3. In the right pane, the instance names and the corresponding instance IDs appear.

Obtain the instance-specific FTData folder path

After you obtain the instance IDs, you must obtain the appropriate instance-specific path of the FTData folder. You will use this path when you add configuration values that specify the lexicon, noise word, and thesaurus files for a language. To obtain the instance-specific FTData folder path, follow these steps:
  1. Click Start, click Run, type regedit, and then click OK.
  2. In Registry Editor, locate and then click the following registry key for an instance of SQL Server:

    HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Microsoft SQL Server\<Instance ID>\MSSQLServer

    Note <Instance ID> is MSSQL.1 for the first instance of SQL Server. Therefore, the registry key will be the following:

    HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Microsoft SQL Server\MSSQL.1\MSSQLServer

  3. In the right pane, the FullTextDefaultPath registry value appears. This value contains the instance-specific path of the FTData folder. For example, the instance-specific path for the first instance may be C:\Program Files\Microsoft SQL Server\MSSQL.1\MSSQL\FTData.

Install the word breakers

The following section describes how to install the word breakers for the following languages:
  • Danish
  • Polish
  • Portuguese-Brazilian
  • Portuguese-Iberian
  • Russian
  • Turkish
Note The following steps use the Danish word breaker as an example. The values that you must use to install other language word breakers are listed after each step.

Add the COM Class ID or Class IDs for the word breaker, and add the stemmer interfaces for the language that you want to use

To add a COM Class ID or Class IDs for these components, follow these steps:
  1. Click Start, click Run, type regedit, and then click OK.
  2. In Registry Editor, locate and then click the following registry key for the first instance of SQL Server:

    HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Microsoft SQL Server\MSSQL.1\MSSearch\CLSID

  3. On the Edit menu, point to New, and then click Key.
  4. Type {16BC5CE4-2C78-4CB9-80D5-386A68CC2B2D}, and then press ENTER.
  5. In the right pane, right-click the Default registry value, and then click Modify.
  6. In the Edit String dialog box, type danlr.dll in the Value data box, and then click OK.
  7. On the Edit menu, point to New, and then click Key.
  8. Type {83BC7EF7-D27B-4950-A743-0F8E5CA928F8}, and then press ENTER.
  9. In the right pane, right-click the Default registry value, and then click Modify.
  10. In the Edit String dialog box, type danlr.dll in the Value data box, and then click OK.
For other languages, use the registry key values that are listed in the following table. In step 6 and in step 10, replace danlr.dll with the DLL for the language that you want.
LanguageRegistry key value for step 4DLL for step 6 and step 10Registry key value for step 8
Polish{B8713269-2D9D-4BF5-BF40-2615D75723D8}lrpolish.dll{CA665B09-4642-4C84-A9B7-9B8F3CD7C3F6}
Portuguese-Brazilian{25B7FD48-5404-4BEB-9D80-B6982AF404FD}ptblr.dll{D5FCDD7E-DBFF-473F-BCCD-3AFD1890EA85}
Portuguese-Iberian{5D5F3A69-620C-4952-B067-4D0126BB6086}ptslr.dll{D4171BC4-90BE-4F70-8610-DAB1C17F063C}
Russian{20036404-F1AF-11D2-A57F-006052076F32}ruslr.dll{20036414-F1AF-11D2-A57F-006052076F32}
Turkish{23A9C1C3-3C7A-4D2C-B894-4F286459DAD6}trklr.dll{8DF412D1-62C7-4667-BBEC-38756576C21B}

Add a subkey to the <InstanceRoot>\MSSearch\Language registry subkey for the language

To add a subkey to this registry subkey, follow these steps:
  1. Locate and then click the following registry key for the first instance of SQL Server:

    HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Microsoft SQL Server\MSSQL.1\MSSearch\Language

  2. Repeat steps 3 through 4 in the "Add the COM Class ID or Class IDs for the word breaker, and add the stemmer interfaces for the language that you want to use" section. Replace the key name in step 4 with dan.

    For other languages, replace the key name in step 4 with the value that is listed in the following table.
    LanguageKey name
    Polishplk
    Portuguese-Brazilianptb
    Portuguese-Iberianpts
    Russianrus
    Turkishtrk

Add configuration values that set the location of the lexicon, noise word, and thesaurus files for the language

To add configuration values for these components, follow these steps:
  1. Locate and then click the registry key that you added in the "Add a subkey to the <InstanceRoot>\MSSearch\Language registry subkey for the language" section. For the first instance of SQL Server, this registry key would be the following:

    HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Microsoft SQL Server\MSSQL.1\MSSearch\Language\dan

  2. On the Edit menu, point to New, and then click String Value.
  3. Type NoiseFile, and then press ENTER.
  4. Right-click the NoiseFile registry value, and then click Modify.
  5. In the Edit String dialog box, type Instance-SpecificFTDataPath\noisedan.txt in the Value data box.

    Note InstanceSpecificFTDataPath represents the path that you obtained by following the steps in the "Obtain the instance-specific FTData folder path" section.
  6. Click OK.
  7. Repeat steps 2 through 6 to add the values that are listed in the following table for the Danish language.
    Value type for step 2Value names for step 3 and step 4Value type for step 5
    String valueTsaurusFileInstance-SpecificFTDataPath\tsdan.xml
    DWORD valueLocale00000406
    String valueWBreakerClass{16BC5CE4-2C78-4CB9-80D5-386A68CC2B2D}
    String valueStemmerClass{83BC7EF7-D27B-4950-A743-0F8E5CA928F8}
For languages that differ from Danish, use the following values.Polish language values Locate and then click the registry key that you added in the "Add a subkey to the <InstanceRoot>\MSSearch\Language registry subkey for the language" section. For the first instance of SQL Server, this registry key would be the following:

HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Microsoft SQL Server\MSSQL.1\MSSearch\Language\plk

Follow the steps in the "Add configuration values that set the location of the lexicon, noise word, and thesaurus files for the language" section by using the values in the following table.
Value type for step 2Value names for steps 3 and 4Value data for step 5
String valueNoiseFileInstance-SpecificFTDataPath\noiseplk.txt
String valueTsaurusFileInstance-SpecificFTDataPath\tsplk.xml
DWORD valueLocale00000415
String valueWBreakerClass{CA665B09-4642-4C84-A9B7-9B8F3CD7C3F6}
String valueStemmerClass{B8713269-2D9D-4BF5-BF40-2615D75723D8}
Portuguese-Brazilian language values Locate and then click the registry key that you added in the "Add a subkey to the <InstanceRoot>\MSSearch\Language registry subkey for the language" section. For the first instance of SQL Server, this registry key would be the following:

HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Microsoft SQL Server\MSSQL.1\MSSearch\Language\ptb

Follow the steps in the "Add configuration values that set the location of the lexicon, noise word, and thesaurus files for the language" section by using the values in the following table.
Value type for step 2Value names for steps 3 and 4Value data for step 5
String valueNoiseFileInstance-SpecificFTDataPath\noiseptb.txt
String valueTsaurusFileInstance-SpecificFTDataPath\tsptb.xml
DWORD valueLocale00000416
String valueWBreakerClass{25B7FD48-5404-4BEB-9D80-B6982AF404FD}
String valueStemmerClass{D5FCDD7E-DBFF-473F-BCCD-3AFD1890EA85}
Portuguese-Iberian language valuesLocate and then click the registry key that you added in the "Add a subkey to the <InstanceRoot>\MSSearch\Language registry subkey for the language" section. For the first instance of SQL Server, this registry key would be the following:

HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Microsoft SQL Server\MSSQL.1\MSSearch\Language\pts

Follow the steps in the "Add configuration values that set the location of the lexicon, noise word, and thesaurus files for the language" section by using the values in the following table.
Value type for step 2Value names for steps 3 and 4Value data for step 5
String valueNoiseFileInstance-SpecificFTDataPath\noisepts.txt
String valueTsaurusFileInstance-SpecificFTDataPath\tspts.xml
DWORD valueLocale00000816
String valueWBreakerClass{5D5F3A69-620C-4952-B067-4D0126BB6086}
String valueStemmerClass{D4171BC4-90BE-4F70-8610-DAB1C17F063C}
Russian language valuesLocate and then click the registry key that you added in the "Add a subkey to the <InstanceRoot>\MSSearch\Language registry subkey for the language" section. For the first instance of SQL Server, this registry key would be the following:

HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Microsoft SQL Server\MSSQL.1\MSSearch\Language\rus

Follow the steps in the "Add configuration values that set the location of the lexicon, noise word, and thesaurus files for the language" section by using the values in the following table.
Value type for step 2Value names for steps 3 and 4Value data for step 5
String valueNoiseFileInstance-SpecificFTDataPath\noiserus.txt
String valueTsaurusFileInstance-SpecificFTDataPath\tsrus.xml
DWORD valueLocale00000419
String valueWBreakerClass{20036404-F1AF-11D2-A57F-006052076F32}
String valueStemmerClass{20036414-F1AF-11D2-A57F-006052076F32}
Turkish language valuesLocate and then click the registry key that you added in the "Add a subkey to the <InstanceRoot>\MSSearch\Language registry subkey for the language" section. For the first instance of SQL Server, this registry key would be the following:

HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Microsoft SQL Server\MSSQL.1\MSSearch\Language\trk

Follow the steps in the "Add configuration values that set the location of the lexicon, noise word, and thesaurus files for the language" section by using the values in the following table.
Value type for step 2Value names for steps 3 and 4Value data for step 5
String valueNoiseFileInstance-SpecificFTDataPath\noisetrk.txt
String valueTsaurusFileInstance-SpecificFTDataPath\tstrk.xml
DWORD valueLocale0000041f
String valueWBreakerClass{8DF412D1-62C7-4667-BBEC-38756576C21B}
String valueStemmerClass{23A9C1C3-3C7A-4D2C-B894-4F286459DAD6}

Modification Type:MinorLast Reviewed:3/10/2006
Keywords:kbsql2005fts kbtshoot kbprb KB908441 kbAudDeveloper kbAudITPRO