Thursday, July 24, 2008

Chinese, Japanese, Korean .... alphabets in SQL Server SSIS


Our SSIS source, a SQL Server 2005 table had customer names in chinese, korean, japanese alphabets, the package was performing sort (duplicate removal - set true) and a conversion to upper case.

The packages work well in development but in QA the destination table, which was supposed to have those chinese, korean, japanese alphabets had unreadable characters and question marks.

Checked on the database property and found that the Collation was set to SQL_Latin1_General_CP1_CI_AS. To fix this issue there is a property setting which is part of the Operating system which has to be set. In control panel under Regional and Language Options setting there is a language tab, when we checked the Install file for east asian languages, some OS files get installed and then on restart the chinese and other languages are visbile in the table.

Note: Converting to upper case had no effect on the chinese, korean and japanese letters. But we are still not able to figure out the sort order.