10 messages in com.mysql.lists.bugsRe: Wrong sorting order in croat.conf...
FromSent OnAttachments
Dubravko Penezic25 Mar 2004 04:35.conf
Sinisa Milivojevic25 Mar 2004 05:58 
Sinisa Milivojevic25 Mar 2004 07:18 
Dubravko Penezic26 Mar 2004 00:15 
Sinisa Milivojevic26 Mar 2004 05:25 
Alexander Barkov30 Mar 2004 06:06 
Dubravko Penezic30 Mar 2004 11:43 
Alexander Barkov31 Mar 2004 02:37.txt
Alexander Barkov31 Mar 2004 02:44 
Dubravko Penezic01 Apr 2004 02:13 
Subject:Re: Wrong sorting order in croat.conf (spouse all version)
From:Alexander Barkov (ba@mysql.com)
Date:03/31/2004 02:37:30 AM
List:com.mysql.lists.bugs
Attachments:

Hello!

Dubravko Penezic wrote:

Hi !

What you sed is complitly wrong becouse Ss in win-1250 is on position 9A and 8A and in ISO-8859-2 is on position B9 and A9.

Yes, I agree. The the above pages state this too. And this is what I wrote in my previous letter to Sinisa:

latin2:

0xA9 0x0160 #LATIN CAPITAL LETTER S WITH CARON 0xB9 0x0161 #LATIN SMALL LETTER S WITH CARON

cp1250:

0x8A 0x0160 #LATIN CAPITAL LETTER S WITH CARON 0x9A 0x0161 #LATIN SMALL LETTER S WITH CARON

I'm 100% sure abote these three facts:

1. "croat" was created for cp1250 character set, and it provides correct sort order for Croatian language for cp1250.

Wrong ... check in repository of codepage you have in source dir ./sql/share/charsets/

Did you try croat.conf on a Windows machine? Did it produce wrong Croatian sort order?

croat.conf win1250.conf

what you talking about is win1250 in some case called cp1250

Yes, it is fine for Croatin too (and for some other languages). That means we had two cp1250+Croatian compatible configurations, and didn't have a single one for latin2+Croatian.

I'm attaching a new sort order array for latin2+Croatian configuration. Can you please replace the old one by this new array, and test if sort order is fine.

There is also HTML file attached, it demonstrates the sort order in a clear manner.

Thank you!

Croatian languge have only one code page standard which is recommandate by law, and that is ISO Latin 2 under ISO code 8859-2.

CP1250 or win1250 or what ever Micro$oft would like ta call thay "standard", is only forced standard, but situation is radical changed in last 5 years.

Also freely check linux/unix implementation of croatian character, you will finde only latin2 under ISO-8859-2 code shema.

Also very simply test of ordering will show you that croat.conf sort under ISO-8859-2 code page except wrong Ss position, inserting character whit code of win1250/cp1250 will destroy that oreder.

2. "croat" was renamed into "latin2_croatian_ci" in mistake in 4.1, it should have been "cp1250_croatian_ci" instead.

once again wrong, see above.

3. We didn't have Croatian sort order for latin2 character set in MySQL so far, and it should be just added now.

Partialy thrue, you have,but with wrong Ss position :)

Your version could be used as a template for latin2 Croatian, but for my opinion, the patch fixes only a half of the problem, and an additional fix is required: to put CAPITAL LETTER Z WITH CARON on the same position with SMALL LETTER Z WITH CARON. Now they are sorted differently.

I will check tomorow morning what I can do with other part, maybe I will need some help about meaning of other part of table.

Please confirm this. Or am I mistaken? Please send the complete table for testing in this case.

I will send tomorow table for testing.

Thank You too, I understand that people outside Croatia, and many inside dont understand what is going on, and also government do nothing to make order in code page standard.

P.S.: Also I will try to make some test table for testing order and lower/upper case. P.S.S.: We use one table which may help to understanding code page whic are in use in Croatia. http://www.open.hr/hiz/kodsus/primjena.html