8 messages in com.mysql.lists.dotnetRe: Problem when inserting unicode re...
FromSent OnAttachments
Duncan Paul Attard11 Nov 2004 06:05 
Reggie Burnett11 Nov 2004 07:39 
Jordan Sparks11 Nov 2004 08:39 
Reggie Burnett11 Nov 2004 11:24 
Reggie Burnett11 Nov 2004 11:44 
Duncan Paul Attard12 Nov 2004 01:21 
Duncan Paul Attard13 Nov 2004 02:07 
Duncan Paul Attard14 Nov 2004 00:35 
Subject:Re: Problem when inserting unicode records
From:Duncan Paul Attard (dunc@euroweb.net.mt)
Date:11/12/2004 01:21:26 AM
List:com.mysql.lists.dotnet

So...

This is the show create output:

# Host: localhost # Database: corpus # Table: 'kbase' # CREATE TABLE `kbase` ( `word` varchar(100) NOT NULL default '', `changes` varchar(100) NOT NULL default '', `status` enum('unknown','accept','reject','change') NOT NULL default 'unknown', PRIMARY KEY (`word`) ) ENGINE=InnoDB DEFAULT CHARSET=utf8;

and these are the character codes in C# format (to get an idea of how the characters look like, take a look at the character map of windowsXP).

Small a with a an accent on top of its head: \u00e0 (U+00E0) Small c with a dot on top of its head: \u010b (U+010B) Small h with a small line through its head: \u0127 (U+0127) Small z with a small dot on top of its head: \u017c (U+017C)

-----------------------

Pls note the following:

If I insert 'accessibli' with the small dots on the c's and afterwards I insert 'accessibli' without the dots on the c's (i.e. normal english 'c'), the second insertion is not successful (duplicate key).

If I insert 'adezjoni' with the small dot on the z, and afterwards I insert 'adezjoni' without the dot on the z, the latter insertion is unsuccessful (duplicate key).

If I insert 'behsiebna' with the h having a small line through its head, and, afterwards I insert 'behsiebna' with a normal 'h' (i.e. as in the quotes), the insertion IS successful.

If I insert 'appogg' with the dots on the g's and afterwards I insert 'appogg' with a normal english 'g', the latter insertion is unsuccessful.

If I insert 'realta' with the accent on the 'a', and then insert 'realta' without an accent on the 'a', the second insertion is unsuccessful.

Duncan