Obscure unicode characters not quite right

SuperflyJon

Visitor

Join Date: Nov 2008

Posts: 3
#1

Obscure unicode characters not quite right

11-Dec-2012, 02:36 AM

This string "𠀎a" is wrongly shown as a difference when comparing a UTF-8 to UTF-16 version of the string:

"�a" <-> "𠀎a" (UTF-8 on left and UTF-16 on right)

On my screen it shows one ? for the UTF-8 and two ?s for the UTF-16 [in beyond compare] (You might be able to see the character on one side in this post).

This code point uses 4 bytes in UTF-16. Note that other code points are correctly matched UTF-8 <-> UTF-16 (when it's a 2 byte UTF-16 representation).

Jon
Tags: None
Aaron

Team Scooter

Join Date: Oct 2007

Posts: 16002
#2

11-Dec-2012, 05:11 PM

Hello Jon,

Would it be possible for you to send in a pair of sample files to [email protected] and your BCSupport.zip from the Help menu -> Support; Export? Please also include a link to this forum thread. Thanks.

Aaron P Scooter Software
Comment

Comment