Obscure unicode characters not quite right

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • SuperflyJon
    Visitor
    • Nov 2008
    • 3

    Obscure unicode characters not quite right

    This string "𠀎a" is wrongly shown as a difference when comparing a UTF-8 to UTF-16 version of the string:

    "�a" <-> "𠀎a" (UTF-8 on left and UTF-16 on right)

    On my screen it shows one ? for the UTF-8 and two ?s for the UTF-16 [in beyond compare] (You might be able to see the character on one side in this post).

    This code point uses 4 bytes in UTF-16. Note that other code points are correctly matched UTF-8 <-> UTF-16 (when it's a 2 byte UTF-16 representation).

    Jon
  • Aaron
    Team Scooter
    • Oct 2007
    • 16002

    #2
    Hello Jon,

    Would it be possible for you to send in a pair of sample files to [email protected] and your BCSupport.zip from the Help menu -> Support; Export? Please also include a link to this forum thread. Thanks.
    Aaron P Scooter Software

    Comment

    Working...