Lines with CJK letters are deemed equal by length only, since the characters seem to be ignored.
I understand this is due to locale.
But, it would be nice if a simple flag would do a locale-free comparison (i.e. equal = all bytes are equal).