Lines with CJK letters are deemed equal by length only, since the characters seem to be ignored. I understand this is due to locale. But, it would be nice if a simple flag would do a locale-free comparison (i.e. equal = all bytes are equal).