Why would you ever want to access a string by a code point index and not a byte offset is absolutely beyond me. Let alone the fact that this article ignores the presence of grapheme clusters (aka user-perceived characters).
If memory is no object, and you expect to use emojis, and you want fast random access in long strings, I think that UTF-32 is superior.
No. – “fast random access” to what?
UTF-32 also isn’t a silver bullet here as they suppose because of combining characters and zwj’s.
Exactly.
Why would you ever want to access a string by a code point index and not a byte offset is absolutely beyond me. Let alone the fact that this article ignores the presence of grapheme clusters (aka user-perceived characters).
No. – “fast random access” to what?
UTF-32 also isn’t a silver bullet here as they suppose because of combining characters and zwj’s.
Exactly.