1. 4

  2. 4

    Why would you ever want to access a string by a code point index and not a byte offset is absolutely beyond me. Let alone the fact that this article ignores the presence of grapheme clusters (aka user-perceived characters).

    1. 2

      If memory is no object, and you expect to use emojis, and you want fast random access in long strings, I think that UTF-32 is superior.

      No. – “fast random access” to what?

      1. 1

        UTF-32 also isn’t a silver bullet here as they suppose because of combining characters and zwj’s.

        1. 0