Hi all I was wondering what's the name of the Unicode Encoding Scheme whereby:
Code Points U+1 to U+7F is represented using one byte:
Byte 1: 0xxxxxxx
Code Points U+80 to U+3FFF is represented using two bytes:
Byte 1: 0xxxxxxx
Byte 2: 1xxxxxxx
Code Points U+4000 to U+10FFFF is represented using three bytes:
Byte 1: 0xxxxxxx
Byte 2: 1xxxxxxx
Byte 3: 1xxxxxxx
From what I can see, this scheme beats UTF-8 because only one to three bytes is required to represent each CodePoint (UTF-8 requires 1 to 4 bytes to represent a single CodePoint), and I was wondering if there's an official name for this encoding.
Your actual question appears to be, "what's the name" of this encoding. While it's hard tp prove a negative, I'll go out on a limb and say, no-one else has published and named this encoding. You just invented it. Thus, your answer is that this encoding has no name. Invent a name for it, and that will be its name.
You also add, "From what I can see, this scheme beats UTF-8...". There's been a discussion about that claim in the comments. If you want to post this as a separate question, e.g. "Is this encoding scheme better than UTF-8", that would be another matter. Hint: if you do so, consider which criteria you use to measure "better". Do you think your new encoding scheme is better than UTF-8 according to UTF-8's own design criteria?