Wednesday, June 1, 2016

Unicode Tutorial - Part 7: Visualization of the UTF-16 Decoding Algorithm


UTF-16 Decoding Algorithm


Below is an interactive visualization of the decoding algorithm that takes as input a byte sequence representing a single Unicode character in the UTF-16, UTF-16BE, or UTF-16LE encoding scheme and converts it to the corresponding Unicode code point in hex notation.

Step 1: Enter a byte sequence for ONE Unicode character in
hex notation (for example, D8 12 DC 34 or FFFE3412):
0x
Pick one of the following encoding schemes:
UTF-16BE
UTF-16LE
UTF-16


The algorithm visualized in this post is described in plain text in this RFC.

No comments:

Post a Comment