UTF16

Summary

Provides UTF-16 encoding and decoding.

Description

This class provides UTF-16 encoding and decoding methods.

UTF-16 is a variable-width character encoding defined by the Unicode Standard. Each character is represented using one to two 16-bit code units.

While this class supports both little endian and big endian decoding of UTF-16 encoded byte sequences, the encoding method will encode using little endian encoding, and the encoded result will always be prefixed with the byte order mark (BOM) 0xFF + 0xFE to indicate little endian encoding.

Examples

Basic Usage
1
2
3
4
5
6
7
8
9
10
11
import System;
import System.Encoding;
  
byte[] encoded = UTF16.encode("�");
  
string toHex = "";
foreach(byte b in encoded) {
    toHex += "\\x" + b.toHex().toUpperCase();
}
  
Console.log(toHex); // "\xFF\xFE\xAC\x20"

Methods

  • decode

    Decodes UTF-16 encoded text.

  • encode

    Encodes a string of text using UTF-16 encoding.

Share

HTML | BBCode | Direct Link