UTF-8

UTF-8 is the name of the charset used by Gambas for coding non-ASCII characters.

It is a gift to humanity made by Ken Thompson.

In this charset, each ASCII character keeps its code, and each non-ASCII character is coded with two or more bytes. Consequently, you cannot use standard string functions to deal with UTF-8 strings. You must use the static methods of the String.

For more information about UTF-8, look at this F.A.Q.

See also