UTF-8 is the most popular unicode encoding format that can represent text in any language. In UTF-8, ASCII characters are encoded using their raw byte equivalents. Each ASCII character results in a single byte in the output. Unicode characters such as © get encoded to two bytes.
Comments 0