An XML file contains an encoding attribute that indicates the code page in the file. The most common encodings are UTF-8 and UTF-16. UTF-8 represents a character with one to four bytes, depending on the Unicode symbol. UTF-16 represents a character as a 16-bit word.
The following example shows a UTF-8 attribute in an XML file: