“
byte[] utf8Bytes = System.Text.Encoding.UTF8.GetBytes ("0123456789"); byte[] utf16Bytes = System.Text.Encoding.Unicode.GetBytes ("0123456789"); byte[] utf32Bytes = System.Text.Encoding.UTF32.GetBytes ("0123456789"); Console.WriteLine (utf8Bytes.Length); // 10 Console.WriteLine (utf16Bytes.Length); // 20 Console.WriteLine (utf32Bytes.Length); // 40
”
”
Joseph Albahari (C# 5.0 in a Nutshell: The Definitive Reference)
“
In an ideal world, the only character encoding (or, loosely, “character set”) that you’d ever see would be UTF-8 (utf-8), and Latin-1 (iso-8859-1) for all those legacy documents. However, the encodings mentioned below exist and can be found on the Web.
”
”
Sean M. Burke (Perl & LWP: Fetching Web Pages, Parsing HTML, Writing Spiders & More)