<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress/2.0.3" -->
<rss version="2.0" 
	xmlns:content="http://purl.org/rss/1.0/modules/content/">
<channel>
	<title>Comments on: Unicode, wxWidgets, and I</title>
	<link>http://www.wilcoxd.com/blog/unicode-wxwidgets-and-i.html</link>
	<description>Blog for project status and lab notes</description>
	<pubDate>Thu, 20 Nov 2008 23:29:20 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.0.3</generator>

	<item>
		<title>by: Ryan Wilcox</title>
		<link>http://www.wilcoxd.com/blog/unicode-wxwidgets-and-i.html#comment-21</link>
		<pubDate>Sun, 05 Sep 2004 14:23:20 +0000</pubDate>
		<guid>http://www.wilcoxd.com/blog/unicode-wxwidgets-and-i.html#comment-21</guid>
					<description>Yves,

Thanks for the information. As I said in the prefix to this article, I'm not a Uicode expert, so I know I missed things. Thanks for the little bit of extra information.

Like many other North American, English speaking programmers, I often 'forget' about encodings like Chinese... but I did say '8 bit encoded characters typically...' (and left the non-8-bit encoded characters unspecified).

A blog is (often times) about learning. Sometimes it's my learning :).

Thanks,
_Ryan Wilcox
</description>
		<content:encoded><![CDATA[<p>Yves,</p>
<p>Thanks for the information. As I said in the prefix to this article, I&#8217;m not a Uicode expert, so I know I missed things. Thanks for the little bit of extra information.</p>
<p>Like many other North American, English speaking programmers, I often &#8216;forget&#8217; about encodings like Chinese&#8230; but I did say &#8216;8 bit encoded characters typically&#8230;&#8217; (and left the non-8-bit encoded characters unspecified).</p>
<p>A blog is (often times) about learning. Sometimes it&#8217;s my learning :).</p>
<p>Thanks,<br />
_Ryan Wilcox
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Yves M</title>
		<link>http://www.wilcoxd.com/blog/unicode-wxwidgets-and-i.html#comment-20</link>
		<pubDate>Sat, 04 Sep 2004 20:46:44 +0000</pubDate>
		<guid>http://www.wilcoxd.com/blog/unicode-wxwidgets-and-i.html#comment-20</guid>
					<description>Several things:

Multibyte codepages (MBCS) don't have to be 8 bits per character at all.  Check for example the Japanese Shift JS and the various encodings for Chinese.  They also have several bytes for a character.

You seem to think that Unicode is equal to UTF-16.  This is a common misconception unfortunately. Unicode in itself is only a big table that relates characters to numbers.  UTF stands for Unicode Transformation Format and it means that it's a means of encoding Unicode.  There are several UTF encodings, like UTF-16 and UTF-8.  Not all characters in UTF-16 are two bytes, some are four bytes (check up on surrogates at Unicode.org)

You're on the right track, but please read more before you post slightly confused articles.</description>
		<content:encoded><![CDATA[<p>Several things:</p>
<p>Multibyte codepages (MBCS) don&#8217;t have to be 8 bits per character at all.  Check for example the Japanese Shift JS and the various encodings for Chinese.  They also have several bytes for a character.</p>
<p>You seem to think that Unicode is equal to UTF-16.  This is a common misconception unfortunately. Unicode in itself is only a big table that relates characters to numbers.  UTF stands for Unicode Transformation Format and it means that it&#8217;s a means of encoding Unicode.  There are several UTF encodings, like UTF-16 and UTF-8.  Not all characters in UTF-16 are two bytes, some are four bytes (check up on surrogates at Unicode.org)</p>
<p>You&#8217;re on the right track, but please read more before you post slightly confused articles.
</p>
]]></content:encoded>
				</item>
</channel>
</rss>
