<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xml:base="http://typophile.com" xmlns:dc="http://purl.org/dc/elements/1.1/">
<channel>
 <title>Typophile - Difference between character set and codepage? - Comments</title>
 <link>http://typophile.com/node/39726</link>
 <description>Comments for &quot;Difference between character set and codepage?&quot;</description>
 <language>en</language>
<item>
 <title>Unicode is a single very</title>
 <link>http://typophile.com/node/39726#comment-247685</link>
 <description>&lt;p&gt;Unicode is a single very large (and still growing) character set and encoding, which encompasses essentially all the standard computer character sets that predated it.&lt;/p&gt;
&lt;p&gt;Most any computer codepage can be mapped to Unicode and back. However, in computer systems Unicode is largely replacing codepage based approaches, and for good reasons. Instead of having dozens of codepages each using (and re-using) the same numbered slots for different characters, each character gets its own unique numbered slot in Unicode. &lt;/p&gt;
&lt;p&gt;T&lt;/p&gt;
</description>
 <pubDate>Wed, 19 Dec 2007 23:50:26 -0800</pubDate>
 <dc:creator>Thomas Phinney</dc:creator>
 <guid isPermaLink="false">comment 247685 at http://typophile.com</guid>
</item>
<item>
 <title>Maybe it helps understanding</title>
 <link>http://typophile.com/node/39726#comment-246147</link>
 <description>&lt;p&gt;Maybe it helps understanding how codepages  work if you are aware that a single character can be represented in various codepages. So for example, the lowercase &amp;#8217;a&amp;#8217; is represented the MacRoman codepage as well as in the Windows 1252 codepage (and in many more).&lt;/p&gt;
&lt;p&gt;Think of Unicode as a label attached to the character via which the character can be accessed by app&amp;#8217;s and OS&amp;#8217;s. Some apps and OS&amp;#8217;s address characters through their name, some through their unicode.&lt;/p&gt;
&lt;p&gt;Hope this helps,&lt;/p&gt;
&lt;p&gt;Artur&lt;/p&gt;
</description>
 <pubDate>Wed, 12 Dec 2007 09:42:43 -0800</pubDate>
 <dc:creator>Artur Schmal</dc:creator>
 <guid isPermaLink="false">comment 246147 at http://typophile.com</guid>
</item>
<item>
 <title>On second thought, maybe</title>
 <link>http://typophile.com/node/39726#comment-246138</link>
 <description>&lt;p&gt;On second thought, maybe &amp;#8220;meta-codepage&amp;#8221; works, if you take it to mean &amp;#8220;beyond codepage&amp;#8221;.&lt;/p&gt;
</description>
 <pubDate>Wed, 12 Dec 2007 09:01:16 -0800</pubDate>
 <dc:creator>Mark Simonson</dc:creator>
 <guid isPermaLink="false">comment 246138 at http://typophile.com</guid>
</item>
<item>
 <title>Unicode is not a codepage,</title>
 <link>http://typophile.com/node/39726#comment-246135</link>
 <description>&lt;p&gt;Unicode is not a codepage, and I&amp;#8217;m not sure if &amp;#8220;meta-codepage&amp;#8221; would be a useful description, either. &lt;/p&gt;
&lt;p&gt;Think of Unicode as a set of all possible character codes. A codepage is a subset of character codes in a particular order, usually limited to 256 characters.&lt;/p&gt;
&lt;p&gt;A little background: A &amp;#8220;page&amp;#8221; is a block or section of computer memory. In early personal computer systems, a page of memory was 256 bytes, the largest number that can be represented with 8 bits. Before Unicode, most standard character code systems used 8-bit encoding, so it was not possible to have more than 256 characters. So, an 8-bit character set could be thought of as a &amp;#8220;page&amp;#8221;, and a codepage usually refers to any standard pre-Unicode 8-bit character set.&lt;/p&gt;
</description>
 <pubDate>Wed, 12 Dec 2007 08:57:42 -0800</pubDate>
 <dc:creator>Mark Simonson</dc:creator>
 <guid isPermaLink="false">comment 246135 at http://typophile.com</guid>
</item>
<item>
 <title>Thanks, Thomas, so UNICODE</title>
 <link>http://typophile.com/node/39726#comment-246059</link>
 <description>&lt;p&gt;Thanks, Thomas, so UNICODE is a codepage? or a meta-codepage?&lt;/p&gt;
</description>
 <pubDate>Wed, 12 Dec 2007 01:33:20 -0800</pubDate>
 <dc:creator>cursusductus</dc:creator>
 <guid isPermaLink="false">comment 246059 at http://typophile.com</guid>
</item>
<item>
 <title>All codepages are character</title>
 <link>http://typophile.com/node/39726#comment-245026</link>
 <description>&lt;p&gt;All codepages are character sets, but not all character sets are codepages.&lt;/p&gt;
&lt;p&gt;A character set is any specific collection of characters. You could consider any given font to have its own character set, which may or may not be the same as some externally-defined one.&lt;/p&gt;
&lt;p&gt;A codepage is a character set used by a computer, usually OS specific, usually to support a specific language or set of languages. For example, MacRoman is a codepage. Windows codepage 1250 (Eastern European) is a codepage. Many codepages are single-byte character sets - that is, they contain no more than 256 characters.&lt;/p&gt;
&lt;p&gt;Regards,&lt;/p&gt;
&lt;p&gt;T&lt;/p&gt;
</description>
 <pubDate>Thu,  6 Dec 2007 20:36:08 -0800</pubDate>
 <dc:creator>Thomas Phinney</dc:creator>
 <guid isPermaLink="false">comment 245026 at http://typophile.com</guid>
</item>
<item>
 <title>Thank you, I’m reading all</title>
 <link>http://typophile.com/node/39726#comment-244543</link>
 <description>&lt;p&gt;Thank you, I&amp;#8217;m reading all this material (I knew Unicode, but somethings are not very well explained). I apreciate specially the simplification of Mark&amp;#8217;s answer, it helps to clarify. I&amp;#8217;ll go on studing.&lt;/p&gt;
</description>
 <pubDate>Wed,  5 Dec 2007 09:28:16 -0800</pubDate>
 <dc:creator>cursusductus</dc:creator>
 <guid isPermaLink="false">comment 244543 at http://typophile.com</guid>
</item>
<item>
 <title>As to Unicode in particular</title>
 <link>http://typophile.com/node/39726#comment-244339</link>
 <description>&lt;p&gt;As to Unicode in particular I&amp;#8217;d recommend The Unicode Consortium FAQ at &lt;a href=&quot;http://www.unicode.org/faq/&quot; title=&quot;http://www.unicode.org/faq/&quot;&gt;http://www.unicode.org/faq/&lt;/a&gt; as well as the &amp;#8220;What is Unicode&amp;#8221; discussion at &lt;a href=&quot;http://www.unicode.org/standard/WhatIsUnicode.html&quot; title=&quot;http://www.unicode.org/standard/WhatIsUnicode.html&quot;&gt;http://www.unicode.org/standard/WhatIsUnicode.html&lt;/a&gt;.&lt;/p&gt;
</description>
 <pubDate>Tue,  4 Dec 2007 14:57:39 -0800</pubDate>
 <dc:creator>j.hadley</dc:creator>
 <guid isPermaLink="false">comment 244339 at http://typophile.com</guid>
</item>
<item>
 <title>Jukka Korpela’s Tutorial</title>
 <link>http://typophile.com/node/39726#comment-244167</link>
 <description>&lt;p&gt;Jukka Korpela&amp;#8217;s &lt;cite&gt;Tutorial on character code issues&lt;/cite&gt; is quite an extensive explanation of the subject: &lt;a href=&quot;http://www.cs.tut.fi/~jkorpela/chars.html&quot; title=&quot;http://www.cs.tut.fi/~jkorpela/chars.html&quot;&gt;http://www.cs.tut.fi/~jkorpela/chars.html&lt;/a&gt;&lt;/p&gt;
</description>
 <pubDate>Tue,  4 Dec 2007 06:23:40 -0800</pubDate>
 <dc:creator>Tim Ahrens</dc:creator>
 <guid isPermaLink="false">comment 244167 at http://typophile.com</guid>
</item>
<item>
 <title>A character set is all the</title>
 <link>http://typophile.com/node/39726#comment-244151</link>
 <description>&lt;p&gt;A character set is all the characters in a font. It might be a few hundred or many thousands.&lt;/p&gt;
&lt;p&gt;A codepage is the set of characters (or a subset in a large font) that can be typed directly from the keyboard for a particular keyboard layout.&lt;/p&gt;
&lt;p&gt;A codepage and &amp;#8220;an encoding&amp;#8221; are essentially the same thing. Most codepages correspond to character sets in older 8-bit (256-character) font encoding schemes, such as ASCII.&lt;/p&gt;
&lt;p&gt;Unicode is a standard system for assigning unique codes to semantically distinct characters in most of the world&amp;#8217;s languages.&lt;/p&gt;
&lt;p&gt;In modern OpenType Unicode-based fonts: Unicode &amp;gt; character sets &amp;gt;= codepages&lt;/p&gt;
&lt;p&gt;(This is a bit of a simplification, and I have not even talked about glyphs vs. characters...)&lt;/p&gt;
</description>
 <pubDate>Tue,  4 Dec 2007 05:30:26 -0800</pubDate>
 <dc:creator>Mark Simonson</dc:creator>
 <guid isPermaLink="false">comment 244151 at http://typophile.com</guid>
</item>
<item>
 <title>Difference between character set and codepage?</title>
 <link>http://typophile.com/node/39726</link>
 <description>&lt;p&gt;Hi everybody, I&amp;#8217;m trying to translate these two terms to spanish; They are often used interchangeably. As I can understand, character set is a collection of characters, &amp;#8220;suerte&amp;#8221; in spanish, and a codepage is a coded character set (or multiple sets) used by an operative system.&lt;br /&gt;
So, UNICODE is a codepage or an encoding? I&amp;#8217;m getting crazy!&lt;br /&gt;
thanks&lt;/p&gt;
</description>
 <comments>http://typophile.com/node/39726#comments</comments>
 <category domain="http://typophile.com/taxonomy/term/6">Build</category>
 <pubDate>Tue,  4 Dec 2007 03:34:08 -0800</pubDate>
 <dc:creator>cursusductus</dc:creator>
 <guid isPermaLink="false">39726 at http://typophile.com</guid>
</item>
</channel>
</rss>
