[Lvlug] UTF Embedding of Eight-Bit Chars

The Artist Formerly Known as Fingolfin fingolfin at thelinuxlink.net
Wed Jun 23 20:52:28 EDT 2004


On Wed, 23 Jun 2004, Ricardo SIGNES wrote:

> * Chris Hever <fingolfin at thelinuxlink.net> [2004-06-23T20:08:59]
> > > They can't... but seven-bit characters can.  Seven bit characters are
> > > seven-bit characters.  Higher level characters are multibyte.
> >
> > Eight-bit is multibyte??
>
> UTF-8 says: characters from 0 to 127 are encoded as is.  Those are, of
> course, the 7-bit characters.
>
> Characters above that are multibyte.  So, for example, U+00FF is
> \307\102 (those numbers are a made up, but you get the point).

So what you are saying is that characters which would normally be
eight-bit in other encodings are multi-byte in Unicode, such as
(iso-8859-1) the =FA in N=FAmenor?

--
Mardil the Hunter bought the Horn of Gondor for sixty-five cents


More information about the Lvlug mailing list