[Lvlug] UTF Embedding of Eight-Bit Chars
The Artist Formerly Known as Fingolfin
fingolfin at thelinuxlink.net
Wed Jun 23 20:52:28 EDT 2004
On Wed, 23 Jun 2004, Ricardo SIGNES wrote:
> * Chris Hever <fingolfin at thelinuxlink.net> [2004-06-23T20:08:59]
> > > They can't... but seven-bit characters can. Seven bit characters are
> > > seven-bit characters. Higher level characters are multibyte.
> >
> > Eight-bit is multibyte??
>
> UTF-8 says: characters from 0 to 127 are encoded as is. Those are, of
> course, the 7-bit characters.
>
> Characters above that are multibyte. So, for example, U+00FF is
> \307\102 (those numbers are a made up, but you get the point).
So what you are saying is that characters which would normally be
eight-bit in other encodings are multi-byte in Unicode, such as
(iso-8859-1) the =FA in N=FAmenor?
--
Mardil the Hunter bought the Horn of Gondor for sixty-five cents
More information about the Lvlug
mailing list