From czyborra@dds.nl Thu Apr 9 23:34:07 1998 Newsgroups: comp.os.linux.announce,comp.std.internat Date: Thu, 9 Apr 1998 23:33:54 +0200 From: Roman Czyborra To: Greg Hankins , Unicode Experts bcc: Roman Czyborra Subject: Unicode-HOWTO Message-ID: Link: PGP-Fingerprint: 2708E38751D3FB90456AC169A49BE6E6 (1024/87329995) User-Agent: Pine/3.96 (private offline X notebook; Linux 2.0.32 i586) Organization: =?UTF-8?Q?Technische_Universit=C3=A4t_Berlin?= Followup-To: poster Accept-Language: de,en,nl Accept-Charset: * MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Content-Length: 2127 Lines: 49 Status: RO X-Status: Greetings, girls :) In the Linux Documentation Project we already have HOWTO documents for Chinese, Cyrillic, Danish, Finnish, French, German, Hebrew, Italian, Slovenian, Spanish, and Thai. Wouldn't it be nice if there was also a HOWTO for Unicode? Yes, I mean that funny Windoze NT code page for all languages of the world :) I would like to do this, compile and maintain a Unicode-HOWTO. After all, I am also writing my master thesis on Unicode use in Unix. I am currently collecting all sorts of information and test-driving some programs and give myself one month time to publish and a first version of the HOWTO and submit it for inclusion. Feel free to mail me any tips and questions you would like to see addressed in this HOWTO. I was thinking about answering the follwing questions: - What is Unicode? - What is it good for? - What problems does it bring? - How far is the Unicode support in Unix? - How can I make use of Unicode in Unix? - How can I input Unicode text? - Which transcoding tools are there? - Which programming languages and libraries support Unicode? - Which fonts can be used to display Unicode text? - Which editors and applications can handle Unicode? - Which terminals can display Unicode? - How can I print Unicode text on paper? - Where can I find out more about all of this? I will be mentioning 2utf, 9term, a2ps, BDF, C, CJK, cxterm, Cyberbit, Emacs, EUC, glibc, groff, HBF, HTML, iconv, ISO 8859, Java, JIS, Langbox, Lucida, lynx, MIME, Mosaic, MSIE, MULE, MUTT, ncurses, Netscape/Mozilla, Omega, PDF, Perl, Pine, Plan 9, PostScript, recode, rxvt, SCSU, SuSE, stty, tcs, Tcl, TeX, troff, TrueType, Type1, UTF-7, UTF-8, UTF-16, uxterm, VT100, wchar.h, X11R6, XLocale, XIM, XOM, XLFD, XML, xhterm, and yudit, just to give you an idea of what I already came across. If you happen to notice any Unicode-relevant Unix subsystems missing from this list or think that you know some interesting details about any of the items, send me your findings so they can be included. Cheerio, Roman http://wwwwbs.cs.tu-berlin.de/~czyborra/charsets/ From czyborra@cs.tu-berlin.de Mon Jun 15 23:57:37 1998 Newsgroups: comp.std.internat,comp.mail.mime,comp.software.international,comp.fonts,de.comp.standards,relcom.postmasters.d,netscape.public.mozilla.i18n Date: Mon, 15 Jun 1998 23:57:31 +0000 From: Roman Czyborra Reply-To: Roman Czyborra To: unicode@unicode.org cc: ietf-charsets@iana.org, i18n@dkuug.dk, mozilla-i18n@mozilla.org, recode-forum@iro.umontreal.ca, emacs-unicode@gnu.org, a2ps@inf.enst.fr, sc2wg3@dkuug.dk bcc: Roman Czyborra Subject: ANNOUNCE: new charset soups (ISO 8859, Cyrillic, Unicode, Euro, ...) Message-ID: Link: PGP-Fingerprint: 2708E38751D3FB90456AC169A49BE6E6 (1024/87329995) User-Agent: Pine/3.96 (private offline X notebook; Linux 2.0.32 i586) Organization: =?koi8-r?Q?Technische_Universit=E4t_Berlin?= Followup-To: poster Accept-Language: de,en,nl,ru,pl Content-Language: en Distribution: world Accept-Charset: * MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Status: RO X-Status: This is to announce that I have moved and spiced up my webpage on ISO 8859 and now I can proudly offer you: The ISO 8859 Alphabet Soup with all the latest news up until ISO-8859-15 special guest: the Euro sign http://czyborra.com/charsets/iso8859.html The Cyrillic Charset Soup NEW! HOT! UNIQUE! BLINK! the history of KOI-8 (the Russian ASCII) & Co. slightly opinionated towards KOI8 Unified and Unicode http://czyborra.com/charsets/cyrillic.html Good ole' ASCII a few words on ISO-646 (US-ASCII and its national variants) and the development towards ISO-10646 (Unicode) http://czyborra.com/charsets/iso646.html Codepage & Co. a superficial introduction to Bill's code page jungle http://czyborra.com/charsets/codepages.html Encoding Vietnamese a graphical illustration of VISCII (RFC 1456) http://czyborra.com/charsets/vietnamese.html Please update your links to the old locations http://www.cs.tu-berlin.de/~czyborra/charsets/ and http://wwwwbs.cs.tu-berlin.de/~czyborra/charsets/. http://czyborra.com/ is there to stay permanently. That was the good news. Enjoy! The bad news is that my promised Unicode-HOWTO for Linux is horribly delayed. It is still far from having grown to a publishable state. My original time frame turned out to be a bit too optimistic. I haven't even gotten back to all the celebrities who sent suggestions. Good things may come to those who wait, though. Cheers, Roman