Turlkic Cyrillic KOI8 extension proposal

The following is my modest proposal for on extension to KOI-8, which I call KOI8-T, that would allow one to write in all major Turkic languages using the Cyrillic script: Azeri, Turkmen, Uzbek, Kirghiz, Kazakh, Uighur, and others. It does not support the Iranian language Tadzhik. Unfortunately, it is also not compatible with my earlier Kazakh Internet Font Kit, which I designed with Kazakh only in mind, and without being familiar with ISO-IR-111. To make a long story short, my first attempt was a bit inelegant and nonconformant.

My Kazakh kit will most likely very soon be updated... watch this space!!

KOI8-T is based on ISO-IR-111, which is a KOI8-based character set for use with all modern Slavic languages that use the Cyrillic script, registered by ECMA.

There are two minor "problems" with the present KOI8-T.

  1. There is no capital n-hook (eng). This letter never appears at the beginning of a word, but someone might want to use "all caps" in which case an upper case version would be needed I sacrificed the capital for the non-breaking space.
  2. Location $AD (173) contains a letter other than "soft hyphen," I have read however, that few applications actually interpret this character as anything other than a regular hyphen anyway.

Please tell me what you think at aeulenbe@indiana.edu.

(Last modified Aug. 13, 1996) Back Home