Character Encodings

scribbu uses iconv for character encoding. For convenience, here is the list of identifiers used to name them:

  1. European & Russian languages

    ASCII, ISO_8859_1, ISO_8859_2, ISO_8859_3, ISO_8859_4, ISO_8859_5, ISO_8859_7, ISO_8859_9, ISO_8859_10, ISO_8859_13, ISO_8859_14, ISO_8859_15, ISO_8859_16, KOI8_R, KOI8_U, KOI8_RU, CP1250, CP1251, CP1252, CP1253, CP1254, CP1257, CP850, CP866, CP1131, MacRoman, MacCentralEurope, MacIceland, MacCroatian, MacRomania, MacCyrillic, MacUkraine, MacGreek, MacTurkish, Macintosh

  2. Semitic languages ISO_8859_6, ISO_8859_8, CP1255, CP1256, CP862, MacHebrew, MacArabic
  3. Japanese EUC_JP, SHIFT_JIS, CP932, ISO_2022_JP, ISO_2022_JP_2, ISO_2022_JP_1, ISO_2022_JP_MS
  4. Chinese EUC_CN, HZ, GBK, CP936, GB18030, EUC_TW, BIG5, CP950, BIG5_HKSCS, BIG5_HKSCS_2004, BIG5_HKSCS_2001, BIG5_HKSCS_1999, ISO_2022_CN, ISO_2022_CN_EXT
  5. Korean EUC_KR, CP949, ISO_2022_KR, JOHAB
  6. Armenian ARMSCII_8
  7. Georgian Georgian_Academy, Georgian_PS
  8. Tajik KOI8_T
  9. Kazakh PT154, RK1048
  10. Thai TIS_620, CP874, MacThai
  11. Laotian MuleLao_1, CP1133
  12. Vietnamese VISCII, TCVN, CP1258
  13. Platform specifics HP_ROMAN8, NEXTSTEP
  14. Full Unicode UTF_8, UCS_2, UCS_2BE, UCS_2LE, UCS_4, UCS_4BE, UCS_4LE, UTF_16, UTF_16BE, UTF_16LE, UTF_32, UTF_32BE, UTF_32LE, UTF_7, C99, JAVA