summaryrefslogtreecommitdiff
path: root/charset (follow)
Commit message (Collapse)AuthorAge
* I've apparently had this lying around for months but forgotten toSimon Tatham2006-06-13
| | | | | | | | | commit it. Add `-i' option to cstable, which causes charset names to be output as CS_* constants where meaningful. (Doesn't apply to MBCS base charsets, because CS_* constants identify _encodings_.) [originally from svn r6728] [this svn revision also touched charset,filter,timber]
* Remove an outright lie I've just noticed in the comment at the topSimon Tatham2006-05-18
| | | | | | | of this file! [originally from svn r6705] [this svn revision also touched charset,filter,timber]
* sbcsgen.pl was giving different results on different machines in the caseJacob Nevins2006-04-26
| | | | | | | | | | | where two SBCS code points mapped to a single Unicode point. Changed so that by default it favours the lower SBCS code point. On ixion, this highlighted ambiguities in CS_MAC_THAI, CS_MAC_SYMBOL, and CS_VISCII. Guessed at a preference for the first two and added "sortpriority" directives. (No idea about VISCII.) [originally from svn r6641] [this svn revision also touched charset,filter,putty,timber]
* CP866 is popular and small. Add it to both the general and PuTTYJacob Nevins2005-12-18
| | | | | | | | implementations of libcharset, since we've had at least one request for it in PuTTY. [originally from svn r6499] [this svn revision also touched charset,filter,putty,timber]
* Reinstate the DEPLANARISE macros, this time in what I believe is aSimon Tatham2005-11-15
| | | | | | | | | genuinely portable form. (Thanks to IWJ for ideas.) While I'm here, add a couple of explicit `unsigned' casts and U suffixes to prevent more pedantic compilers from warning. [originally from svn r6463] [this svn revision also touched charset,filter,timber]
* Fix various compiler warnings and errors. In particular, my cunningSimon Tatham2005-11-13
| | | | | | | | auto-type-checking DEPLANARISE and REPLANARISE macros have turned out to only work in gcc, which is a shame. [originally from svn r6455] [this svn revision also touched charset,filter,timber]
* write_utf8() is used in iso2022.c as of r6378; declare it.Jacob Nevins2005-10-23
| | | | | | | | (Fixes a warning in iso2022.c. There are lots more.) [originally from svn r6424] [r6378 == 41e50e9f2e3e67da805c5d9037cc650f363e5279] [this svn revision also touched charset,filter,timber]
* Working ISO 2022 output function. Outputs full ISO 2022 (not sureSimon Tatham2005-10-07
| | | | | | | | | | | | | | | what that's useful for but it seemed a pity not to do it) and compound text. I've completely removed the compound text implementation from iso2022s.c in favour of using the more flexible iso2022.c, meaning we can cope with nastiness such as DOCS. This is largely untested: I've checked it on small examples as I went along, but it lacks anything resembling a proper test suite. [originally from svn r6378] [this svn revision also touched charset,filter,timber]
* PostScript StandardEncoding might occasionally come in handy. WhileSimon Tatham2005-10-06
| | | | | | | I'm here, I've updated the URL to the Adobe Glyph List. [originally from svn r6376] [this svn revision also touched charset,filter,timber]
* Correct the URL of the X Registry to the one given in the Registry, whichBen Harris2005-09-26
| | | | | | | works (unlike our old one). [originally from svn r6358] [this svn revision also touched charset,filter,timber]
* Never loop up to _and including_ lenof(array).Simon Tatham2005-09-26
| | | | | [originally from svn r6357] [this svn revision also touched charset,filter,timber]
* Correct copy and paste error.Simon Tatham2005-09-24
| | | | | [originally from svn r6354] [this svn revision also touched charset,filter,timber]
* EUC-TW implementation, plus an explanation of why ISO-2022-CN is difficult.Ben Harris2005-09-24
| | | | | [originally from svn r6353] [this svn revision also touched charset,filter,timber]
* Space-saving restructure of the CNS 11643 data tables. Reduces theSimon Tatham2005-09-24
| | | | | | | | | | | | | | | RO data size in cns11643.o from 400k to 240k. Relies on there being at most seven planes (7*94*94 <= 64k) and on the character set not encoding any Unicode code point above U+40000; if either of these becomes untrue later on we can always fall back to the previous approach, or to somewhere between that and here. The new version passes all the same tests as the old one did, and generates the same output under the new `cstable -v'. I'm confident that I haven't broken it. [originally from svn r6351] [this svn revision also touched charset,filter,timber]
* Fix a couple of warnings.Ben Harris2005-09-24
| | | | | [originally from svn r6350] [this svn revision also touched charset,filter,timber]
* Introduce the -v flag which outputs the actual index of each codeSimon Tatham2005-09-24
| | | | | | | point in every charset. [originally from svn r6349] [this svn revision also touched charset,filter,timber]
* Add support for CNS 11643.Ben Harris2005-09-24
| | | | | [originally from svn r6348] [this svn revision also touched charset,filter,timber]
* Include CNS 11643 in the cstable diagnostic utility.Simon Tatham2005-09-24
| | | | | [originally from svn r6347] [this svn revision also touched charset,filter,timber]
* IRG source T3 includes not only plane 3 of CNS 11643, but also "some additionalBen Harris2005-09-24
| | | | | | | characters". We now filter out the latter from our mapping table. [originally from svn r6345] [this svn revision also touched charset,filter,timber]
* CNS 11643 goes above the BMP, so the test code should take that intoSimon Tatham2005-09-24
| | | | | | | | account when checking the reverse mapping for every potentially relevant Unicode character. [originally from svn r6343] [this svn revision also touched charset,filter,timber]
* Add a mapping table for CNS 11643-1992. It's a bit big, and nothingBen Harris2005-09-24
| | | | | | | uses it yet. [originally from svn r6342] [this svn revision also touched charset,filter,timber]
* Support for the ESC $ ( 0 and ESC $ ( 1 sets that Emacs uses to embedBen Harris2005-09-21
| | | | | | | | Big5 in COMPOUND_TEXT. Emacs does lots of other rude things to COMPOUND_TEXT, but this one is supported by XLib as well. [originally from svn r6336] [this svn revision also touched charset,filter,timber]
* Add support for COMPOUND_TEXT extended segments encoding ISO 98859-14,Ben Harris2005-09-21
| | | | | | | ISO 8859-15, and BIG5. [originally from svn r6335] [this svn revision also touched charset,filter,timber]
* Add two new SBCSes: BS 4730 (alias UK-ASCII) and DEC graphics (alias VT100Ben Harris2005-09-18
| | | | | | | | line-drawing). I think this means that libcharset supports all the character sets that PuTTY supports, which is nice. [originally from svn r6330] [this svn revision also touched charset,filter,timber]
* When documenting s0 and s1, get then the right way around.Ben Harris2005-09-18
| | | | | [originally from svn r6329] [this svn revision also touched charset,filter,timber]
* 1: Better documentation of how read_iso2022() stores its state.Ben Harris2005-09-18
| | | | | | | | 2: Minimal write_iso2022(): it can't encode anything, but promises not to segfault. [originally from svn r6328] [this svn revision also touched charset,filter,timber]
* Ben points out that ESC ( J in ISO-2022-JP should encode theSimon Tatham2005-09-18
| | | | | | | | | _bottom_ half of JIS X 0201 (the one that's almost identical to ASCII, equivalent to the bottom half of Shift-JIS), not the top half. [originally from svn r6327] [this svn revision also touched charset,filter,timber]
* Make read_utf8(), like read_sbcs(), accessible to the rest of the library,Ben Harris2005-09-18
| | | | | | | so it can be used directly in iso2022.c. [originally from svn r6326] [this svn revision also touched charset,filter,timber]
* Undo another change that leaked through with the ISO-2022 commit.Ben Harris2005-09-18
| | | | | [originally from svn r6325] [this svn revision also touched charset,filter,timber]
* Update comment to reflect state of DOCS support.Ben Harris2005-09-18
| | | | | [originally from svn r6324] [this svn revision also touched charset,filter,timber]
* Undo accidental change in previous commit.Ben Harris2005-09-18
| | | | | [originally from svn r6323] [this svn revision also touched charset,filter,timber]
* Support for using DOCS to switch to and from UTF-8 mode.Ben Harris2005-09-17
| | | | | [originally from svn r6321] [this svn revision also touched charset,filter,timber]
* Reasonably complete ISO 2022 support. Huge and hairy, but it seems toBen Harris2005-09-17
| | | | | | | largely work. It might even be useful for something. [originally from svn r6320] [this svn revision also touched charset,filter,timber]
* Use standard "WILD" markers for unregistered Big 5 aliases.Ben Harris2005-09-17
| | | | | [originally from svn r6319] [this svn revision also touched charset,filter,timber]
* Fix stupid typo.Ben Harris2005-09-17
| | | | | [originally from svn r6318] [this svn revision also touched charset,filter,timber]
* Names for ASCII and JIS X 0201 that appear both in the X registry and inBen Harris2005-09-17
| | | | | | | the usual X fonts. [originally from svn r6317] [this svn revision also touched charset,filter,timber]
* TIS-620 is equivalent to ISO 8859-11, so map the MIME name for the former toBen Harris2005-09-17
| | | | | | | the latter. [originally from svn r6316] [this svn revision also touched charset,filter,timber]
* Substantial overhaul of the UTF-8 decoder. It now uses 26 bits of stateBen Harris2005-09-16
| | | | | | | | rather than 32, which might make it possible to use it inside another decoder. All the tests still pass. [originally from svn r6315] [this svn revision also touched charset,filter,timber]
* Bring utf8.c's internal tests up to date in the (somewhat belated)Simon Tatham2005-09-16
| | | | | | | | wake of r3713. [originally from svn r6314] [r3713 == 9392c7ecb6da984e144ce00dfb8c7cb725ad0a61 in timber repository] [this svn revision also touched charset,filter,timber]
* I've apparently had this todo-list comment sitting on stormhawk forSimon Tatham2005-09-13
| | | | | | | nearly a year and not checked it in. [originally from svn r6309] [this svn revision also touched charset,filter,timber]
* Explicitly constify a bunch of static data declarations which wereSimon Tatham2005-03-10
| | | | | | | | | conceptually const but not declared as such. Halibut is now back to the practically-speaking-pointless but rather satisfying status of having no global writable data whatsoever :-) [originally from svn r5476] [this svn revision also touched charset,filter,timber]
* Add a `--list-charsets' option to Halibut to enumerate canonical names of knownJacob Nevins2005-02-18
| | | | | | | | | character sets. (Also make libcharset `return_in_enum' values saner.) [originally from svn r5341] [this svn revision also touched charset,filter,timber]
* Move MODULE files out of individual project directories into aSimon Tatham2004-11-18
| | | | | | | | MODULES top-level directory, which is where the Tartarus website scripts will (hopefully) start reading them from. [originally from svn r4813] [this svn revision also touched charset,enigma,filter,putty,puzzles,pycee,sdlgames,timber]
* Remove .cvsignore files on all active branches.Simon Tatham2004-11-16
| | | | | [originally from svn r4788] [this svn revision also touched bmbm,caltrap,charset,enigma,filter,fonts,golem,grunge,html,lj,local,misc,polyhedra,putty,putty-website,putty-wishlist,puzzles,pycee,sdlgames,svn-tools,timber,tweak]
* Couple of fiddly fixes in libcharset.Simon Tatham2004-10-26
| | | | | [originally from svn r4701] [this svn revision also touched charset,filter,timber]
* Cross-testing the libcharset compound text implementation againstSimon Tatham2004-09-25
| | | | | | | | | | Xutf8TextListToTextProperty reveals that the latter supports JIS X 0212 via the escape sequences ESC $ ( D and ESC $ ) D, although this is not listed in my copy of ctext.ps. It's easy enough to support it, though, so now we do. [originally from svn r4581] [this svn revision also touched charset,filter,timber]
* Fix first two bugs in compound text support: escape sequences wereSimon Tatham2004-09-25
| | | | | | | mis-ordered, and initial charset state failed to specify 8859-1 in GR. [originally from svn r4579] [this svn revision also touched charset,filter,timber]
* The COMPOUND_TEXT encoding used by some X applications to transferSimon Tatham2004-09-25
| | | | | | | | | | | | | internationalised text in selections is a subset of ISO 2022 containing no base character sets which libcharset doesn't already support. As such, it isn't too hard to add direct compound text support into libcharset, so here it is. With any luck I should eventually be able to integrate this into Unix PuTTY, to deal with the fact that the useful Xutf8 functions we currently use are specific to XFree86. [originally from svn r4578] [this svn revision also touched charset,filter,timber]
* Preferred MIME name for ASCII is "US-ASCII", not "ANSI_X3.4-1968". Oops.Simon Tatham2004-06-03
| | | | | [originally from svn r4269] [this svn revision also touched charset,filter,timber]
* Add charset_from_locale(), a best-effort attempt to return theSimon Tatham2004-04-22
| | | | | | | | libcharset CS_* identifier for the character set indicated by the active locale. Uses code from Markus Kuhn's website. [originally from svn r4115] [this svn revision also touched charset,timber]