halibut - My halibut tree

	Commit message (Collapse)	Author	Age
*	General configurability upgrade for the info back end.	Simon Tatham	2004-04-23
\| \| \| \|	[originally from svn r4131]
*	Ahem; that only didn't cause a segfault by sheer luck.	Simon Tatham	2004-04-23
\| \| \| \|	[originally from svn r4130]
*	Oh, and now the text backend warns about too-long code lines, I'd	Simon Tatham	2004-04-23
\| \| \| \| \| \|	better remove the one too-long code line in the manual :-) [originally from svn r4129]
*	Enhance the text backend to support configurable quote characters,	Simon Tatham	2004-04-23
\| \| \| \| \| \| \| \| \| \| \|	configurable emphasis characters, various other configurable bits which have been marked FIXME in the code for a while, and also to warn when a code paragraph line is too long (because that was the only other thing labelled FIXME). Fallback options are implemented, and defaults set accordingly. A UTF-8 text output file now looks like proper UTF-8. [originally from svn r4128]
*	Rewrite ustrftime(), so that (a) it uses wcsftime() where available,	Simon Tatham	2004-04-22
\| \| \| \| \| \| \|	and (b) it doesn't trip over strange Unicode characters in the format string. [originally from svn r4120]
*	Use wcscoll(), if available, when sorting index terms. (In a	Simon Tatham	2004-04-22
\| \| \| \| \| \| \| \|	somewhat roundabout and arse-backwards sort of way, due to some other properties of the sort that I rather wanted to maintain. But I hope it should still do some good.) [originally from svn r4119]
*	Use iswalpha and towlower if they're available.	Simon Tatham	2004-04-22
\| \| \| \|	[originally from svn r4118]
*	Instead of traversing a list of paragraphs, mark_attr_ends() now	Simon Tatham	2004-04-22
\| \| \| \| \| \| \| \| \|	merely traverses a list of words, and main() takes responsibility for applying it to each paragraph in the document. This is so that it can _also_ be applied to the display form of each index entry, which Jacob spotted wasn't previously being done. [originally from svn r4117]
*	bk_text and bk_info both need to know the on-screen width of	Simon Tatham	2004-04-22
\| \| \| \| \| \| \| \| \| \| \| \|	characters in order to wrap and align them properly. Therefore, they should be using wcwidth(). So here are a couple of wrappers on wcwidth(), one which filters out the Unicode characters not representable in the target charset, and one which converts _from_ a charset to Unicode before calling wcwidth(). bk_text and bk_info should now align correctly even in the face of unsupported characters and Japanese. [originally from svn r4116]
*	Add charset_from_locale(), a best-effort attempt to return the	Simon Tatham	2004-04-22
\| \| \| \| \| \| \| \|	libcharset CS_* identifier for the character set indicated by the active locale. Uses code from Markus Kuhn's website. [originally from svn r4115] [this svn revision also touched charset,timber]
*	Support the locale-supplied character set where appropriate. It's	Simon Tatham	2004-04-22
\| \| \| \| \| \| \| \| \| \|	used for converting command-line -C directives into Unicode; it's used for outputting Unicode strings to stderr in error messages; and it's used as the default character set for input files (although I'd be inclined to recommend everyone use \cfg{input-charset} in all their source files to ensure their portability). [originally from svn r4114]
*	Charset support for the man page backend (\cfg{man-charset}).	Simon Tatham	2004-04-21
\| \| \| \|	[originally from svn r4112]
*	Charset support for the info backend (\cfg{info-charset}). (This	Simon Tatham	2004-04-21
\| \| \| \| \| \| \|	checkin touches other files because a function in bk_text.c turned out to be of more general use so I moved it out into ustring.c.) [originally from svn r4111]
*	I typed `Win1252' today and libcharset didn't recognise it. Fixed.	Simon Tatham	2004-04-20
\| \| \| \| \|	[originally from svn r4107] [this svn revision also touched charset,timber]
*	Don't forget to mention the default setting for \cfg{input-charset}.	Simon Tatham	2004-04-20
\| \| \| \|	[originally from svn r4106]
*	Acroread's PDF cut and paste appears to depend on the order in which	Simon Tatham	2004-04-20
\| \| \| \| \| \| \| \|	text fragments appear in the page graphics stream. Therefore, I should take care to display the leaders _before_ the page number in each contents entry. [originally from svn r4105]
*	Add some tests of Unicode in the PDF document outline.	Simon Tatham	2004-04-20
\| \| \| \|	[originally from svn r4104]
*	Ahem. Fix stupid string handling error in new command-line config	Simon Tatham	2004-04-20
\| \| \| \| \| \|	mechanism. [originally from svn r4103]
*	The Windows Help backend now uses libcharset to the maximum extent	Simon Tatham	2004-04-20
\| \| \| \| \| \| \| \| \| \|	it can. (Which isn't much, since I currently don't know how to indicate that a help file is encoded in anything other than Win1252, and so the only use I found for libcharset was to replace the previous pathetic 8859-1 conversion so that extra things like the trademark sign are supported.) [originally from svn r4102]
*	PDF outline headings, it turns out, should be encoded using either	Simon Tatham	2004-04-20
\| \| \| \| \| \| \| \| \| \| \| \|	PDFDocEncoding or UTF-16BE. (The PDF specification's index is terribly bad; I looked under various obvious things such as `character set' and `string literal' with no success, and I didn't manage to find out what character set metadata string literals were intended to be interpreted in until I discovered from another source that the encoding was called PDFDocEncoding, and _then_ I was able to look that up in the index. They should have been using Halibut! :-) [originally from svn r4101]
*	Text backend is now charset-enabled: the entire text file is built	Simon Tatham	2004-04-20
\| \| \| \| \| \| \|	from the ground up in Unicode, and a single charset conversion pass is done over the data as it's output. [originally from svn r4100]
*	Add some tests of input charset switching.	Simon Tatham	2004-04-20
\| \| \| \|	[originally from svn r4099]
*	Forgot to add .ps and .pdf to the `make clean' list!	Simon Tatham	2004-04-20
\| \| \| \|	[originally from svn r4098]
*	Infrastructure changes for character set support. ustrtoa,	Simon Tatham	2004-04-20
\| \| \| \| \| \| \| \| \| \| \|	ustrfroma, utoa_dup and ufroma_dup now take a charset parameter, and also have a variety of subtly distinct forms. Also, when a \cfg directive is seen in the input file, the precise octet strings for each parameter are kept in their original form as well as being translated into Unicode, so that when they represent filenames they can be used verbatim. [originally from svn r4097]
*	Support for \cfg{input-charset}. Input files can now be in ASCII,	Simon Tatham	2004-04-19
\| \| \| \| \| \| \|	8859-*, UTF-8, or a variety of more fun encodings including various multibyte ones. [originally from svn r4095]
*	Better robustness when backends encounter a Unicode character not	Simon Tatham	2004-04-19
\| \| \| \| \| \|	representable in the output character set. [originally from svn r4094]
*	Fix an oddity in PDFDocEncoding.	Simon Tatham	2004-04-18
\| \| \| \| \|	[originally from svn r4093] [this svn revision also touched charset,timber]
*	Now that I've renamed the `test' program to `convcs', fix .cvsignore.	Simon Tatham	2004-04-17
\| \| \| \| \|	[originally from svn r4091] [this svn revision also touched charset,timber]
*	`gcc -Wall' points out some signed/unsigned comparisons. Fixed.	Simon Tatham	2004-04-17
\| \| \| \| \|	[originally from svn r4090] [this svn revision also touched charset,timber]
*	Now this is a top-level CVS module, it should have LICENCE and	Simon Tatham	2004-04-17
\| \| \| \| \| \| \|	MODULE files of its own. [originally from svn r4089] [this svn revision also touched charset,timber]
*	Link libcharset into Halibut. (This involved faffing with	Simon Tatham	2004-04-17
\| \| \| \| \| \| \| \| \|	CVSROOT/modules, so anyone with a checked-out copy of Halibut will unfortunately need to do `cvs co' again.) [originally from svn r4088] [added anachronistic .gitmodules so that past revisions can be checked out] [this svn revision also touched charset,timber]
*	In preparation for using libcharset in Halibut, I've added	Simon Tatham	2004-04-17
\| \| \| \| \| \| \| \| \|	PDFDocEncoding to the SBCS list; this is a custom superset of ISO-8859-1 used in PDF files to store user-visible text that isn't printed on a page (such as metadata and the document outline). [originally from svn r4087] [this svn revision also touched charset,timber]
*	Include libcharset into both the Timber and Halibut checkouts.	Simon Tatham	2004-04-17
\| \| \| \| \| \| \| \| \|	Unfortunately this means people will have to do `cvs co' again to get this update, but that appears to be the price I pay for being able to conveniently share a single source base in this way. [originally from svn r4086] [this svn revision also touched charset,timber]
*	Add a TODO entry.	Simon Tatham	2004-04-16
\| \| \| \|	[originally from svn r4084]
*	Compacted PS and PDF output files by removing redundant reiterations	Simon Tatham	2004-04-16
\| \| \| \| \| \| \| \|	of the same font and position designations. Reduced the size of the Halibut manual PDF to less than half what it started out as, and the PS one to more like a third of its original size. [originally from svn r4083]
*	Standardise on using double quotes in HTML attributes. It turns out	Simon Tatham	2004-04-15
\| \| \| \| \| \| \| \|	that the GNOME help browser is perfectly happy with Halibut's HTML _except_ that it doesn't approve of single quotes in places such as <a href='here'>. [originally from svn r4082]
*	Support the \W{...}\i\c{...} combination used in the NASM manual.	Simon Tatham	2004-04-15
\| \| \| \| \| \| \| \|	Halibut is now capable of reading the old NASM manual with only minor modifications, and probably doing a much better job of formatting it than rdsrc.pl ever did. (Certainly much _faster_!) [originally from svn r4081]
*	Prevent a tight loop. Oops.	Simon Tatham	2004-04-15
\| \| \| \|	[originally from svn r4080]
*	Put the document's version IDs into comments in the PS and PDF	Simon Tatham	2004-04-15
\| \| \| \| \| \|	output files. [originally from svn r4079]
*	Ahem. If an indexable term appears in a section heading, the index	Simon Tatham	2004-04-15
\| \| \| \| \| \|	should not also point to a page in the contents! :-) [originally from svn r4078]
*	Administrivia: update a TODO and a .cvsignore.	Simon Tatham	2004-04-14
\| \| \| \|	[originally from svn r4077]
*	After printing out the PDF manual and reading it through, here's a	Simon Tatham	2004-04-14
\| \| \| \| \| \| \| \|	collection of minor edits for clarity, and also quite a few intended to keep down the length of lines in code paragraphs (because Courier is not just a thoroughly ugly font but is also WAY TOO WIDE). [originally from svn r4076]
*	... oh, except that minimal documentation might be helpful.	Simon Tatham	2004-04-14
\| \| \| \|	[originally from svn r4075]
*	Display the page numbers on every page. Right. I think this is now	Simon Tatham	2004-04-14
\| \| \| \| \| \| \| \|	basically usable, and certainly it entirely includes the level of functionality which was provided by our ancestor Perl script. So I think I'll stop coding frantically and have a rest! [originally from svn r4074]
*	And now the page numbers in the index are PDF cross-references too.	Simon Tatham	2004-04-14
\| \| \| \| \| \| \|	Funny, I thought that would be as hard again as the main index processing, and it turned out to be nearly trivial. [originally from svn r4073]
*	Implemented an index. Good _grief_, that was hard work to get all	Simon Tatham	2004-04-14
\| \| \| \| \| \|	the fine details right. [originally from svn r4072]
*	Typo during restructuring caused all code paragraphs to be	Simon Tatham	2004-04-14
\| \| \| \| \| \|	permanently bold. Fixed. [originally from svn r4071]
*	The contents section now contains PDF cross-references.	Simon Tatham	2004-04-14
\| \| \| \|	[originally from svn r4070]
*	Support for a contents section.	Simon Tatham	2004-04-14
\| \| \| \|	[originally from svn r4069]
*	Restructuring to remove the requirement for a printed paragraph to	Simon Tatham	2004-04-14
\| \| \| \| \| \| \| \| \| \|	correspond exactly to a source paragraph. Should allow me to create multiple printed paragraphs from the same source paragraph (i.e. a contents entry for each heading in addition to the heading itself), and invent entirely new printed paragraphs of my own (e.g. for index entries). [originally from svn r4068]