Monday, February 12, 2007

Unix text processing

My new task is to read the relevent chapters of UTP (1.7MB; source available here) and TUM. In particular, I need to get my head around troff, the Unix typesetting system (now, mostly known as groff). I'm convinced that I can secure groff and use it in the typesetting chain for readng.

Here's what I think will be the sequence: (1) ASCII or HTML; (2) groff mm or mom format; (3) postscript/PDF; (4) PNG image.

I know it's a little nasty, using an image format to display pages of text, but that's just tough. HTML in any flavour doesn't give the necessary control for pretty typesetting, and readers want books to look pretty.