Some notes on the preparation of a F2-style output text and associated transcriber's notes.
Use UNIX line endings, convert to utf-8 handling simple transliterations (eg., --). Results: text and good word list.
Fix any mistakes missed by P3/F2. Results: text, or as a diff.
Fix printer's errors as noted by proofreaders. Notes detailing fixed errors are transformed to <corr> form; notes detailing unfixed errors are left untransformed. Results: text.
Validate the resulting text. Result: log.
Identify quotation marks and replace them with their curly forms. Apostrophes are left unchanged. Inputs: PG #10688, PG #36485, PG #36833, good words, bad words. Result: text, word analysis log.
Rejoin words split across pages. Identify -*ed words and resolve them. Inputs: good words, Result: text, resolution log.
Remove notes that we've handled. Inputs: handled notes, Result: text.
Generate a word list, and a matching base word by eliding diacritical marks and hyphens. Replace words identified as bad. Inputs: bad words, Result: text, log.
Use the <corr> markers to generate a list of changes made to the original text and remove the markers from the main text. Result: notes, text.
Use DOS line endings. Final result: notes and text.
Strip page markers & blank pages, and join blocks that cross pages. Result: text.
Convert to ASCII by transliterating unicode. Result: text.
Convert to plain text by indenting /# and /p blocks, and handling <tb>, <sc>, <i>, & <b> (<f> and <g> are discarded). Result: text.
Re-wrap text to 72 coloumns honouring /*, /$ and /p no-wrap blocks. Result: text.
Discard /*, /$, /p, and /# block markers. Result: text.
Use DOS line endings. Result: text.
Run gutcheck. Result: log.
Run jeebies. Result: log.
Strip page markers & blank pages, and join blocks that cross pages. Result: text.
Convert to plain text by indenting /# and /p blocks, and handling <tb>, <sc>, <i>, & <b> (<f> and <g> are discarded). Result: text.
Re-wrap text to 72 coloumns honouring /*, /$ and /p no-wrap blocks. Result: text.
Discard /*, /$, /p, and /# block markers. Result: text.
Run a spell check honouring the project's good words list. Result: list of possible misspellings.