g4tweak - Handle G4-compressed black-and-white scans. Holger Blasum 2006-07-01 License GPL v. 2 or any higher version (at your choice) for any program file in C or python, GPL v. 2 or any higher version or LGPL or any higher version (at your choice) for tiff.c (de facto a mini-library), GLP v. 2 or any higher version or BSD (at your choice) for any shell script. Stuff for handling G4 compressed scans. Experimental, may require tweaking. (Feedback and questions see email at http://www.blasum.net/holger/) Some of it uses Sam Leffler's libtiff. pdf2c4: Inverse functionality of c42pdf (see below): get TIFF data out of PDFs. The binaries are compiled on gnu/linux (gcc4), but the stuff should be somewhat portable (complain if not). g4rotate and g4split run on any g4-compressed TIFF. pdf2c4 does not even uncompress the data, g4rotate and g4split keep the 8-pixel-per-bit representation. The main purpose the author uses the pdf2c4 programs for is to quickly manipulate scanned g4-compressed PDFs he has on file, eg to split a page into two which comes helpful if one has a pile of 2-on-1 page copies of eg journal articles. (eg run script/srupisright on the 1 page rotated pdf2c4/sample-big.pdf gives the usable 2-page pdf2c4/sample-big-transformed.pdf - of course this also works on multipage documents) raw2pdf: python scripts to dump raw g4-compressed raw format scans into PDF (only some scanners, eg Fujitsu M3093, support g4-compressed "raw" format). script: bash shell scripts that use the programs. Use at your own risk, read before using (they do contain rm statements), maybe helpful for inspiration. For conversion of TIFF to PDF see http://c42pdf.ffii.org/ Related: http://g4tool.sourceforge.net/ , http://sourceforge.net/users/eichholz/ , http://www.vips.ecs.soton.ac.uk/ .