Linux journal had an article for tesseract code.google.com/p/tesseract-ocr the files needed to be cleaned up first though (contrast black text against white background), so understanding gimp or some other equally functional command-line image editor is essential. Suggested alternative was netpbm.sourceforge.net for image editing and for OCR the alternative was ocrad. It was suggested that the images be scanned in at 150dpi or greater. -- PHP General Mailing List (http://www.php.net/) To unsubscribe, visit: http://www.php.net/unsub.php