Might be able to drive this down on a machine with more cores. On my desktop machine (only dual core), with my chosen inputs, every package required at least 3 seconds to process a page some took more. The accuracy was very good, on par with Acrobat. ![]() Stops and demands user input on an invalid file processes without further complaint all protected documents apparently by OCR-ing the image. It will process folders and subfolders, one output for each input, and by default the output directory structure matches the input directory structure. Batch feature is invoked by the "Batch OCR" item which is revealed by clicking on the "From Files" button on the main screen. The batch processor writes ( shock) a plain-text log file to the output directory. Bad or fuzzy files did not cause it to hang. Accuracy is excllent, as good or better than OmniPage. Will use multiple cores, but not aggressively what that means is I could not get it to saturate a multi-core host. It will process folders and subfolders, preserving the input structure in the output. Batch feature is called "Batch Converter" and it's reachable from the main program under the Advanced Processing tab. Nuance PowerPDF Advanced v1.1 (successor to OmniPage Ultimate): $150. But stability of the batch processor is poor a fuzzy document will stop it in its tracks, never to recover, derailing a batch with ease. Seems to take excellent advantage of multi-core processors to run tasks in parallel. Stops and demands a password for a protected file. It will process folders and subfolders if you select the features just right, it will preserve the input directory tree in the output area. Batch feature is called "DocuDirect" and it's a separate program that comes with the package. ![]() Nuance OmniPage Ultimate (aka v19): $500. Does not preserve input directory tree by default can do so by writing output to same folder as input. Stops and puts up a prompt if it finds a password-protected file. Processes subfolders, one output for each input. Batch feature is called "Text Recognition/In Multiple Files" which can be found by clicking on Tools (third toolbar, top right side of the main screen). The accuracy was high in my tests, yet still the lowest of the packages I've listed here.Īdobe Acrobat XI: $300. It does not seem capable of preserving the input folder hierarchy all output files went to the same output folder. It will process files from a folder, including subfolders it will happily create a separate output file for each input file. Batch feature is called the "Task Manager" and it's on the Tools menu. Take my comments about accuracy with a grain of salt your inputs will not be the same as my inputs so your mileage will certainly vary.ĪBBYY Finereader 12 Corporate: $400. Prices shown below are list but discounts abound. I was looking for true batch: one output file for each input file, unattended operation, don't stop for anything, give me a detailed report at the end. IMHO this is dead wrong, I have no idea who would want that. Many software packages want to load all the input files, do OCR and coalesce the mess into a single output. ![]() If you're willing to spend a few hundred dollars you have many options trial versions can get you thru if you only need to convert a few hundred pages. I evaluated OCR software in Dec 2014 in prep for a big project - OCR on millions of English-language pages done in batches. Tl dr? Start with Nuance PowerPDF Advanced. But the OCR is only PDF to text, not PDF to PDF! Note: In the past, if a PDF file was password protected, I removed the password with another batch (paying) tool: "pwdremover" įinereader 6 pro on xp at the time, but there was no batch processor included. My goal : I give the program a folder and it search alone in the subfolders the PDF files that need to be converted into PDF-OCRed files. So I would like to OCR many of my PDF files that are not already OCRed. It is normal as " PDF files that contain scanned images" are not indexed ( ) ![]() I use Google Desktop Search (I am on Vista) and not all my PDF files are recognized in my archive folder.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |