Text Capture in scanned documents

General Macro Scheduler discussion

Moderators: Dorian (MJT support), JRL

Post Reply
nodochau
Pro Scripter
Posts: 131
Joined: Wed Jan 16, 2019 12:59 pm

Text Capture in scanned documents

Post by nodochau » Wed Feb 06, 2019 12:10 pm

Hello all,
I have a problem of capturing text in scanned documents. I would like to write a scripts that can copy all text in a scanned document. It is about 20 pages. Because they are scanned so they are not real text. I am not able to copy and paste the text by using CTLR C , V...If I use OCR in MS and Microsoft Office Document Imaging but it takes long and can not do 20 pages at a time. Do you have any solutions? Please help.
I am new with MS but willing to learn.
Thanks a lot

User avatar
Grovkillen
Automation Wizard
Posts: 1009
Joined: Fri Aug 10, 2012 2:38 pm
Location: Bräcke, Sweden
Contact:

Re: Text Capture in scanned documents

Post by Grovkillen » Wed Feb 06, 2019 4:30 pm

Split the PDF (or how do you store them?) and scan multiple files at a time.

https://portableapps.com/apps/office/pd ... r_portable

PDFtk is a good command line compatible program for splitting PDF files.
Let>ME=%Script%

Running: 15.0.24
version history

nodochau
Pro Scripter
Posts: 131
Joined: Wed Jan 16, 2019 12:59 pm

Re: Text Capture in scanned documents

Post by nodochau » Wed Feb 06, 2019 5:33 pm

We scan multiple files at a time and store it at one file name. If I use OCR in MS how can I set the OCR scan entire file since the OCR Capture just give us three options and can not do all pages.

User avatar
Marcus Tettmar
Site Admin
Posts: 7378
Joined: Thu Sep 19, 2002 3:00 pm
Location: Dorset, UK
Contact:

Re: Text Capture in scanned documents

Post by Marcus Tettmar » Sun Feb 10, 2019 12:20 pm

Macro Scheduler's OCR functions are designed for scanning the screen. There are no built in document OCR functions. You would need to use a third party tool. Many offer command line options you can use from a Macro Scheduler script.
Marcus Tettmar
http://mjtnet.com/blog/ | http://twitter.com/marcustettmar

Did you know we are now offering affordable monthly subscriptions for Macro Scheduler Standard?

nodochau
Pro Scripter
Posts: 131
Joined: Wed Jan 16, 2019 12:59 pm

Re: Text Capture in scanned documents

Post by nodochau » Tue Feb 26, 2019 12:04 pm

Hi Marcus,
Please tell me what command line options from OS that I can use.
Thanks

User avatar
Marcus Tettmar
Site Admin
Posts: 7378
Joined: Thu Sep 19, 2002 3:00 pm
Location: Dorset, UK
Contact:

Re: Text Capture in scanned documents

Post by Marcus Tettmar » Wed Mar 06, 2019 9:16 pm

Marcus Tettmar
http://mjtnet.com/blog/ | http://twitter.com/marcustettmar

Did you know we are now offering affordable monthly subscriptions for Macro Scheduler Standard?

Post Reply
Sign up to our newsletter for free automation tips, tricks & discounts