PDF Content
Moderators: Dorian (MJT support), JRL
-
- Junior Coder
- Posts: 25
- Joined: Wed Jul 20, 2011 3:07 pm
PDF Content
I need to find some content in a PDF doc. Anyone have any ideas?
-
- Junior Coder
- Posts: 25
- Joined: Wed Jul 20, 2011 3:07 pm
Re: PDF Content
I found a command line utility at https://pdfbox.apache.org/commandline/#extractText that will extract the text into a text file which I then search for the content I'm looking for. This will work for my needs, is probably not perfect for needs.[email protected] wrote:I need to find some content in a PDF doc. Anyone have any ideas?
- Marcus Tettmar
- Site Admin
- Posts: 7380
- Joined: Thu Sep 19, 2002 3:00 pm
- Location: Dorset, UK
- Contact:
Re: PDF Content
If the PDF contains text that text can be extracted and there are a number of free/open source tools you can use. However, some PDFs are all images, e.g. Documents that have been scanned ... so won't have any extractable text.
Sent from my iPad using Tapatalk
Sent from my iPad using Tapatalk
Marcus Tettmar
http://mjtnet.com/blog/ | http://twitter.com/marcustettmar
Did you know we are now offering affordable monthly subscriptions for Macro Scheduler Standard?
http://mjtnet.com/blog/ | http://twitter.com/marcustettmar
Did you know we are now offering affordable monthly subscriptions for Macro Scheduler Standard?
Re: PDF Content
hi,
you can first open de pdf-file with the Adobe reader,
then use the "save as" txt to a file,
then use msched to find your string in this file, you can find all the text.
kind regards,
Djek
you can first open de pdf-file with the Adobe reader,
then use the "save as" txt to a file,
then use msched to find your string in this file, you can find all the text.
kind regards,
Djek