I'm copying and pasting text from pdfs and word documents into a notepad file, so i can parse all the data and make a set of ini files.
The data in the documents is in a table, and when i paste it into notepad, i get the raw text with no formatting.
I want to extract that raw text, and create an ini file, with the values in each line separated by a |
So, how do i use string commands to search for a tab, or a carriage return?
If there's a simpler way of extracting the data, I'd like to know about it, of course. But for now, my way should work.
Below is a stripped down version of a similar script I use to take names from names.txt, and make ini files. It reads a second file too, and combines them both, but I removed all that for the sake of simplicity.
Can I use
Code: Select all
VBEval>Replace("%linedash%","SEARCH","REPLACE"),linedash
Code: Select all
VBSTART
VBEND
// READ names.txt
Let>k=1
Label>start
ReadLn>c:\names.txt,k,line
If>line=ENDFILE,finish
VBEval>UCase("%line%"),linecaps
let>linedash=line
//CREATE OUTPUT
//LINE 1 - NAME WITH DASH
VBEval>LCase("%linedash%"),linedash
VBEval>Replace("%linedash%","SEARCH","REPLACE"),linedash
WriteLn>c:\ini-generated\%line%.txt,result,%linedash%
//LINE 2 - NAME CAPS
WriteLn>c:\ini-generated\%line%.txt,result,%linecaps%
//LINE 3 - 6 ARTIST URLS
WriteLn>c:\ini-generated\%line%.txt,result,
WriteLn>c:\ini-generated\%line%.txt,result,2012-%linedash%1.htm
WriteLn>c:\ini-generated\%line%.txt,result,2012-%linedash%2.htm
WriteLn>c:\ini-generated\%line%.txt,result,2012-%linedash%3.htm
WriteLn>c:\ini-generated\%line%.txt,result,2012-%linedash%4.htm
WriteLn>c:\ini-generated\%line%.txt,result,2012-%linedash%5.htm
WriteLn>c:\ini-generated\%line%.txt,result,2012-%linedash%6.htm
WriteLn>c:\ini-generated\%line%.txt,result,2012-%linedash%-biography.htm
WriteLn>c:\ini-generated\%line%.txt,result,
Let>k=k+1
Goto>start
Label>finish