I am trying to convert pdf form data to something useful (read SAS data sets). I am using the batch capabilities under Adobe Acrobat Pro to convert to pages/form to xml.
I've come with something, well I hestitate to use the term new, because it's all new to me, but, well, new...
Each one of these things start off with the following:
<================ start insert ==================>
xmlns:pdf="http://ns.adobe.com/pdf/1.3/">
Adobe Designer 6.0
xmlns:xap="http://ns.adobe.com/xap/1.0/">
2005-09-07T10:16:31-05:00
2005-09-06T14:58:03-05:00
2005-09-06T15:26:56-05:00
Adobe Designer 6.0
xmlns:xapMM="http://ns.adobe.com/xap/1.0/mm/">
uuid:6df01935-27c9-4f55-9188-685f73b9397c
uuid:0b4b1c5f-06f7-4fda-88f7-d13da1540150
xmlns:dc="http://purl.org/dc/elements/1.1/">
xml
<================ end insert ==================>
it then continues with a lot of white space, then a bunch of stuff. (I'll happily fill in the details if you're interested)
I went through the online XML trying to come up with something, but, hey, here I am, so what does that say?
Does anybody (I repeat, Chevell, are you listening) know how to read this?
And, just to throw a followup, monkey wrench into things - I have over 2000 of these. If possible, I'd like to append them all into one file. Yes/No/Are you crazy?
TIA