on 04-06-2018 02:19 PM
on 04-06-2018 10:30 PM
I don't see a link to download a zip file either.
Hmmm, perhaps some kind SAS programmer will be kind enough to post a little data step that will crawl through the web page and download all the linked pdf files, then zip them. Then present it as a late-breaking coders corner fill-in. : )
on 04-07-2018 12:36 AM
@Quentin, that request is simple enough...
filename foo url "https://www.sas.com/content/sascom/en_us/events/sas-global-forum/program/proceedings/jcr:content/par/styledcontainer_1306083555/par/tabwrapper/tabwrapperpar/tab/tabpar/styledcontainer/par/listgrouppdf.ajaxlist.html"; %macro getPaper(url); %local big small; %let big=%scan(&url,-1,/); %let small=%scan(&big,1,-); filename _&small "%sysfunc(pathname(work,l))\&big"; proc http url="&url" method='get' out=_&small; run; ods package(pckg) add file=_&small; %mend; *filename pckg "%sysfunc(pathname(work,l))\sgf2018.zip"; ods package(pckg) open nopf; data _null_; length pdf $1024.; infile foo dsd dlm=' '; input @'<a href=' pdf :; if substr(pdf,max(lengthn(pdf)-2,1))='pdf'; call execute('%nrstr(' || '%getPaper(' || trim(pdf) || ')' || ')'); run; ods package(pckg) publish archive properties( archive_name='sgf2018.zip' archive_path="%sysfunc(pathname(work,l))" ); ods package(pckg) close; filename _all_ clear;
In case anyone was wondering... by uncompressed file size, the biggest paper is:
Predictive Modeling of the Journey from H-1B to Permanent US Work Visa by Shibbir Khan, Clark University, Graduate School of Management, Worcester, MA, USA
and the smaller is:
A Macro for Ensuring Data Integrity When Converting SAS® Data Sets by Richard D. Langston, SAS Institute Inc.
and the highest compression ratio
Weather Data Cleansing for Electrical Load Forecasting by Aubrey Condor, University of Central Florida
and the largest compressed file size
Getting Started with Survival Analysis Course Notes by Marc Huber, SAS Institute, Inc.
The resulting zip file is is about 311Mb and contains 427 pdfs
on 04-07-2018 06:55 AM
Thanks much @FriedEgg !
Simple enough for you, I suppose. I had thought about taking the time to re-read some of @ChrisHemedinger's blog posts on SAS web scrapers and give it a try myself, but even if I had started last night I probably wouldn't have managed to get it working before my flight tomorrow. : )
Hope to catch up with you in Denver.
on 04-27-2018 10:08 AM
Sorry for the delay in posting this. A "download all papers" link has been added to the proceedings page: