We've recently installed SAS 9.4.TS1M4. When trying to GEOCODE 4 MM rows of data the proc hangs up. It reached 5% at one point and appears to get increasingly slow between % readouts in the log. Eventually it hangs altogether. We are using StreetLookupData_94-2016 address files and Proc coding log and results are below. If I add obs then things seem to finish relatively fast. I've seen board posting about freeze issues with prior SAS releases; just wondering if this release has the same issues. Any help would be appreciated.
23 proc GEOCODE
24 method=street /* Geocoding method */
25 data=geoData.PAT_ADDRESS (obs=5000) /* Input address data */
26 out=geoData.PAT_ADDRESS_GEO /* Geocoded output data set*/
27 lookupstreet=streetdl.usm /* Tiger 2016 data set */
28 /*attribute_var=(tracte00) /* Added output variables */
29 ;
30 run;
NOTE: Address data set GEODATA.PAT_ADDRESS has 3,891,171 observations.
_________________ Geocoding Progress _____________________
_____ Completed ____ _____________ Time _______________
Percent Obs Current Remaining Completion
0% 0 07:09:20 -- --
100% 5,000 07:12:39 00:00:00 --
_________ Geocoding Summary _____________________________
Address data: GEODATA.PAT_ADDRESS
Output data: GEODATA.PAT_ADDRESS_GEO
STREET lookup data: STREETDL.USM
CITY lookup data: MAPSGFK.USCITY_ALL
ZIP lookup data: SASHELP.ZIPCODE
Geocoding method: Street level
Run date: 09Mar2017
Obs processed: 5,000
Elapsed time: 00:03:19
Obs per minute: 1,505
Street matches: 2,846
ZIP matches: 1,954
City matches: 87
Not matched: 113
Given your progress of 1505 obs / min, the whole dataset would need 43 hours.
Either something is seriously wrong with the lookup data, or your SAS machine is massively underpowered for this.
Thanks for the response; the machine is an I7 8 core with 32 Gig of RAM; The 2019 Tiger files were downloaded and built using scripts at SAS Maps Online (http://support.sas.com/rnd/datavisualization/mapsonline/html/geocode.html) with no errors. When I run 100 or 5000 or even 50,000 things work. Eventually the thing hangs up. Can't really see any issues with horse power or Tiger files.
Join us for SAS Innovate 2025, our biggest and most exciting global event of the year, in Orlando, FL, from May 6-9. Sign up by March 14 for just $795.
Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.