We've recently installed SAS 9.4.TS1M4. When trying to GEOCODE 4 MM rows of data the proc hangs up. It reached 5% at one point and appears to get increasingly slow between % readouts in the log. Eventually it hangs altogether. We are using StreetLookupData_94-2016 address files and Proc coding log and results are below. If I add obs then things seem to finish relatively fast. I've seen board posting about freeze issues with prior SAS releases; just wondering if this release has the same issues. Any help would be appreciated.
23 proc GEOCODE
24 method=street /* Geocoding method */
25 data=geoData.PAT_ADDRESS (obs=5000) /* Input address data */
26 out=geoData.PAT_ADDRESS_GEO /* Geocoded output data set*/
27 lookupstreet=streetdl.usm /* Tiger 2016 data set */
28 /*attribute_var=(tracte00) /* Added output variables */
29 ;
30 run;
NOTE: Address data set GEODATA.PAT_ADDRESS has 3,891,171 observations.
_________________ Geocoding Progress _____________________
_____ Completed ____ _____________ Time _______________
Percent Obs Current Remaining Completion
0% 0 07:09:20 -- --
100% 5,000 07:12:39 00:00:00 --
_________ Geocoding Summary _____________________________
Address data: GEODATA.PAT_ADDRESS
Output data: GEODATA.PAT_ADDRESS_GEO
STREET lookup data: STREETDL.USM
CITY lookup data: MAPSGFK.USCITY_ALL
ZIP lookup data: SASHELP.ZIPCODE
Geocoding method: Street level
Run date: 09Mar2017
Obs processed: 5,000
Elapsed time: 00:03:19
Obs per minute: 1,505
Street matches: 2,846
ZIP matches: 1,954
City matches: 87
Not matched: 113
Given your progress of 1505 obs / min, the whole dataset would need 43 hours.
Either something is seriously wrong with the lookup data, or your SAS machine is massively underpowered for this.
Thanks for the response; the machine is an I7 8 core with 32 Gig of RAM; The 2019 Tiger files were downloaded and built using scripts at SAS Maps Online (http://support.sas.com/rnd/datavisualization/mapsonline/html/geocode.html) with no errors. When I run 100 or 5000 or even 50,000 things work. Eventually the thing hangs up. Can't really see any issues with horse power or Tiger files.
April 27 – 30 | Gaylord Texan | Grapevine, Texas
Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and lock in 2025 pricing—just $495!
Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.