About wernie

wernie · ‎08-15-2019

@ballardw It's stored in my work library - I can see it there and know I ran it in the current session. What you have listed for the code is what I tried and it didn't work, which is why I'm confused because I thought that should work.

wernie · ‎08-15-2019

The format looks like this: proc format; value $Temp_Agg "CT_Case_Year1"="Annual" "CT_Case_Year2"="Annual" "CT_Case_Year3"="Annual" "CT_Case_Year4"="Annual" "CT_Case_Year5"="Annual" "CT_Case_Year6"="Annual" "CT_Case_Year7"="Annual" "CT_Case_Year8"="Annual" "CT_Case_Year9"="Annual" "CT_Case_Year10"="Annual" "_3YearPeriod_1"="3-Year Period" "_3YearPeriod_2"="3-Year Period" "_3YearPeriod_3"="3-Year Period" "_3YearPeriod_4"="3-Year Period" "_3YearPeriod_5"="3-Year Period" "_5YearPeriod_1"="5-Year Period" "_5YearPeriod_2"="5-Year Period" "_10YearPeriod"="10-Year Period"; run; Sorry, I'm not sure what you mean by the decode values, so I don't know what to provide there. When I'm looking at the first dataset (where it does apply the format properly), it's showing _5YearPeriod_1 and gets formatted to 5-Year Period. Thank you!

wernie · ‎08-15-2019

I have a few datasets where I have a temporal aggregation variable formatted (see screenshot showing temporal aggregation of "5-Year Period"). I then use the following to combine multiple datasets: data final; set Output_: ; /* There are 4 datasets in this case, so to combine Output_1 through Output_4 */ run; After I do that, I then lose the format (see screenshot now showing temporal aggregation _5YearPeriod_). I did try to add a line in the above code to say format Temp_Agg $Temp_Agg.; but that did not change the format. I also tried to apply that format in a separate data step to the final dataset and that did not work. Thanks!

wernie · ‎08-14-2019

Thanks! It's showing COL1 and COL2 for the new columns, which makes me think it's not doing what I want it to be doing because there should be 3 variables for the different years. I did proc transpose data = in out = new; by CT; run; I think I'm missing something?

wernie · ‎08-14-2019

I have data that I summarized to get cases by year and tract. The example looks like this: Cases Year CT 2 2010 020300 1 2010 020400 2 2011 020500 1 2012 020300 I want to transform the data so I end up with something like this: CT Case2010 Case2011 Case2012 020300 2 0 1 020400 1 0 0 020500 0 2 0 How do I do this?? I'm really stuck. Thanks!

wernie · ‎07-17-2019

Thanks everyone! I was replicating a process that someone else used and they did that, so I was trying to do the same thing. I'm not entirely sure why they did that though as it looks like they ended up putting it back into a regular date and just used the format that I used way back when I imported the data. I'll keep looking through to see if it's really necessary, as mentioned.

wernie · ‎07-17-2019

I have a dataset where I use an informat of anydtdte21. and format MMDDYYs10. when I read in the file. The date field is displayed as something like 10/30/2008. I want to convert that to a character field that would look like 10302008 for a char_date field. After doing that, I want to pull out the month, day, and year, so I have separate fields (char_mo, char_day, char_yr) that would be like 10, 30, and 2008 for this example. I've tried a few things with put, input, and the datepart function, but I can't seem to get anything to work. Any help is appreciated. Thanks!

wernie · ‎07-09-2019

I did try doing something, but I don't have the code in front of me so can't quite recall what I did. I do know I was trying to do proc SQL to merge and I don't think I did it on county and date. I'll try this code and see if that gives what I need. Good point on the value columns and renaming them. Thanks!

wernie · ‎07-09-2019

I have two datasets that I want to merge - one has data from monitors, so it may not include all dates or all counties, and the other has modeled data, so it covers all dates and all counties. I want to merge the data so that the modeled data will fill in for the dates where there are no monitoring data or for the counties where there are no monitoring data. For example, the monitoring data may look like this (note 04Jan14 missing for county 05391 and county 05392 has no observations listed): Date County Value 01Jan14 05391 5.4 02Jan14 05391 4.9 03Jan14 05391 5.1 05Jan14 05391 5.8 01Jan14 05393 10.3 02Jan14 05393 12.1 And the modeled data might look like this (note the observations with * denote those observations that I want to merge into the monitoring dataset to get a complete dataset with all dates and all counties): Date County Value 01Jan14 05391 4.8 02Jan14 05391 5.0 03Jan14 05391 4.9 04Jan14 05391 5.4* 05Jan14 05391 5.8 01Jan14 05392 7.6* 02Jan14 05392 6.7* 03Jan14 05392 6.9* 04Jan14 05392 7.1* 01Jan14 05393 10.3 02Jan14 05393 12.1 I'm not sure how to merge these to get a full set of observations where I keep the monitoring values and then fill in the dates and/or counties that aren't in the monitoring datasets with the values from the modeled data. Thanks!

wernie · ‎06-27-2019

Thanks! That's what I tried after @Reeza posted and didn't get to come back to say that worked. That did what I needed it to do. Now I'm trying to find out how to calculate a population-weighted mean based on all census tracts within a given county for each day. I tried doing that in proc sql, but it made SAS quit (I'm dealing with ~26 million records). Not sure if I can incorporate that into my code posted here for the other purposes.

wernie · ‎06-27-2019

I did try that first and just tried it again, but for some reason, that still gives me more than one row per county (looks like it's still displaying by tract even though that variable isn't there anymore). It's not collapsing it down to one row per county per day. I also tried that using different orders of the variables in the group by statement

wernie · ‎06-27-2019

Hi all, I know I did this before, but can't seem to get this to work right now. I have a dataset that looks something like this: Date State County CensusTract Value StdErr 01Jan2011 01 1001 1001020200 50.40 4.23 01Jan2011 01 1001 1001020300 29.47 1.39 01Jan2011 01 1001 1001020400 68.51 2.77 01Jan2011 01 1003 1003010200 5.38 8.47 01Jan2011 01 1003 1003010300 18.78 6.24 So I have daily values for each census tract within each county within each state. What I want is something that summarizes the maximum value (as well as the mean and the median) for each county in a day. So I want to end up with something like this, which results in a dataset that just has one observation per county per day: Date State County Max_DailyValue Mean_DailyValue Med_DailyValue 01Jan2011 01 1001 100.84 80.11 64.58 01Jan2011 01 1003 68.53 47.22 33.46 01Jan2011 01 1005 85.66 55.81 45.28 01Jan2011 01 1006 74.82 53.27 47.63 I also want to calculate the population-weighted average value for the county, but haven't figured that out yet. I've been trying with variations on this code, but it's not quite right. Any advice is appreciated. Thanks! proc sql; create table county_2011 as select year, date, statefips, countyfips, max(value) as max_value, median(value) as med_value, mean(value) as mean_value from tractfinal_2011 order by date, countyfips; quit;

wernie · ‎06-13-2019

Thank you! I thought I had tried that, but I guess not. That worked.

wernie · ‎06-13-2019

I can't seem to figure this out for some reason. I'm trying to import a .CSV file into SAS. It's a very large file (~20 million + records), so I can't easily change something in Excel, etc. The date appears in the format of MON-DD-YYYY, so an example is Jan-01-2008. How do I use an informat to make sure this gets into SAS properly? I used Proc Import and then tried editing the code in the log to change the informat and nothing I'm doing gets the proper format. The closest I've gotten is just something like JAN2008, but no day. Thanks!

wernie · ‎06-13-2019

Thanks @ballardw! It looks like I do have SAS/Graph (plus I see something called SAS/Graph NV Workshop that I've never heard of/used). I was able to import the tract boundaries. I can see the projection (Albers) and the projected coordinate system (USA Contiguous Albers Equal Area Conic) in ArcGIS. Based on that, I'm not sure what I need to do to my lat/long values (?). I didn't try using GProject because I wasn't sure what to do for that, but tried GINSIDE and it keeps saying the DATA = dataset must have x and y variables, so I'm not sure if that has to do with the GProject step that I'm confused about.

Online Status	Offline
Date Last Visited	‎04-28-2020 03:20 PM

Extracting word(s) before comma delimiter

Re: Transforming a dataset and calculating variables with dates

Transforming a dataset and calculating variables with dates

How to create new dataset with daily dates for each state?

Re: How to add value from variable of one observation to another obser...

Re: How to add value from variable of one observation to another obser...

How to add value from variable of one observation to another observati...

Add leading 0 to census tract value (character var)

Re: Need help merging two files to fill in values

Need help merging to expand dataset

Re: Extracting word(s) before comma delimiter

Re: How to create new dataset with daily dates for each state?

Re: How to create new dataset with daily dates for each state?

Re: How to add value from variable of one observation to another obser...

Re: How to add value from variable of one observation to another obser...

Re: How to add value from variable of one observation to another obser...

Re: Losing character format when setting multiple datasets?

Re: Losing character format when setting multiple datasets?

Losing character format when setting multiple datasets?

Re: How to summarize with new variable names?

How to summarize with new variable names?

Re: Converting date to numeric and separating values

Converting date to numeric and separating values

Re: Join to fill in missing dates and counties

Join to fill in missing dates and counties

Re: Proc SQL to summarize data by county and date

Re: Proc SQL to summarize data by county and date

Proc SQL to summarize data by county and date

Re: Informat for JAN-01-2008 date?

Informat for JAN-01-2008 date?

Re: Assigning census tract to lat/long coordinates?