About Walternate

Walternate · ‎08-13-2019

That worked perfectly! Thank you

Walternate · ‎08-13-2019

Hi, I have a file with almost 1,000 variables. I want a single report that lists every var in my file, the # of missing values, and the number of non-missing values: Varname nmiss n var1 100 0 var2 0 100 var3 50 50 .... etc. I usually use proc iml for this, but I received an error message about not having enough memory, so I am looking for an alternate methodology. I know that missingness for character and numeric variables can be displayed using PROC FREQ, but I really want the output all together instead of separate for each variable. Note that I have too many vars to list all of them by hand, and that I have both numeric and character variables. Any help is much appreciated!

Walternate · ‎07-11-2019

I'm looking for one process to zip and one to unzip files using SAS programming. The trick is that I need to maintain the original file creation date for each file that is zipped/unzipped (ie, not the date of zipping/unzipping). I have tried ODS PACKAGE and FILENAME ZIP, but neither of these maintained the file date (although maybe there is some sort of special way to set it up that would allow me to do so). Any help is much appreciated. Thanks!

Walternate · ‎07-02-2019

That's perfect, thank you!

Walternate · ‎07-02-2019

Hi, I have two data files. Each one has an ID variable, var1-var7 (which are the same variables on each file), and then the first file has some additional variables not found on the other file. The ID variable is unique on both files, BUT on File1 can sometimes be missing. Some of the IDs match across files, but there are also IDs that are unique to each. If the ID is missing, var1 - var7 will be missing, but other vars can be populated. If the IDs match across both files, var1-var7 will only be populated on File2. FILE1 ID var1 var2 var3....var7 other_vars... 1 xyz 2 d e f xyz . xyz FILE2 ID var1 var2 var3....var7 1 a b c d 3 a b c d What I want is to combine the files such that: 1. All records from File1 and File2 are kept (even those with no ID) 2. If a record is in both files, the var1-var7 values come from File2 3. If a record is just in File1 or just in File2, all values should remain intact for all variables. I considered using UPDATE to update File1 using File2 (as the update process is essentially what I want to do), but the missing ID variables seemed to be problematic when I tried. I also tried using a data step merge and then a PROC SQL join, but in both of those cases, either the File1-only or File2-only values were being overwritten.

Walternate · ‎06-26-2019

filename zip1 zip "&dir./&file..zip"; filename ds "%sysfunc(getoption(work))/test.sas7bdat"; /* Read the "members" (files) from the ZIP file */ data contents (keep=memname isfolder); length memname $200 isfolder 8; fid=dopen("zip1"); if fid=0 then stop; memcount=dnum(fid); do i=1 to memcount; memname=dread(fid,i); isFolder = (first(reverse(trim(memname)))='/'); output; end; rc=dclose(fid); run; data _null_; set contents; if index(memname, "arc") > 0 then call symput('fname', memname); run; data _null_; infile zip1(&fname) lrecl=256 recfm=F length=length eof=eof unbuf; file ds lrecl=256 recfm=N; input; put _infile_ $varying256. length; return; eof: stop; run;

Walternate · ‎06-26-2019

Hi everyone, I'm having a very strange problem. I have a program which refers to a zip file using a filename statement: filename zip1 zip "&dir./&file..zip"; Then later, I try to read in a SAS file from within the zip file using the infile statement: data _null_; infile zip1(C:/dirname/filename.sas7bdat) lrecl=256 recfm=F length=length eof=eof unbuf; -other unrelated stuff- run; This works just fine. The issue is that I wanted to automatically generate the name of the SAS file. I set up a step to do this and it produces exactly the value I wanted (C:/dirname/filename.sas7bdat). The issue is that the data _null_ step above will work if I use the value of the macro variable written out (C:/dirname/filename.sas7bdat), but will generate an error message if I substitute that with &macrovar. I tried putting " " around &macrovar (no help), and macrotizing just parts of the filepath (which worked fine, but I want to macrotize the whole thing). The error message I get says that the SAS file does not exist within the zip file. Any help is much appreciated!

Walternate · ‎05-12-2017

Hi, I have gotten to the point where I'm running so many programs that it would take too much time to go through and document each run by hand. I'm looking to automate part or all of the process of documenting programs as (or after) I run them. Ideally, this is the information I would like (but if not all of it is possible, I will take what I can get): -Date program was run -Name and filepath of program -Input files used by program -Output files produced by program -N of output files produced by program Any help is much appreciated.

Walternate · ‎12-16-2016

Hi, I am using proc sql to pull the variable names and labels from a datafile I have and put them into macro vars (one each for varnames and labels). The issue is that I only want to pull in the varname+label pairs for those variables that do not have year values in the labels. This is what I've tried: proc sql noprint; select name, label into :varnames separated by ' ', :varlabels separated by '#' from dictionary.columns where upcase(libname)='MYLIB' and upcase(memname)='DATAFILE1' and label not contains ('%20%'); quit; It runs without any error messages, but it pulls in all variables even if the labels do have years in them. Any help is much appreciated!

Walternate · ‎11-11-2016

Hi, I have two text files. Each of them has a continuous numeric variable (the same variable across both, ContVar); one of them also has a categorical var. Basically, I want to create a format in which all values of ContVar in either text file are valid values for Var1 in my SAS dataset. Textfile 1 ContVar CategVar 12345 abc 23456 def 34567 ghi etc. Textfile 2 Contvar 890123 874321 etc. I know the simple but inefficient way to do this (1. Create SAS dataset from textfile 1, 2. Create SAS dataset from textfile2, 3. Stack them and then create the format file). I'm hoping there is a more efficient way. When I thought I only needed the values from Textfile 1, this is the formatting step I came up with: data want (keep=start label hlo fmtname); infile "path/Textfile1.txt" end=eof; input start CategVar $; retain fmtname '$fmt_name'; retain label 'VALID'; output; if eof; start=' '; label='NOT VALID'; hlo='O'; output; run; proc format cntlin=want;run; I'm hoping Textfile 2 can just somehow be incorporated into that (or similar) rather than the process I outlined above. Any help is much appreciated.

Walternate · ‎09-23-2016

I've only used a proc glm so far. I have just read that it is possible to get robust SEs in proc reg using some option (it's /white or something to that effect). What I've done so far is a proc glm statement like this: proc glm data=mydata; absorb categ_var; model dependent=indep_var1 indep_var2/solution noint; run; This gives me estimates of the coefficients of the indep_vars, it does NOT give me coefficients for all the values of categ_var (which I don't need anyway), but takes categ_var into account in the calculations. The only difference between the output from that step and what I need is that from what I can tell, there's no way to get robust SEs in proc glm. PROC REG and PROC SURVEYREG have class statements but not absorb statements, so the output includes coefficients for all the levels of the categorical variable, which I don't need and which makes the model take a much longer time to run.

Walternate · ‎09-23-2016

Thank you for your response! I tried both of the methods you suggested, but the models were taking forever to run because unlike proc glm, they couldn't "absorb" the categorical variable with hundreds of values, so SAS is struggling to calculate coefficients for all of the dummies from the categorical variable even though I don't need to know the coefficents for each level of the categorical var.

Walternate · ‎09-23-2016

Hi, I have a dataset with a categorical variable with hundreds of values, many dummy variables, and a continuous variable. I'm trying to create a regression model with the continuous variable as the dependent variable and the dummies/categorical variable as the independent variables, and include robust standard errors in the output. I know two ways to create linear regression models in SAS: proc glm can convert the categorical var to dummies and suppress the output of the different levels, but from what I can tell it can't produce robust standard errors. Proc reg can get me the robust SEs, but can't deal with the categorical variable. Is there some sort of workaround, or even an alternative procedure that would yield everything I need in one step? Any help is much appreciated.

Walternate · ‎09-14-2016

Hi, I have a dataset with many categorical variables and one continuous numeric variable. I am trying to make a table summarizing descriptive statistics for the continuous variable across the levels of each of the categorical variables. I used the following proc tabulate code: proc tabulate data=mydata; var continuous_var; class categ_var1 categ_var2 categ_var3 categ_var4 categ_var5; table categ_var1 categ_var2 categ_var3 categ_var4 categ_var5, continouous_var*(median etc.); run; This creates the table I want: Continuous_var Median...etc. Categ_var1 Level1 123 Level2 345 Categ_var2 Level1 etc. The issue is that I would like to have a subtotal row for each of the categorical variables giving the values of the descriptive stastistics for everyone with a value for that caetgorical variable, like this: Continuous_var Median...etc. Categ_var1 Level1 123 Level2 345 Subtotal 234 Categ_var2 etc. I tried inserting "all" after each categ_var, and this added a row called all to the end of each categ_var section, but the numbers in each of the all segments are the same, so I think it's calculating some kind of total rather than a subtotal specific to each categorical var. Any help is much appreciated.

Walternate · ‎06-30-2016

It was a one-to-one merge. Thanks!

Online Status	Offline
Date Last Visited	‎02-27-2025 04:00 PM

ODS Excel - building within-document hyperlink using a numeric row var...

Reading in SAS program and not seeing the formatting the way it shoul...

Weird characters messing up directory/file name macros

Using libname to create directories when some directories not represen...

How to build an output indicating which numbers in a range are not pre...

Re: Possible to remove carriage returns from a string and leave the re...

Re: Pattern matching to two different patterns

Pattern matching to two different patterns

Re: Possible to remove carriage returns from a string and leave the re...

Possible to remove carriage returns from a string and leave the rest o...

Re: Parsing a character string based on format

Re: Residuals in logistic regression

Merge step overwriting shared vars?

Transposing multiple variables

Re: Missing values in infile statement

Re: Missingness report for all vars (char and numeric) in a file

Missingness report for all vars (char and numeric) in a file

Maintaining file creation dates while zipping and unzipping in SAS

Re: Combining two files with matching variables

Combining two files with matching variables

Re: Macrotizing file name in infile file reference

Macrotizing file name in infile file reference

SAS programs to automatically document programs/datafiles

PROC SQL NOT CONTAINS

Combining two text files

Re: Linear regression in SAS with robust SEs and large categorical var...

Re: Linear regression in SAS with robust SEs and large categorical var...

Linear regression in SAS with robust SEs and large categorical vars

Subtotals on multiple different variables using PROC TABULATE

Re: Merge step overwriting shared vars?