BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
PavelD
Obsidian | Level 7

Hi!

I know informat is using for reading raw data. How can I read data from other datasets (normally with a set statement) using a custom informat? I am looking for a permanent modification of the data, not just formatting the values. Also why predefined informats work with the set statement as seen below?

Here is an example:

 

data test;
  input CC $2.;
  datalines;
LU
AT
BE
;run;

proc format ;
  invalue $tform 
    "LU"  = "XX"
    other = "ZZ";
run;

/* 
Output from a:
Expected  Actual
XX        LU
ZZ        AT
ZZ        BE
 */
data a;
  informat CC $tform.; /* custom informat does not work? */
  set test;
run;

/* What about predefined informat?
Output from b:
Expected  Actual
LU        L
AT        A
BE        B
 */
data b;
  informat CC $1.;    /* predefined informat works? */
  set test;
run;

 

1 ACCEPTED SOLUTION

Accepted Solutions
Astounding
PROC Star

A couple of things that work ...   First run PROC FORMAT.  Then:

 

data test;
  input CC $tform.;
datalines;

LU

AT

BE

;

 

Or:

 

data test;
  input CC $2.;
datalines;

LU

AT

BE

;

 

data want;

set test;

CC = input(CC, $tform.);

run;

View solution in original post

6 REPLIES 6
RW9
Diamond | Level 26 RW9
Diamond | Level 26

??

 

An informat is for reading raw data in a certain way.  Datasets already have formats attached to them, to alter that you change the format of the variable?

data a;
  set test;
  format CC $tform.; 
run;
Astounding
PROC Star

A couple of things that work ...   First run PROC FORMAT.  Then:

 

data test;
  input CC $tform.;
datalines;

LU

AT

BE

;

 

Or:

 

data test;
  input CC $2.;
datalines;

LU

AT

BE

;

 

data want;

set test;

CC = input(CC, $tform.);

run;

PavelD
Obsidian | Level 7

Thank you for clarifying answers and explanations. I am still battling with the basics.

I knew I am using INFORMAT incorrectly, just did not know how exactly.

 

Here is the part I was looking for:


@Astounding wrote:

 

data want;

set test;

CC = input(CC, $tform.);

run;


I.e. using put or input function to rewrite existing data. I wrongly thought I could achieve the same thing with informat. I did not want to use format, as it only changes the visual appearance of the data, not the data itself.

Thanks all!

Tom
Super User Tom
Super User

An INFORMAT converts text to stored values. You use it with an INPUT statement or an INPUT() function.

A FORMAT converts stored values to text. You use it with a PUT statement or a PUT() function.

 

In your case since you are translating character variables you could either use an INFORMAT with an INPUT() function call or a FORMAT with a PUT() function call.

 

In PROC FORMAT you use a VALUE statement to create a FORMAT and an INVALUE statement to create an INFORMAT.

Kurt_Bremser
Super User

The informat does not "work" here:

data b;
  informat CC $1.;    /* predefined informat works? */
  set test;
run;

as you rightfully stated, it only works when reading from raw data with an input statement (or in an input function).

Since here the informat statement sets the attributes before the incoming variables are determined by the set statement, CC is defined with a length of 1, and the values are truncated.

Now run this:

data a;
  informat CC $tform1.;
  set test;
run;

Note that both data steps cause a WARNING because of the truncation. Maxim 2: read the log!

Tom
Super User Tom
Super User

Changing the INFORMAT associated with a dataset really does nothing.  Perhaps if you opened the dataset in FSEDIT and tried to manually add records it might have an impact.

 

You state that this "works".

data b;
  informat CC $1.;    /* predefined informat works? */
  set test;
run;

What you have done is forced SAS to define the variable CC as character with a length on one byte.  Then when it reads in the existing dataset it truncates the data to fit.

 

Remember that FORMAT and INFORMAT are statements are instructions to SAS about what default format or informat to use when translating values to text or the reverse.  An INFORMAT or FORMAT statement only has an impact on the definition of a variable's type and/or length if it is the first place the SAS code references the variable.  In general SAS makes a decision about the variable type/length when you first reference the variable.  So if the first place you reference it is in a INFORMAT statement then SAS will define the type of the variable to match the informat type.  And for character variables it will set the length to match the width of the format.

 

If you want to set the type and/or length of a variable you should use either a LENGTH or ATTRIB statement.

* Truncate CC to one character ;
data b;
  length CC $1;
  set test;
run;

Also note that letting SAS guess at how to define a variable can result in lengths that are probably not what you intended.  For example if you define a character format that formats one letter codes to longer descriptions and then use a FORMAT statement before defining the variable SAS will set the length to match the length of the display values instead of the actual stored values.

proc format ;
value $testf 'L'='LONG' ;
run;

data test;
  length a $1 ;
  format a b $testf. ;
run;

proc contents data=test;
run;
The CONTENTS Procedure

Alphabetic List of Variables and Attributes

#    Variable    Type    Len    Format

1    a           Char      1    $TESTF.
2    b           Char      4    $TESTF.

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

What is Bayesian Analysis?

Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 6 replies
  • 1055 views
  • 4 likes
  • 5 in conversation