BookmarkSubscribeRSS Feed
☑ This topic is solved. Need further help from the community? Please sign in and ask a new question.
rmacarthur
Pyrite | Level 9

Hi SAS Friends, 

I'm importing text definitions where there are multiple rows of text within each definition. 

Every definition ends with a unique delimiter '^'.   

For each definition, need all the text captured as a single variable, collecting multiple rows of text in a single variable, but ending with the unique delimiter '^'.

The sample data below should create 5 observations and 1 variable, but the code provided instead creates 18 observations and 1 variable.

 

Here is a sample:

data TD;
infile  datalines  delimiter= '^'  ;
	length 		Text	$5000.;
	informat	Text	$5000.;
	input 		Text		; 
  	datalines;
Bayesian approaches: Approaches to data analysis that provide a posterior probability distribution for some parameter 
(e.g., treatment effect), derived from the observed data and a prior probability distribution for the parameter. 
The posterior distribution is then used as the basis for statistical inference.^
Bias (statistical and operational): The systematic tendency of any factors associated with the
design, conduct, analysis and evaluation of the results of a clinical trial to make the estimate of a
treatment effect deviate from its true value. Bias introduced through deviations in conduct is
referred to as operational bias. The other sources of bias listed above are referred to as
statistical bias.^
Blind review: The checking and assessment of data during the period of time between trial
completion (the last observation on the last subject) and the breaking of the blind, for the purpose
of finalizing the planned analysis.^
Content validity: The extent to which a variable (e.g., a rating scale) measures what it is
supposed to measure.^
Double dummy: A technique for retaining the blind when administering supplies in a clinical
trial, when the two treatments cannot be made identical. Supplies are prepared for Treatment A
(active and indistinguishable placebo) and for Treatment B (active and indistinguishable placebo). 
Subjects then take two sets of treatment either A (active) and B (placebo), or A (placebo) and B
(active).^
;
run;

Really appreciate your help with this !, 

R.

 

1 ACCEPTED SOLUTION

Accepted Solutions
Tom
Super User Tom
Super User

Since you want the whole line just use the automatic _INFILE_ variable to get the text.

There is no need to include a period in a length specification.  Lengths are always integers.

There is no need to attach the $ informat to the variable, SAS does not need special instructions for reading text.

data TD;
  length Text $5000;
  do until(index(_infile_,'^'));
    input;
    text = catx(' ',text,scan(_infile_,1,'^'));
  end;
datalines4;
Bayesian approaches: Approaches to data analysis that provide a posterior probability distribution for some parameter 
(e.g., treatment effect), derived from the observed data and a prior probability distribution for the parameter. 
The posterior distribution is then used as the basis for statistical inference.^
Bias (statistical and operational): The systematic tendency of any factors associated with the
design, conduct, analysis and evaluation of the results of a clinical trial to make the estimate of a
treatment effect deviate from its true value. Bias introduced through deviations in conduct is
referred to as operational bias. The other sources of bias listed above are referred to as
statistical bias.^
Blind review: The checking and assessment of data during the period of time between trial
completion (the last observation on the last subject) and the breaking of the blind, for the purpose
of finalizing the planned analysis.^
Content validity: The extent to which a variable (e.g., a rating scale) measures what it is
supposed to measure.^
Double dummy: A technique for retaining the blind when administering supplies in a clinical
trial, when the two treatments cannot be made identical. Supplies are prepared for Treatment A
(active and indistinguishable placebo) and for Treatment B (active and indistinguishable placebo). 
Subjects then take two sets of treatment either A (active) and B (placebo), or A (placebo) and B
(active).^
;;;;

Screenshot 2025-02-01 at 11.40.11 AM.png

View solution in original post

2 REPLIES 2
Tom
Super User Tom
Super User

Since you want the whole line just use the automatic _INFILE_ variable to get the text.

There is no need to include a period in a length specification.  Lengths are always integers.

There is no need to attach the $ informat to the variable, SAS does not need special instructions for reading text.

data TD;
  length Text $5000;
  do until(index(_infile_,'^'));
    input;
    text = catx(' ',text,scan(_infile_,1,'^'));
  end;
datalines4;
Bayesian approaches: Approaches to data analysis that provide a posterior probability distribution for some parameter 
(e.g., treatment effect), derived from the observed data and a prior probability distribution for the parameter. 
The posterior distribution is then used as the basis for statistical inference.^
Bias (statistical and operational): The systematic tendency of any factors associated with the
design, conduct, analysis and evaluation of the results of a clinical trial to make the estimate of a
treatment effect deviate from its true value. Bias introduced through deviations in conduct is
referred to as operational bias. The other sources of bias listed above are referred to as
statistical bias.^
Blind review: The checking and assessment of data during the period of time between trial
completion (the last observation on the last subject) and the breaking of the blind, for the purpose
of finalizing the planned analysis.^
Content validity: The extent to which a variable (e.g., a rating scale) measures what it is
supposed to measure.^
Double dummy: A technique for retaining the blind when administering supplies in a clinical
trial, when the two treatments cannot be made identical. Supplies are prepared for Treatment A
(active and indistinguishable placebo) and for Treatment B (active and indistinguishable placebo). 
Subjects then take two sets of treatment either A (active) and B (placebo), or A (placebo) and B
(active).^
;;;;

Screenshot 2025-02-01 at 11.40.11 AM.png

rmacarthur
Pyrite | Level 9
Excellent, and thanks for adding "DATALINES4 ", since there are ;'s in the text.
Runs like a charm, much appreciated !

hackathon24-white-horiz.png

The 2025 SAS Hackathon has begun!

It's finally time to hack! Remember to visit the SAS Hacker's Hub regularly for news and updates.

Latest Updates

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 2 replies
  • 933 views
  • 3 likes
  • 2 in conversation