12-18-2015 12:56 PM
data test; do i = 1 to 10; N=_n_; output; end; run; data test; do i = 1 to 10; output; N=_n_; end; run;
I am comparing these programs. The second program has first missing N. My impression is that do loop is executing on the first implicit loop of the data test. That is why N is 1 for all observations. I am not sure why second progam creates first missing N.
12-18-2015 01:19 PM
From the documentation: _N_ is initially set to 1. Each time the DATA step loops past the DATA statement, the variable _N_ increments by 1. The value of _N_ represents the number of times the DATA step has iterated.
Since you do not have a set statment bringing in data then the Data step is only interated once so _n_ is 1.
The rest is timing.
12-18-2015 01:38 PM - edited 12-18-2015 04:03 PM
Thanks, Reeza and ballardw. Is there any exmple of explicit loop and implicit loop working together? I have heard about dow-loop, I wonder if it is example of interplay of inner and outer loop.
12-18-2015 04:02 PM
12-18-2015 05:02 PM
It shows that SET is an executeable statment that can be conditionally executed, for instead use one data set and condtionally bring in values from another data set, interleave two or more datasets, use a variable in one data set to controll how many records are brought from another while repeating the values from the first set.
All kinds of ways to mix things up. Also the reason why you'll find cautions about using _n_ as counter for number of records. _n_ will work fine for single datasets or a single Set with multiple data sets. But as soon as you have 2 or more Set statements the reliability of _n_ to tell you the number of records goes way down.
Need further help from the community? Please ask a new question.