turn on suggestions

Auto-suggest helps you quickly narrow down your search results by suggesting possible matches as you type.

Showing results for

Find a Community

Topic Options

- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

04-09-2013 02:10 PM

How can I do a two sample KS test in SAS IML of the null hypothesis that the two empirical distributions were drawn from the same *continuous* distribution?

Accepted Solutions

Solution

04-09-2013
03:10 PM

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Posted in reply to SidBajpai

04-09-2013 03:10 PM

Probably the easiest way is to leverage PROC NPAR1WAY, like this:

proc iml;

/* the data */

x = T(1:10);

y = {2,2,3,5,5,6,6,8,9,10};

/* concatenate data and add ID variable; write to data set */

ID = repeat(1, nrow(x)) // repeat(2, nrow(x));

z = x // y;

create KSData var {ID z};

append;

close KSData;

quit;

proc npar1way data=KSData edf;

class ID;

var z;

run;

If you need to stay inside PROC IML, you can use the SUBMIT and ENDSUBMIT statements to call PROC NPAR1WAY and then read in whatever statistics you need from the procedure output. For details and an example, see Video: Calling SAS procedures from the SAS/IML language - The DO Loop

All Replies

Solution

04-09-2013
03:10 PM

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Posted in reply to SidBajpai

04-09-2013 03:10 PM

Probably the easiest way is to leverage PROC NPAR1WAY, like this:

proc iml;

/* the data */

x = T(1:10);

y = {2,2,3,5,5,6,6,8,9,10};

/* concatenate data and add ID variable; write to data set */

ID = repeat(1, nrow(x)) // repeat(2, nrow(x));

z = x // y;

create KSData var {ID z};

append;

close KSData;

quit;

proc npar1way data=KSData edf;

class ID;

var z;

run;

If you need to stay inside PROC IML, you can use the SUBMIT and ENDSUBMIT statements to call PROC NPAR1WAY and then read in whatever statistics you need from the procedure output. For details and an example, see Video: Calling SAS procedures from the SAS/IML language - The DO Loop

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Posted in reply to Rick_SAS

04-10-2013 09:49 AM

Thanks Dr Wicklin,

This is really helpful.

However if in the code above,

x is a nxm matrix,

y is a px1 vector,

where n≠m≠p

The value n,m and p can change each time I run the code.

Now if I have to get the P value of the KS test for each column of matrix x by the column vector y, how do you suggest I modify this code above in IML.

Thanks

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Posted in reply to SidBajpai

04-10-2013 10:13 AM

It's almost the same, you just need to replicate y and run PROC NPAR1WAY on x1-xm. For example:

proc iml;

/* the data */

x = T(1:10)|| T(0:9) || T(2:11);

y = {2,5,3,8,7,6,6,9};

/* concatenate data and add ID variable; write to data set */

z = x // repeat(y, 1, ncol(x));

ID = repeat(1, nrow(x)) // repeat(2, nrow(y));

Q = ID || z;

varNames = "ID" || ("x1""x"+strip(char(ncol(x)))));

create KSData from Q[c=varNames];

append from Q;

close KSData;

/* submit; */

proc npar1way data=KSData plots=none noprint;

class ID;

var x:;

output out=KSOUT edf;

run;

/* endsubmit; */

Now read the p-values, which are in the KSOUT data set.

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Posted in reply to Rick_SAS

04-10-2013 01:43 PM

That works,

Thanks