turn on suggestions

Auto-suggest helps you quickly narrow down your search results by suggesting possible matches as you type.

Showing results for

Find a Community

- Home
- /
- Analytics
- /
- Stat Procs
- /
- Testing 2 sets of proportions to determine if they...

Topic Options

- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

01-25-2011 06:50 PM

I have 68 2km x 2km plots for which the proportions of 5 land cover classes (proportions add to 1.0) have been determined by 2 methods. How can I determine that the two methods generate a significantly different result?

I can use a pair-wise t-test to determine if the proportions generated by the two methods in the same class are significantly different. What I am looking for is an overall assessment of whether or not the two methods generate a significantly different result across all 5 land cover classes.

Thanks in advance for any input.

I can use a pair-wise t-test to determine if the proportions generated by the two methods in the same class are significantly different. What I am looking for is an overall assessment of whether or not the two methods generate a significantly different result across all 5 land cover classes.

Thanks in advance for any input.

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Posted in reply to Frank_PFC

01-26-2011 04:35 PM

The table to be analyzed here is a 5x5 table (method1 x method2) and each of the 68 plots falls into one of the 25 cells. As stated, it sounds like you want a test of marginal homogeneity. That is, whether the 5 row proportions (marginal proportions for one method) are the same as the 5 column proportions (marginals for the other method.

You can test for marginal homogeneity as shown in the example titled "Repeated Measures, 4 Response Levels, 1 Population" in the PROC CATMOD documentation. You will need to create a data set with one observation per plot and with METHOD1 and METHOD2 variables containing the plots' classes under each method. Then METHOD1 and METHOD2 are used like the RIGHT and LEFT variables in the example. See this usage note for additional interpretation:

http://support.sas.com/kb/39243

You can test for marginal homogeneity as shown in the example titled "Repeated Measures, 4 Response Levels, 1 Population" in the PROC CATMOD documentation. You will need to create a data set with one observation per plot and with METHOD1 and METHOD2 variables containing the plots' classes under each method. Then METHOD1 and METHOD2 are used like the RIGHT and LEFT variables in the example. See this usage note for additional interpretation:

http://support.sas.com/kb/39243

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Posted in reply to StatDave_sas

01-27-2011 01:56 PM

Thank you for your suggestion.

The approach you suggest works for the case where the 2 classifications are intersected spatially in a geographic information system and the area common to each of the 5 land cover classes in the two methods is evaluated in a contingency table to calculate a user's and producer's accuracy and an overall accuracy.

In my case, I would like to test the similarity of the two sets of proportions as if they are describing two independent populations. In this case, there are two sets of proportions that add up to 1.0 in each plot. I want to know if the two sets of proportions are significantly different from one another.

The approach you suggest works for the case where the 2 classifications are intersected spatially in a geographic information system and the area common to each of the 5 land cover classes in the two methods is evaluated in a contingency table to calculate a user's and producer's accuracy and an overall accuracy.

In my case, I would like to test the similarity of the two sets of proportions as if they are describing two independent populations. In this case, there are two sets of proportions that add up to 1.0 in each plot. I want to know if the two sets of proportions are significantly different from one another.

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Posted in reply to Frank_PFC

01-28-2011 12:50 PM

Frank,

You can use the MIXED procedure to test multiple correlated land cover class differences. First, set up your data as within-plot differences for the first four cover classes. (The fifth cover class is an exact linear combination of the first four, so it can be excluded from use. If you had just two cover classes, you would only use the within-plot difference of the first land cover class as your response variable.) You will need each within-plot difference as a separate result with a label indicating which land cover class the difference belongs to. That is, you want your data to be constructed as shown below:

__plot__ __CoverClass__ __ diff __

1 1*diff[1,1]*

1 2*diff[1,2]*

1 3*diff[1,3]*

1 4*diff[1,4]*

2 1*diff[2,1]*

2 2*diff[2,2]*

2 3*diff[2,3]*

2 4*diff[2,4]*

... ... ...

With your data in this form, the following PROC MIXED code would be appropriate:

proc mixed data=mydata;

class plot CoverClass;

model diff = CoverClass / s noint;

repeated CoverClass / subject=plot type=un;

run;

You can use the MIXED procedure to test multiple correlated land cover class differences. First, set up your data as within-plot differences for the first four cover classes. (The fifth cover class is an exact linear combination of the first four, so it can be excluded from use. If you had just two cover classes, you would only use the within-plot difference of the first land cover class as your response variable.) You will need each within-plot difference as a separate result with a label indicating which land cover class the difference belongs to. That is, you want your data to be constructed as shown below:

1 1

1 2

1 3

1 4

2 1

2 2

2 3

2 4

... ... ...

With your data in this form, the following PROC MIXED code would be appropriate:

proc mixed data=mydata;

class plot CoverClass;

model diff = CoverClass / s noint;

repeated CoverClass / subject=plot type=un;

run;

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Posted in reply to Dale

02-01-2011 12:38 PM

Thanks very much for your suggestion, Dale. I haven't had a chance yet to try it out but my colleagues think it should do the trick.