BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
Goffy123
Calcite | Level 5

Compute Descriptive Statistics of score by review_weekday?

 

This is my question any help into what to do in code? 

1 ACCEPTED SOLUTION

Accepted Solutions
Reeza
Super User

Assuming the variable is numeric you would use PROC MEANS, here's a fully worked example, copy and paste into SAS and run to see the example. If you have categorical data, it's a different process.

 

*Create summary statistics for a dataset by a 'grouping' variable and store it in a dataset;

*Generate sample fake data;
data have;
	input ID          feature1         feature2         feature3;
	cards;
1               7.72               5.43              4.35
1               5.54               2.25              8.22 
1               4.43               6.75              2.22
1               3.22               3.21              7.31
2               6.72               2.86              6.11
2               5.89               4.25              5.25 
2               3.43               7.30              8.21
2               1.22               3.55              6.55

;
run;

*ensure sort before means;
proc sort data=have;
by id;
run;

*Create summary data;
proc means data=have noprint;
	by id;
	var feature1-feature3;
	output out=want median= var= mean= /autoname;
run;

*Show for display;
proc print data=want;
run;

*First done here:https://communities.sas.com/t5/General-SAS-Programming/Getting-creating-new-summary-variables-longitudinal-data/m-p/347940/highlight/false#M44842;
*Another way to present data is as follows;

proc means data=have stackods nway n min max mean median std p5 p95;
    by id;
    var feature1-feature3;
    ods output summary=want2;
run;

*Show for display;
proc print data=want2;
run;

 

https://github.com/statgeek/SAS-Tutorials/blob/master/proc_means_basic.sas

 


@Goffy123 wrote:

Compute Descriptive Statistics of score by review_weekday?

 

This is my question any help into what to do in code? 


 

View solution in original post

8 REPLIES 8
Reeza
Super User

Assuming the variable is numeric you would use PROC MEANS, here's a fully worked example, copy and paste into SAS and run to see the example. If you have categorical data, it's a different process.

 

*Create summary statistics for a dataset by a 'grouping' variable and store it in a dataset;

*Generate sample fake data;
data have;
	input ID          feature1         feature2         feature3;
	cards;
1               7.72               5.43              4.35
1               5.54               2.25              8.22 
1               4.43               6.75              2.22
1               3.22               3.21              7.31
2               6.72               2.86              6.11
2               5.89               4.25              5.25 
2               3.43               7.30              8.21
2               1.22               3.55              6.55

;
run;

*ensure sort before means;
proc sort data=have;
by id;
run;

*Create summary data;
proc means data=have noprint;
	by id;
	var feature1-feature3;
	output out=want median= var= mean= /autoname;
run;

*Show for display;
proc print data=want;
run;

*First done here:https://communities.sas.com/t5/General-SAS-Programming/Getting-creating-new-summary-variables-longitudinal-data/m-p/347940/highlight/false#M44842;
*Another way to present data is as follows;

proc means data=have stackods nway n min max mean median std p5 p95;
    by id;
    var feature1-feature3;
    ods output summary=want2;
run;

*Show for display;
proc print data=want2;
run;

 

https://github.com/statgeek/SAS-Tutorials/blob/master/proc_means_basic.sas

 


@Goffy123 wrote:

Compute Descriptive Statistics of score by review_weekday?

 

This is my question any help into what to do in code? 


 

Goffy123
Calcite | Level 5
I have categoric data does this mean it is a different way?
ballardw
Super User

@Goffy123 wrote:
I have categoric data does this mean it is a different way?

What descriptive statistics do you want?

Goffy123
Calcite | Level 5
The average of variable score by variable weekday
Reeza
Super User

Provide more details and I can answer it then. For example, you can calculate the mean of a categorical variable so not sure how that would work, given your other response.

 

EDIT - added last sentence.


@Goffy123 wrote:
I have categoric data does this mean it is a different way?

 

Goffy123
Calcite | Level 5
My dataset is named Lasvegas and from that dataset I am trying to find the average of score by Review_weekday. The Score variable is 1-5 and the Review_ weekday is Monday, Tuesday.... Sunday.
Reeza
Super User

If your only categorical variable is the grouping variable, ie weekday, that's fine since you're not summarizing the categorical data.

Did you run the example above? Did it work?

 

You can align your variables the same way and it should work fine. 

 

If you're having issues post the code you're using and detail any issues. 

 


@Goffy123 wrote:
My dataset is named Lasvegas and from that dataset I am trying to find the average of score by Review_weekday. The Score variable is 1-5 and the Review_ weekday is Monday, Tuesday.... Sunday.

 

hackathon24-white-horiz.png

2025 SAS Hackathon: There is still time!

Good news: We've extended SAS Hackathon registration until Sept. 12, so you still have time to be part of our biggest event yet – our five-year anniversary!

Register Now

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 8 replies
  • 2023 views
  • 0 likes
  • 4 in conversation