Data visualization with SAS programming

summarized report and graph help

Posts: 46

summarized report and graph help

summarized report and graph help

I have a SAS dataset that has following the variables like following example:

Cust_ID               State     City                         Income             Profit           age

1                         TX         Houston                   15000             400               24

5                         OR       Portland                   20000             300             40

7                         TX         Houston                   7500                 100             23

8                         VA       Arlington                     4000               340             38

9                         OR       Portland                   17000             120               19

15                       CA         Sacremento             9000                 300             45

3                         CA       San francisco           14000             240               56


The actual dataset is 20 million records with unique cust_ids, however i want a summarization done at state level, age level, city level and represent the report in the form of a bar chart or the best possible graph to make sense and make reading easy.


Q1. For state, i want the the top 10 states contributing to the maximum profit starting left most to right most.


Q2. For city,, I want the top 10 cities contributing to the maximum profit same as above (BUT "NOT" CITY GROUPED AMONG STATES, THAT IS CITY IS CONSIDERED A DIFFERENT ENITITY AND NOT WITHIN STATE)


Q3. For age, i want to group 0-20, 20-30,30-40,40-50 age ranges and create a bar charts to visualise the age ranges that are top contributors to the profit.


Can anyone help me with the use of SAS summrization and graph example please? In excel i know well, but SAS i need help please

Super User
Posts: 12,148

Re: summarized report and graph help

Posted in reply to Allaluiah

Since this look a lot like homework here's some hints. Post back with your attempts it you don't get solutions that work.


Q1 Summarize the data using state as a grouping variable to create an output data set with the desired totals. The sort descending by the total variable. In your graphing procedure you can indicate the sorted data set and use the data set option OBS=10 to get the first 10 records into the plot.


Q2.Summarize as above with city as the grouping variable.


Q3. Create a custom format that assigns age group values to the desired range and associate that format with the age variable in the graphing procedure.


Ask a Question
Discussion stats
  • 1 reply
  • 2 in conversation