BookmarkSubscribeRSS Feed
declanjohn
Calcite | Level 5

Hi there, these are the tasks I need to complete using SAS tasks. A template for the code I need to use to perform each task would be very helpful. Thanks in advance. 

 

1) Import the data from air_pollution.csv to SAS (3)

Weather in Beijing

  1. 2)  Produce the frequencies of all values of combined wind direction (cbwd) (from most frequent to least frequent) (2)
  2. 3)  What is the most frequently occurring wind direction? (1)
  3. 4)  Calculate the correlation coefficient between temperature (TEMP) and pressure (PRES) (3)
  4. 5)  Is the correlation positive or negative? What is its strength (very weak/weak/moderate/strong/very strong)? (2)

How does air pollution vary over months?

  1. 6)  Compute the descriptive statistics of pm2_5 by month (3)
  2. 7)  On average, in which month is the pollution level highest, and in which month is it lowest? (2)

Relationship between air pollution and weather

  1. 😎  Build a linear regression model where pm2_5 is a dependent variable, and month, DEWP, TEMP, PRES, cbwd, Iws, Is and Ir are independent variables (8)
  2. 9)  What is the R-squared of this model? Based on the R-squared, does the model fit the data well? (4)
  3. 10)  Create a new variable (high_pm2_5) that takes value 1 if pm2_5 is greater than 150 ug/m^3 and value 0 otherwise (3)

11) Develop a logistic regression model where high_pm2_5 is a dependent variable, and month, DEWP, TEMP, PRES, cbwd, Iws, Is and Ir are independent variables (8)

12) What is the AUC (c statistic) of this model? Based on the AUC, does the model separate high and low pollution levels well? (4)

The worst smog in Beijing

  1. 13)  Create a new data set (avg_air_pollution) and calculate an average pollution level (avg_pm2_5) for each month of each year. Select the variables year, month and avg_pm2_5 (10)
  2. 14)  In which of the 60 analysed months was the average pollution level (avg_pm2_5) highest? This was the worst smog Beijing has experienced for over 50 years (2)
  3. 15)  Based on the new data set, create a pdf report (highest_avg_air_pollution.pdf) that contains the list of those months in which the average pollution level (avg_pm2_5) was greater than 100 ug/m^3. Use the Moonflower style and add a title ("Months With Highest Average Air Pollution") (5)
1 REPLY 1
andreas_lds
Jade | Level 19

So you posted the same thing in three different sections of the communities. To remain polite: this was not necessary.

 

Links:

https://communities.sas.com/t5/Community-Matters/Help-SAS-coding/m-p/524054

https://communities.sas.com/t5/SAS-Procedures/Coding-help/m-p/524055

 

SAS Studio has a task for importing text files, you should start with this. Good Luck

sas-innovate-white.png

Our biggest data and AI event of the year.

Don’t miss the livestream kicking off May 7. It’s free. It’s easy. And it’s the best seat in the house.

Join us virtually with our complimentary SAS Innovate Digital Pass. Watch live or on-demand in multiple languages, with translations available to help you get the most out of every session.

 

Register now!

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 1 reply
  • 698 views
  • 0 likes
  • 2 in conversation