BookmarkSubscribeRSS Feed
hockeydata16
Calcite | Level 5

Hello,

I am needing help determining which are the appropriate predictive statistical tests to run to prepare my dataset for predicting future sales for each location. The dataset consists of 24 observations, a month column (independent variable), and 4 different City columns (dependent variables). The values recorded for each city during the observation is the total sales (in dollars). 

 

The dataset essentially looks like:

 

Month     Dallas     Boston     NYC      Chicago

1              34245     58745     62145     12345

.......

24            87451     58472     21451     64125

 

I think a correlation test will be needed to see if there is a relationship between any of the cities and then finally a linear regression model to predict future values of sales at each location. thank you in advance for your assistance!

1 REPLY 1
PaigeMiller
Diamond | Level 26

This seems to me like any prediction ought to take into account the auto-correlation between the different time points, and so linear regression would not work well if there is such auto-correlation. You probably ought to consider time-series modeling, if you have SAS/ETS then there are a number of procedures that can perform such modeling, including PROC ARIMA.

--
Paige Miller

hackathon24-white-horiz.png

The 2025 SAS Hackathon Kicks Off on June 11!

Watch the live Hackathon Kickoff to get all the essential information about the SAS Hackathon—including how to join, how to participate, and expert tips for success.

YouTube LinkedIn

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 394 views
  • 0 likes
  • 2 in conversation