BookmarkSubscribeRSS Feed
☑ This topic is solved. Need further help from the community? Please sign in and ask a new question.
sanchez_dan
Calcite | Level 5

Hello,

 

I'm working with a dataset produced from a 5-variable full factorial screening with 3 centerpoints. The raw data is heavily skewed with an exponential distribution. I've tried log, log10, square root and various box-cox transformations and can't seem to get anything even nearly approaching a normal distribution. 

 

ConditionPatterndata1
1+−−+−0.59
2−+−−−1.6
3−−−+−1.78
4+−++−0.45
5+−−−−1.37
6−++−−0.87
7++−+−0.05
8−−−−−4.46
9++−−−0.14
10−+−+−0.36
11+++−−0.11
12+−+−−0.8
13−+++−0.33
14++++−0.05
15−−+−−2.43
16−−++−1.53
1700.86
1800.79
1900.9
20−+−−+0.94
21+−+++0.87
22−−+++0.91
23−−−++0.72
24−+−++0.05
25−−+−+2.74
26−++−+0.72
27−++++0.08
28+−+−+2.92
29−−−−+4.08
30++−−+0.88
31+−−−+3.98
32+−−++0.82
33+++++0.08
34+++−+0.78
35++−++0.06

 

 

 

1. What kind of transform is appropriate to handle the data set?

2. If there aren't any appropriate methods of transforming the data, how can it be modeled? (using Fit Model, etc)

1 ACCEPTED SOLUTION

Accepted Solutions
PaigeMiller
Diamond | Level 26

I think you would be better off asking in the JMP community.

--
Paige Miller

View solution in original post

1 REPLY 1
PaigeMiller
Diamond | Level 26

I think you would be better off asking in the JMP community.

--
Paige Miller

SAS Innovate 2025: Call for Content

Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!

Submit your idea!

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 701 views
  • 0 likes
  • 2 in conversation