SAS Data Science

Building models with SAS Enterprise Miner, SAS Factory Miner, SAS Viya (Machine Learning), SAS Visual Text Analytics, with point-and-click interfaces or programming
BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
Pritish
Quartz | Level 8

Hi,

I am trying to understand and find out what are the different techniques commonly used to Caps and floor outliers present in a dataset? Any guidance/research paper link is greatly appreciated?

1 ACCEPTED SOLUTION

Accepted Solutions
M_Maldonado
Barite | Level 11

Hi Pritish,

Take a look at the Replacement node in SAS Enterprise miner. It does not remove outliers the way the Filter node would.

By default the limit method is based on a standard deviation from the mean. Take a look at the flooring and capping that happens by default. There is also a brief example of how to use the replacement editor in the reference help (press F1 and go to the Replacement node under the Modify sections).

Good luck!

Miguel

View solution in original post

4 REPLIES 4
Ksharp
Super User

Usually , you can define outliers as   if  >  mean+2*std  or  < mean-2*std

Pritish
Quartz | Level 8

Thanks! How do you treat those outlier? I don't want them to exclude from my dataset.

M_Maldonado
Barite | Level 11

Hi Pritish,

Take a look at the Replacement node in SAS Enterprise miner. It does not remove outliers the way the Filter node would.

By default the limit method is based on a standard deviation from the mean. Take a look at the flooring and capping that happens by default. There is also a brief example of how to use the replacement editor in the reference help (press F1 and go to the Replacement node under the Modify sections).

Good luck!

Miguel

Ksharp
Super User

I will Winsorted .  Replace them with 90th or 95th percentile .

sas-innovate-white.png

Our biggest data and AI event of the year.

Don’t miss the livestream kicking off May 7. It’s free. It’s easy. And it’s the best seat in the house.

Join us virtually with our complimentary SAS Innovate Digital Pass. Watch live or on-demand in multiple languages, with translations available to help you get the most out of every session.

 

Register now!

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 4 replies
  • 7291 views
  • 0 likes
  • 3 in conversation