About kisumsam

kisumsam · ‎08-15-2020

Is there any option in Proc Discrim (or another KNN procedure) that can do n-fold cross validation? Just some quick background. I'm trying to use KNN to classify the fishes in the SASHelp.Fish data set. Below is the code: data analysis; set sashelp.fish; where species in ('Bream', 'Perch'); run; data train test; set analysis; rand = ranuni(100); if rand <= 0.8 then output train; else output test; run; Above code splits the FISH data set into training and testing data set. In the training set, I want to do n-fold cross-validation to get the optimal k for KNN. See link below: https://medium.com/@svanillasun/how-to-deal-with-cross-validation-based-on-knn-algorithm-compute-auc-based-on-naive-bayes-ff4b8284cff4 I can't seem to find the Proc Discrim options that enable me to do this easily. proc discrim data = train test = test testout = _score1 method = npar k = 5 testlist crossvalidate crosslist; class species; var weight height; run; Does anyone know whether this cross-validation feature is available in Proc Discrim (or any other procedure)? If not, what's the better way to find the optimal k for KNN?

kisumsam · ‎08-13-2020

No. Is that a better procedure than Proc Discrim for KNN?

kisumsam · ‎08-13-2020

Hi there, I'm learning KNN. I found that my Proc Discrim procedure gives me a much better results than me doing the manual calculation for the KNN algorithm. I'm wondering if there is any expert here who can explain why Proc Discrim does so much better. For example, below is the code that I use to classify the fish species in the SASHelp.fish data set. ** Standardize Columns **; proc standard data=sashelp.fish out=fish mean=0 std=1; var weight length1 length2 length3 height width; run; data fish_train fish_test; set fish; rand = ranuni(100); if rand <= 0.5 then output fish_train; else output fish_test; run; ** Using Built-in Proc Discrim **; proc discrim data = fish_train test = fish_test testout = _score1 method = npar k = 9 testlist; class species; var weight height length1 length2 length3 width; run; The error rate is very low: Now, I'm doing it manually by calculating the distance between the points and find the K nearest neighbor (k=9). ** Manually build do KNN **; data train1 train2 (drop=num); set fish_train; num = _n_; run; proc sql; create table train_combine as select a.num, a.species as species_a, b.species as species_b, sqrt((a.weight - b.weight)**2 + (a.height - b.height)**2 + (a.length1 - b.length1)**2 + (a.length2 - b.length2)**2 + (a.length3 - b.length3)**2 + (a.width - b.width)**2 ) as distance from train1 a, train2 b order by a.num, distance; quit; data train_combine2; set train_combine; by num distance; if first.num then i = 0; i + 1; if i <= 9; run; proc freq data=train_combine2 noprint; table species_b / out = fish_freq; by num species_a; run; proc sort data=fish_freq; by num count; run; data fish_freq2; set fish_freq; by num count; if last.num; if species_a = species_b then match = "Y"; else match = "N"; run; proc sql; select species_a, match, count(*) as cnt from fish_freq2 group by species_a, match order by species_a, match; quit; I did the Euclidean distance. And the results are not even close to being as good as Proc Discrim. For example, my manual model classified it all wrong for Parkki. It got only one right for Roach. In contrast, Proc Discrim classifies 4 Parkki and 9 Roach correctly. How does the Proc Discrim algorithm work that gives the better classification results?

kisumsam · ‎07-05-2020

Great thanks!

kisumsam · ‎07-05-2020

Hi, I have learned that the default length of a numeric variable is 8. However, when I use the length statement on a numeric variable, it returns 12. data test; a = 1; len = length(a); run; The result is 12: Does anyone know why? Thanks, Sam

kisumsam · ‎08-28-2018

Unfortunately it is not my call to stop using SAS 9.1.3 at my work place. From my understanding, they will be using it for the next little while and they are going to want to run programs on the older version of SAS with data set created in Viya. I understand we are going to run into security issues but if the management insists on doing this, do you think SAS 9.1.3 can handle data sets created in Viya?

kisumsam · ‎08-28-2018

Thanks Kurt. Our SAS 9.1.3 runs on Windows but Viya runs on a new Linux server. They aren't the same OS. Is this going to cause issues when using the data sets across the two versions of SAS?

kisumsam · ‎08-27-2018

Hi there, my team is migrating from SAS 9.1.3 to SAS Viya and it's a fairly big upgrade. My boss is concerned about SAS Viya having issues running some of the older data sets created in SAS 9.1.3. In addition, another department here will still be using SAS 9.1.3 and they are expected to read data sets created by the new machine (Viya). I know that it is still a .sas7bdat file but does anyone have any issues running the data sets across the two very different versions of SAS? Thanks in advance.

kisumsam · ‎06-04-2018

Hello, I'm given the task to build decision tree using SAS 9.1 (yes, very old, I know). There is no Proc DTREE in SAS 9.1 and I was told to try doing it with just base SAS (data step and proc step). I tried looking up some documentations on online but I can't seem to find any. Does anyone have any resources that can guide me on how to build the decision tree model using just SAS base (no Proc Dtree)? Thanks!

kisumsam · ‎04-06-2018

Thanks all! I actually found a solution that I haven't come across on this forum. The Work library path can be set in the SAS properties: Looks like this is a good option since we don't have to mess around any cfg or system file.

kisumsam · ‎04-04-2018

Thanks Oligolas. So your suggestion is to improve the working speed? I'm hoping to save the working data sets in different folders based on the server user. E.g. User ABC101 --> C:\Work Other Users --> H:\Work Do you think that's possible?

kisumsam · ‎04-04-2018

Hello, I need to change the WORK library folder location based on user. I'm wondering if anyone can help 🙂 We installed SAS on a server with roughly 3-5 people accessing the software. My boss wants her Work location to be in C: drive but the rest of the users in H: drive. I opened the SASV9.CFG file but I can't seem to find a way to set the WORK location based on user. The best I could do is to use: -WORK "H:\WORK\!USERNAME" However, all of the data sets are stored in the H: drive. Does anyone have any suggestions? Thanks so much.

kisumsam · ‎02-22-2018

Would love to upgrade it but whether to upgrade it is not my call unfortunately 😞

kisumsam · ‎02-22-2018

Hello, I got a quick question. I don't know why SAS doesn't recognize the format B8601DA8. (or B8601DA.) on my system (SAS 9.1.3). For example, I run the code below: data test; a = 10000; format a b8601da8.; b = put(a, b8601da8.); run; I got the following error message: 25 data test; 26 a = 10000; 27 format a b8601da.; -------- 48 28 b = put(a, b8601da.); -------- 48 ERROR 48-59: The format B8601DA was not found or could not be loaded. I triple checked the code and I remember it worked before on the exact same system. Are the ISO 8601 format not supported by SAS 9.1.3?

kisumsam · ‎01-01-2018

Thanks so much!

Online Status	Offline
Date Last Visited	‎01-25-2022 02:09 AM

How to do n-fold cross-validation in KNN

Re: How to replicate KNN results from Proc Discrim

How to replicate KNN results from Proc Discrim

Re: Default length of a numeric variable?

Default length of a numeric variable?

Re: Is there any issue reading old SAS data sets on SAS Viya?

Re: Is there any issue reading old SAS data sets on SAS Viya?

Is there any issue reading old SAS data sets on SAS Viya?

Building Decision Tree Model in SAS 9.1

Re: How to change the WORK library folder location based on user

Re: Is there any issue reading old SAS data sets on SAS Viya?

Re: Is there any issue reading old SAS data sets on SAS Viya?

Re: Is there any issue reading old SAS data sets on SAS Viya?

Re: Identifying AR and MA terms

Re: Question about ACF plot

Proc ttest not working on Enterprise Guide

Re: How to change the WORK library folder location based on user

How to do n-fold cross-validation in KNN

Re: How to replicate KNN results from Proc Discrim

How to replicate KNN results from Proc Discrim

Re: Default length of a numeric variable?

Default length of a numeric variable?

Re: Is there any issue reading old SAS data sets on SAS Viya?

Re: Is there any issue reading old SAS data sets on SAS Viya?

Is there any issue reading old SAS data sets on SAS Viya?

Building Decision Tree Model in SAS 9.1

Re: How to change the WORK library folder location based on user

Re: How to change the WORK library folder location based on user

How to change the WORK library folder location based on user

Re: B8601DA8. does not work on SAS 9.1.3

B8601DA8. does not work on SAS 9.1.3

Re: Identifying AR and MA terms