SAS Programming

DATA Step, Macro, Functions and more
BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
windlove
Fluorite | Level 6

I am trying to encrpt a set of IDs using SHA256 in SAS 9.4. 

 

However, when I finished that I found the length of the hash does not always the 64 characters. 

 

My code is:

 

data B;

  length hashID $64;

 format hashID $hex64.;

 set A;

 catxID = catx("XXX", ID, "XXXX");

hashID = sha256(catxID);

drop catxID ID;

run;

 

XXX are some random salts. 

 

However, when I examined the hashID, most of them were 64 characters, but some were only 22 characters and some even only contained 6 or 9 or othe number of characters. 

 

Can anyone advice whether it is normal or not? and What would result in such variations?

 

Thanks.

 

1 ACCEPTED SOLUTION

Accepted Solutions
Patrick
Opal | Level 21

@windlove

SHA256() returns always a 256Bit hash as per documentation.

 

I've done some testing and it appears that what you observe has to do with the table viewer in EG.

I've used below code:

data test;
  length hashid1Test $32 l_hashid1Test 8 hashID1 hashID2 $64;
  format hashid1Test $hex64.;

  do id=21 to 21;
    catxID = catx("XXX", ID, "XXXX");
    hashid1Test=sha256(catxID);
    l_hashid1Test=lengthn(hashid1Test);
    hashID1 = put(sha256(catxID),$hex64.);
    hashID2 = sha256hex(catxID);
    output;
  end;

  drop catxID ID;
run;

proc print data=test;
run;

That's what you see in the EG table viewer:

undefined

Even though variable hashid1Test doesn't show the whole string, it still has a length of 32 characters. The length of 32 indicates that the values as such are still there internally; they just don't print in the data grid.

 

Having done a bit more analysis it appears that this always happens for HEX00.

 

If you print the values to an output destination then everything shows up.

undefined

 

So... No serious issue and your data is o.k. It's just a glitch in the EG data grid viewer.

 

View solution in original post

4 REPLIES 4
Patrick
Opal | Level 21

@windlove

SHA256() returns always a 256Bit hash as per documentation.

 

I've done some testing and it appears that what you observe has to do with the table viewer in EG.

I've used below code:

data test;
  length hashid1Test $32 l_hashid1Test 8 hashID1 hashID2 $64;
  format hashid1Test $hex64.;

  do id=21 to 21;
    catxID = catx("XXX", ID, "XXXX");
    hashid1Test=sha256(catxID);
    l_hashid1Test=lengthn(hashid1Test);
    hashID1 = put(sha256(catxID),$hex64.);
    hashID2 = sha256hex(catxID);
    output;
  end;

  drop catxID ID;
run;

proc print data=test;
run;

That's what you see in the EG table viewer:

undefined

Even though variable hashid1Test doesn't show the whole string, it still has a length of 32 characters. The length of 32 indicates that the values as such are still there internally; they just don't print in the data grid.

 

Having done a bit more analysis it appears that this always happens for HEX00.

 

If you print the values to an output destination then everything shows up.

undefined

 

So... No serious issue and your data is o.k. It's just a glitch in the EG data grid viewer.

 

windlove
Fluorite | Level 6

Thanks for the detailed reply. 

Yes, I am using EG too. 

I will check as adviced tomorrow whether those cases with only 6 or 9 characters hold the same story. 

Rafall
Calcite | Level 5

We encounter similar difficulties with working with digests. Please check if the situation with shorter digest is related to 0x20 bytes (space character) at the end of digest. Similar situation is when you have

hashID = md5('a98');
l = length (hashid);
output;
hashID = md5('a99');
l = length (hashid);
output;

 

First case generates digest with 0x20 at last character and this causes some issues with e.g. length function or further exporting.

andreas_lds
Jade | Level 19

You should not use md5 at all, see a blog-post by @ChrisHemedinger from 2014.

sas-innovate-white.png

Our biggest data and AI event of the year.

Don’t miss the livestream kicking off May 7. It’s free. It’s easy. And it’s the best seat in the house.

Join us virtually with our complimentary SAS Innovate Digital Pass. Watch live or on-demand in multiple languages, with translations available to help you get the most out of every session.

 

Register now!

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 4 replies
  • 4302 views
  • 0 likes
  • 4 in conversation