In standard definition for UTF-8 is used 1-4 bytes per character, if UTF-8 is used. It works like this calculator:
https://mothereff.in/byte-counter
But how it's in SAS? Does SAS used 3bytes per character by default and it doesn't matter what character is stored?
Thank you,
Marian
UTF-8 in bytes is a function of language not of SAS.
Special characters will use one byte each.
Letters will use - latin (A a B b etc.) 1 byte each.
language specific characters (french, german, nordic, chinese, hebrew, arabic etc.) will use at least 2 bytes each.
Don't miss out on SAS Innovate - Register now for the FREE Livestream!
Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.