In standard definition for UTF-8 is used 1-4 bytes per character, if UTF-8 is used. It works like this calculator: https://mothereff.in/byte-counterBut how it's in SAS? Does SAS used 3bytes per character by default and it doesn't matter what character is stored?
Thank you,
Marian
UTF-8 in bytes is a function of language not of SAS.
Special characters will use one byte each.
Letters will use - latin (A a B b etc.) 1 byte each.
language specific characters (french, german, nordic, chinese, hebrew, arabic etc.) will use at least 2 bytes each.
SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!
Save the date!
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.
Browse our catalog!