- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
Posted 01-31-2017 12:58 PM
(538 views)
In standard definition for UTF-8 is used 1-4 bytes per character, if UTF-8 is used. It works like this calculator:
https://mothereff.in/byte-counter
But how it's in SAS? Does SAS used 3bytes per character by default and it doesn't matter what character is stored?
Thank you,
Marian
1 REPLY 1
- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
UTF-8 in bytes is a function of language not of SAS.
Special characters will use one byte each.
Letters will use - latin (A a B b etc.) 1 byte each.
language specific characters (french, german, nordic, chinese, hebrew, arabic etc.) will use at least 2 bytes each.