How much space pre character is taken when UTF-8 is used in SAS

Reply
Occasional Learner
Posts: 1

How much space pre character is taken when UTF-8 is used in SAS

In standard definition for UTF-8 is used 1-4 bytes per character, if UTF-8 is used. It works like this calculator: 
https://mothereff.in/byte-counter

But how it's in SAS? Does SAS used 3bytes per character by default and it doesn't matter what character is stored?

 

Thank you,

Marian

Trusted Advisor
Posts: 1,566

Re: How much space pre character is taken when UTF-8 is used in SAS

[ Edited ]

UTF-8 in bytes is a function of language not of SAS.

 

Special characters will use one byte each.

Letters will use - latin (A a B b etc.) 1 byte each.

language specific characters (french, german, nordic, chinese, hebrew, arabic etc.) will use at least 2 bytes each.

Ask a Question
Discussion stats
  • 1 reply
  • 95 views
  • 0 likes
  • 2 in conversation