BookmarkSubscribeRSS Feed
Aspoyam
Calcite | Level 5

In standard definition for UTF-8 is used 1-4 bytes per character, if UTF-8 is used. It works like this calculator: 
https://mothereff.in/byte-counter

But how it's in SAS? Does SAS used 3bytes per character by default and it doesn't matter what character is stored?

 

Thank you,

Marian

1 REPLY 1
Shmuel
Garnet | Level 18

UTF-8 in bytes is a function of language not of SAS.

 

Special characters will use one byte each.

Letters will use - latin (A a B b etc.) 1 byte each.

language specific characters (french, german, nordic, chinese, hebrew, arabic etc.) will use at least 2 bytes each.

sas-innovate-2026-white.png



April 27 – 30 | Gaylord Texan | Grapevine, Texas

Registration is open

Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and lock in 2025 pricing—just $495!

Register now

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 1 reply
  • 697 views
  • 0 likes
  • 2 in conversation