Hi,
When using SAS OnDemand, the é character is automatically interpreted as 'C3A9'x i.e. requires 2 bytes.
When using the exact same code with SAS Studio from SAS Analytics, it is interpreted as 'E9'x from the extended ASCII table and therefore only requires one byte (which is what I would expect).
Do you know why SAS OnDemand goes for a 2-byte value ?
Cheers
Study this: https://en.wikipedia.org/wiki/UTF-8
Run this in your environments.
proc options option=encoding;
run;
and compare the logs.
Because On Demand runs with UTF encoding, and E9 is not available there, as it is a start byte for a UTF sequence.
Thanks @Kurt_Bremser
I'm missing background encoding knowledge to understand what "as it is a start byte for a UTF sequence." means.
But on my local SAS Studio installation, I'm also using UTF-8. Did you mean this setting or something else?
Study this: https://en.wikipedia.org/wiki/UTF-8
Run this in your environments.
proc options option=encoding;
run;
and compare the logs.
After reading the you comment I've start reading the Wikipedia page and came up with more questions (I will create a distinct post for one of them).
Thanks for the code. I didn't thought about looking in the global options.
So also SAS Studio shows UTF-8, WLATIN1 is actually used.
Thanks!
Good news: We've extended SAS Hackathon registration until Sept. 12, so you still have time to be part of our biggest event yet – our five-year anniversary!
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.