Text mining and content categorization

text parsing for a mix of Japanese and English language

Reply
New Contributor
Posts: 2

text parsing for a mix of Japanese and English language

Hi,

 

I want to do text parsing in the SAS Text miner for a mix of Japanese and english words . E.g.

 

カーネリアンネックレスオレンジ45cm/天然石シルバー925銀/

 

I tried running text parsing on this observation, but it does parsing on only term '45cm' which is in english language, and not for Japanese characters. Can any body help ?

Grand Advisor
Posts: 9,447

Re: text parsing for a mix of Japanese and English language

What kind of output would you like to see ?
Check the functions started with Kxxxx() which are used for Chinese,Korean,Japanese words:
KSCAN()
KSUBSTR()
.........

New Contributor
Posts: 2

Re: text parsing for a mix of Japanese and English language

Hi KSharp,

 

I am using Text Miner and not SAS coding. I think the functions you are referring to are used for text processing but I want to do text parsing.

Grand Advisor
Posts: 9,447

Re: text parsing for a mix of Japanese and English language

That is very hard, since SAS is coming from America. But you could ask SAS Support if SAS offer Japanese text parsing.
Ask a Question
Discussion stats
  • 3 replies
  • 206 views
  • 0 likes
  • 2 in conversation