Text mining and content categorization

text parsing for a mix of Japanese and English language

Reply
New Contributor
Posts: 2

text parsing for a mix of Japanese and English language

Hi,

 

I want to do text parsing in the SAS Text miner for a mix of Japanese and english words . E.g.

 

カーネリアンネックレスオレンジ45cm/天然石シルバー925銀/

 

I tried running text parsing on this observation, but it does parsing on only term '45cm' which is in english language, and not for Japanese characters. Can any body help ?

Super User
Posts: 10,621

Re: text parsing for a mix of Japanese and English language

Posted in reply to sumitthakral
What kind of output would you like to see ?
Check the functions started with Kxxxx() which are used for Chinese,Korean,Japanese words:
KSCAN()
KSUBSTR()
.........

New Contributor
Posts: 2

Re: text parsing for a mix of Japanese and English language

Hi KSharp,

 

I am using Text Miner and not SAS coding. I think the functions you are referring to are used for text processing but I want to do text parsing.

Super User
Posts: 10,621

Re: text parsing for a mix of Japanese and English language

Posted in reply to sumitthakral
That is very hard, since SAS is coming from America. But you could ask SAS Support if SAS offer Japanese text parsing.
Ask a Question
Discussion stats
  • 3 replies
  • 268 views
  • 0 likes
  • 2 in conversation