Language detect function

parmis · Posted 05-02-2019 09:08 AM

Hello,

I have a dataset similar to the following that contains a text(a single word or phrase) variable. The strings are either in English or French.

Is there a way to flag the English words?

data list;

input name $20.;

datalines;

Côté

Boucher

Fournier

Cats

how to register

morning

Thibeault

Martin

Vaudron

Girard

Hello;

run;

Thank you!

art297 · Posted 05-02-2019 09:29 AM

May not be possible with just words out of context, but you could try incorporating Python. Take a look at: https://www.probytes.net/blog/python-language-detection/

Art, CEO, AnalystFinder.com

Ksharp · Posted 05-02-2019 10:00 AM

data list;
input name $20.;
flag=prxmatch('/[^a-z]/i',compress(name,,'ka'))>0;
datalines;
Côté
Boucher
Fournier
Cats
how to register
morning
Thibeault
Martin
Vaudron
Girard
Hello
;
run;

ballardw · Posted 05-02-2019 01:18 PM

My French is pretty rusty but I do remember that a moderate number of nouns are the same in both French and English.

So without the articles the / a or le/ la /les/ un / une or similar clue those are going to be very problematic.

Some adjectives, grand, for example are going to be worse.

I would hesitate to assign any name to a specific language as the French and English have been interacting for so long names go both ways (and spelling gets butchered)

Sundaresh1 · Posted 07-16-2021 09:00 PM

Hi @parmis ,

I know this is an answer that comes after 2 years :), but felt that you may derive some benefit nevertheless, knowledge at the least. In Jan of this year, SAS released a language identification action as part of its Viya platform. Here are details on how it works :

https://go.documentation.sas.com/doc/en/sasstudiocdc/v_009/pgmsascdc/casanpg/cas-textmanagement-iden...

regards,

Sundaresh

Language detect function

Re: Language detect function

Re: Language detect function

Re: Language detect function

Re: Language detect function

Catch up on SAS Innovate 2026

Language detect function

Re: Language detect function

Re: Language detect function

Re: Language detect function

Re: Language detect function

Catch up on SAS Innovate 2026

SAS Training: Just a Click Away