BookmarkSubscribeRSS Feed

TRANSLATION SYSTEM FOR AFRICAN LANGUAGES

Started ‎04-02-2021 by
Modified ‎10-20-2022 by
Views 1,216
    

TEAM LANGTECH STORY

African languages have been neglected for far too long, I am Swi Innocent Che closely assisted by Mafor Anna Ngwenyi and together we are Team LangTech.

PROBLEMS0

As an African I like to protect my culture and preserve our cultural expression, Statistics from UNESCO tell us that Africa has more than 1000 languages and at least 200 of these languages have less than 500 speakers. Now for a language to be alive, we need at least 100000 speakers. Ethnologue on the other hand, presents Cameroon as having over 283 languages and 274 of these languages are living, 17 are dying and 9 are already extinct.

The electronic use of African languages is not encouraging, as you find very few languages actually implemented in devices like computers or smart phones.

International experts find it difficult to collaborate with local experts because they do not understand the languages the local population speaks.

Very little Documentation and structuration of many of these languages causing the digitalization extremely challenging

Most African state policies do not encourage the national integration of local languages making youths lose interest in learning at such a gradual decline in the language and in the culture.

METHODOLOGY

Step 1

Data collection and preparation

First data, that is parallel sentences was collected locally, inputted into an excel sheet then profiled beforehand with the help SAS visual analytics tool to obtain some critical information like the uniqueness percentage between each data column, percentage of null values, pattern count, data length, minimum length, maximum length, and others.

 

 

Building the Model

From the SAS Boot monitor, we gained access to the Jupyter notebook, it permitted us to write code to train translation model for the different language pairs. The principal Natural language processing library used was Keras. But we can update when need is to SAS deep learning model.

 

Application

We have built a simple web application using Python flask framework and for the purpose of the demo, the models are hosted with a flask API.  As a future plan, we intend to use host our models using SAS API to better manage the numerous translation models for each unique language pair.

Again we have created a diagram using SAS Decision Manager modeling our intension on how to manage a multi-head translation system. Behind each block will be code to govern its functionality.

 

Demo

As a demonstration of the system, we are going to test a few sentences and observe the results.

  1. I am a gentleman
  2. We are going to school
  3. What is the name of your mother

Notice that for each of the sentences we get a corresponding translation and observing the rest API, we notice the translation time and some other information produced during the time of inferencing the translation models.

Team Name LangTech
  Track: Startups
Use Case  Translation System based on African Languages
Technology Artificial Intelligence and Machine Learning
Region EMEA
Team lead SWI INNOCENT CHE
Team members MAFOR ANNA NGWENYI

 

Comments

Dear Lang tech Team,

 

I love your idea. This could be a game changer for communities who do not have access to online resources because of language.

I will follow your project and let me know if there is an way I can support you.

I am from Zimbabwe and speak Ndebele and Shona zimbabwe's two main langauges

I'm dancing with joy, Swi!!!

@Swi 

Can you please replace your YouTube links with direct uploads of your videos instead? You can do this by clicking the Video button in the toolbar:

Unknown.png

 

Version history
Last update:
‎10-20-2022 12:25 PM
Updated by:
Contributors

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

Article Tags