Repository logo
 

The AI Tongue-Twister: Disentangling the Algorithmic Underpinnings of Multilingual AI

dc.Access
dc.contributor.authorBenfareh Aicha
dc.date.accessioned2025-06-29T11:57:47Z
dc.date.available2025-06-29T11:57:47Z
dc.date.issued2025-06-12
dc.description.abstractWe first examined the representation of the 20 languages in M-BERT by deriving language identity representations in 1000 labeled corpora. The high performance of the language identification model in distinguishing the languages in M-BERT (mean F1 score 0.999) indicated that BERT models use strong language-specific information in their pretraining process. We then tested the M-BERT model's capability of differentiating between pairs of languages. By feeding modeling prompts that include the name of the language and a token from one of the two languages to the model, we used the model's output probability to determine which language the input was expressed in. This is effectively a language disambiguation task, and we should be able to use it to measure the model's ability to differentiate and understand pairs of languages. This simple disambiguation setup, combined with the model's ability to perform probability judgment, could serve as a test to reveal what exchanges the model is capable of processing for any given pair of languages.ar_AR
dc.identifier.issnEISSN 2507-721X
dc.identifier.urihttp://ddeposit.univ-alger2.dz/handle/20.500.12387/9009
dc.language.isoenar_AR
dc.publisherالمجلة الجزائرية لعلوم اللسان - كلية اللغة العربية وأدابها - جامعة الجزائر 02 أبو القاسم سعد اللهar_AR
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 United States*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/us/*
dc.subjectM-BERTar_AR
dc.subjectLangage identificationar_AR
dc.subjectLangage disambiguationar_AR
dc.subjectProbability judgmentar_AR
dc.subjectLanguage-specific informationar_AR
dc.titleThe AI Tongue-Twister: Disentangling the Algorithmic Underpinnings of Multilingual AIar_AR
dc.typeArticlear_AR

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
the-ai-tongue-twister_-disentangling-the-algorithmic-underpinnings-of-multilingual-ai (3).pdf
Size:
336.66 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
3.69 KB
Format:
Item-specific license agreed upon to submission
Description: