dc.contributor.author | Benfareh Aicha![]() |
|
dc.date.accessioned | 2025-06-29T11:57:47Z | |
dc.date.available | 2025-06-29T11:57:47Z | |
dc.date.issued | 2025-06-12 | |
dc.identifier.issn | EISSN 2507-721X | |
dc.identifier.uri | http://ddeposit.univ-alger2.dz:8080/xmlui/handle/20.500.12387/9009 | |
dc.description.abstract | We first examined the representation of the 20 languages in M-BERT by deriving language identity representations in 1000 labeled corpora. The high performance of the language identification model in distinguishing the languages in M-BERT (mean F1 score 0.999) indicated that BERT models use strong language-specific information in their pretraining process. We then tested the M-BERT model's capability of differentiating between pairs of languages. By feeding modeling prompts that include the name of the language and a token from one of the two languages to the model, we used the model's output probability to determine which language the input was expressed in. This is effectively a language disambiguation task, and we should be able to use it to measure the model's ability to differentiate and understand pairs of languages. This simple disambiguation setup, combined with the model's ability to perform probability judgment, could serve as a test to reveal what exchanges the model is capable of processing for any given pair of languages. | ar_AR |
dc.language.iso | en | ar_AR |
dc.publisher | المجلة الجزائرية لعلوم اللسان - كلية اللغة العربية وأدابها - جامعة الجزائر 02 أبو القاسم سعد الله | ar_AR |
dc.rights | Attribution-NonCommercial-NoDerivs 3.0 United States | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/us/ | * |
dc.subject | M-BERT | ar_AR |
dc.subject | Langage identification | ar_AR |
dc.subject | Langage disambiguation | ar_AR |
dc.subject | Probability judgment | ar_AR |
dc.subject | Language-specific information | ar_AR |
dc.title | The AI Tongue-Twister: Disentangling the Algorithmic Underpinnings of Multilingual AI | ar_AR |
dc.type | Article | ar_AR |
dc.Access |
Les fichiers de licence suivants sont associés à ce document :