Search
Now showing items 1-10 of 23
ArbDialectID at MADAR Shared Task 1: Language Modelling and Ensemble Learning for Fine Grained Arabic Dialect Identification
(Association for Computational Linguistics, 2019-08)
In this paper, we present a Dialect Identification system (ArbDialectID) that competed at Task 1 of the MADAR shared task, MADARTravel Domain Dialect Identification. We build a course and a fine-grained identification model ...
LSTM-CNN Deep Learning Model for Sentiment Analysis of Dialectal Arabic
(Springer Science and Business Media LLC, 2019)
In this paper we investigate the use of Deep Learning (DL) methods for Dialectal Arabic Sentiment Analysis. We propose a DL model that combines long-short term memory (LSTM) with convolutional neural networks (CNN). The ...
Alignment of comparable documents: Comparison of similarity measures on French–English–Arabic data
(Cambridge University Press, 2018)
The objective, in this article, is to address the issue of the comparability of documents, which are extracted from different sources and written in different languages. These documents are not necessarily translations of ...
Shami: A Corpus of Levantine Arabic Dialects
(2018)
Modern Standard Arabic (MSA) is the official language used in education and media across the Arab world both in writing and formal speech. However, in daily communication several dialects depending on the country, region ...
A Lexical Distance Study of Arabic Dialects
(Elsevier BV, 2018)
Diglossia is a very common phenomenon in Arabic-speaking communities, where the spoken language is different from both Classical Arabic (CA) and Modern Standard Arabic (MSA). The spoken language is characterised as a number ...
WikiDocsAligner: An Off-the-Shelf Wikipedia Documents Alignment Tool
(Institute of Electrical and Electronics Engineers (IEEE), 2017-05)
Wikipedia encyclopedia is an attractive source for comparable corpora in many languages. Most researchers develop their own script to perform document alignment task, which requires efforts and time. In this paper, we ...
Neural network-based minutiae extraction for fingerprint verification system
(IEEE, 2017)
Fingerprint is one of the most important biometrics that has been employed for verification systems. Fingerprint is characterized by two fundamental properties; Easy to acquire, and it is unique for each person. This paper ...
Mining Documents and Sentiments in Cross-lingual Context
(2015)
The aim of this thesis is to study sentiments in comparable documents. First, we collect English, French and Arabic comparable corpora from Wikipedia and Euronews, and we align each corpus at the document level. We further ...
Cross-dialectal arabic processing
(Springer, Cham, 2015)
We present, in this paper an Arabic multi-dialect study including dialects from both the Maghreb and the Middle-east that we compare to the Modern Standard Arabic (MSA). Three dialects from Maghreb are concerned by this ...
Fouille de documents et d’opinions multilingue
(2015)
L’objectif de cette thèse est d’étudier les sentiments dans les documents comparables. Premièrement, nous avons recueillis des corpus comparables en anglais, français et arabe de Wikipédia et d’Euronews, et nous avons ...