Natural Language Processing
Our focus on natural language Processing branches into several problems of interest:
- Low-resource languages, specifically developing Arabic dialect resources
- Opinion mining
- Conversational systems and chatbots
- Language Modeling
Opinion Mining for Arabic Project (OMA)
Project Website: http://oma-project.com/
Mission:
Building state-of-the-art automated sentiment analysis and opinion mining for Arabic (OMA).
A joint work with Qatar University, American University of Beirut, Columbia University and New York University. Supported by the Qatar National Research Fund (a member of Qatar Foundation)
Members:
Name | Role | Title/Affiliation | Contact Info | |
---|---|---|---|---|
Dr. Hazem El Hajj | Lead Principal Investigator | Associate Professor, Electrical and Computer Engineering, American University of Beirut | ||
Dr. Khaled Bashir Shaaban | Co-lead Principal Investigator | Associate Professor, Computer Science and Engineering Department, Qatar University | ||
Dr. Wassim El Hajj | Principal Investigator | Associate Professor and Chairman of Computer Science, American University of Beirut | ||
Dr. Nizar Habash | Principal Investigator | Associate Professor of Computer Science, New York University Abu Dhabi (NYUAD) | ||
Dr. Shady Elbassuoni | Collaborator | Assistant Professor of Computer Science at the American University of Beirut | ||
Dr. Kathy McKeown | Collaborator | Henry and Gertrude Rothschild Professor of Computer Science. Director, Data Science Institute |
Papers and Projects
1st place in Arabic Sentiment Analysis 2021 @ KAUST
We are pleased to announce that Wissam Antoun ranked 1st in the Arabic Sentiment Analysis 2021 @ KAUST competition. Arabic Sentiment Analysis is one of the most popular tasks in Arabic Natural Language Processing (ANLP), with this competition being one of the largest...
WANLP 2021: Arabic Empathetic Conversational Agents and Pre-Trained Language Models
We are pleased to announce that we have 3 papers accepted to The Sixth Arabic Natural Language Processing Workshop (WANLP 2021) co-located with EACL 2021. Authored by our talented team members: Tarek Naous, Wissam Antoun, Reem Mahmoud, Fady Baly under the supervision...
Paper Abstract: Empathy-driven Arabic Conversational Chatbot
We are excited to share a preview of the paper titled “Empathy-driven Arabic Conversational Chatbot” by Tarek Naous, Christian Hokayem, and Prof. Hazem Hajj. The paper will be published at WANLP 2020 at COLING'2020, Barcelona, Spain, 12 Dec. 2020. Abstract:...
Paper Abstract: A Link Prediction Approach for Accurately Mapping a Large-Scale Arabic Lexical Resource to English WordNet
We are excited to share a preview of Gilbert Badaro's (Ph.D.) paper titled "A Link Prediction Approach for Accurately Mapping a Large-Scale Arabic Lexical Resource to English WordNet". The paper will be published at the prestigious ACM Transactions on Asian and...
AUB MIND Lab at OSACT 4 Shared Task on Offensive Language Detection
The use of social media platforms has become more prevalent, which has provided tremendous opportunities for people to connect but has also opened the door for misuse with the spread of hate speech and offensive language. This phenomenon has been driving more and more...
Resources and End-to-End Neural Network Models for Arabic Image Captioning
An end-to-end model that directly transcribes images into Arabic text and an annotated dataset for Arabic image captioning (AIC)
AraBERT : Pre-training BERT for Arabic Language Understanding
AraBERT is an Arabic pretrained language model based on Google’s BERT architecture
State of the Art Models for Fake News Detection Tasks
A paper detailing the winning models and methods in the Qatar International Fake News Detection and Annotation Contest
hULMonA: tHe first Universal Language MOdel iN Arabic
hULMonA is an Arabic universal language model based on ULMFit that can be fine-tuned for any text classification task