Tools & Resources

Tool IDReference IDTool NameDescriptive keywordsPurposeCategorysub-categoryLanguagesLink
1_11Tagger(POS)Part of speech taggingPOS TaggingBasic Language AnalysisSyntax ParsingEgyptian, Gulf, Maghrebi, Levantine https://github.com/qcri/dialectal_arabic_pos_tagger
1_21Orthography GuidelineConventional orthographyOrthographic ConsistencyBasic Language AnalysisOrthographic analysisArabic Dialects https://github.com/CAMeL-Lab/camel-guidelines/blob/master/docs/orthography.md
1_31ADAMMorphological analyzer Morphological AnalysisBasic Language AnalysisMorphological AnalysisLevantine, Egyptian https://github.com/WaelSalloum/adam
1_41CALIMA STARMorphological analyzer Morphological AnalysisBasic Language AnalysisMorphological AnalysisGulf https://calimastar.abudhabi.nyu.edu/analyzer/
1_51Tunisian Arabic morphological analyzer evaluation corpusMorphological Analyzer evaluation corpusMorphological AnalysisLexical ResourcesCorpusTunisian https://github.com/NadiaBMKarmani/Intelligent-Tunisian-Arabic-Morphological-Analyzer-evaluation-corpus
1_61morphological,pos annotationMorphological analyzer CorpusMorphological AnalysisLexical ResourcesCorpus Gulfhttps://camel.abudhabi.nyu.edu/gumar/
1_71morphological disambiguationMorphological analyzerMorphological AnalysisBasic Language AnalysisMorphological AnalysisEgyptian, MSAhttps://camel.abudhabi.nyu.edu/madamira/
1_81Automatic Arabic Dialect Detection Taskdialect identificationLanguage IdentificationLanguage IdentificationLanguage IdentificationMSA, Levantine, North African, Egyptian https://github.com/qcri/dialectID
1_91Arabic CorporaHabibi, Kalimat and othersMultipurposeLexical ResourcesDatabaseMultihttp://www.lancaster.ac.uk/staff/elhaj/corpora.htm
1_101Detecting Arabic Dialectsdialect identificationLanguage IdentificationLanguage IdentificationLanguage IdentificationArabic Dialects, MSA https://github.com/drelhaj/ArabicDialects
1_111End-to-end Dialect Identificationdialect identificationLanguage IdentificationLanguage IdentificationLanguage IdentificationMSA, Levantine, North African, Egyptian https://github.com/swshon/dialectID_e2e
1_121DarijaBERTMoroccan Model, Language identification, sentiment analysisLanguage IdentificationLanguage IdentificationLanguage IdentificationMaghrebi,MSAhttps://github.com/AIOXLABS/DBert
1_131arabic-dialect-identificationArabic dialect speech language identificationLanguage IdentificationLanguage IdentificationLanguage IdentificationArabic Dialectshttps://github.com/swshon/arabic-dialect-identification
1_141BOLT Egyptian Arabic Treebank - Discussion ForumMultipurposeMultipurposeLexical ResourcesTreebankEgyptianhttps://catalog.ldc.upenn.edu/LDC2018T23
1_151Masaderonline catalogue for Arabic NLP datasetsMultipurposeLexical ResourcesDatabaseArabic Dialectshttps://github.com/ARBML/masader
1_161Saudi corpus - SDCSaudi dialect corpusMultipurposeLexical ResourcesCorpusSaudihttps://github.com/TaghreedT/SDC
1_171PADICParallel arabic dialect, Dialect translationTranslationLexical ResourcesCorpusArabic Dialects http://smart.loria.fr/pmwiki/pmwiki.php/PmWiki/Corpora
1_181WikiDocsAlignerLexical resourcesMultipurposeLexical ResourcesCorpusEgyptian,MSAhttps://github.com/motazsaad/WikiDocsAligner
1_191Comparable corpusLexical resourcesMultipurposeLexical ResourcesCorpusEgyptian,MSA https://github.com/motazsaad/comparableWikiCoprus
1_201Madar parallel corpus and lexiconLexical resourcesTranslationLexical ResourcesCorpusMulti https://camel.abudhabi.nyu.edu/madar-parallel-corpus/?
1_211DART corpusLexical resourcesMultipurposeLexical ResourcesCorpusGulf, Iraqi, Levantine, Maghrebi, Egyptianhttps://www.dropbox.com/s/jslg6fzxeu47flu/DART.zip?dl=0
1_221Arabic Multidialectal Word Embeddingsword embeddings learned from different dialects of ArabicFeature EngineeringFeature engineeringWord embeddingsArabic Dialects https://camel.abudhabi.nyu.edu/arabic-multidialectal-embeddings/
1_231darija-dictionaryDictionaryTranslationLexical ResourcesDictionaryMaghrebi,Englishhttps://github.com/darija-open-dataset/dataset
1_241Comparable Algerian corpusAlgerian corpusMultipurposeLexical ResourcesCorpusAlgerian, MSA, French, Englishhttps://github.com/abidikarima/CALYOU
1_251CorpusPalestinian corpusMultipurposeLexical ResourcesCorpusPalestinianhttp://portal.sina.birzeit.edu/curras/download.html
1_261ArabicWeb16Arabic Web collectionMultipurposeLexical ResourcesCorpusArabic Dialects, MSAhttp://qufaculty.qu.edu.qa/telsayed/arabicweb16/
1_271Extraction tweet codeSentiment analysis on arabic tweetsSentiment AnalysisSemantic AnalysisText classificationArabic Dialects, MSAhttps://github.com/alazraq/arabic-nlp
1_281super-parallal corporaParallel corpusTranslationLexical ResourcesCorpusMultihttps://github.com/ehsanasgari/1000Langs
1_291Tunisian Sentiment Analysis Corpus - TSACSentiment Analysis CorpusSentiment AnalysisLexical ResourcesCorpusTunisianhttps://github.com/fbougares/TSAC
1_301DzSenticode for sentiment analysis, corpusSentiment AnalysisSemantic AnalysisText classificationAlgerianhttps://github.com/adelabdelli/DzSentiA
1_311Sentiment-analysis-of-riyadh-season-eventsSentiment Analysis Sentiment AnalysisLexical ResourcesCorpusSaudihttps://github.com/Yasalm/Sentiment-analysis-of-riyadh-season-events
1_321oeadalgSentiment analysisSentiment AnalysisLexical ResourcesCorpusAlgerianhttps://github.com/kinmokusu/oea_algd
1_331ARLSTemStemmer algorithmMorphological AnalysisBasic Language AnalysisMorphological AnalysisMSA https://github.com/xprogramer/Arabic-Stemmers/blob/master/ARLSTem.py
1_341YAMAMADialect Arabic Morphological AnalyzerMorphological AnalysisBasic Language AnalysisMorphological AnalysisArabic Dialects https://nyuad.nyu.edu/en/research/centers-labs-and-projects/computational-approaches-to-modeling-language-lab/resources.html
1_351ParserArabic Syntactic Analysis and Morphological DisambiguationMultipurposeBasic Language AnalysisSyntax ParsingMSAhttps://camel.abudhabi.nyu.edu/camelparser/
1_361NUDAR treebank of texts annotated in the Universal Dependency syntactic representation.Syntactic AnalysisLexical ResourcesTreebankMSAhttps://nyuad.nyu.edu/en/research/faculty-labs-and-projects/computational-approaches-to-modeling-language-lab/research/arabic-natural-language-processing.html
1_371CONLL-UL7Universal Morphological Lattices for Universal Dependency ParsingSyntactic AnalysisBasic Language AnalysisMorphological AnalysisMSAhttps://github.com/conllul/conllul.github.io
1_381Arabic News Article Classificationcorpus, text classificationText ClassificationSemantic AnalysisText classificationMSA https://github.com/saidziani/Arabic-News-Article-Classification
1_391Sinai Corpustagged sentencesMultipurposeLexical ResourcesCorpusMSAhttps://github.com/mohabmes/Sinai-corpus
1_401Prague Arabic Dependency Treebankmulti-level linguistic annotations over the language of Modern Standard ArabicMultipurposeLexical ResourcesTreebankMSAhttps://ufal.mff.cuni.cz/padt/PADT_1.0/docs/index.html
1_411United Nations Parallel Corpussix-language parallel corpus, Machine TranslationTranslationLexical ResourcesCorpusMultihttps://conferences.unite.un.org/UNCorpus
1_421Parallelcorpus(60languageincludingMSA)1165 languagesTranslationLexical ResourcesCorpusMulti http://opus.nlpl.eu/OpenSubtitles2016.php
1_431TUFS Media CorpusParallel corpusTranslationLexical ResourcesCorpusMulti http://el.tufs.ac.jp/tufsmedia-corpus/
1_441Sentiment corpusLabeled corpusSentiment AnalysisLexical ResourcesCorpusMSAhttps://github.com/iamaziz/ar-embeddings/tree/master/datasets
1_451LABR: A Large-SCale Arabic Book Reviews Dataset Large-SCale Arabic Book Reviews DatasetMultipurposeLexical ResourcesCorpusMSAhttps://github.com/mahmoudnabil/labr
1_461Character-Aware Neural Language ModelsCharacter-Aware Neural Language ModelsLanguage ModelingLanguage ModelingLanguage ModelMulti https://github.com/yoonkim/lstm-char-cnn
1_471Quranic corpus Classical Arabic CoprusMultipurposeLexical ResourcescorpusCA http://corpus.quran.com/
1_481Shamela corpusOnline LibraryMultipurposeLexical ResourcesDatabaseCA, MSAhttps://shamela.ws/
1_491Quranic corpus corpusMultipurposeLexical ResourcesCorpusCAhttp://textminingthequran.com/
1_501QuranAnalysis (QA) ProjectSemantic Search and Intelligence System for the QuranInformation retrievalSemantic AnalysisInformation retrievalCA https://github.com/karimouda/qurananalysis
1_511Translation of Quran corpusTranslationLexical ResourcesCorpusCA http://tanzil.net/trans/
3_13Arabic emoticon lexiconSentiment analysis lexiconSentiment AnalysisLexical Resourceslexiconmultihttps://www.saifmohammad.com/WebPages/NRC-Emotion-Lexicon.htm
3_23ArSenLSentiment analysis lexiconSentiment AnalysisLexical ResourceslexiconMSAhttp://oma-project.com/ArSenL/download_intro
3_33NileULexSentiment analysis lexiconSentiment AnalysisLexical ResourceslexiconEgyptian, MSAhttps://github.com/NileTMRG/NileULex
3_43ASTD + python codeSentiment analysis datasetSentiment AnalysisLexical ResourcescorpusMSAhttps://github.com/mahmoudnabil/ASTD
3_53Large Multi-Domain Resources for Arabic Sentiment AnalysisSentiment analysis datasetSentiment AnalysisLexical ResourcesDatabaseMSAhttps://github.com/hadyelsahar/large-arabic-sentiment-analysis-resouces
3_63ArTwitterSentiment analysis datasetSentiment AnalysisLexical ResourcesCorpusMSA, Jordanianhttps://archive.ics.uci.edu/ml/datasets/Twitter+Data+set+for+Arabic+Sentiment+Analysis
4_14Tim Buckwalter’s morphological analyzerMorphological analyzerMorphological AnalysisBasic Language AnalysisMorphological AnalysisMSAhttp://www.qamus.org
4_24Sarf engineengine that can generate Arabic verbs, nouns, gerunds, adjectives from their roots.Morphological AnalysisBasic Language AnalysisMorphological AnalysisMSAhttp://sourceforge.net/projects/sarf
5_15Al-Hayat Arabic Corpusfor information retrieval purposesInformation retrievalLexical ResourcesCorpusMSAhttp://catalog.elra.info/product_info.php?products_id=632
5_25Amaraopen multilingual collection of subtitles for educational videosTranslationLexical ResourcesCorpusMultihttp://alt.qcri.org/resources/qedcorpus/
5_35Arab-Acquismachine translation between arabic and 22 european countriesTranslationLexical ResourcesCorpusMultihttps://camel.abudhabi.nyu.edu/arabacquis/
5_45Arabic English Parallel Newsparallel corpus, Machine translationTranslationLexical ResourcesCorpusMSA,Englishhttps://catalog.ldc.upenn.edu/LDC2004T18
5_55Arabic Treebankautomatic content extraction, cross-lingual information retrieval, information detectionMultipurposeLexical ResourcesTreebankMSAhttps://catalog.ldc.upenn.edu/LDC2005T20
5_65Aranea Web CorporacorporaMultipurposeLexical ResourcesDatabaseMultihttp://unesco.uniba.sk/guest/
5_75arTenTenpos, part of speech taggingMultipurposeLexical ResourcesCorpusMultihttps://www.sketchengine.co.uk/artenten-corpus/
5_85KSUCCA 50 million tokens annotated corpus of Classical Arabic MultipurposeLexical ResourcesCorpusCAhttps://sourceforge.net/projects/ksucca-corpus/
5_95Multilingual Multi-Document Summarization Corpussummarization corpusText SummarizationLexical ResourcesCorpusMultihttp://multiling.iit.demokritos.gr/pages/view/1540/task-mms-multi-document-summarization-data-and-information
5_105NEMLAR CorpusArabic text from 13 different domainsMultipurposeLexical ResourcesCorpusMSAhttps://old.linguistlist.org/issues/17/17-2368.html
5_115OntoNotesMultilingual CoporaMultipurposeLexical ResourcesCorpusMultihttps://catalog.ldc.upenn.edu/LDC2013T19
5_125The International Corpus of ArabiccorporaMultipurposeLexical ResourcesCorpusMSAhttp://www.bibalex.org/ica
5_135WIT3Web Inventory of Transcribed and Translated Talks MultipurposeLexical ResourcesDatabaseMultihttps://wit3.fbk.eu/
6_16Ajdir Corpora Monolingual CorporaMultipurposeLexical ResourcesCorpusMSAhttp://aracorpus.e3rab.com/argistestsrv.nmsu.edu/AraCorpus/
6_26OSACMonolingual CorporaMultipurposeLexical ResourcesCorpusMSAhttps://sourceforge.net/projects/ar-text-mining/files/Arabic-Corpora/
6_36Alwatan Monolingual CorporaMultipurposeLexical ResourcesCorpusMSAhttp://sourceforge.net/projects/arabiccorpus/
6_46Tashkeela Monolingual CorporaDiacritizationLexical ResourcesCorpusMSAhttp://sourceforge.net/projects/tashkeela/
6_56Al KhaleejMonolingual CorporaMultipurposeLexical ResourcesCorpusMSAhttp://sourceforge.net/projects/arabiccorpus/
6_66KACST Arabic Newspaper Corpus Monolingual CorporaMultipurposeLexical ResourcesCorpusMSAhttp://sourceforge.net/projects/kacst-acptool/files/?source=navbar
6_76Arabic Words Corpora Monolingual CorporaMultipurposeLexical ResourcesCorpusMSAhttp://sourceforge.net/projects/arabicwordcorpu/files/
6_86Corpus of Contemporary Arabic Monolingual CorporaMultipurposeLexical ResourcesCorpusMSAhttp://shachi.org/resources/4051
6_96CRI KACST Arabic Corpus Monolingual CorporaMultipurposeLexical ResourcesCorpusMSAhttp://cri.kacst.edu.sa/Resources/TRN_DB.rar
6_106Arabic Learners Written Corpus Monolingual CorporaMultipurposeLexical ResourcesDatabaseMSAhttps://cercll.arizona.edu/arabic-corpus/
6_116MEEDAN Translation Memory Multilingual CoporaTranslationLexical ResourcesCorpusMSA, Englishhttps://github.com/anastaw/Meedan-Memory
6_126Tunisian Dialect Corpus (TuDiCoI) Dialectal CorporaMultipurposeLexical ResourcesCorpusTunisianhttps://sites.google.com/site/anlprg/outils-et-corpus-realises/TuDiCoIV1.xml?attredirects=0
6_136KACST Arabic Corpus Web based CorporaMultipurposeLexical ResourcesDatabaseMSAhttps://corpus.kacst.edu.sa/
6_146Leeds Arabic Internet CorpusWeb based CorporaMultipurposeLexical ResourcesDatabaseMultihttp://corpus.leeds.ac.uk/query-ar.html
6_156ArabiCorpus Web based CorporaMultipurposeLexical ResourcesCorpusMSAhttp://arabicorpus.byu.edu/
6_166QURANY Web based CorporaMultipurposeLexical ResourcesCorpusCAhttps://corpus.quran.com/
6_176ANERCorp Named Entity CorporaNamed Entity RecognitionLexical ResourcesCorpusMSAhttp://curtis.ml.cmu.edu/w/courses/index.php/ANERcorp
6_186AQMAR Named Entity CorpusNamed Entity CorporaNamed Entity RecognitionLexical ResourcesCorpusMSAhttp://www.ark.cs.cmu.edu/ArabicNER/
6_196Named Entity Translation Lexicon Named Entity CorporaNamed Entity RecognitionLexical ResourcesCorpusMSAhttp://nlp.qatar.cmu.edu/resources/NETLexicon/
6_206Named Entities List Named Entity CorporaNamed Entity RecognitionLexical ResourcesGazetteerMSAhttps://sourceforge.net/projects/arabicnes/
6_216ANERGazet Named Entity CorporaNamed Entity RecognitionLexical ResourcesGazetteerMSAhttp://curtis.ml.cmu.edu/w/courses/index.php/ANERgazet
6_226Qatar Arabic language Bank(QALB)Error-Annotated CorporaOrthographic ConsistencyLexical ResourcesCorpusMSAhttp://nlp.qatar.cmu.edu/qalb/sharedtask/shared_task.html
6_236Arabic Learner CorpusError-Annotated CorporaOrthographic ConsistencyLexical ResourcesCorpusMSA, Saudihttps://www.arabiclearnercorpus.com/
6_246The Quranic Arabic CorpusMiscellaneous Annotated CorporaMultipurposeLexical ResourcesCorpusCAhttp://corpus.quran.com/download/
6_256AQMAR Arabic Wiki.
Supersense Corpus
Miscellaneous Annotated CorporaNominal SupersensesLexical ResourcesCorpusMSAhttp://www.ark.cs.cmu.edu/ArabicSST/
6_266Khoja POS tagged corpusMiscellaneous Annotated CorporaMultipurposeLexical ResourcesCorpusMSAhttp://zeus.cs.pacificu.edu/shereen/research.htm#corpora
6_276Arabic Wikipedia
Dependency Corpus
Miscellaneous Annotated Corpora, syntax dependencyMultipurposeLexical ResourcesCorpusMSAhttp://www.ark.cs.cmu.edu/ArabicDeps/
6_286BAMA 1.0 English-Arabic LexiconLexical Databases List, POS tagging datasetMultipurposeLexical ResourcesLexiconMultihttp://catalog.ldc.upenn.edu/LDC2002L49
6_296Arabic-English Learner's DictionaryLexical Databases List.MultipurposeLexical ResourcesdictionaryMultihttp://www.perseus.tufts.edu/hopper/opensource/download
6_306Unitex Arabic PackageLexical Databases List.MultipurposeLexical ResourcesCorpusMSAhttp://www-igm.univ-mlv.fr/~unitex/index.php?page=3&htm
6_316ARALEX OnlineLexical Databases List.MultipurposeLexical ResourcesLexiconMSAhttps://aralex.mrc-cbu.cam.ac.uk/aralex.online/login.jsp
6_326AraComLex Arabic Lexical DatabaseLexical Databases List.Morphological AnalysisLexical ResourcesDatabaseMSAhttp://sourceforge.net/projects/aracomlex/files/
6_336Arabic VerbNEtLexical Databases List.MultipurposeLexical ResourcesVerbNEtMSAhttps://github.com/JaouadMousser/Arabic-Verbnet
6_346Arabic WordNEtLexical Databases List.MultipurposeLexical ResourcesWordNEtMSA, Englishhttp://sourceforge.net/projects/awnbrowser/
6_356NOOJ Arabic DictionaryLexical Databases List.MultipurposeLexical ResourcesDatabaseMSAhttps://site-nooj.blogspot.com/p/arabic-tutorials.html
6_366Word Count of Modern Standard
Arabic
List of Words ListsMultipurposeLexical ResourcesListMSAhttp://arabicwordcount.sourceforge.net/
6_376Arabic Wordlist for SpellcheckingList of Words ListsOrthographic ConsistencyLexical ResourcesListMSAhttp://sourceforge.net/projects/arabic-wordlist/
6_386Multiword ExpressionsList of Words ListsPreprocessingLexical ResourcesListMSAhttps://sourceforge.net/projects/arabicmwes/
6_396Arabic Unknown WordsList of Words ListsPreprocessingLexical ResourcesListMSAhttp://arabic-unknowns.sourceforge.net/
6_406Arabic Stop wordsList of Words ListsPreprocessingLexical ResourcesListMSAhttp://sourceforge.net/projects/arabicstopwords/
6_416Obsolete Arabic WordsList of Words ListsPreprocessingLexical ResourcesListMSAhttp://obsoletearabic.sourceforge.net/
6_426Arabic Broken PluralsList of Words ListsMultipurposeLexical ResourcesListMSAhttp://broken-plurals.sourceforge.net/
6_436AFEWC and Enews Comparable
Corpora
Miscellaneous Corpora TypesMultipurposeLexical ResourcesCorpusMSA, French, Englishhttp://sourceforge.net/projects/crlcl/
6_446InAra (a corpus for Arabic Intrinsic plagiarism detection evaluation)plagiarism detection corpusContent ModerationLexical ResourcesCorpusMSAhttps://sourceforge.net/projects/inaracorpus/
6_456Essex Arabic Summaries CorpusMiscellaneous Corpora TypesText SummarizationLexical ResourcesCorpusMSAhttp://sourceforge.net/projects/easc-corpus/
6_466KALIMAT Multi-Purpose CorpusMiscellaneous Corpora TypesMultipurposeLexical ResourcesCorpusMSAhttp://sourceforge.net/projects/kalimat/
8_18Arabic text recognition competition of ICDAR 2013text recognitionComputer VisionLexical ResourcesCorpusMSAhttps://diuf.unifr.ch/main/diva/APTI/download.html
8_28Arabic handwritten ancient manuscripts called AVICENNA.Handwriting Recognition CorporaComputer VisionLexical ResourcesCorpusCAhttp://www.causality.inf.ethz.ch/ul_data/AVICENNA.html
8_38the IIIT Arabic scene text datasetRecognizing arabic text in videosComputer VisionLexical ResourcesCorpusMSAhttps://cvit.iiit.ac.in/research/projects/cvit-projects/arabic-text-recognition
8_48tessertactocrComputer VisionNLP ToolkitNLP ToolkitMSAhttps://github.com/tesseract-ocr/tesseract
8_58Script identificationScene Text Script IdentificationComputer VisionNLP ToolkitNLP ToolkitMultihttps://github.com/lluisgomez/script_identification
8_68Video Script Identification Competition (CVSI-2015) datasetVideo scriptComputer VisionLexical ResourcesCorpusMultihttp://www.ict.griffith.edu.au/cvsi2015/.
8_782016 Arabic multi-genre broadcast (MGB) challengeaudio, Multipurpose lexical resourcesMultipurposeLexical ResourcesCorpusMSAhttp://www.mgb-challenge.org/before20190909/arabic_download.html
8_88character-level NN for the Arabic dialects identification task of the DSL challengeDialect identificationLanguage IdentificationLanguage IdentificationLanguage IdentificationMultihttps://github.com/boknilev/dsl-char-cnn
8_98dialect datasetsDialect identification, POSMultipurposeLexical ResourcesCorpusArabic Dialectshttp://alt.qcri.org/resources/da_resources/
8_108sentiment analysis using word embeddingssentiment analysis codeSentiment AnalysisSemantic AnalysisText ClassificationMSAhttps://github.com/iamaziz/ar-embeddings
8_118sentiment analysis dataset comparisonsentiment analysis datasetSentiment AnalysisLexical ResourcesCorpusMSA http://saifmohammad.com/WebPages/ArabicSA.html
9_19ANTMultipurpose corporaMultipurposeLexical ResourcesCorpusMSAhttps://antcorpus.github.io/
9_29CANERMultipurpose corporaMultipurposeLexical ResourcesCorpusCAhttps://github.com/RamziSalah/Classical-Arabic-Named-Entity-Recognition-Corpus
9_39PAADMultipurpose corporaMultipurposeLexical ResourcesCorpusMSAhttps://data.mendeley.com/datasets/spvbf5bgjs/2
9_49RCATSMultipurpose corporaMultipurposeLexical ResourcesCorpusMSAhttps://fstf.fst-usmba.ac.ma/laboratoires/lsia/RCATSS/index.html
9_59ANSMultipurpose corporaMultipurposeLexical ResourcesCorpusMSAhttps://github.com/latynt/ans
9_69DZDC12Multipurpose corporaMultipurposeLexical ResourcesCorpusMSAhttps://bit.ly/3uqX6bb
9_79KunuzMultipurpose corporaMultipurposeLexical ResourcesCorpusCAhttp://jarir.tn/kunuzcorpus
9_89OpenITI-proc corpusMultipurpose corporaMultipurposeLexical ResourcesCorpusCAhttps://zenodo.org/record/2535593#.Yvo5EXZByHt
9_99N/AMultipurpose corporaMultipurposeLexical ResourcesCorpusMSAhttp://www.cs.cmu.edu/~fraisi/arabic/arparallel/
9_109N/AMultipurpose corporaMultipurposeLexical ResourcesLexiconMSA, Englishhttps://github.com/Hkiri-Emna/Named_Entities_Lexicon_Project
9_119The Saudi Dialect Irony Dataset (Sa`7r ساخر)Multipurpose corporaMultipurposeLexical ResourcesLexiconSaudihttps://github.com/iwan-rg/Saudi-Dialect-Irony-Dataset
9_129MADARDialectal CorporaMultipurposeLexical ResourcesCorpusMultihttps://sites.google.com/nyu.edu/madar/
9_139Arabic Hate Speech DatasetHate speech detectionContent ModerationLexical ResourcesCorpusArabic Dialectshttps://github.com/sbalsefri/ArabicHateSpeechDataset
9_149MADARSpelling correction corpusOrthographic ConsistencyLexical ResourcesCorpusMultihttps://nyuad.nyu.edu/en/research/faculty-labs-and-projects/computational-approaches-to-modeling-language-lab/resources.html
9_159DAICTARABIC IRONY CORPUSIrony DetectionLexical ResourcesCorpusMSA, Arabic Dialectshttps://www.hbku.edu.qa/en/DAICT
9_169ShamiDialectal CorporaMultipurposeLexical ResourcesCorpusSyrianhttps://github.com/GU-CLASP/shami-corpus
9_179N/ASentiment Analysis CorporaSentiment AnalysisLexical ResourcesCorpusMSAhttps://rb.gy/vea9g7
9_189RSACSentiment Analysis CorporaSentiment AnalysisLexical ResourcesLexiconMSAhttps://github.com/asooft/Sentiment-Analysis-Hotel-Reviews-Dataset
9_199MoarlexSentiment Analysis CorporaSentiment AnalysisLexical ResourcesLexiconMSA, Egyptianhttps://github.com/Mohabyoussef09/MoArLex
9_209AraSenti LexiconSentiment Analysis CorporaSentiment AnalysisLexical ResourcesLexiconMSAhttps://github.com/nora-twairesh/AraSenti
9_219Multi-domain Arabic Sentiment Corpus (MASC)Sentiment Analysis CorporaSentiment AnalysisLexical ResourcesLexiconMSAhttps://github.com/almoslmi/masc
9_229The Arabic Dialect Identification for 17 countries (ADI17) DatasetSpeech CorporaLanguage IdentificationLexical ResourcesCorpusArabic Dialectshttps://bit.ly/3kon1vo
9_239Arabic Dialect Identification CorporaSpeech CorporaLanguage IdentificationLexical ResourcesCorpusMultihttps://www.kaggle.com/datasets/corpora4research/arpod-corpus-based-on-arabic-podcasts
9_249SmartATIDImage CorporaComputer VisionLexical ResourcesCorpusMSAhttps://sites.google.com/site/smartatid/
9_259ASAYARImage CorporaComputer VisionLexical ResourcesCorpusMaghrebihttps://vcar.github.io/ASAYAR/
11_111HARD: Hotel Arabic-Reviews Datasethotel reviews datasetMultipurposeLexical ResourcesCorpusMSA, Arabic Dialectshttps://github.com/elnagara/HARD-Arabic-Dataset
11_211BRAD: Books Reviews in Arabic DatasetBook reviews datasetMultipurposeLexical ResourcesCorpusMSA, Arabic Dialectshttps://github.com/elnagara/BRAD-Arabic-Dataset
11_311AOC datasetCorpusMultipurposeLexical ResourcesCorpusMSAhttps://github.com/sjeblee/AOC/blob/master/stuff-from-omar/AOC_readme.txt
11_411Nuanced Arabic Dialect Identification Shared Task Series (NADI)Dialect identificationLanguage IdentificationLanguage IdentificationLanguage IdentificationMSA, Arabic Dialectshttps://github.com/UBC-NLP/nadi
11_511BBN/AUB DARPA Babylon Levantine Arabic Speech and TranscriptsSpeech recognition, speech to speech translationMultipurposeLexical ResourcesCorpusLevantinehttps://catalog.ldc.upenn.edu/LDC2005S08
11_611Spoken Arabic Regional Archive (SARA)Spoken arabic dialectsMultipurposeLexical ResourcesDatabaseArabic Dialectshttps://data.mendeley.com/datasets/btfx5pw2rm/2
13_113The open parallel corpuscollection of translated texts from the webMultipurposeLexical ResourcesCorpusMultihttp://opus.nlpl.eu/.
13_213OpenNMTAn open source neural machine translation system.TranslationNLP ToolkitNLP ToolkitMultihttps://opennmt.net/.
13_313Fairseqa sequence modeling toolkitMultipurposeNLP ToolkitNLP ToolkitMultihttps://github.com/facebookresearch/fairseq
13_413Tensor2Tensor a library of deep learning models and datasetsMultipurposeNLP ToolkitNLP ToolkitMultihttps://github.com/tensorflow/tensor2tensor
13_513Mosesstatistical machine translation systemTranslationNLP ToolkitNLP ToolkitMultihttp://www2.statmt.org/moses/
13_613PhrasalA statistical machine translation systemTranslationNLP ToolkitNLP ToolkitMultihttps://github.com/stanfordnlp/phrasal
13_713Subword-nmtSubword Neural Machine TranslationMorphological AnalysisNLP ToolkitNLP ToolkitMultihttps://github.com/rsennrich/subword-nmt
13_813SentencePiecean unsupervised text tokenizer and detokenizerMorphological AnalysisNLP ToolkitNLP ToolkitMultihttps://github.com/google/sentencepiece
13_913GIZA++Statistical Machine TranslationTranslationNLP ToolkitNLP ToolkitMultihttps://github.com/moses-smt/giza-pp
13_1013FastText MultilingualfastText vectors of 78 languagesMultipurposeNLP ToolkitNLP ToolkitMultihttps://github.com/babylonhealth/fastText_multilingual
13_1113GensimTopic modellingMultipurposeNLP ToolkitNLP ToolkitMultihttps://radimrehurek.com/gensim/
13_1213FastTextPre-trained word vectorsMultipurposeNLP ToolkitNLP ToolkitMultihttps://github.com/facebookresearch/fastText/blob/master/docs/crawl-vectors.md
13_1313NLG evaluationEvaluation code for various unsupervised automated metrics for NLG MultipurposeNLP ToolkitNLP ToolkitMultihttps://github.com/Maluuba/nlg-eval
20_120AdawatNLP softwareMultipurposeNLP ToolkitNLP ToolkitArabic Dialectshttp://adawat.sourceforge.net/
20_220Salma AIFinancial ChatbotChatbotSemantic AnalysisDialogue systemMSAhttps://salma.ai/home
20_320Adam AIIslam ChatbotChatbotSemantic AnalysisDialogue systemMSAhttps://iadam.ai/
20_420Arabic ToolsNLP softwareMultipurposeNLP ToolkitNLP ToolkitMultihttps://www.arabitools.com/
20_520UIMANLP softwareMultipurposeNLP ToolkitNLP ToolkitMultihttps://uima.apache.org/d/uimaj-current/
20_620Safar NLP softwareMultipurposeNLP ToolkitNLP ToolkitMultihttp://arabic.emi.ac.ma/safar/
25_125ACE 2004 Multilingual Training CorpusMultilingual CoporaMultipurposeLexical ResourcesCorpusMultihttps://catalog.ldc.upenn.edu/LDC2005T09
25_225Arabic NLP LexiconsArabic lexical resourcesMultipurposeLexical ResourcesCorpusMSAhttps://www.cjk.org/data/arabic/nlp/
25_325The General Architecture for Text Engineering GATE NLP softwareMultipurposeNLP ToolkitNLP ToolkitMultihttp://gate.ac.uk/.
25_425LingPipe A toolkit for text engineering and processingNLP softwareMultipurposeNLP ToolkitNLP ToolkitMultihttp://alias-i.com/lingpipe/.
25_525Yasmetmaximum entropy modelsMultipurposeNLP ToolkitNLP ToolkitMultihttp://www-i6.informatik.rwth-aachen.de/web/Software/YASMET.html
25_625CRF++NLP softwareMultipurposeNLP ToolkitNLP ToolkitMultihttps://taku910.github.io/crfpp/
25_725YamchaMultipurpose CHunk AnnotatorMultipurposeNLP ToolkitNLP ToolkitMultihttp://chasen.org/~taku/software/yamcha/
25_825WekaMachine Learning Software in JavaMultipurposeNLP ToolkitNLP ToolkitMultihttps://www.cs.waikato.ac.nz/ml/weka/
25_925NetOwl ExtractorNamed Entity ExtractorMultipurposeNLP ToolkitNLP ToolkitMultihttps://www.netowl.com/entity-extraction
25_1025About GazetteerLexical resourcesNamed Entity RecognitionLexical ResourcesGazetteerMultihttps://dbpedia.org/page/Gazetteer
29_129MADAMIRAPackage, NLP ToolkitMultipurposeNLP ToolkitNLP ToolkitMSAhttps://camel.abudhabi.nyu.edu/madamira/
29_229FARASAPackage, NLP ToolkitMultipurposeNLP ToolkitNLP ToolkitMSAhttps://farasa.qcri.org/
29_329CAMeLPackage, NLP ToolkitMultipurposeNLP ToolkitNLP ToolkitMSAhttps://camel-tools.readthedocs.io/en/latest/
29_429ARBMLPackage, NLP ToolkitMultipurposeNLP ToolkitNLP ToolkitMSAhttps://github.com/ARBML/ARBML
29_529CoreNLP Package, NLP ToolkitMultipurposeNLP ToolkitNLP ToolkitMSAhttps://stanfordnlp.github.io/CoreNLP/
29_629UDPipe Package, NLP ToolkitMultipurposeNLP ToolkitNLP ToolkitMSAhttps://cran.r-project.org/web/packages/udpipe/index.html
29_729StanzaPackage, NLP ToolkitMultipurposeNLP ToolkitNLP ToolkitMSAhttps://stanfordnlp.github.io/stanza/
29_829Trankit Package, NLP ToolkitMultipurposeNLP ToolkitNLP ToolkitMSAhttps://github.com/nlp-uoregon/trankit
30_130AraGPT2language generation modelMultipurposeLanguage ModelingLanguage ModelMSA, Arabic Dialectshttps://github.com/aub-mind/arabert/tree/master/aragpt2
30_230Arabert Language Understanding and GenerationMultipurposeFeature engineeringWord embeddingsMSA, Arabic Dialectshttps://github.com/aub-mind/arabert
36_136Lemur ToolkitNLP softwareMultipurposeNLP ToolkitNLP ToolkitMultihttps://www.lemurproject.org/lemur.php
36_236Arabic WordnetwordnetMultipurposeLexical ResourcesWordNEtMSAhttp://globalwordnet.org/resources/arabic-wordnet/awn-browser/
36_336Arabic Q&A Datasetquestion answering datasetQuestion AnsweringLexical ResourcesCorpusMSAhttp://xminers.club/2017/07/22/arabic-qa-dataset/
36_436AR-ASAG-DatasetThe ARabic Dataset for Automatic Short Answer Grading Evaluation Question AnsweringLexical ResourcesCorpusMSAhttps://data.mendeley.com/datasets/dj95jh332j/1
36_536DAWQASA Dataset for Arabic Why Question Answering SystemQuestion AnsweringLexical ResourcesCorpusMSAhttps://github.com/masun/DAWQAS
36_636Arabic AskFM DatasetIslamic question answering datasetQuestion AnsweringLexical ResourcesCorpusMSAhttps://github.com/Omarito2412/ASKFM