Ιατρικά Άρθρα: Unicode-8 based linguistics data set of annotated Sindhi text

Τετάρτη 4 Ιουλίου 2018

Unicode-8 based linguistics data set of annotated Sindhi text

Publication date: August 2018
Source:Data in Brief, Volume 19
Author(s): Mazhar Ali Dootio, Asim Imdad Wagan
Sindhi Unicode-8 based linguistics data set is multi-class and multi-featured data set. It is developed to solve the natural languages processing (NLP) and linguistics problems of Sindhi language. The data set presents information on grammatical and morphological structure of Sindhi language text as well as sentiment polarity of Sindhi lexicons. Therefore, data set may be used for information retrieving, machine translation, lexicon analysis, language modeling analysis, grammatical and morphological analysis, Semantic and sentiment analysis.

https://ift.tt/2u4ivsl

Ιατρικά Άρθρα

Ετικέτες

Τετάρτη 4 Ιουλίου 2018

Unicode-8 based linguistics data set of annotated Sindhi text

Δεν υπάρχουν σχόλια:

Δημοσίευση σχολίου

Multifunctional Two-Dimensional Bi<sub>2</sub>Se<sub>3</sub> Nanodiscs for Anti-Inflammatory Therapy of Inflammatory Bowel Diseases

Αναζήτηση αυτού του ιστολογίου

Αναφορά κατάχρησης