SIRATa : a Real-Time Indexing Arabic Text Editor Based on the Extraction of Keywords

dc.contributor.authorDilekh, Tahar
dc.contributor.authorBenharzallah, Saber
dc.contributor.authorMokeddem, Ayoub
dc.date.accessioned2024-03-12T18:40:22Z
dc.date.available2024-03-12T18:40:22Z
dc.date.issued2021-05-25
dc.description.abstractIndexing stage in information retrieval process has a great importance as an essential tool for the performance of recall and precision. Despite the many studies that have been done on the indexing conducted in the last few decades, to our knowledge, no study has investigated whether indexing realtime based on keywords extraction is efficient to perform of recall and precision. Moreover, relatively fewer Arabic text indexing studies are currently available despite the enormous efforts put together to satisfy the needs of the growing number of Arabic internet users. This paper suggests a method for Arabic text indexing based on keywords extraction. The proposed method consists of two stages. The first stage conducts a real-time indexing. The second stage is a keywords extraction and updating of initial index taking into account the output of keywords extraction process. We illustrate application and the performance of this method of indexing using an Arabic text editor (SIRAT) developed and designed for this aim. We also illustrate the process of building a new form of Arabic corpus appropriate to conduct the necessary experiments. Our findings show that SIRAT successfully identifies the keywords most relevant to the document. Finally, the main contribution of this experiment is to demonstrate the effectiveness of this method compared to other methods. In addition, the paper proposes a solution to issues and deficiencies Arabic language processing suffers from, especially regarding corpora building and keywords extraction evaluation systems.
dc.identifier.isbn978-9931-9788-0-0
dc.identifier.urihttp://dspace.univ-oeb.dz:4000/handle/123456789/18734
dc.language.isoen
dc.publisherUniversity of Oum El Bouaghi
dc.subjectNLPb; Arabic text indexing; real-time indexing; Arabic keywords extraction; Arabic information retrieval system.
dc.titleSIRATa : a Real-Time Indexing Arabic Text Editor Based on the Extraction of Keywords
dc.typeArticle
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
SIRATa ; a Real-Time Indexing Arabic Text Editor Based on the Extraction of Keywords.pdf
Size:
614.48 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: