Minimum redundancy and maximum relevance for single and multi-document Arabic text summarization
dc.contributor.author | Oufaida, Houda | |
dc.contributor.author | Nouali, Omar | |
dc.contributor.author | Blache, Philippe | |
dc.date.accessioned | 2023-10-04T18:55:27Z | |
dc.date.available | 2023-10-04T18:55:27Z | |
dc.date.issued | 2014-12 | |
dc.description.abstract | Automatic text summarization aims to produce summaries for one or more texts using machine techniques. In this paper, we propose a novel statistical summarization system for Arabic texts. Our system uses a clustering algorithm and an adapted discriminant analysis method: mRMR (minimum redundancy and maximum relevance) to score terms. Through mRMR analysis, terms are ranked according to their discriminant and coverage power. Second, we propose a novel sentence extraction algorithm which selects sentences with top ranked terms and maximum diversity. Our system uses minimal language-dependant processing: sentence splitting, tokenization and root extraction. Experimental results on EASC and TAC 2011 MultiLingual datasets showed that our proposed approach is competitive to the state of the art systems. | |
dc.identifier.doi | https://doi.org/10.1016/j.jksuci.2014.06.008 | |
dc.identifier.issn | 1319-1578 | |
dc.identifier.uri | https://dl.cerist.dz/handle/CERIST/982 | |
dc.publisher | Elsevier | |
dc.relation.ispartofseries | Journal of King Saud University - Computer and Information Sciences; 26(4) | |
dc.relation.pages | 450-461 | |
dc.structure | Web Sémantique et Langue Arabe | |
dc.subject | Arabic text summarization | |
dc.subject | Sentence extraction | |
dc.subject | mRMR | |
dc.subject | Minimum redundancy | |
dc.subject | Maximum relevance | |
dc.title | Minimum redundancy and maximum relevance for single and multi-document Arabic text summarization | |
dc.type | Article |