
This project aims to extract keywords from a text document.
All text documents are written to comment, citate, confirm or deny an idea. Each of these documents has unique ideas that discrimate them from the others. Each document contains a set of words that determine its general idea that signals the readers about its content. These words or phrases are called keywords. As we know, keywords are the core of a text which can simply demonstrate the main content of a text in a few words without reading the whole text.
In this project, after entering the text, first, Stop words are removed and other words will gain weights according their repetition in a text. If a word repeated several times in a text, it could be the expected keyword.
In the next step, by choosing how many keywords you want to extract, keywords are shown to and you can evaluate this system.
This projects is written for both Persian and English language, so you can use it to extract keywords in both languages.