Paper Title
NLTK based Processing of Sanskrit Text
Abstract
The paper deals with the usage and significance of Natural Language Toolkit (NLTK), an open source library for
Computational Linguistics and Natural Language Processing (NLP). Extraction and processing of text is a very challenging
task in the field of NLP because it requires a large corpora to be processed under various NLP tools and repositories. The
research is being carried out in most of the Indian and foreign languages by analyzing the grammatical aspect of the languages
and then further implementation is done using natural language repositories. In this paper the main focus is on processing
Indian Languages, specifically Sanskrit that possesses a definite rule-based structure given by Panini and has a great potential
in the field of semantic extraction.
Index Terms - Natural Language Processing, python, tokenizers, part-of-speech taggers, parsers, WordNet, Ashtadhay