Ethnic Language Processing and Cross Language Processing
WANG Lianxi, LIN Nankai, JIANG Shengyi, DENG Zhiyan
Journal of Chinese Information Processing.
2023, 37(5):
53-69.
Compared with western languages, Hindi is a low resource language in Southeast Asia. Due to the lack of corpus, annotation specifications and computational modeling practices, the studies on Hindi natural language processing have not been well addressed. This paper reviews the research progresses in Hindi natural language processing in terms of the resource construction, part of speech tagging, named entity recognition, syntactic analysis, word sense disambiguation, as well as information retrieval, machine translation, sentiment analysis and automatic summarization. This paper also reveals the issues and challenges in Hindi natural language processing, and outlooks the future development trend.