Article,

The Problem of Words Undergoing Sound Changes in Uzbek Stemmers

.
Central Asian Journal of Literature, Philosophy and Culture, 4 (6): 107-114 (June 2023)

Abstract

Stemming is one of the most common initial data processing steps that can be performed on almost all Natural Language Processing (NLP) projects. In the process of Stemming, it is carried out to remove some part of the word or shorten the word to its root. Several stemming algorithms are used to decide how to cut a word. In determining the stem of Uzbek words, problems such as homonymy of root and suffix with one root, sound changes when the suffix is added to the words, stemming of neologisms and NERs can occur. This article presents models for solving the problem of the occurrence of sound changes in words in the process of performing stemming in the texts of the Uzbek language Corpus.

Tags

Users

  • @centralasian_20

Comments and Reviews