I am writing an open-source library that is designed to analyze and synthesize the word forms of the Russian language. Those. According to a given word / phrase / sentence, it can be used to say which grammatical forms of each lexeme included in a given text (analysis), and vice versa - convert them into a specified grammatical form (synthesis) for a given word / phrase.

For names (i.e., nouns, adjectives, numerals, participles, etc. - everything except verbs), there is an excellent work by A. A. Zaliznyak "Russian nominal izmenenie" , in which all cases are thoroughly analyzed and a specific complete algorithm of analysis and synthesis word forms. With this Zaliznyak algorithm, all subtle points are taken into account (for example, most libraries cannot adequately parse both expressions "car brakes" and "brakes of economic development" and many others).

Is there a similar work on verb conjugation?

  • The pymorphy2 library uses data from the opencorpora project, and you may also find what you need there. - insolor

0