Generation of nominal word forms of the South Ludic dialect
English
journal number:
Journal’s Subject Headings:
Philology
About author:
A. P. Rodionova Institute of Linguistics, Literature and History, Karelian Research Center of the Russian Academy of Sciences, Petrozavodsk, Russian Federation, [email protected]
N. B. Krizhanovskaya Institute of Applied Mathematical Research, Karelian Research Centre of the Russian Academy of Sciences, Petrozavodsk, Russian Federation, [email protected]
ABSTRACT
Introduction: the Ludic dialect has practically found itself on the periphery of the revitalization processes. There are no academics dictionaries on this dialect in Russia (compared to the Livvi dialect), which affects the low rate of filling of the Ludic sub-corpus in the VepKar. The central part of our work is the developed rules for generating nominal word forms. The rationale for choosing the South Ludic dialect for creating rules is given. The prepared tables demonstrate examples of the Ludiс dialect’s nominal inflection paradigm for single-stem and double-stem words. Inflectional types of names of the South Ludiс dialect are formalized. Rules for identifying stems that are needed for the subsequent generation of nominal word forms of the dialect of Svyatozero are proposed. A program is described, which is part of the VepKar corpus, in which these rules are programmed. The program allows us to speed up the generation of word forms in dictionary entries for subsequent text markup.
Objective: to illustrate the rules for the automatic generation of word forms, prepared using a list of stems of nominal parts of speech of the Ludic dialect of the Karelian language.
Research materials: lemmas and word forms from the Open Corpus of the Vepsian and Karelian languages (VepKar).
Results and novelty of the research: based on the studied theoretical sources, the researchers were able to identify grammatical patterns. In the course of the experiments carried out in the study, a list of the stems and pseudo-stems of the nominal inflection of the Ludic dialect of the Karelian language (Southern Ludic dialect) was formed; a system of rules for generating word forms was developed; a corresponding program was prepared and tested. The scientific novelty of the research lies in the development of a system of unified rules for the automatic generation of word forms for the Ludic dialect
of the Karelian language, which is being implemented for the first time.
Key words: Karelian language, Ludic dialect, Southern Ludic dialect, corpus linguistics, morphology, nominal inflection, generation of word forms
Acknowledgments: the study is carried out under the state order of the Karelian Research Centre of the Russian Academy of Sciences. A. P. Rodionova conducted research within the framework of the research work topic “Baltic-Finnish languages of the North-West of Russia in the conditions of digitalization of scientific knowledge”. State registration number: 124022000089-4. N. B. Krizhanovskaya conducted research within the framework of the research work topic “Mathematical models and methods for research and application of information computing systems and networks”, state registration
number: FMEN-2021-0015.
For citation: Rodionova A. P., Krizhanovskaya N. B. Generation of nominal word forms of the South Ludic dialect // Vestnik ugrovedenia = Bulletin of Ugric Studies. 2024; 14 (3/58): 476–488.
N. B. Krizhanovskaya Institute of Applied Mathematical Research, Karelian Research Centre of the Russian Academy of Sciences, Petrozavodsk, Russian Federation, [email protected]
ABSTRACT
Introduction: the Ludic dialect has practically found itself on the periphery of the revitalization processes. There are no academics dictionaries on this dialect in Russia (compared to the Livvi dialect), which affects the low rate of filling of the Ludic sub-corpus in the VepKar. The central part of our work is the developed rules for generating nominal word forms. The rationale for choosing the South Ludic dialect for creating rules is given. The prepared tables demonstrate examples of the Ludiс dialect’s nominal inflection paradigm for single-stem and double-stem words. Inflectional types of names of the South Ludiс dialect are formalized. Rules for identifying stems that are needed for the subsequent generation of nominal word forms of the dialect of Svyatozero are proposed. A program is described, which is part of the VepKar corpus, in which these rules are programmed. The program allows us to speed up the generation of word forms in dictionary entries for subsequent text markup.
Objective: to illustrate the rules for the automatic generation of word forms, prepared using a list of stems of nominal parts of speech of the Ludic dialect of the Karelian language.
Research materials: lemmas and word forms from the Open Corpus of the Vepsian and Karelian languages (VepKar).
Results and novelty of the research: based on the studied theoretical sources, the researchers were able to identify grammatical patterns. In the course of the experiments carried out in the study, a list of the stems and pseudo-stems of the nominal inflection of the Ludic dialect of the Karelian language (Southern Ludic dialect) was formed; a system of rules for generating word forms was developed; a corresponding program was prepared and tested. The scientific novelty of the research lies in the development of a system of unified rules for the automatic generation of word forms for the Ludic dialect
of the Karelian language, which is being implemented for the first time.
Key words: Karelian language, Ludic dialect, Southern Ludic dialect, corpus linguistics, morphology, nominal inflection, generation of word forms
Acknowledgments: the study is carried out under the state order of the Karelian Research Centre of the Russian Academy of Sciences. A. P. Rodionova conducted research within the framework of the research work topic “Baltic-Finnish languages of the North-West of Russia in the conditions of digitalization of scientific knowledge”. State registration number: 124022000089-4. N. B. Krizhanovskaya conducted research within the framework of the research work topic “Mathematical models and methods for research and application of information computing systems and networks”, state registration
number: FMEN-2021-0015.
For citation: Rodionova A. P., Krizhanovskaya N. B. Generation of nominal word forms of the South Ludic dialect // Vestnik ugrovedenia = Bulletin of Ugric Studies. 2024; 14 (3/58): 476–488.