Original article
https://doi.org/10.15507/2076-2577.26182.170-182
https://elibrary.ru/jvwfmv
УДК / UDC 81’373:621.013:811.511.111:811.161.1
Making the Finnish-Russian Corpus of Phraseological Units: A Case Study of Idioms Meaning “to Die”
N. V. Belkina, D. Yu. Shareshik
Herzen State Pedagogical University of Russia, St. Petersburg, Russian Federation
Abstract
Introduction. A comparative linguistic analysis of the worldview of particular ethnic groups makes it possible to understand the differences between national cultures and the ways in which they complement one another. A Finnish-Russian corpus of phraseological units annotated according to specific criteria provides an opportunity to identify both universal and culture-specific features in the perception of the surrounding world. The creation and linguistic annotation of a specialized bilingual corpus of phraseological expressions constitutes a challenge; however, once resolved, it may yield an effective tool for comparative analysis and machine translation. The aim of the study is to determine the criteria for annotating a corpus of Finnish-Russian phraseological units and to test this annotation framework on a thematic group with the meaning “to die”.
Materials and Methods. The material for the study comprised Finnish and Russian phraseological units belonging to the thematic group “to die”, selected through continuous sampling from Finnish and Russian dictionaries of phraseology, slang, and colloquial speech. The relevance of this research stems from the absence of digital parallel phraseological corpora for the Finnish and Russian languages and, consequently, from the lack of annotation criteria for such a corpus. The pilot corpus, which currently includes 123 Finnish and 136 Russian phraseological units, is intended for comparative linguocultural analysis and for applied machine translation tasks. The methodology is based on a comparative approach that takes into account the thematic-semantic, stylistic, and etymological parameters of phraseological units.
Results and Discussion. A detailed tagging scheme is proposed, including semantic classification, stylistic labels, and indications of origin. In the process of tagging phraseological units meaning “to die,” a set of criteria is employed: a semantic-thematic criterion based on universal semantic metaphor; a semantic criterion taking into account connotative components of meaning; a stylistic criterion; a language pair criterion; and an etymological criterion. In addition to semantic, etymological, and stylistic parameters, the tagging process also considers the presence of derivatives and synonyms, as well as the grammatical features of phraseological units, which is necessary for the subsequent generation of contextually appropriate examples.
Conclusion. The developed corpus is valuable for identifying both universal and culture-specific features in the perception of the concept of death within Finnish and Russian linguistic cultures, and it also provides a foundation for further research in the fields of phraseology and cross-linguistic communication. The creation and tagging of the corpus according to the proposed criteria do not require specialized programming knowledge and can be carried out using open-source software. Such a corpus constitutes a step toward the development of a computational model of the linguistic worldview, which can be employed for processing large-scale data and training artificial intelligence systems.
Keywords: phraseology, linguocultural studies, parallel corpus, semantic annotation, Finnish language, Russian language, concept of death
Conflict of interest: The authors declare no conflict of interest.
For citation: Belkina N.V., Shareshik D.Yu. Making the Finnish-Russian Corpus of Phraseological Units: A Case Study of Idioms Meaning “to Die”. Finno–Ugric World. 2026;18(2):170‒182. https://doi.org/10.15507/2076-2577.26182.170-182
REFERENCES
- Belkina N.V., Lalym A.S., Osmak N.A. On Spatial Relationships in the Finnish Language. Perm University Herald. Russian and Foreign Philology. 2024;16(2):16‒27. (In Russ., abstract in Eng.) https://doi.org/10.17072/2073-6681-2024-2-16-27
- Bikmukhametova R.M. The Language Images of the Concept Death in the World View of Languages of Different Structure (on the Material of the Russian and Bashkir Languages). Modern Problems of Science and Education. 2014;(1). (In Russ., abstract in Eng.) Available at: https://science-education.ru/ru/article/view?id=12022 (accessed 17.10.2025).
- Dobrovolsky D.O. [National and Cultural Specificity in Phraseology]. Voprosy jazykoznanija. 1997;(6):37‒58. (In Russ.) https://elibrary.ru/pyiohb
- Kryukova N.F., Aleksandrova E.V., Isakova E.M. [Idiomatic Equivalence in Terms of Cognitive and Hermeneutic Analysis]. Theory of Language and Intercultural Communication. 2023;(3):150‒158. (In Russ.) https://elibrary.ru/fyiaws
- Pochueva N.N. [The Concept of Equivalence in Phraseology (Based on the Russian and English Languages)]. Nauchnye issledovanija. 2019;(2):19‒21. (In Russ.) https://elibrary.ru/yuraem
- Yusupova S.M. [Comparative Analysis of Idioms about Work (Based on English, German, Russian, and Chechen Languages)]. Moscow University Bulletin. Ser. 19: Linguistics and Intercultural Communication. 2011;(2):140‒146. (In Russ.) https://elibrary.ru/nxpdbj
- McEnery T., Brezina V. Fundamental Principles of Corpus Linguistics. Cambridge: Cambridge University Press; 2022. 313 p.
- Sinclair J. (ed.) How to Use Corpora in Language Teaching. Amsterdam: John Benjamins Publishing; 2004. 307 p.
- McEnery T., Hardie A. Corpus Linguistics: Method, Theory and Practice. Cambridge: Cambridge University Press; 2012. 294 p.
- Dobrovol’skij D.O., Zalizniak A.A. On a Special Type of Necessity Modality: The German Verb Sollen Viewed from Parallel Corpora. Voprosy Jazykoznanija. 2021;(6):22‒39. (In Russ., abstract in Eng.) https://doi.org/10.31857/0373-658X.2021.6.22-39
- Karimova S.G. [Features of the Translation of Adjectival Phraseological Units Using the Potential of Corpus Linguistics (Based on the Material of English and Russian Languages)]. Kazan Science. 2023;(9):214‒216. (In Russ.) https://elibrary.ru/uuwikq
- Panteleeva L.M., Khorobrykh S.V. Spatial Meanings of the Concept “Death” in Perm Phraseology. Philology. Theory & Practice. 2018;(10‒2):330‒337. (In Russ., abstract in Eng.) https://doi.org/10.30853/filnauki.2018-10-2.24
- Orlova O.S. The Topics of Birth and Death in Russian and English Phraseological UNITS. The Metaphor of Way. Journal of Psycholinguistics. 2020;(1):64‒74. (In Russ., abstract in Eng.) https://doi.org/10.30982/2077-5911-2020-41-1-64-74
- Dobrovol’skij D.O. Bilingual Corpus-Based Dictionary of Idioms: Organization of the Dictionary Entry Text. Critique and Semiotics. 2024;(2):124–146. (In Russ., abstract in Eng.) https://doi.org/10.25205/2307-1753-2024-2-124-146
- Belkina N.V., Lalym A.S., Osmak N.A. Valence Structure of Verbs of Motion in Finnish (Using the Example of Idiomatic Expressions with the Verb Mennä). Yearbook of Finno-Ugric Studies. 2025;19(1):6‒17. (In Russ., abstract in Eng.) https://doi.org/10.35634/2224-9443-2025-19-1-6-17
Information about the authors:
Natalia V. Belkina, Cand.Sci. (Ped.), Associate Professor of the Department of Languages of Northern Europe, Herzen State Pedagogical University of Russia (48 Moyka Embankment, St. Petersburg 191186, Russian Federation), ORCID: https://orcid.org/0009-0004-1080-1458, SPIN-code: 9094-6553, bel_nat@mail.ru
Denis Yu. Shareshik, Master’s Student of the Institute of Foreign Languages, Herzen State Pedagogical University of Russia (48 Moyka Embankment, St. Petersburg 191186, Russian Federation), ORCID: https://orcid.org/0009-0003-1641-5681, SPIN-code: 5207-3621, shareshik.denis@yandex.ru
Authors’ contribution:
N. V. Belkina – ideas; formulation or evolution of overarching research goals and aims; preparation and creation of the published work, specifically writing the initial draft (including substantive translation); oversight and leadership responsibility for the research activity planning and execution, including mentorship external to the core team.
D. Yu. Shareshik – conducting a research and investigation process, specifically performing the data; verification of the reproducibility of the research results within the framework of the objectives of the research.
All authors have read and approved the final manuscript.
Submitted 13.11.2025; revised 02.12.2025; accepted 13.01.2026.






















