Institute of Philology of the Siberian Branch of Russian Academy of Sciences
Monuments of Folklore Siberian Journal of Philology Critique and Semiotics
Yazyki i fol’klor korennykh narodov Sibiri Syuzhetologiya i Syuzhetografiya
Institute of Philology of
the Siberian Branch of
Russian Academy of Sciences
По-русски
  
Siberian Journal of Philology
По-русски
Archive
Editorial board
Our ethical principles
Submission Requirements
Process for Submission & Publication
List of Typos
Search:

Author:

and/or Keyword:

Article

Name: Genre markup of the Tomsk dialect corpus: from concept to implementation

Authors: Zemicheva S. S.

Tomsk State University, Tomsk, Russian Federation

In the section Linguistics

Issue 2, 2022Pages 312-324
UDK: 81'42 + 811.161.1.28DOI: 10.17223/18137083/79/22

Abstract:

The relevance of the study is due to the fact that it has been conducted at the intersection of two scientific fields: corpus linguistics and communicative dialectology. The paper presents a comparative analysis of corpus practice based on the material of spoken language. Also, consideration is given to the process and results of creating a discursively annotated corpus of dialect speech with a size of more than 2 million tokens. Discursive markup implies the labeling of three parameters: topic, type, and genre of the text. The novelty of this research project is related to the fact that, for the first time, a large array of dialectal texts has been marked up according to intentional orientation: not only folklore but also speech genres have been annotated. The value of the new source is provided by the combination of archived data with current materials. A methodological advantage of the corpus is the possibility of combining qualitative and quantitative analysis. The paper describes the principles of implementing the genre marking in the Tomsk dialect corpus, with the factors influencing the composition of the genres of the dialect corpus revealed. The results make it possible to determine the quantitative ratio of different speech genres in dialectal communication. The observations are supported by examples from the spoken language of villagers. The accuracy of the conclusions is ensured by the reliance on quantitative data and a considerable amount of material (over 16 thousand genre fragments).

Keywords: dialectal corpus, genre markup, speech genre, Russian dialects of Siberia, dialectology, communicative linguistics, spoken language

Bibliography:

Bogdanova-Beglaryan N. V., Blinova O. V., Zaydes K. D., Popova T. I., Sherstinova T. Yu. Korpus estestvennoy rechi: problemy ruchnogo annotirovaniya pragmaticheskikh markerov i puti ikh resheniya [Natural speech corpus: problems and solutions of manual pragmatic markers annotation]. In: Analiz razgovornoy russkoy rechi (ARz-2019). Trudy vos`mogo mezhdistsiplinarnogo seminara [Analysis of colloquial Russian speech (ARz-2019). Proceedings of the eighth interdisciplinary seminar]. St. Petersburg, 2019, pp. 5–10.

Borisova I. N. Russkiy razgovornyy dialog: struktura i dinamika [Russian conversational dialogue: structure and dynamics]. Moscow, LIBROKOM, 2009, 320 p.

Dement’ev V. V., Stepanova N. B. Korpusnye metody v issledovanii rechevykh zhanrov: problema klyuchevykh fraz [Corpus genristics: a problem of key phrases]. Speech Genres. 2016, no. 3, pp. 24–41.

Ermolov O. B., Bogdanova-Beglaryan N. V. Yazykovoe oformlenie proshchaniya v sovremennoy razgovornoy rechi (na materiale zvukovogo korpusa “Odin rechevoy den’”) [Linguistic design of “goodbye” situation in modern colloquial speech (based on the material of the speech corpus “one speaker’s day”)]. Communication Studies. 2019, vol. 6, no. 2, pp. 307–331.

Gol’din V. E. Teoreticheskie problemy kommunikativnoy dialektologii [Theoretical problems of communicative dialectology]. Abstract of Dr. philol. sci. diss. Saratov, 1997, 52 p.

Grishina E. A. Mul’timediynyy russkiy korpus (MURCO): problemy annotatsii [Multimedia Russian Corpus (MURCO): problems of annotation]. In: Natsional’nyy korpus russkogo yazyka: 2006–2008. Novye rezul’taty i perspektivy [National corpus of the Russian language: 2006–2008. New results and prospects]. St. Petersburg, 2009, pp. 175–214.

Grishina E. A. Ustnaya rech’ v Natsional’nom korpuse russkogo yazyka [Spoken language in the National Corpus of the Russian language]. In: Natsional’nyy korpus russkogo yazyka: 2003–2005 [National corpus of the Russian language: 2003–2005]. Moscow, 2005, pp. 94–110.

Kachinskaya I. B., Malysheva A. V. Narodnaya rech’ v Natsional’nom korpuse russkogo yazyka [Folk speech in Russian National Corpus]. Russkaya Rech’ (Russian Speech). 2019, no. 4, pp. 103–118.

Kazakova O. A. Dialektnaya yazykovaya lichnost’ v zhanrovom aspekte [Dialectal linguistic personality in the genre aspect]. Tomsk, TSPU Publ., 200 p.

Kibrik A. A., Korotaev N. A., Fedorova O. V., Evdokimova A. A. Edinaya mul’tikanal’naya annotatsiya kak instrument analiza estestvennoy kommunikatsii [Unified multichannel annotation: a tool for analysing natural communication]. In: Komp`yuternaya lingvistika i intellektual`nye tekhnologii: Po materialam ezhegodnoy mezhdunarodnoy konferentsii “Dialog” (Moskva, 29 maya – 1 iyunya 2019 g.) [Computer linguistics and intellectual technologies: On the materials of the annual international conference “Dialogue” (Moscow, May 29 – June 1, 2019)]. Moscow, 2019, iss. 18 (25), pp. 265–280.

Kopotev M. V. Vvedenie v korpusnuyu lingvistiku: Ucheb. posobie dlya studentov filologicheskikh i lingvisticheskikh spetsial’nostey universitetov [Introduction to the corpus linguistics] Praga, Animedia Company, 2014, 195 p.

Kotov A. A., Budyanskaya E. M. Videokorpus obrashcheniy grazhdan po voprosam oplaty kommunal’nykh uslug [Video corpus of dialogues on issues of utility bills payments]. Vestnik Yaroslavskogo gosudarstvennogo universiteta im. P. G. Demidova. Seriya gumanitarnye nauki. 2016, no. 2 (36), pp. 93–99.

Kryuchkova O. Yu., Gol’din V. E. Korpus russkoy dialektnoy rechi: kontseptsiya i parametry otsenki [A corpus of Russian dialectal speech: the concept and parameters of evaluation]. In: Komp’yuternaya lingvistika i intellektual`nye tekhnologii: Po materialam ezhegodnoy mezhdunarodnoy konferentsii “Dialog” (Moskva, 25–29 may, 2011g.) [Computer linguistics and intellectual technologies: On the materials of the annual international conference “Dialogue” (Moscow, May 25– 29, 2011)]. Moscow, 2011, iss. 10 (17), pp. 359–367.

Kryuchkova O. Yu., Gol’din V. E. Saratovskiy dialektnyy korpus: novyy nauchnyy i obrazovatel’nyy resurs. Kontseptsiya, metodicheskie materialy [Saratov dialect corpus: a new scientific and educational resource. Concept, teaching materials]. Saratov, 2010, 39 p.

Plungyan V. A. Korpus kak instrument i kak ideologiya: o nekotorykh urokakh sovremennoy korpusnoy lingvistiki [Corpus as a tool and as an ideology: on some lessons of modern corpus linguistics]. Russian Language and Linguistic Theory. 2008, no. 16 (2), pp. 7–20.

Sherstinova T. Yu. Pragmaticheskoe annotirovanie kommunikativnykh edinits v korpuse ORD: mikroepizody i rechevye akty [Approaches to pragmatic annotation in the ORD corpus: micro episodes and speech acts]. In: Korpusnaya lingvistika – 2015. Tr. Mezhdunar. konf. [Corpus linguistics – 2015. Proceedings of the intern. conf.] V. P. Zakharov, O. A. Mitrofanova, M. V. Khokhlova (Eds). St. Petersburg, 2015, pp. 451–459.

Sherstinova T. Yu. Struktura povsednevnogo dialoga kak posledovatel’nost’ rechevykh aktov [The structure of everyday dialogue as the sequence of speech acts]. In: Komp’yuternaya lingvistika i intellektual`nye tekhnologii: Po materialam ezhegodnoy mezhdunarodnoy konferentsii “Dialog” (Moskva, 30 maya – 2 iyunya 2018 g.) [Computer linguistics and intellectual technologies: On the materials of the annual international conference “Dialogue” (Moscow, May 30 – June 2, 2018)]. Moscow, 2018, issue 17 (24), pp. 637–651.

Shilikhina K. M. Ispol’zovanie korpusov v issledovaniyakh diskursa [Using corpora in discourse research]. Proceedings of Voronezh State University. Series: Linguistics and intercultural communication. 2014, no. 3, pp. 21–26.

Shmeleva T. V. Model’ rechevogo zhanra [The model of speech genre]. In: Zhanry rechi: Sb. nauch. st. [Genres of speech: Coll. of sci. papers]. Saratov, 1997, pp. 91–96.

Shmurak R. I. K utochneniyu ponyatiya upreka s pomoshch’yu korpusnykh instrumentov [Clarifying the concept of reproach using corpus tools]. The Bulletin of the Russian Academy of Sciences: Studies in Literature and Language. 2020, vol. 79, no. 3, pp. 24–48.

Voloshina S. V. Rechevoy zhanr avtobiograficheskogo rasskaza v dialektnoy kommunikatsii [Speech genre of autobiographical story in dialect communication]. In: Portrety rechevykh zhanrov: raznye diskursivnye praktiki [Portraits of speech genres: different discursive practices]. T. A. Demeshkina (Ed.). Tomsk, TSU Publ., 2016, pp. 37–96.

Wierzbicka A. Rechevye zhanry [Speech genres]. Speech Genres. 1997, no. 1, pp. 99–112.

Yurina E. A. Tomskiy dialektnyy korpus: v nachale puti [Tomsk dialectal corpus: the starting point]. Tomsk State University Journal of Philology. 2011, no. 2 (14), pp. 58–63.

Institute of Philology
Nikolaeva st., 8, Novosibirsk, 630090, Russian Federation
+7-383-330-15-18, ifl@philology.nsc.ru
© Institute of Philology