LINGVISTIK KORPUSLAR TADQIQI: MULTIMEDIA KORPUSI
DOI:
https://doi.org/10.66345/stj.v4i2.5020Keywords:
multimedia korpusi, multimodal annotatsiya, og‘zaki nutq korpusi, ELAN, ANVIL, Praat, turkiy tillar korpusi, lingvistik annotatsiya, prosodik annotatsiyaAbstract
Ushbu maqolada korpus lingvistikasining nazariy asoslari, jahon tajribasi va turkiy tillarda yaratilgan multimedia korpuslarining tahlili amalga oshirilgan. Tadqiqotda multimedia korpuslarini yaratish bosqichlari, multimodal annotatsiyalash tamoyillari hamda ularning lingvistik tadqiqotlardagi ahamiyati o‘rganilgan. ELAN, ANVIL, Praat kabi maxsus dasturiy vositalar yordamida audio va video materiallarni annotatsiyalash metodologiyasi tahlil qilingan. Turkiy tillar, jumladan, turk, tatar, qozoq, qirg‘iz va o‘zbek tillarida yaratilgan multimedia korpuslari qiyosiy o‘rganilgan. Tadqiqot natijasida multimedia korpuslarini yaratishning optimal metodologiyasi ishlab chiqilgan va o‘zbek tili multimedia korpusini yaratish bo‘yicha tavsiyalar berilgan.
Downloads
References
1. Allwood J. (2008). Multimodal corpora. In A. Lüdeling & M. Kytö (Eds.), Corpus Linguistics: An International Handbook (pp. 207-225). Berlin: Mouton de Gruyter.
2. Baldry A., & Thibault P. J. (2008). Applications of multimodal concordances. Hermes – Journal of Language and Communication Studies, 41, 11-41.
3. Baisa V., & Suchomel V. (2012). Large Corpora for Turkic Languages and Unsupervised Morphological Analysis. Proceedings of the Eighth conference on International Language Resources and Evaluation (LREC'12). Istanbul, Turkey: ELRA.
4. Belcavello F., Viridiano M., Matos E., & Timponi Torrent, T. (2022). Charon: A FrameNet annotation tool for multimodal corpora. Proceedings of the 16th Linguistic Annotation Workshop (LAW-XVI) (pp. 91-96). Marseille, France: ELRA.
5. Dibo A. V., Sheymovich A. V. (2011). Morfologicheskaya razmetka korpusa xakasskogo yazyka. Rossiyskaya tyurkologiya, 2(5), 48-61.
6. Hamroyeva Sh. M. (2018). Korpus lingvistikasi atamalarining qisqacha izohli lug‘ati. Toshkent: Kamalak.
7. Jewitt C. (2009). The Routledge Handbook of Multimodal Analysis. London: Routledge.
8. Knight D. (2011). The future of multimodal corpora. Revista Brasileira de Linguística Aplicada, 11(2), 491-415.
9. Liu H., Liu L., Li H. (2024). Multimodal Discourse Studies in the International Academic Community (1997-2023): A Bibliometric Analysis. SAGE Open, 14(4).
10. Lovei R., Dembryii C., Hardiei A., Brezinai V., McEneryi T. (2017). The Spoken BNC2014: Designing and building a spoken corpus of everyday conversations. International Journal of Corpus Linguistics, 22(3), 319-344.
11. Mengliev, D., Nabiyeva, D., Abdurakhmonov, A., Makhmudov, K., Nuritdinov, A., & Otemisov, A. (2025, June). Educational Text Analysis in Uzbek: Developing an NER Algorithm for Academic and Pedagogical Content. In 2025 IEEE 26th International Conference of Young Professionals in Electron Devices and Materials (EDM) (pp. 2100-2103). IEEE.
12. Nuritdinov, A. (2025). MATNLARNI LINGVOSTATISTIK TAHLIL QILISHDA KORPUS USULLARIDAN FOYDALANISH. Молодые ученые, 3(19), 93-97.
13. Nuritdinov, A. (2025). KONKORDANS–LINGVISTIK TAHLIL VOSITASI SIFATIDA. Теоретические аспекты становления педагогических наук, 4(13), 173-178.
14. Nuritdinov, A. (2025). Korpus lingvistikasida lingvostatistik tahlil metodi. MAKTABGACHA VA MAKTAB TA’LIMI JURNALI, 3(5).
15. Nuritdinov, A. (2024). JADID DAVRI ADABIY MUHITIGA DOIR ASARLARDAN KORPUSDA FOYDALANISH. COMPUTER LINGUISTICS: PROBLEMS, SOLUTIONS, PROSPECTS, 1(1).
16. Nuritdinov, A. (2022). O ‘ZBEK TILI KORPUSI UCHUN ABDURAUF FITRATNING LINGVISTIK ASARLARINI MANBA SIFATIDA OLINISHI. COMPUTER LINGUISTICS: PROBLEMS, SOLUTIONS, PROSPECTS, 1(1).
17. Nuritdinov, A. S. O. G. L. (2022). O’zbek tili milliy korpusi uchun jadid tilshunoslarining lingvistik asarlarini manba sifatida olinishi. Science and Education, 3(4), 2048-2057.
18. Suleymanov D., Gilmullin R., Gataullin R. (2011). National Corpus of the Tatar Language: Grammatical Annotation and Implementation. 5th International Conference on Corpus Linguistics (CILC2013) (pp. 68-74).
19. Zaxarov V. P., Bogdanova S. Yu. (2011). Korpusnaya lingvistika. Irkutsk: IGLU.
20. Zaxarov V. P., Azarova I. V. va b. (2019). Modelirovaniye v korpusnoy lingvistike: Spetsializirovannyye korpusy russkogo yazyka. Sankt-Peterburg: SPbGU.
21. Frontiers in Communication. (2024). Rethinking multimodal corpora from the perspective of Peircean semiotics. doi: 10.3389/fcomm.2024.1337434




















