Corpus in uzbek linguistics

Download 14,63 Kb.

Sana	28.04.2022
Hajmi	14,63 Kb.
	#586456

Bog'liq
CORPUS IN UZBEK LINGUISTICS

CORPUS IN UZBEK LINGUISTICS
Nazarova Nodira Gulomjonovna
Fergana polytechnic institute
nodira.nazarova@ferpi.uz

Abstract: The article researched the corpus in Uzbek linguistics, its idiosyncrasies, sociological, lexical, and educational relevance, types of corpus.
Keywords: computer technology, national, data, linguistics, corpora, Uzbek, language.
Corpus linguistics is a subfield of computer linguistics that focuses on the establishment of basic rules for the generation and use of linguistic corpora (text corpora) with the aid of computer technology. A large-scale machine-readable, combined, tagged, and philologically sound bundle of language data designed to answer specific language problems is referred to as a linguistic corpus or text corpus.
Corpus linguistics was coined in 1977, making it a very popular scientific topic. However, in such a short time, corpus linguistics has emerged as one of the most important directions in modern linguistics as one of the computational linguistics specialties. The term was first heard in Russia in 1996, thanks to the talks of Sidney Greenbaum, one of the International Corps of English's founders.
Today, there are numerous meanings of corpus. A corpus, for example, is usually a series of machine-readable texts that contain information about the speaker, the author, the addressee, or the audience, according to E. Finegan's textbook.
The study of not only speech patterns, but also individual temporal or spatial settings, as well as the systematization of speech practice, are all part of modern language activity conditions. Speech practice is defined as a specific unit in a systematic approach to learning and generalization. Traditionally, empirical data could only be analyzed without the impact of constant, subjective elements. The goal of linguistics on a computer nowadays is to create, store, and apply a tertiary medium and communication mediation time that cannot be read in a "document."
Of fact, because of today's objective criteria, the melting of speech practice is considered realistic. Simultaneously, the new universal-statistical and special disciplines, using the new "lingvo-information" format, provide up new possibilities for language format selection. Corpus linguistics, which first appeared in the 1960s, established a new methodological foundation for speech practice and research agendas. As a metalinguistic instrument, corpus linguistics enables the creation of language methodology, its application, and linguistic interpretation of speech patterns - the solution of "old" and "new" language problems using computer technologies [2].
Fluent linguistics is defined by methodological diversity and efficiency in the modern linguistics paradigm. Corpus linguistics is now widely regarded in linguistics as a novel technique to learning to use language in everyday situations with the use of a computer and an electronic corpus. At least two traits distinguish corpus linguistics as a distinct disciplinary field: 1) the type of the oral content used; 2) the tools' special characteristics. While the syntactic, semantic, and sociolinguistic portions of linguistics are concerned with both describing and evaluating linguistic usage of language structure, corpus linguistics is a larger notion that may be used to a wide range of linguistic studies. Corpus linguistics is frequently referred to as "a set of approaches used in diverse domains of linguistic research"[6].
Sh.Khamroeva researched the corpus in Uzbek linguistics, its idiosyncrasies, sociological, lexical, and educational relevance, types of corpus, linguistic worth of the author's corpus, and linguistic bases of authorship corpus development. Corpus linguistics has become a rapidly expanding branch of world linguistics in the information age, according to the professor, and the corpus itself has become an excellent instrument for linguists:
1) samples of oral, written monuments, national and cultural heritage, created in the corporatized language, appeared in electronic form on the Internet;
2) since the corpus is a collection of texts of the natural (real) language in electronic form, placed in the search program, the perfectly marked corpus serves as a stable linguistic base in ensuring the effectiveness of linguistic research;
3) The case is the basis for the creation of an electronic library, dictionary, grammar. Because it is comprehensive, it ensures the uniqueness of the information and provides a complete representation of all aspects of language phenomena;
4) Various information is provided in the body of language in its natural context, which serves as the foundation for a full, in-depth, and objective investigation;
5) The corpus is the most contemporary, broad-ranging programmed system in the study of language (word change, historism, neologism, expansion of meaning, narrowing; observation of the creation of new phraseology), language learning, and dictionary [12].
The National Linguistic Corps is a fun-to-learn language of social opportunity that comes in a range of genres, styles, regional languages, and written materials. The National Linguistics Corps is a reading service. Because the fundamental purpose of the national corps is to demonstrate live language, texts of many styles and genres are employed as a corpus unit, although representativeness (text genre variety) reigns supreme.
The corpus serves as a mirror, allowing each language's essence to be reflected, and the construction and practical use of the Uzbek language's national and educational corpus is a pressing problem today.

References
1. Barkovich A.A. Corpus linguistics: the specifics of modern language meta-descriptions. // Bulletin of Tomsk State University. 2016. № 406. p. 5-13.
2. Khamroeva Sh. Linguistic bases of creation of the Uzbek language author's corps. Doctoral dissertation on philological sciences (PhD) - Bukhara, 2018. p- 250.
3. Mengliev B. and etc. National corpus of Uzbek language. /Marifat. - 26/04/2018.
4. McEnery T., Wilson A. Corpus Linguistics. – Edinburgh: Edinburgh University Press, 2001. – 154 p.
5. Sabirova E. Corpus linguistics is a novelty in the field of linguistics.// https://uza.uz/

Download 14,63 Kb.

Do'stlaringiz bilan baham: