Dictionary of noun words of the Russian language. Frequency Dictionary of the Russian Language

20.09.2019

noun e is an independent significant part of speech that combines words that

1) have a generalized meaning of objectivity and answer the questions who? or what?;

2) are proper or common nouns, animate or inanimate, have a permanent gender and non-permanent (for most nouns) signs of number and case;

3) in the proposal most often act as subjects or additions, but can be any other members of the proposal.

Noun- this is a part of speech, in the selection of which the grammatical features of words come to the fore. As for the meaning of nouns, this is the only part of speech that can mean anything: an object (table), a person (boy), an animal (cow), a sign (depth), an abstract concept (conscience), an action (singing) , relation (equality). In terms of meaning, these words are united by the fact that you can ask them the question who? or what?; this, in fact, is their objectivity.

Common nouns designate objects without distinguishing them from the class of the same type (city, river, girl, newspaper).

Proper nouns designate objects, distinguishing them from the class of homogeneous objects, individualizing them (Moscow, Volga, Masha, Izvestia). Proper names must be distinguished from proper names - ambiguous names of individualized objects ("Evening Moscow"). Proper names do not necessarily include a proper name (Moscow State University).

Animate and inanimate nouns

Nouns have a permanent morphological sign of animation.

The sign of animateness of nouns is closely connected with the concept of living / inanimate. Nevertheless, animation is not a rank in meaning, but a proper morphological feature.

Animation as a morphological feature also has formal means of expression. First, animateness / inanimateness is expressed by the endings of the noun itself:

1) animate nouns have the same endings. numbers V. p. and R. p., and for nouns husband. genus, this also applies to units. number;

2) inanimate nouns have the same endings. numbers V. p. and I. p., and for nouns husband. genus, this also applies to units. number.

The animacy of most nouns reflects a certain state of affairs in extralinguistic reality: animate nouns are mainly called living beings, and inanimate - inanimate objects, however, there are cases of violation of this pattern:


fluctuation by animation

An object cannot be both alive and non-living at the same time:
alive but inanimate

1) aggregates of living beings:

(see)armies, crowds, peoples ;

2) plants, mushrooms:

(gather)chanterelles ;

inanimate but animated

1) human toys:

(see)dolls, nesting dolls, tumblers ;

2) figures of some games:

(play out)kings, queens ;

3) deceased:

(see)dead, drowned , Butdead body (inanimate);

4) fictional creatures:

(see)mermaids, goblin, brownies.

Nouns have a constant morphological gender and refer to male, female or neuter.

The masculine, feminine and neuter gender includes words with the following compatibility:

Some nouns with the ending -a, denoting signs, properties of persons, in I. p. have a double characterization by gender, depending on the gender of the designated person:

your ignoramus has come

your-I'm ignorant came-a.

Such nouns belong to the common gender.

Nouns only plural (cream, scissors) do not belong to any of the genders, since in the plural formal differences between nouns of different genders are not expressed (cf .: desks - tables).

Nouns change by numbers and cases. Most nouns have singular and plural forms ( city ​​- cities, village - villages). However, some nouns have either only the singular form (for example, peasantry, asphalt, burning), or only the plural form (for example, scissors, railings, weekdays, Luzhniki).

Case as a morphological feature of nouns

Nouns change in cases, that is, they have a non-permanent morphological sign of number.

There are 6 cases in Russian: nominative (I. p.), genitive (R. p.), dative (D. p.), accusative (V. p.), instrumental (T. p.), prepositional (P. p.). P.). These case forms are diagnosed in the following contexts:

I. p.who is this? What?

R. p. no one? what?

D. p.glad to whom? what?

V. p. see who? What?

T. p.proud of who? how?

P. p. thinking about whom? how?

The endings of different cases are different depending on which declension the noun belongs to.

Noun declension

Changing nouns in cases is called declension.

TO I declension include nouns husband. and wives. genus with the ending I. p. units. numbers -а (-я), including words ending in -iya: mom-a, dad-a, earth-i, lecture-i (lectij-a). Words with a stem ending in a hard consonant (hard variant), a soft consonant (soft variant) and with a stem in - and j have some differences in endings, for example:

caseSingular
hard option
soft option
On - and I
Im.p. countries - A Earth -I Army -I
R.p. countries - s
Earth -And Army -And
D.p. countries - e Earth -e
Army -And
V.p. countries - at Earth -Yu Army -Yu
etc. countries -Ouch (-oy )
Earth -to her (-yoyu ) Army -to her (-her )
P.p. countries -e Earth -e Army -And

Co. II declension include nouns husband. gender with a zero ending I. p., including words in -y, and nouns m. and cf. kind with the ending -o (-e), including words in -e: table-, genius-, small town-o, window-o, floor-e, peni-e (penij-e).

TO III declension include nouns of women. genus with zero ending in I. p .: dust-, night-.

In addition to nouns that have endings in only one of these declensions, there are words that have some endings from one declension, and some from another. They are called dissimilar. These are 10 words for -mya (burden, time, stirrup, tribe, seed, name, flame, banner, udder, crown) and path.

In Russian there are so-called indeclinable nouns. These include many common nouns and own borrowings (coat, Tokyo), Russian surnames in -y, -ih, -vo (Petrovykh, Dolgikh, Durnovo). They are usually described as words without endings.

Morphological analysis of a noun

The noun is parsed according to the following plan:

I. Part of speech. General value. Initial form (nominative singular).

II. Morphological features:

1. Permanent signs: a) proper or common noun, b) animate or inanimate, c) gender (male, female, neuter, general), d) declension.
2. Variable signs: a) case, b) number.

III. syntactic role.

Sample morphological parsing of a noun

Two ladies ran up to Luzhin and helped him up; he began to knock the dust off his coat with his palm (according to V. Nabokov).

I. ladies- noun;

initial form - lady.

II. Constant signs: narits., odush., wives. genus, I class;

non-permanent signs: pl. number, I. p.

III. ran up(Who?) ladies (subject part).

I.(To) Luzhin- noun;

initial form - Luzhin;

II. Permanent signs: own, soul, husband. genus, I class;

non-permanent features: units. number, D. p.;

III.
ran up(to whom?) .underline ( border-bottom: 1px dashed blue; ) to Luzhin(addition).

I. palm- noun;

initial form - palm;

II.
Constant signs: narits., inanimate., wives. genus, I class;

non-permanent features: units. number, etc.;

III.
Began to shoot down(how?) palm(addition).

I. Dust- noun;

initial form - dust;

II.
Constant signs: narits., inanimate., wives. genus, III class;

non-permanent features: units. number, V. p.;

III. Began to shoot down(What?) dust(addition).

I. Coat- noun;

initial form - coat;

II.
Constant signs: nav., inanimate, cf. genus, uninclined;

non-permanent signs: the number is not determined by the context, R. p .;

III. Began to shoot down(why?) with a coat(addition).

The second version of the frequency list

On this page you can get lists of the most frequent words in the Russian language. To date, Frequency Dictionary of the Russian Language, ed. LN Zasorina (1977) was most often used as a source of information about the frequency of Russian words. However, the corpus, on the basis of which the frequency of words in this dictionary was calculated, is very small by modern standards (about a million words). In addition, the list is significantly outdated: it corresponds to the frequency of use of words in the period from the 20s to the 60s. As a result, the corpus includes a large number of ideological sources, for example, the works of Lenin and Kalinin, Materials of the 22nd and 23rd Congresses of the CPSU, and Soviet newspapers. Words Soviet And comrade are included in the first hundred Russian words, along with function words (they are more common than words where, here, your), words party, revolution, communist occur more often than back, around, better etc. Finally, the list of words from Zasorina's dictionary does not exist in electronic form.

The word list available from this page contains approximately 35,000 words with a frequency greater than 1 ipm (occurrences per million words, instances per million words). There is also a shorter list of the 5000 most frequent Russian words. The lists use the utf8 Cyrillic encoding and are packed with the WinZip utility (Linux or Mac users can use StuffIt to unpack).

The structure of the lists follows the format of the lemmatized lists from the British National Corpus (BNC) created by Adam Kilgarif, namely:
sequence number, frequency (ipm), lemma, part of speech (BNC classification).

Words with a frequency greater than 1 ipm

  • - word forms sorted by frequency

List of 5000 most frequent words

  • - lemmas sorted alphabetically
  • - lemmas sorted by frequency

Some statistics on the use of Russian words

  • The average word length is 5.28 characters.
  • The average sentence length is 10.38 words.
  • The 1000 most frequent lemmas cover 64.0708% of the text.
  • The 2000 most frequent lemmas cover 71.9521% of the text.
  • The 3000 most frequent lemmas cover 76.5104% of the text.
  • The 5000 most frequent lemmas cover 82.0604% of the text.

More complete information on the correspondence between word frequency and corpus coverage can be found at .

The list is built on the basis of the representative corpus of the modern Russian language. It includes a selection of contemporary prose, political memoirs, contemporary newspapers and non-fiction (about 40 million words, prose is about a little over half the volume). All texts of the corpus were written in Russian between 1970 and 2002; majority between 1980 and 1995, newspaper corpus 1997-1999 (corpus based on texts from the Moshkov Library and A.V. Baranov's contemporary journalism corpus).

It is well known that large texts pose a problem for compiling frequency lists, since a relatively long text may contain a large number of occurrences of some rare word, which will significantly increase its frequency in the final list. For example, the corpus used to compile this list contains a variation on Tolkien's The Lord of the Rings (by Nick Perumov). Although this novel is 250,000 words long, less than one percent of the entire corpus, the word's frequency of use hobbit in this novel puts it in the first thousand Russian words, if the frequency is counted for all texts without restrictions on their length. For this reason, the frequency lists were compiled under the assumption that the sample from large texts is limited to 10 thousand words, and the sample from texts of one author is less than 100 thousand words. As a result, the subset of the full corpus used in the frequency calculation is about 16 million words.

The distribution of words in texts is far from uniform. Some words (for example, prepositions) appear in many texts with quite predictable frequency. The frequency of others (for example, pronouns or mental verbs) depends significantly on the author or genre of the text, while many words are "infectious": if this word (for example, a proper name, a person's designation by rank or position, or a technical term) is encountered in the text once, it is very likely that it will be repeated there many more times, thus significantly increasing its frequency in the document. There are different ways to measure this variation (Church, K. and Gale, W. (1995) Poisson Mixtures, Journal of Natural Language Engineering, 1:2). The simplest way to evaluate the behavior of a word is to calculate the coefficient of variation, which is calculated as the standard deviation divided by the mean. The standard deviation gives the absolute value of the variation of the data set (it increases for words with a higher mean frequency), while the coefficient of variation compares the distribution of words with unequal mean frequency. The deviation values ​​for the 5000 most frequent words can be viewed. File structure:
lemma, average frequency (ipm), number of texts in which this word occurs, standard deviation of frequency across all texts, coefficient of variation, variance.

The corpus, tools for working with it, as well as the parallel English-Russian corpus (sentence-based alignment) are described, in particular, in the following publication by the author:

Sharoff, Serge, (2002). Meaning as use: exploitation of aligned corpora for the contrastive study of lexical semantics. Proc. of Language Resources and Evaluation Conference (LREC02). May, 2002, Las Palmas, Spain.

There are also separate frequency lists for the following classes of words:

The creation of the corpus, the development of the corresponding software tools and frequency lists were supported by a grant provided to the author by the Humboldt Foundation, Germany. Lemmatization for the analysis of word forms in the corpus was carried out using the Dialing morphological analyzer. Since many word forms are ambiguous (for example, dear, were, became, for, three, already), the frequency of some words is not quite reliable, for example, For treated as a verb only if it is not followed by a noun, adjective or pronoun, become always treated as a noun spouses always chosen spouse possible spouse And spouses(plural). The criteria for choosing a word form were:

  1. the frequency of the corresponding lemma ( took, I will, submit as a noun is extremely unlikely, so in these cases the verb is chosen);
  2. comparative frequency of a particular form (both lemmas for become quite frequent, but the noun, unlike the verb, is very often used in this form; form it's time one has to count in predicative usage, while the noun appears in all its other forms).
Similar to Zasorina's surname dictionary, first names and patronymics were filtered out of the lemmatized frequency lists, but place names were left in, since it is difficult to justify why they were left in Zasorina's dictionary. Moscow or American, but not Moscow And America. The frequency list of word forms was not filtered.

The meaning of the word NOUN in the Dictionary of the Russian Language Ozhegov

NOUN

noun == noun Declension of nouns. noun In grammar: a part of speech denoting an object and expressing the meaning of objectivity in the forms of gender, number and case. Concrete and abstract nouns. Proper nouns and common nouns. Animate and inanimate nouns.

Ozhegov. Dictionary of the Russian language Ozhegov. 2012

See also interpretations, synonyms, word meanings and what is NOUNS in Russian in dictionaries, encyclopedias and reference books:

  • NOUN in the Big Encyclopedic Dictionary:
  • NOUN
    part of speech, a class of meaningful words (lexemes), which includes the names of objects and animated beings and can appear in a sentence ...
  • NOUN in the Modern Encyclopedic Dictionary:
  • NOUN in the Encyclopedic Dictionary:
    part of speech denoting objects (things, substances, people, animals), properties abstracted from their carrier ("goodness"), actions and states in abstraction from ...
  • NOUN in the Encyclopedic Dictionary:
    , -oto, cf. or a noun - in grammar: a part of speech denoting an object and expressing the meaning of objectivity in gender forms, ...
  • NOUN in the Big Russian Encyclopedic Dictionary:
    NOUN, a class of full-valued words (part of speech), which includes the name. objects and animate beings and can act in a sentence ...
  • NOUN
    - a class of fully-meaning words (part of speech), which includes the names of objects and animate beings and can act in a sentence for ...
  • NOUN
    see noun…
  • NOUN in the dictionary of Synonyms of the Russian language.
  • NOUN in the New explanatory and derivational dictionary of the Russian language Efremova:
  • NOUN in the Complete Spelling Dictionary of the Russian Language:
    noun, …
  • NOUN in the Spelling Dictionary:
    noun, ...
  • NOUN in the Modern Explanatory Dictionary, TSB:
    a class of full-value words (part of speech), which includes the names of objects and animate beings and can appear in a sentence in ...
  • NOUN in the Explanatory Dictionary of Efremova:
    noun cf. Part of speech denoting an object and usually changing in cases and numbers; noun (in...
  • NOUN in the New Dictionary of the Russian Language Efremova:
    cf. Part of speech denoting an object and usually changing in cases and numbers; noun (in...
  • NOUN in the Big Modern Explanatory Dictionary of the Russian Language:
    cf. Part of speech denoting an object and usually changing in cases and numbers; noun (in linguistics) ...
  • ENGLISH LANGUAGE in the Literary Encyclopedia:
    lang. mixed. In its origin, it is associated with the western branch of the Germanic group of languages. (cm.). It is customary to share the history of A. Yaz. on the …
  • PARTS OF SPEECH in the Great Soviet Encyclopedia, TSB:
    speech, the main classes of words of the language, distinguished on the basis of the similarity of their syntactic (see Syntax), morphological (see Morphology) and logical-semantic (see ...
  • PARTS OF SPEECH in the Linguistic Encyclopedic Dictionary:
    - classes of words of a language, distinguished on the basis of the generality of their syntactic (see Syntax), morphological (see Morphology) and semantic (see Semantics) properties. …
  • WORD ORDER in the Linguistic Encyclopedic Dictionary:
    - a certain arrangement of words in a sentence or syntax. group. Structural types P. s. differ in the following. oppositions: progressive, or consistent ...
  • PERIPHRASE in the Linguistic Encyclopedic Dictionary:
    (periphrase) (from Greek, periphrasis - a descriptive expression, allegory) - a stylistic device that consists in an indirect, descriptive, designation of objects and phenomena of reality ...
  • NAMED CLASSES in the Linguistic Encyclopedic Dictionary:
    - a lexical-grammatical category of a noun, consisting in the distribution of names into groups (classes) in accordance with the century-ry semantic features with a mandatory formal ...
  • BALTIC LANGUAGES in the Linguistic Encyclopedic Dictionary:
    a group of Indo-European languages. B. i. more fully preserve the ancient Indo-European. language system than other modern. Indo-European groups families of languages. There is a point...
  • ADAMAUA-ORIGINAL LANGUAGES in the Linguistic Encyclopedic Dictionary.
  • PARTS OF SPEECH in the Dictionary of Linguistic Terms:
    The main lexico-grammatical categories, according to which the words of the language are distributed on the basis of signs: a) semantic (generalized meaning of an object, action or state, quality ...
  • AGREEMENT IN MEANING in the Dictionary of Linguistic Terms:
    The choice of the form of the number or gender of the predicate is based not on grammatical similarity to the form of expression of the subject, but on the semantic relationship between both ...
  • CASE in the Dictionary of Linguistic Terms:
    1 (case category). The grammatical category of a noun, expressing the relation of the object designated by it to other objects, actions, signs. Dying off in the Romanesque…
  • CHARACTER in the Popular Explanatory-Encyclopedic Dictionary of the Russian Language:
    -a, m. The protagonist in an artistic, dramatic work, in genre painting. Comic character. negative character. Chekhov characters. Characters of Russian folk ...


Similar articles