Persian literaturePersian literature comprises oral compositions and written texts in the Persian language and is one of the world's oldest literatures. It spans over two-and-a-half millennia. Its sources have been within Greater Iran including present-day Iran, Iraq, Afghanistan, the Caucasus, and Turkey, regions of Central Asia (such as Tajikistan), South Asia and the Balkans where the Persian language has historically been either the native or official language.
Word-sense disambiguationWord-sense disambiguation (WSD) is the process of identifying which sense of a word is meant in a sentence or other segment of context. In human language processing and cognition, it is usually subconscious/automatic but can often come to conscious attention when ambiguity impairs clarity of communication, given the pervasive polysemy in natural language. In computational linguistics, it is an open problem that affects other computer-related writing, such as discourse, improving relevance of search engines, anaphora resolution, coherence, and inference.
Natural-language understandingNatural-language understanding (NLU) or natural-language interpretation (NLI) is a subtopic of natural-language processing in artificial intelligence that deals with machine reading comprehension. Natural-language understanding is considered an AI-hard problem. There is considerable commercial interest in the field because of its application to automated reasoning, machine translation, question answering, news-gathering, text categorization, voice-activation, archiving, and large-scale content analysis.
Natural language generationNatural language generation (NLG) is a software process that produces natural language output. A widely-cited survey of NLG methods describes NLG as "the subfield of artificial intelligence and computational linguistics that is concerned with the construction of computer systems than can produce understandable texts in English or other human languages from some underlying non-linguistic representation of information". While it is widely agreed that the output of any NLG process is text, there is some disagreement about whether the inputs of an NLG system need to be non-linguistic.
Persian languagePersian (ˈpɜrʒən or ˈpɜrʃən ), also known by its endonym Farsi (فارسی, , fɒːɾˈsiː), is a Western Iranian language belonging to the Iranian branch of the Indo-Iranian subdivision of the Indo-European languages. Persian is a pluricentric language predominantly spoken and used officially within Iran, Afghanistan, and Tajikistan in three mutually intelligible standard varieties, namely Iranian Persian (officially known as Persian), Dari Persian (officially known as Dari since 1964) and Tajiki Persian (officially known as Tajik since 1999).
Natural language processingNatural language processing (NLP) is an interdisciplinary subfield of linguistics and computer science. It is primarily concerned with processing natural language datasets, such as text corpora or speech corpora, using either rule-based or probabilistic (i.e. statistical and, most recently, neural network-based) machine learning approaches. The goal is a computer capable of "understanding" the contents of documents, including the contextual nuances of the language within them.
Language deathIn linguistics, language death occurs when a language loses its last native speaker. By extension, language extinction is when the language is no longer known, including by second-language speakers, when it becomes known as an extinct language. A related term is linguicide, the death of a language from natural or political causes, and, rarely, glottophagy, the absorption or replacement of a minor language by a major language.
LanguageLanguage is a structured system of communication that consists of grammar and vocabulary. It is the primary means by which humans convey meaning, both in spoken and written forms, and may also be conveyed through sign languages. The vast majority of human languages have developed writing systems that allow for the recording and preservation of the sounds or signs of language. Human language is characterized by its cultural and historical diversity, with significant variations observed between cultures and across time.
Tajik languageTajik, also called Tajiki Persian or Tajiki, is the variety of Persian spoken in Tajikistan and Uzbekistan by Tajiks. It is closely related to neighbouring Dari of Afghanistan with which it forms a continuum of mutually intelligible varieties of the Persian language. Several scholars consider Tajik as a dialectal variety of Persian rather than a language on its own.
Extinct languageAn extinct language is a language that no longer has any speakers, especially if the language has no living descendants. In contrast, a dead language is one that is no longer the native language of any community, even if it is still in use, like Latin. A dormant language is a dead language that still serves as a symbol of ethnic identity to a particular group. These languages are often undergoing a process of revitalisation. Languages that currently have living native speakers are sometimes called modern languages to contrast them with dead languages, especially in educational contexts.
Middle PersianMiddle Persian or Pahlavi, also known by its endonym Pārsīk or Pārsīg (𐭯𐭠𐭫𐭮𐭩𐭪) in its later form, is a Western Middle Iranian language which became the literary language of the Sasanian Empire. For some time after the Sasanian collapse, Middle Persian continued to function as a prestige language. It descended from Old Persian, the language of the Achaemenid Empire and is the linguistic ancestor of Modern Persian, an official language of Iran, Afghanistan (Dari) and Tajikistan (Tajik).
Second languageA second language (L2) is a language spoken in addition to one's first language (L1). A second language may be a neighbouring language, another language of the speaker's home country, or a foreign language. A speaker's dominant language, which is the language a speaker uses most or is most comfortable with, is not necessarily the speaker's first language. For example, the Canadian census defines first language for its purposes as "the first language learned in childhood and still spoken", recognizing that for some, the earliest language may be lost, a process known as language attrition.
Languages of IndiaLanguages spoken in the Republic of India belong to several language families, the major ones being the Indo-Aryan languages spoken by 78.05% of Indians and the Dravidian languages spoken by 19.64% of Indians; both families together are sometimes known as Indic languages. Languages spoken by the remaining 2.31% of the population belong to the Austroasiatic, Sino–Tibetan, Tai–Kadai, and a few other minor language families and isolates. According to the People's Linguistic Survey of India, India has the second highest number of languages (780), after Papua New Guinea (840).
Language revitalizationLanguage revitalization, also referred to as language revival or reversing language shift, is an attempt to halt or reverse the decline of a language or to revive an extinct one. Those involved can include linguists, cultural or community groups, or governments. Some argue for a distinction between language revival (the resurrection of an extinct language with no existing native speakers) and language revitalization (the rescue of a "dying" language).
PersiansThe Persians (ˈpɜrʒənz or ˈpɜrʃənz ) are an Iranian ethnic group who comprise over half of the population of Iran. They share a common cultural system and are native speakers of the Persian language as well as of the languages that are closely related to Persian. The ancient Persians were originally an ancient Iranian people who had migrated to the region of Persis (corresponding to the modern-day Iranian province of Fars) by the 9th century BCE.
Constructed languageA constructed language (shortened to a conlang) is a language whose phonology, grammar, and vocabulary, instead of having developed naturally, are consciously devised for some purpose, which may include being devised for a work of fiction. A constructed language may also be referred to as an artificial, planned or invented language, or (in some cases) a fictional language. Planned languages (or engineered languages/engelangs) are languages that have been purposefully designed; they are the result of deliberate, controlling intervention and are thus of a form of language planning.
Speaker typesWithin the linguistic study of endangered languages, sociolinguists distinguish between different speaker types based on the type of competence they have acquired of the endangered language. Often when a community is gradually shifting away from an endangered language to a majority language, not all speakers acquire full linguistic competence; instead, speakers have varying degrees and types of competence depending on their exposure to the minority language in their upbringing.
Sign languageSign languages (also known as signed languages) are languages that use the visual-manual modality to convey meaning, instead of spoken words. Sign languages are expressed through manual articulation in combination with non-manual markers. Sign languages are full-fledged natural languages with their own grammar and lexicon. Sign languages are not universal and are usually not mutually intelligible, although there are also similarities among different sign languages.
Indigenous languageAn indigenous language, or autochthonous language, is a language that is native to a region and spoken by indigenous peoples. This language is from a linguistically distinct community that originated in the area. Indigenous languages are not necessarily national languages but they can be; for example, Aymara is an official language of Bolivia. Also, national languages are not necessarily indigenous to the country.
First languageA first language (L1), native language, native tongue, or mother tongue is the first language or dialect that a person has been exposed to from birth or within the critical period. In some countries, the term native language or mother tongue refers to the language or dialect of one's ethnic group rather than the individual's actual first language. The first language of a child is part of that child's personal, social and cultural identity.