Vietnamese (tiếng Việt) is an Austroasiatic language spoken primarily in Vietnam where it is the national and official language. Vietnamese is spoken natively by around 85 million people,[1] several times as many as the rest of the Austroasiatic family combined.[5] It is the native language of the Vietnamese (Kinh) people, as well as a second or first language for other ethnic groups in Vietnam, and still used by Vietnamese diaspora in the world.
Like many languages in Southeast Asia and East Asia, Vietnamese is highly analytic and is tonal. It has head-initial directionality, with subject–verb–object order and modifiers following the words they modify. It also uses noun classifiers. Its vocabulary has had significant influence from Middle Chinese and loanwords from French.[6] Although it is often mistakenly thought as being an monosyllabic language, Vietnamese words typically consist of from one to many as eight individual morphemes or syllables; the majority of Vietnamese vocabulary are disyllabic and trisyllabic words.[7]
Vietnamese is written using the Vietnamese alphabet (chữ Quốc ngữ). The alphabet is based on the Latin script and was officially adopted in the early 20th century during French rule of Vietnam. It uses digraphs and diacritics to mark tones and some phonemes. Vietnamese was historically written using chữ Nôm, a logographic script using Chinese characters (chữ Hán) to represent Sino-Vietnamese vocabulary and some native Vietnamese words that correspond to the meaning of the Chinese character, together with many locally invented characters to represent other words.[8][9]
Early linguistic work in the late 19th and early 20th centuries (Logan 1852, Forbes 1881, Müller 1888, Kuhn 1889, Schmidt 1905, Przyluski 1924, and Benedict 1942)[10] classified Vietnamese as belonging to the Mon–Khmer branch of the Austroasiatic language family (which also includes the Khmer language spoken in Cambodia, as well as various smaller and/or regional languages, such as the Munda and Khasi languages spoken in eastern India, and others in Laos, southern China and parts of Thailand). In 1850, British lawyer James Richardson Logan detected striking similarities between the Korku language in Central India and Vietnamese. He suggested that Korku, Mon, and Vietnamese were part of what he termed "Mon–Annam languages" in a paper published in 1856. Later, in 1920, French-Polish linguist Jean Przyluski found that Mường is more closely related to Vietnamese than other Mon–Khmer languages, and a Viet–Muong subgrouping was established, also including Thavung, Chut, Cuoi, etc.[11] The term "Vietic" was proposed by Hayes (1992),[12] who proposed to redefine Viet–Muong as referring to a subbranch of Vietic containing only Vietnamese and Mường. The term "Vietic" is used, among others, by Gérard Diffloth, with a slightly different proposal on subclassification, within which the term "Viet–Muong" refers to a lower subgrouping (within an eastern Vietic branch) consisting of Vietnamese dialects, Mường dialects, and Nguồn (of Quảng Bình Province).[13]
Austroasiatic is believed to have dispersed around 2000 BC.[14]The arrival of the agricultural Phùng Nguyên culture in the Red River Delta at that time may correspond to the Vietic branch.[15]
This ancestral Vietic was typologically very different from later Vietnamese. It was polysyllabic, or rather sesquisyllabic, with roots consisting of a reduced syllable followed by a full syllable, and featured many consonant clusters. Both of these features are found elsewhere in Austroasiatic and in modern conservative Vietic languages south of the Red River area.[16]The language was non-tonal, but featured glottal stop and voiceless fricative codas.[17]
Borrowed vocabulary indicates early contact with speakers of Tai languages in the last millennium BC, which is consistent with genetic evidence from Dong Son culture sites.[15]Extensive contact with Chinese began from the Han dynasty (2nd century BC).[18]At this time, Vietic groups began to expand south from the Red River Delta and into the adjacent uplands, possibly to escape Chinese encroachment.[15]The oldest layer of loans from Chinese into northern Vietic (which would become the Viet–Muong subbranch) date from this period.[19]
The northern Vietic varieties thus became part of the Mainland Southeast Asia linguistic area, in which languages from genetically unrelated families converged toward characteristics such as isolating morphology and similar syllable structure.[20] Many languages in this area, including Viet–Muong, underwent a process of tonogenesis, in which distinctions formerly expressed by final consonants became phonemic tonal distinctions when those consonants disappeared. These characteristics have become part of many of the genetically unrelated languages of Southeast Asia; for example, Tsat (a member of the Malayo-Polynesian group within Austronesian), and Vietnamese each developed tones as a phonemic feature.
After the split from Muong around the end of the first millennium AD, the following stages of Vietnamese are commonly identified:[14]
After expelling the Chinese at the beginning of the 10th century, the Ngô dynasty adopted Classical Chinese as the formal medium of government, scholarship and literature. With the dominance of Chinese came wholesale importation of Chinese vocabulary. The resulting Sino-Vietnamese vocabulary makes up about a third of the Vietnamese lexicon in all realms, and may account for as much as 60% of the vocabulary used in formal texts.[26]
Vietic languages were confined to the northern third of modern Vietnam until the "southward advance" (Nam tiến) from the late 15th century.[27]The conquest of the ancient nation of Champa and the conquest of the Mekong Delta led to an expansion of the Vietnamese people and language, with distinctive local variations emerging.
After France invaded Vietnam in the late 19th century, French gradually replaced Literary Chinese as the official language in education and government. Vietnamese adopted many French terms, such as đầm ('dame', from madame), ga ('train station', from gare), sơ mi ('shirt', from chemise), and búp bê ('doll', from poupée), resulting in a language that was Austroasiatic but with major Sino-influences and some minor French influences from the French colonial era.
The following diagram shows the phonology of Proto–Viet–Muong (the nearest ancestor of Vietnamese and the closely related Mường language), along with the outcomes in the modern language:[28][29][30][31]
^1 According to Ferlus, */tʃ/ and */ʄ/ are not accepted by all researchers. Ferlus 1992[28] also had additional phonemes */dʒ/ and */ɕ/.
^2 The fricatives indicated above in parentheses developed as allophones of stop consonants occurring between vowels (i.e. when a minor syllable occurred). These fricatives were not present in Proto-Viet–Muong, as indicated by their absence in Mường, but were evidently present in the later Proto-Vietnamese stage. Subsequent loss of the minor-syllable prefixes phonemicized the fricatives. Ferlus 1992[28] proposes that originally there were both voiced and voiceless fricatives, corresponding to original voiced or voiceless stops, but Ferlus 2009[29] appears to have abandoned that hypothesis, suggesting that stops were softened and voiced at approximately the same time, according to the following pattern:
^3 In Middle Vietnamese, the outcome of these sounds was written with a hooked b (ꞗ), representing a /β/ that was still distinct from v (then pronounced /w/). See below.
^4 It is unclear what this sound was. According to Ferlus 1992,[28] in the Archaic Vietnamese period (c. 10th century AD, when Sino-Vietnamese vocabulary was borrowed) it was *r̝, distinct at that time from *r.
The following initial clusters occurred, with outcomes indicated:
A large number of words were borrowed from Middle Chinese, forming part of the Sino-Vietnamese vocabulary. These caused the original introduction of the retroflex sounds /ʂ/ and /ʈ/ (modern s, tr) into the language.
Proto-Viet–Muong did not have tones. Tones developed later in some of the daughter languages from distinctions in the initial and final consonants. Vietnamese tones developed as follows:[32]
Glottal-ending syllables ended with a glottal stop /ʔ/, while fricative-ending syllables ended with /s/ or /h/. Both types of syllables could co-occur with a resonant (e.g. /m/ or /n/).
At some point, a tone split occurred, as in many other mainland Southeast Asian languages. Essentially, an allophonic distinction developed in the tones, whereby the tones in syllables with voiced initials were pronounced differently from those with voiceless initials. (Approximately speaking, the voiced allotones were pronounced with additional breathy voice or creaky voice and with lowered pitch. The quality difference predominates in today's northern varieties, e.g. in Hanoi, while in the southern varieties the pitch difference predominates, as in Ho Chi Minh City.) Subsequent to this, the plain-voiced stops became voiceless and the allotones became new phonemic tones. The implosive stops were unaffected, and in fact developed tonally as if they were unvoiced. (This behavior is common to all East Asian languages with implosive stops.)
As noted above, Proto-Viet–Muong had sesquisyllabic words with an initial minor syllable (in addition to, and independent of, initial clusters in the main syllable). When a minor syllable occurred, the main syllable's initial consonant was intervocalic and as a result suffered lenition, becoming a voiced fricative. The minor syllables were eventually lost, but not until the tone split had occurred. As a result, words in modern Vietnamese with voiced fricatives occur in all six tones, and the tonal register reflects the voicing of the minor-syllable prefix and not the voicing of the main-syllable stop in Proto-Viet–Muong that produced the fricative. For similar reasons, words beginning with /l/ and /ŋ/ occur in both registers. (Thompson 1976[31] reconstructed voiceless resonants to account for outcomes where resonants occur with a first-register tone, but this is no longer considered necessary, at least by Ferlus.)
Old Vietnamese/Ancient Vietnamese was a Vietic language which was separated from Viet–Muong around the 9th century, and evolved into Middle Vietnamese by 16th century. The sources for the reconstruction of Old Vietnamese are Nom texts, such as the 12th-century/1486 Buddhist scripture Phật thuyết Đại báo phụ mẫu ân trọng kinh ("Sūtra explained by the Buddha on the Great Repayment of the Heavy Debt to Parents"),[34] old inscriptions, and a late 13th-century (possibly 1293) Annan Jishi glossary by Chinese diplomat Chen Fu (c. 1259 – 1309).[35] Old Vietnamese used Chinese characters phonetically where each word, monosyllabic in Modern Vietnamese, is written with two Chinese characters or in a composite character made of two different characters.[36] This conveys the transformation of the Vietnamese lexicon from sesquisyllabic to fully monosyllabic under the pressure of Chinese linguistic influence, characterized by linguistic phenomena such as the reduction of minor syllables; loss of affixal morphology drifting towards analytical grammar; simplification of major syllable segments, and the change of suprasegment instruments.[37]
For example, the modern Vietnamese word "trời" (heaven) was read as *plời in Old/Ancient Vietnamese and as blời in Middle Vietnamese.[38]
The writing system used for Vietnamese is based closely on the system developed by Alexandre de Rhodes for his 1651 Dictionarium Annamiticum Lusitanum et Latinum. It reflects the pronunciation of the Vietnamese of Hanoi at that time, a stage commonly termed Middle Vietnamese (tiếng Việt trung đại). The pronunciation of the "rime" of the syllable, i.e. all parts other than the initial consonant (optional /w/ glide, vowel nucleus, tone and final consonant), appears nearly identical between Middle Vietnamese and modern Hanoi pronunciation. On the other hand, the Middle Vietnamese pronunciation of the initial consonant differs greatly from all modern dialects, and in fact is significantly closer to the modern Saigon dialect than the modern Hanoi dialect.
The following diagram shows the orthography and pronunciation of Middle Vietnamese:
^1 [p] occurs only at the end of a syllable.
^2 This letter, ⟨ꞗ⟩, is no longer used.
^3 [j] does not occur at the beginning of a syllable, but can occur at the end of a syllable, where it is notated i or y (with the difference between the two often indicating differences in the quality or length of the preceding vowel), and after /ð/ and /β/, where it is notated ĕ. This ĕ, and the /j/ it notated, have disappeared from the modern language.
Note that b [ɓ] and p [p] never contrast in any position, suggesting that they are allophones.
The language also has three clusters at the beginning of syllables, which have since disappeared:
Most of the unusual correspondences between spelling and modern pronunciation are explained by Middle Vietnamese. Note in particular:
De Rhodes's orthography also made use of an apex diacritic, as in o᷄ and u᷄, to indicate a final labial-velar nasal /ŋ͡m/, an allophone of /ŋ/ that is peculiar to the Hanoi dialect to the present day. This diacritic is often mistaken for a tilde in modern reproductions of early Vietnamese writing.
As a result of emigration, Vietnamese speakers are also found in other parts of Southeast Asia, East Asia, North America, Europe, and Australia. Vietnamese has also been officially recognized as a minority language in the Czech Republic.[b]
As the national language, Vietnamese is the lingua franca in Vietnam. It is also spoken by the Jing people traditionally residing on three islands (now joined to the mainland) off Dongxing in southern Guangxi Province, China.[39] A large number of Vietnamese speakers also reside in neighboring countries of Cambodia and Laos.
In the United States, Vietnamese is the sixth most spoken language, with over 1.5 million speakers, who are concentrated in a handful of states. It is the third-most spoken language in Texas and Washington; fourth-most in Georgia, Louisiana, and Virginia; and fifth-most in Arkansas and California.[40] Vietnamese is the third most spoken language in Australia other than English, after Mandarin and Arabic.[41] In France, it is the most spoken Asian language and the eighth most spoken immigrant language at home.[42]
Vietnamese is the sole official and national language of Vietnam. It is the first language of the majority of the Vietnamese population, as well as a first or second language for the country's ethnic minority groups.[43]
In the Czech Republic, Vietnamese has been recognized as one of 14 minority languages, on the basis of communities that have resided in the country either traditionally or on a long-term basis. This status grants the Vietnamese community in the country a representative on the Government Council for Nationalities, an advisory body of the Czech Government for matters of policy towards national minorities and their members. It also grants the community the right to use Vietnamese with public authorities and in courts anywhere in the country.[44][45]
Vietnamese is taught in schools and institutions outside of Vietnam, a large part contributed by its diaspora. In countries with Vietnamese-speaking communities Vietnamese language education largely serves as a role to link descendants of Vietnamese immigrants to their ancestral culture. In neighboring countries and vicinities near Vietnam such as Southern China, Cambodia, Laos, and Thailand, Vietnamese as a foreign language is largely due to trade, as well as recovery and growth of the Vietnamese economy.[46][47]
Since the 1980s, Vietnamese language schools (trường Việt ngữ/ trường ngôn ngữ Tiếng Việt) have been established for youth in many Vietnamese-speaking communities around the world such as in the United States, Germany and France.[48][49][50][51][52]
Vietnamese has a large number of vowels. Below is a vowel diagram of Vietnamese from Hanoi (including centering diphthongs):
Front and central vowels (i, ê, e, ư, â, ơ, ă, a) are unrounded, whereas the back vowels (u, ô, o) are rounded. The vowels â [ə] and ă [a] are pronounced very short, much shorter than the other vowels. Thus, ơ and â are basically pronounced the same except that ơ [əː] is of normal length while â [ə] is short – the same applies to the vowels long a [aː] and short ă [a].[c]
The centering diphthongs are formed with only the three high vowels (i, ư, u). They are generally spelled as ia, ưa, ua when they end a word and are spelled iê, ươ, uô, respectively, when they are followed by a consonant.
In addition to single vowels (or monophthongs) and centering diphthongs, Vietnamese has closing diphthongs[d] and triphthongs. The closing diphthongs and triphthongs consist of a main vowel component followed by a shorter semivowel offglide /j/ or /w/.[e] There are restrictions on the high offglides: /j/ cannot occur after a front vowel (i, ê, e) nucleus and /w/ cannot occur after a back vowel (u, ô, o) nucleus.[f]
The correspondence between the orthography and pronunciation is complicated. For example, the offglide /j/ is usually written as i; however, it may also be represented with y. In addition, in the diphthongs [āj] and [āːj] the letters y and i also indicate the pronunciation of the main vowel: ay = ă + /j/, ai = a + /j/. Thus, tay "hand" is [tāj] while tai "ear" is [tāːj]. Similarly, u and o indicate different pronunciations of the main vowel: au = ă + /w/, ao = a + /w/. Thus, thau "brass" is [tʰāw] while thao "raw silk" is [tʰāːw].
The consonants that occur in Vietnamese are listed below in the Vietnamese orthography with the phonetic pronunciation to the right.
Some consonant sounds are written with only one letter (like "p"), other consonant sounds are written with a digraph (like "ph"), and others are written with more than one letter or digraph (the velar stop is written variously as "c", "k", or "q"). In some cases, they are based on their Middle Vietnamese pronunciation; since that period, ph and kh (but not th) have evolved from aspirated stops into fricatives (like Greek phi and chi), while d and gi have collapsed and converged together (into /z/ in the north and /j/ in the south).
Not all dialects of Vietnamese have the same consonant in a given word (although all dialects use the same spelling in the written language). See the language variation section for further elaboration.
Syllable-final orthographic ch and nh in Vietnamese has had different analyses. One analysis has final ch, nh as being phonemes /c/, /ɲ/ contrasting with syllable-final t, c /t/, /k/ and n, ng /n/, /ŋ/ and identifies final ch with the syllable-initial ch /c/. The other analysis has final ch and nh as predictable allophonic variants of the velar phonemes /k/ and /ŋ/ that occur after the upper front vowels i /i/ and ê /e/; although they also occur after a, but in such cases are believed to have resulted from an earlier e /ɛ/ which diphthongized to ai (cf. ach from aic, anh from aing). (See Vietnamese phonology: Analysis of final ch, nh for further details.)
Each Vietnamese syllable is pronounced with one of six inherent tones,[g] centered on the main vowel or group of vowels. Tones differ in:
Tone is indicated by diacritics written above or below the vowel (most of the tone diacritics appear above the vowel; except the nặng tone dot diacritic goes below the vowel).[h] The six tones in the northern varieties (including Hanoi), with their self-referential Vietnamese names, are:
Other dialects of Vietnamese may have fewer tones (typically only five).
In Vietnamese poetry, tones are classed into two groups: (tone pattern)
Words with tones belonging to a particular tone group must occur in certain positions within the poetic verse.
Vietnamese Catholics practice a distinctive style of prayer recitation called đọc kinh, in which each tone is assigned a specific note or sequence of notes.
Before Vietnamese switched from a Chinese-based script to a Latin-based script, Vietnamese had used the traditional Chinese system of classifying tones. Using this system, Vietnamese has 8 tones, but modern linguists only count 6 phonemic tones.
Vietnamese tones were classified into two main groups, bằng (平; 'level tones') and trắc (仄; 'sharp tones'). Tones such as ngang belong to the bằng group, while other tones such as ngã belong to the trắc group. Then, these tones were further divided in several other categorizes: bình (平; 'even'), thượng (上; 'rising'), khứ (去; 'departing'), and nhập (入; 'entering').
Sắc and nặng are counted twice in the system, once in khứ (去; 'departing') and again in nhập (入; 'entering'). The reason for the extra two tones is that syllables ending in the stops /p/, /t/, /c/ and /k/ are treated as having entering tones, but phonetically they are exactly the same.
The tones in the old classification were called Âm bình 陰平 (ngang), Dương bình 陽平 (huyền), Âm thượng 陰上 (hỏi), Dương thượng 陽上 (ngã), Âm khứ 陰去 (sắc; for words that do not end in /p/, /t/, /c/ and /k/), Dương khứ 陽去 (nặng; for words that do not end in /p/, /t/, /c/ and /k/), Âm nhập 陰入 (sắc; for words that do end in /p/, /t/, /c/ and /k/), and Dương nhập 陽入 (nặng; for words that do end in /p/, /t/, /c/ and /k/).
Vietnamese, like Thai and many languages in Southeast Asia, is an analytic language. Vietnamese does not use morphological marking of case, gender, number or tense (and, as a result, has no finite/nonfinite distinction).[i] Also like other languages in the region, Vietnamese syntax conforms to subject–verb–object word order, is head-initial (displaying modified-modifier ordering), and has a noun classifier system. Additionally, it is pro-drop, wh-in-situ, and allows verb serialization.
Some Vietnamese sentences with English word glosses and translations are provided below.
Minh
Minh
là
BE
giáo viên
teacher.
"Minh is a teacher."
Trí
Trí
13
13
tuổi
age
"Trí is 13 years old,"
Mai
Mai
có vẻ
seem
là
BE
sinh viên
student (college)
hoặc
or
học sinh.
student (under-college)
"Mai seems to be a college or high school student."
Tài
Tài
đang
PRES.CONT
nói.
talk
"Tài is talking."
Giáp
Giáp
rất
INT
cao.
tall
"Giáp is very tall."
Người
person
đó
that.DET
là
BE
anh
older brother
của
POSS
nó.
3.PRO
"That person is his/her brother."
Con
CL
chó
dog
này
DET
chẳng
NEG
bao giờ
ever
sủa
bark
cả.
all
"This dog never barks at all."
Nó
3.PRO
chỉ
just
ăn
eat
cơm
rice.FAM
Việt Nam
Vietnam
thôi.
only
"He/she/it only eats Vietnamese rice (or food, especially spoken by the elderly)."
Tôi
1.PRO
thích
like
con
CL
ngựa
horse
đen.
black
"I like the black horse."
Tôi
1.PRO
thích
like
cái
FOC
con
CL
ngựa
horse
đen
black
đó.
DET
"I like that black horse."
Hãy
HORT
ở lại
stay
đây
here
ít
few
phút
minute
cho tới
until
khi
when
tôi
1.PRO
quay
turn
lại.
again
"Please stay here for a few minutes until I return."
Many early studies hypothesized Vietnamese language-origins to have been either Kra-Dai, Sino-Tibetan, or Austroasiatic. Austroasiatic origins are so far the most tenable to date, with some of the oldest words in Vietnamese being Austroasiatic in origin.[32][54]
Although Vietnamese roots are classified as Austroasiatic, Vietic, and Viet-Muong, language contact with Chinese heavily influenced the Vietnamese language, causing it to diverge from Viet-Muong around the 10th to 11th century and become the Vietnamese we know today. For instance, the Vietnamese word quản lý, meaning "management" (noun) or "manage" (verb), likely descended from the same word as guǎnlǐ (管理) in Chinese (also kanri (管理, かんり) in Japanese and gwalli (gwan+ri; Korean: 관리; Hanja: 管理) in Korean). Instances of Chinese contact include the historical Nam Việt (aka Nanyue) as well as other periods of influence. Besides English and French, which have made some contributions to the Vietnamese language, Japanese loanwords into Vietnamese are also a more recently studied phenomenon.
Modern linguists describe modern Vietnamese having lost many Proto-Austroasiatic phonological and morphological features that original Vietnamese had.[55] The Chinese influence on Vietnamese corresponds to various periods when Vietnam was under Chinese rule and subsequent influence after Vietnam became independent. Early linguists thought that this meant the Vietnamese lexicon had only two influxes of Chinese words, one stemming from the period under actual Chinese rule and a second from afterwards. These words are grouped together as Sino-Vietnamese vocabulary.
However, according to linguist John Phan, “Annamese Middle Chinese” was already used and spoken in the Red River Valley by the 1st century CE, and its vocabulary significantly fused with the co-existing Proto-Viet-Muong language, the immediate ancestor of Vietnamese. He lists three major classes of Sino-Vietnamese borrowings:[56][57][58] Early Sino-Vietnamese (Han dynasty ca. 1st century CE and Jin dynasty ca. 4th century CE), Late Sino-Vietnamese (Tang dynasty), and Recent Sino-Vietnamese (Ming dynasty and afterwards)
Vietnam became a French protectorate/colonial territory in 1883 (until the Geneva Accords of 1954), which resulted in significant influence from French into the Indochina region (Laos, Cambodia and Vietnam). Examples include:
"Cà phê" in Vietnamese was derived from the French café (coffee). Yogurt in Vietnamese is "sữa chua" (lit. "sour milk"), but it is also calqued from French (yaourt) into Vietnamese (da ua - /j/a ua). "Phô mai" (cheese) is from the French fromage. Musical note was borrowed into Vietnamese as "nốt" or "nốt nhạc", from the French note de musique. The Vietnamese term for steering wheel is "vô lăng", a partial derivation from the French volant directionnel. A necktie (cravate in French) is rendered into Vietnamese as "cà vạt".
In addition, modern Vietnamese pronunciations of French names correspond directly to the original French pronunciations ("Pa-ri" for Paris, "Mác-xây" for Marseille, "Boóc-đô" for Bordeaux, etc.), whereas pronunciations of other foreign names (Chinese excluded) are generally derived from English.
Some English words were incorporated into Vietnamese as loan words - such as "TV", borrowed as "tivi" or just TV, but still officially called truyền hình. Some other borrowings are calques, translated into Vietnamese. For example, 'software' is translated into "phần mềm" (literally meaning "soft part"). Some scientific terms, such as "biological cell", were derived from chữ Hán. For example, the word tế bào is 細胞 in chữ Hán, whilst other scientific names such as "acetylcholine" are unaltered. Words like "peptide" may be seen as peptit.
Japanese loanwords are a more recently studied phenomenon, with a paper by Nguyễn & Lê (2020) classifying three waves of Japanese influence - with the first two waves being the principal influxes and the third wave coming from the Vietnamese who studied Japanese.[59] The first wave consisted of Kanji words created by Japanese to represent Western concepts that were not readily available in Chinese or Japanese, where by the end of the 19th century they were imported to other Asian languages.[60] This first influx is called Sino-Vietnamese words of Japanese origins. For example, the Vietnamese term for "association club", câu lạc bộ, which was borrowed from Chinese (俱乐部, pinyin: jùlèbù, jyutping: keoi1 lok6 bou6), and then in turn from Japanese (kanji: 倶楽部, katakana: クラブ, rōmaji: kurabu) which came from the English "club", resulting in indirect borrowing from Japanese.
The second wave was during the brief Japanese occupation of Vietnam from 1940 until 1945. However, Japanese cultural influence in Vietnam started significantly from the 1980s. This newer second wave of Japanese-origin loanwords is distinctive from the Sino-Vietnamese words of Japanese origin in that they were borrowed directly from Japanese. This vocabulary includes words representative of Japanese culture, such as kimono, sumo, samurai, and bonsai from modified Hepburn romanisation. These loanwords are coined as "new Japanese loanwords". A significant number of new Japanese loanwords were also of Chinese origin. Sometimes the same concept can be described using both Sino-Vietnamese words of Japanese origin (first wave) and new Japanese loanwords (second wave). For example, judo can be referred to as both judo and nhu đạo, the Vietnamese reading of 柔道.[59]
Some words, such as lạp xưởng from 臘腸 (Chinese sausage), primarily keep to the Cantonese pronunciations, having been brought over by southern Chinese migrants, whereas in Hán-Việt, which has been described as being close to Middle Chinese pronunciation, it is actually pronounced lạp trường. However, the Cantonese term is the better-known name for Chinese sausage in Vietnam. Meanwhile, any new terms calqued from Chinese would be based on the Mandarin pronunciation. Additionally, in the southern provinces of Vietnam, the term xí ngầu can be used to refer to dice, which may have derived from a Cantonese or Teochew idiom, "xập xí, xập ngầu" (十四, 十五, Sino-Vietnamese: thập tứ, thập ngũ), literally "fourteen, fifteen" to mean 'uncertain'.
Basic vocabulary in Vietnamese has Proto-Vietic origins. Vietnamese shares a large amount of vocabulary with the Mường languages, a close relative of the Vietnamese language.
Other compound words, such as nước non (chữ Nôm: 渃𡽫, "country/nation", lit. "water and mountains"), appear to be of purely Vietnamese origin and used to be inscribed in chữ Nôm characters (compounded, self-coined Chinese characters) but are now written in the Vietnamese alphabet.
Vietnamese slang (tiếng lóng) has changed over time. Vietnamese slang consists of pure Vietnamese words as well as words borrowed from other languages such as Mandarin or Indo-European languages.[61] It is estimated that Vietnamese slang originating from Mandarin accounts for a tiny proportion (4.6% of surveyed data in newspapers).[61] On the other hand, slang originating from Indo-European languages accounts for a more significant proportion (12%) and is much more common in today's usage.[61] Slang borrowed from these languages can be either transliteral or vernacular.[61] Some examples:
With the rise of the Internet, new slang is generated and popularized through social media. This more modern slang is commonly used in the younger generation's teenspeak in Vietnam. This recent slang is mostly pure Vietnamese, and almost all the words are homonyms or some form of wordplay. Some slang words may include profanity swear words (derogatory) or just a play on words.
Some examples with newer and older slang that originate from northern, central, or southern Vietnamese dialects include:
Whilst older slang has been used by previous generations, the prevalence of modern slang used by young people in Vietnam (as teenspeak) has made conversations more difficult for older generations to understand. This has become subject for debate. Some believe that incorporating teenspeak or internet slang in daily conversation among teenagers will affect the formality and cadence of their general speech.[68] Others argue that it is not slang that is the problem, but rather the lack of communication techniques for the instant internet messaging era. They believe slang should not be dismissed, but instead, youth should be adequately informed to recognise when to use it and when it is inappropriate.
After ending a millennium of Chinese rule in 939, the Vietnamese state adopted Literary Chinese (called văn ngôn 文言 or Hán văn 漢文 in Vietnamese) for official purposes.[69]Up to the late 19th century (except for two brief interludes), all formal writing, including government business, scholarship and formal literature, was done in Literary Chinese, written with Chinese characters (chữ Hán).[70] Although the writing system is now mostly in chữ Quốc ngữ (Latin script), Chinese script known as chữ Hán in Vietnamese as well as chữ Nôm (together, Hán-Nôm) is still present in such activities such as Vietnamese calligraphy.
From around the 13th century, Vietnamese scholars used their knowledge of the Chinese script to develop the chữ Nôm (lit. 'Southern characters') script to record folk literature in Vietnamese. The script used Chinese characters to represent both borrowed Sino-Vietnamese vocabulary and native words with similar pronunciation or meaning. In addition, thousands of new compound characters were created to write Vietnamese words using a variety of methods, including phono-semantic compounds.[71]For example, in the opening lines of the classic poem The Tale of Kiều,
The oldest example of an early form of the Nôm is found in a list of names in the Tháp Miếu Temple Inscription, dating from early 13th century AD.[72][73]Nôm writing reached its zenith in the 18th century when many Vietnamese writers and poets composed their works in Nôm, most notably Nguyễn Du and Hồ Xuân Hương (dubbed "the Queen of Nôm poetry"). However, it was only used for official purposes during the brief Hồ and Tây Sơn dynasties (1400–1406 and 1778–1802 respectively).[74]
A Vietnamese Catholic, Nguyễn Trường Tộ, unsuccessfully petitioned the Court suggesting the adoption of a script for Vietnamese based on Chinese characters.[75][76]
A romanisation of Vietnamese was codified in the 17th century by the Avignonese Jesuit missionary Alexandre de Rhodes (1591–1660), based on works of earlier Portuguese missionaries, particularly Francisco de Pina, Gaspar do Amaral and Antonio Barbosa.[77][78]It reflects a "Middle Vietnamese" dialect close to the Hanoi variety as spoken in the 17th century. Its vowels and final consonants correspond most closely to northern dialects while its initial consonants are most similar to southern dialects. (This is not unlike how English orthography is based on the Chancery Standard of Late Middle English, with many spellings retained even after the Great Vowel Shift.)
The Vietnamese alphabet contains 29 letters, supplementing the Latin alphabet with an additional consonant letter (đ) and 6 additional vowel letters (ă, â/ê/ô, ơ, ư) formed with diacritics. The Latin letters f, j, w and z are not used.[79][80]The script also represents additional phonemes using ten digraphs (ch, gh, gi, kh, ng, nh, ph, qu, th, and tr) and a single trigraph (ngh). Further diacritics are used to indicate the tone of each syllable:
Thus, it is possible for diacritics to be stacked e.g. ể, combining letter with diacritic, ê, with diacritic for tone, ẻ, to make ể.
Despite the missionaries' creation of the alphabetic script, chữ Nôm remained the dominant script in Vietnamese Catholic literature for more than 200 years.[81] Starting from the late 19th century, the Vietnamese alphabet (chữ Quốc ngữ or 'national language script') gradually expanded from its initial usage in Christian writing to become more popular among the general public.
The romanised script became predominant over the course of the early 20th century, when education became widespread and a simpler writing system was found to be more expedient for teaching and communication with the general population. The French colonial administration sought to eliminate Chinese writing, Confucianism, and other Chinese influences from Vietnam.[76] French superseded Literary Chinese in administration. Vietnamese written with the alphabet became required for all public documents in 1910 by issue of a decree by the French Résident Supérieur of the protectorate of Tonkin. In turn, Vietnamese reformists and nationalists themselves encouraged and popularized the use of chữ Quốc ngữ. By the middle of the 20th century, most writing was done in chữ Quốc ngữ, which became the official script on independence.
Nevertheless, chữ Hán was still in use during the French colonial period and as late as World War II was still featured on banknotes,[82][83] but fell out of official and mainstream use shortly thereafter. The education reform by North Vietnam in 1950 eliminated the use of chữ Hán and chữ Nôm.[84] Today, only a few scholars and some extremely elderly people are able to read chữ Nôm or use it in Vietnamese calligraphy. Priests of the Jing minority in China (descendants of 16th-century migrants from Vietnam) use songbooks and scriptures written in chữ Nôm in their ceremonies.[85]
The Unicode character set contains all Vietnamese characters and the Vietnamese currency symbol. On systems that do not support Unicode, many 8-bit Vietnamese code pages are available such as Vietnamese Standard Code for Information Interchange (VSCII) or Windows-1258. Where ASCII must be used, Vietnamese letters are often typed using the VIQR convention, though this is largely unnecessary with the increasing ubiquity of Unicode. There are many software tools that help type Roman-script Vietnamese on English keyboards, such as WinVNKey and Unikey on Windows, or MacVNKey on Macintosh, with popular methods of encoding Vietnamese using Telex, VNI or VIQR input methods all included. Telex input method is often set as the default for many devices. Besides third-party software tools, operating systems such as Windows or macOS can also be installed with Vietnamese and Vietnamese keyboard, e.g. Vietnamese Telex in Microsoft Windows.
Vietnamese speak date in the format "day month year". Each month's name is just the ordinal of that month appended after the word tháng, which means "month". Traditional Vietnamese, however, assigns other names to some months; these names are mostly used in the lunar calendar and in poetry.
When written in the short form, "DD/MM/YYYY" is preferred.
Example:
The Vietnamese prefer writing numbers with a comma as the decimal separator in lieu of dots, and either spaces or dots to group the digits. An example is 1 629,15 (one thousand six hundred twenty-nine point fifteen). Because a comma is used as the decimal separator, a semicolon is used to separate two numbers instead.
The Tale of Kiều is an epic narrative poem by the celebrated poet Nguyễn Du, (阮攸), which is often considered the most significant work of Vietnamese literature. It was originally written in chữ Nôm (titled Đoạn Trường Tân Thanh 斷腸新聲) and is widely taught in Vietnam (in chữ Quốc ngữ transliteration).
Currently Nguồn language is considered by the Vietnamese government to be a dialect of Vietnamese, however it is also considered a separate Việt-Mường language or the southernmost dialect of Mường language. The Vietnamese language also has several mutually intelligible regional varieties:[j]
Vietnamese has traditionally been divided into three dialect regions: North (45%), Central (10%), and South (45%). Michel Ferlus and Nguyễn Tài Cẩn found that there was a separate North-Central dialect for Vietnamese as well. The term Haut-Annam refers to dialects spoken from the northern Nghệ An Province to the southern (former) Thừa Thiên Province that preserve archaic features (like consonant clusters and undiphthongized vowels) that have been lost in other modern dialects.
The dialect regions differ mostly in their sound systems (see below) but also in vocabulary (including basic and non-basic vocabulary) and grammar.[k] The North-Central and the Central regional varieties, which have a significant number of vocabulary differences, are generally less mutually intelligible to Northern and Southern speakers. There is less internal variation within the Southern region than the other regions because of its relatively late settlement by Vietnamese-speakers (around the end of the 15th century). The North-Central region is particularly conservative since its pronunciation has diverged less from Vietnamese orthography than the other varieties, which tend to merge certain sounds. Along the coastal areas, regional variation has been neutralized to a certain extent, but more mountainous regions preserve more variation. As for sociolinguistic attitudes, the North-Central varieties are often felt to be "peculiar" or "difficult to understand" by speakers of other dialects although their pronunciation fits the written language the most closely; that is typically because of various words in their vocabulary that are unfamiliar to other speakers (see the example vocabulary table below).
The large movements of people between North and South since the mid-20th century has resulted in a sizable number of Southern residents speaking in the Northern accent/dialect and, to a greater extent, Northern residents speaking in the Southern accent/dialect. After the Geneva Accords of 1954, which called for the temporary division of the country, about a million northerners (mainly from Hanoi, Haiphong, and the surrounding Red River Delta areas) moved south (mainly to Saigon and heavily to Biên Hòa and Vũng Tàu and the surrounding areas) as part of Operation Passage to Freedom. About 180,000 moved in the reverse direction (Tập kết ra Bắc, literally "go to the North".)
After the Fall of Saigon in 1975, Northern and North-Central speakers from the densely-populated Red River Delta and the traditionally-poorer provinces of Nghệ An, Hà Tĩnh, and Quảng Bình have continued to move south to look for better economic opportunities allowed by the new government's New Economic Zones, a program that lasted from 1975 to 1985.[86] The first half of the program (1975–1980) resulted in 1.3 million people sent to the New Economic Zones (NEZs), most of which were relocated to the southern half of the country in previously uninhabited areas, and 550,000 of them were Northerners.[86] The second half (1981–1985) saw almost 1 million Northerners relocated to the New Economic Zones.[86] Government and military personnel from Northern and North-Central Vietnam are also posted to various locations throughout the country that were often away from their home regions. More recently, the growth of the free market system has resulted in increased interregional movement and relations between distant parts of Vietnam through business and travel. The movements have also resulted in some blending of dialects and more significantly have made the Northern dialect more easily understood in the South and vice versa. Most Southerners, when singing modern/old popular Vietnamese songs or addressing the public, do so in the standardized accent if possible, which uses the Northern pronunciation. That is true in both Vietnam and overseas Vietnamese communities.
Modern Standard Vietnamese is based on the Hanoi dialect. Nevertheless, the major dialects are still predominant in their respective areas and have also evolved over time with influences from other areas. Historically, accents have been distinguished by how each region pronounces the letters d ([z] in the Northern dialect and [j] in the Central and Southern dialect) and r ([z] in the Northern dialect and [r] in the Central and Southern dialects). Thus, the Central and the Southern dialects can be said to have retained a pronunciation closer to Vietnamese orthography and resemble how Middle Vietnamese sounded, in contrast to the modern Northern (Hanoi) dialect, which has since undergone pronunciation shifts.
Although regional variations developed over time, most of those words can be used interchangeably and be understood well, albeit with more or less frequency then others or with slightly different but often discernible word choices and pronunciations. Some accents may mix, with words such dạ vâng combining dạ and vâng, being created.
The syllable-initial ch and tr digraphs are pronounced distinctly in the North-Central, Central, and Southern varieties but are merged in Northern varieties, which pronounce them the same way). Many North-Central varieties preserve three distinct pronunciations for d, gi, and r, but the Northern varieties have a three-way merger, and the Central and the Southern varieties have a merger of d and gi but keep r distinct. At the end of syllables, the palatals ch and nh have merged with the alveolars t and n, which, in turn, have also partially merged with velars c and ng in the Central and the Southern varieties.
In addition to the regional variation described above, there is a merger of l and n in certain rural varieties in the North:[88]
Variation between l and n can be found even in mainstream Vietnamese in certain words. For example, the numeral "five" appears as năm by itself and in compound numerals like năm mươi "fifty", but it appears as lăm in mười lăm "fifteen" (see Vietnamese grammar#Cardinal). In some northern varieties, the numeral appears with an initial nh instead of l: hai mươi nhăm "twenty-five", instead of the mainstream hai mươi lăm.[n]
There is also a merger of r and g in certain rural varieties in the South:
The consonant clusters that were originally present in Middle Vietnamese (in the 17th century) have been lost in almost all modern Vietnamese varieties although they have been retained in other closely related Vietic languages. However, some speech communities have preserved some of these archaic clusters: "sky" is blời with a cluster in Hảo Nho (Yên Mô, Ninh Bình Province) but trời in Southern Vietnamese and giời in Hanoi Vietnamese (initial single consonants /ʈ/, /z/, respectively).
There are six tones in Vietnamese, with phonetic differences between dialects, mostly in the pitch contour and phonation type.
The table above shows the pitch contour of each tone using Chao tone number notation in which 1 represents the lowest pitch, and 5 the highest; glottalization (creaky, stiff, harsh) is indicated with the ⟨◌̰⟩ symbol; murmured voice with ⟨◌̤⟩; glottal stop with ⟨ʔ⟩; sub-dialectal variants are separated with commas. (See also the tone section below.)
A basic form of word play in Vietnamese involves disyllabic words in which the last syllable forms the first syllable of the next word in the chain. This game involves two members versing each other until the opponent is unable to think of another word. For instance:
Another language game known as nói lái is used by Vietnamese speakers.[89] Nói lái involves switching, adding or removing the tones in a pair of words and may also involve switching the order of words or the first consonant and the rime of each word. Some examples:
The resulting transformed phrase often has a different meaning but sometimes may just be a nonsensical word pair. Nói lái can be used to obscure the original meaning and thus soften the discussion of a socially sensitive issue, as with dấm đài and hoảng chưa (above), or when implied (and not overtly spoken), to deliver a hidden subtextual message, as with bồi tây.[o] Naturally, nói lái can be used for a humorous effect.[91]
Another word game somewhat reminiscent of pig latin is played by children. Here a nonsense syllable (chosen by the child) is prefixed onto a target word's syllables, then their initial consonants and rimes are switched with the tone of the original word remaining on the new switched rime.
This language game is often used as a "secret" or "coded" language useful for obscuring messages from adult comprehension.
This description distinguishes four degrees of vowel height and a rounding contrast (rounded vs. unrounded) between back vowels. The relative shortness of ă and â would then be a secondary feature. Thompson describes the vowel ă [ɐ] as being slightly higher (upper low) than a [a].
Of the approximately 90 millions speakers of Austroasiatic languages, over 70 million speak Vietnamese, nearly ten million speak Khmer and roughly five million speak Santali.
Research projects and data resources