Chinese characters

Unless otherwise specified, Chinese text in this article is written in the format (Simplified Chinese / Traditional Chinese; Pinyin). If the Simplified and Traditional Chinese characters are identical, they are written only once.

For the species of moth known as "Chinese Character", see Cilix glaucata.

In Japan, common characters are written in post-WWII Japan-specific simplified forms (shinjitai), which are closer to traditional forms than Chinese simplifications, while uncommon characters are written in Japanese traditional forms (kyūjitai), which are virtually identical to Chinese traditional forms. In South Korea, when Chinese characters are used they are of the traditional variant and are almost identical to those used in places like Taiwan and Hong Kong. Teaching of Chinese characters in South Korea starts in the 7th grade and continues until the 12th grade; a total of 1,800 characters are taught, though these characters are used only in certain cases (on signs, academic papers, historical writings, etc.) and are slowly declining in use.

In Old Chinese (and Classical Chinese, which is based on it), most words were monosyllabic and there was a close correspondence between characters and words. In modern Chinese (esp. Mandarin Chinese), characters do not necessarily correspond to words; indeed the majority of Chinese words today consist of two or more characters because of the merging and loss of sounds in the Chinese language over time.[7] Rather, a character almost always corresponds to a single syllable that is also a morpheme.[8] However, there are a few exceptions to this general correspondence, including bisyllabic morphemes (written with two characters), bimorphemic syllables (written with two characters) and cases where a single character represents a polysyllabic word or phrase.[9]

Modern Chinese has many homophones; thus the same spoken syllable may be represented by many characters, depending on meaning. A single character may also have a range of meanings, or sometimes quite distinct meanings; occasionally these correspond to different pronunciations. Cognates in the several varieties of Chinese are generally written with the same character. They typically have similar meanings, but often quite different pronunciations; in other languages, most significantly today in Japanese and sometimes in Korean, characters are used to represent Chinese loanwords, to represent native words independently of the Chinese pronunciation (e.g., kunyomi in Japanese), and as purely phonetic elements based on their pronunciation in the historical variety of Chinese from which they were acquired. These foreign adaptations of Chinese pronunciation are known as Sino-Xenic pronunciations and have been useful in the reconstruction of Middle Chinese.

When the script was first used in the late 2nd millennium BC, words of Old Chinese were generally monosyllabic, and each character denoted a single word.[10] Increasing numbers of polysyllabic words have entered the language from the Western Zhou period to the present day, it is estimated that about 25–30% of the vocabulary of classic texts from the Warring States period was polysyllabic, though these words were used far less commonly than monosyllables, which accounted for 80–90% of occurrences in these texts.[11] The process has accelerated over the centuries as phonetic change has increased the number of homophones,[12] it has been estimated that over two thirds of the 3,000 most common words in modern Standard Chinese are polysyllables, the vast majority of those being disyllables.[13]

The most common process has been to form compounds of existing words, written with the characters of the constituent words. Words have also been created by adding affixes, reduplication and borrowing from other languages.[14] Polysyllabic words are generally written with one character per syllable;[15][b] in most cases the character denotes a morpheme descended from an Old Chinese word.[16]

Many characters have multiple readings, with instances denoting different morphemes, sometimes with different pronunciations; in modern Standard Chinese, one fifth of the 2,400 most common characters have multiple pronunciations. For the 500 most common characters, the proportion rises to 30%.[17] Often these readings are similar in sound and related in meaning; in the Old Chinese period, affixes could be added to a word to form a new word, which was often written with the same character. In many cases the pronunciations diverged due to subsequent sound change, for example, many additional readings have the Middle Chinese departing tone, the major source of the 4th tone in modern Standard Chinese. Scholars now believe that this tone is the reflex of an Old Chinese *-s suffix, with a range of semantic functions,[18] for example,

Another common alternation is between voiced and voiceless initials (though the voicing distinction has disappeared on most modern varieties), this is believed to reflect an ancient prefix, but scholars disagree on whether the voiced or voiceless form is the original root. For example,

Chinese characters represent words of the language using several strategies. A few characters, including some of the most commonly used, were originally pictograms, which depicted the objects denoted, or ideograms, in which meaning was expressed iconically, the vast majority were written using the rebus principle, in which a character for a similarly sounding word was either simply borrowed or (more commonly) extended with a disambiguating semantic marker to form a phono-semantic compound character.[24]

The traditional six-fold classification (liùshū六书 / 六書 "six writings") was first described by the scholar Xu Shen in the postface of his dictionary Shuowen Jiezi in 100 AD.[25] While this analysis is sometimes problematic and arguably fails to reflect the complete nature of the Chinese writing system, it has been perpetuated by its long history and pervasive use.

Pictograms make up only a small portion of Chinese characters. Characters in this class derive from pictures of the objects they denote, over time they have been standardized, simplified, and stylized to make them easier to write, and their derivation is therefore not always obvious. Examples include 日rì for "sun", 月yuè for "moon", 木mù for "tree" or "wood", and 麻má for "hemp".

There is no concrete number for the proportion of modern characters that are pictographic in nature; however, Xu Shen placed approximately 4% of characters in this category.

Also translated as logical aggregates or associative compounds, these characters have been interpreted as combining two or more pictographic or ideographic characters to suggest a third meaning. Commonly cited examples include 休 "rest" (composed of the pictograms 人 "person" and 木 "tree") and 好 "good" (composed of 女 "woman" and 子 "child").

Xu Shen placed approximately 13% of characters in this category. However, many of these characters are now believed to be phono-semantic compounds whose origin has been obscured by subsequent changes in their form,[26] some scholars reject the applicability of this category to any of the compound characters devised in ancient times, maintaining that now-lost "secondary readings" are responsible for the apparent absence of phonetic indicators.[27]

Also called borrowings or phonetic loan characters, the rebus category covers cases where an existing character is used to represent an unrelated word with similar or identical pronunciation; sometimes the old meaning is then lost completely, as with characters such as 自zì, which has lost its original meaning of "nose" completely and exclusively means "oneself", or 萬wàn, which originally meant "scorpion" but is now used only in the sense of "ten thousand".

Rebus was pivotal in the history of writing in China insofar as it represented the stage at which logographic writing could become purely phonetic (phonographic). Chinese characters used purely for their sound values are attested in the Chun Qiu春秋 and Zhan Guo戰國 manuscripts, in which zhi氏 was used to write shi是 and vice versa, just lines apart; the same happened with shao 勺 for Zhao趙, with the characters in question being homophonous or nearly homophonous at the time.[28]

Semantic-phonetic compounds or pictophonetic compounds are by far the most numerous characters. These characters are composed of two parts: one of a limited set of characters (the semantic indicator, often graphically simplified) which suggests the general meaning of the compound character, and another character (the phonetic indicator) whose pronunciation suggests the pronunciation of the compound character. In most cases the semantic indicator is also the radical under which the character is listed in dictionaries.

Examples are 河hé "river", 湖hú "lake", 流liú "stream", 沖chōng "surge", 滑huá "slippery". All these characters have on the left a radical of three short strokes (氵), which is a reduced form of the character 水 shuǐ meaning "water", indicating that the character has a semantic connection with water, the right-hand side in each case is a phonetic indicator. For example, in the case of 沖chōng (Old Chinese *ɡ-ljuŋ[29]) "surge", the phonetic indicator is 中zhōng (Old Chinese *k-ljuŋ[30]), which by itself means "middle". In this case it can be seen that the pronunciation of the character is slightly different from that of its phonetic indicator; the effect of historical sound change means that the composition of such characters can sometimes seem arbitrary today.

Xu Shen (c. 100 AD) placed approximately 82% of characters into this category, while in the Kangxi Dictionary (1716 AD) the number is closer to 90%, due to the extremely productive use of this technique to extend the Chinese vocabulary.[citation needed]

This method is still sometimes used to form new characters, for example 钚 / 鈈bù ("plutonium") is the metal radical 金jīn plus the phonetic component 不bù, described in Chinese as "不 gives sound, 金 gives meaning". Many Chinese names of elements in the periodic table and many other chemistry-related characters were formed this way. In fact, it is possible to tell from a Chinese periodic table at a glance which elements are metal (金), solid nonmetal (石, "stone"), liquid (氵), or gas (气).

Occasionally a bisyllabic word is written with two characters that contain the same radical, as in 蝴蝶húdié "butterfly", where both characters have the insect radical 虫. A notable example is pipa (a Chinese lute, also a fruit, the loquat, of similar shape) – originally written as 批把 with the hand radical, referring to the down and up strokes when playing this instrument, which was then changed to 枇杷 (tree radical), which is still used for the fruit, while the character was changed to 琵琶 when referring to the instrument.[31] In other cases a compound word may coincidentally share a radical without this being meaningful.

The smallest category of characters is also the least understood;[32] in the postface to the Shuowen Jiezi, Xu Shen gave as an example the characters 考kǎo "to verify" and 老lǎo "old", which had similar Old Chinese pronunciations (*khuʔ and *c-ruʔ respectively[33]) and may once have been the same word, meaning "elderly person", but became lexicalized into two separate words. The term does not appear in the body of the dictionary, and is often omitted from modern systems.[26]

According to legend, Chinese characters were invented by Cangjie, a bureaucrat under the legendary Yellow Emperor. Inspired by his study of the animals of the world, the landscape of the earth and the stars in the sky, Cangjie is said to have invented symbols called zì (字) – the first Chinese characters, the legend relates that on the day the characters were created, people heard ghosts wailing and saw crops falling like rain.[34]

In recent decades, a series of inscribed graphs and pictures have been found at Neolithic sites in China, including Jiahu (c. 6500 BC), Dadiwan and Damaidi from the 6th millennium BC, and Banpo (5th millennium BC). Often these finds are accompanied by media reports that push back the purported beginnings of Chinese writing by thousands of years.[35][36] However, because these marks occur singly, without any implied context, and are made crudely and simply, Qiu Xigui concluded that "we do not have any basis for stating that these constituted writing nor is there reason to conclude that they were ancestral to Shang dynasty Chinese characters."[37] They do however demonstrate a history of sign use in the Yellow River valley during the Neolithic through to the Shang period.[36]

The earliest confirmed evidence of the Chinese script yet discovered is the body of inscriptions carved on oracle bones from the late Shang dynasty (c. 1200–1050 BC).[38][39] In 1899, pieces of these bones were being sold as "dragon bones" for medicinal purposes, when scholars identified the symbols on them as Chinese writing. By 1928, the source of the bones had been traced to a village near Anyang in Henan Province, which was excavated by the Academia Sinica between 1928 and 1937. Over 150,000 fragments have been found.[38]

Oracle bone inscriptions are records of divinations performed in communication with royal ancestral spirits,[38] the shortest are only a few characters long, while the longest are thirty to forty characters in length. The Shang king would communicate with his ancestors on topics relating to the royal family, military success, weather forecasting, ritual sacrifices, and related topics by means of scapulimancy, and the answers would be recorded on the divination material itself.[38]

The oracle-bone script is a well-developed writing system,[40][41] suggesting that the Chinese script's origins may lie earlier than the late second millennium BC.[42] Although these divinatory inscriptions are the earliest surviving evidence of ancient Chinese writing, it is widely believed that writing was used for many other non-official purposes, but that the materials upon which non-divinatory writing was done – likely wood and bamboo – were less durable than bone and shell and have since decayed away.[42]

The traditional picture of an orderly series of scripts, each one invented suddenly and then completely displacing the previous one, has been conclusively demonstrated to be fiction by the archaeological finds and scholarly research of the later 20th and early 21st centuries.[43] Gradual evolution and the coexistence of two or more scripts was more often the case, as early as the Shang dynasty, oracle-bone script coexisted as a simplified form alongside the normal script of bamboo books (preserved in typical bronze inscriptions), as well as the extra-elaborate pictorial forms (often clan emblems) found on many bronzes.

Left: Bronze fāngzūn (方樽) ritual wine container dated about 1000 BC. The written inscription cast in bronze on the vessel commemorates a gift of cowrie shells (then used as currency in China) from someone of presumably elite status in Zhou dynasty society. Right: Bronze fāngyí (方彝) ritual container dated about 1000 BC. A written inscription of some 180 Chinese characters appears twice on the vessel, the written inscription comments on state rituals that accompanied court ceremony, recorded by an official scribe.

Based on studies of these bronze inscriptions, it is clear that, from the Shang dynasty writing to that of the Western Zhou and early Eastern Zhou, the mainstream script evolved in a slow, unbroken fashion, until assuming the form that is now known as seal script in the late Eastern Zhou in the state of Qin, without any clear line of division.[44][45] Meanwhile, other scripts had evolved, especially in the eastern and southern areas during the late Zhou dynasty, including regional forms, such as the gǔwén ("ancient forms") of the eastern Warring States preserved as variant forms in the Han dynasty character dictionary Shuowen Jiezi, as well as decorative forms such as bird and insect scripts.

Seal script, which had evolved slowly in the state of Qin during the Eastern Zhou dynasty, became standardized and adopted as the formal script for all of China in the Qin dynasty (leading to a popular misconception that it was invented at that time), and was still widely used for decorative engraving and seals (name chops, or signets) in the Han dynasty period. However, despite the Qin script standardization, more than one script remained in use at the time, for example, a little-known, rectilinear and roughly executed kind of common (vulgar) writing had for centuries coexisted with the more formal seal script in the Qin state, and the popularity of this vulgar writing grew as the use of writing itself became more widespread.[46] By the Warring States period, an immature form of clerical script called "early clerical" or "proto-clerical" had already developed in the state of Qin[47] based upon this vulgar writing, and with influence from seal script as well.[48] The coexistence of the three scripts – small seal, vulgar and proto-clerical, with the latter evolving gradually in the Qin to early Han dynasties into clerical script – runs counter to the traditional belief that the Qin dynasty had one script only, and that clerical script was suddenly invented in the early Han dynasty from the small seal script.

Proto-clerical script, which had emerged by the time of the Warring States period from vulgar Qin writing, matured gradually, and by the early Western Han period, it was little different from that of the Qin.[49] Recently discovered bamboo slips show the script becoming mature clerical script by the middle-to-late reign of Emperor Wu of the Western Han,[50] who ruled from 141 to 87 BC.

Contrary to the popular belief of there being only one script per period, there were in fact multiple scripts in use during the Han period,[51] although mature clerical script, also called 八分 (bāfēn)[52] script, was dominant at that time, an early type of cursive script was also in use by the Han by at least as early as 24 BC (during the very late Western Han period),[c] incorporating cursive forms popular at the time, well as many elements from the vulgar writing of the Warring State of Qin.[53] By around the time of the Eastern Jin dynasty, this Han cursive became known as 章草 zhāngcǎo (also known as 隶草 / 隸草 lìcǎo today), or in English sometimes clerical cursive, ancient cursive, or draft cursive. Some believe that the name, based on 章 zhāng meaning "orderly", arose because the script was a more orderly form[54] of cursive than the modern form, which emerged during the Eastern Jin dynasty and is still in use today, called 今草 jīncǎo or "modern cursive".[55]

Around the mid-Eastern Han period,[54] a simplified and easier-to-write form of clerical script appeared, which Qiu terms "neo-clerical" (新隶体 / 新隸體, xīnlìtǐ).[56] By the late Eastern Han, this had become the dominant daily script,[54] although the formal, mature bāfēn (八分) clerical script remained in use for formal works such as engraved stelae.[54] Qiu describes this neo-clerical script as a transition between clerical and regular script,[54] and it remained in use through the Cao Wei and Jin dynasties.[57]

By the late Eastern Han period, an early form of semi-cursive script appeared,[56] developing out of a cursively written form of neo-clerical script[d] and simple cursive.[58] This semi-cursive script was traditionally attributed to Liu Desheng c. 147–188 AD,[57][e] although such attributions refer to early masters of a script rather than to their actual inventors, since the scripts generally evolved into being over time. Qiu gives examples of early semi-cursive script, showing that it had popular origins rather than being purely Liu’s invention.[59]

Regular script has been attributed to Zhong Yao, of the Eastern Han to Cao Wei period (c. 151–230 AD), who has been called the "father of regular script". However, some scholars[60] postulate that one person alone could not have developed a new script which was universally adopted, but could only have been a contributor to its gradual formation, the earliest surviving pieces written in regular script are copies of Yao's works, including at least one copied by Wang Xizhi. This new script, which is the dominant modern Chinese script, developed out of a neatly written form of early semi-cursive, with addition of the pause (頓/顿 dùn) technique to end horizontal strokes, plus heavy tails on strokes which are written to the downward-right diagonal.[61] Thus, early regular script emerged from a neat, formal form of semi-cursive, which had itself emerged from neo-clerical (a simplified, convenient form of clerical script), it then matured further in the Eastern Jin dynasty in the hands of the "Sage of Calligraphy", Wang Xizhi, and his son Wang Xianzhi. It was not, however, in widespread use at that time, and most writers continued using neo-clerical, or a somewhat semi-cursive form of it, for daily writing,[61] while the conservative bafen clerical script remained in use on some stelae, alongside some semi-cursive, but primarily neo-clerical.[62]

Meanwhile, modern cursive script slowly emerged from the clerical cursive (zhāngcǎo) script during the Cao Wei to Jin period, under the influence of both semi-cursive and the newly emerged regular script.[63] Cursive was formalized in the hands of a few master calligraphers, the most famous and influential of whom was Wang Xizhi.[f]

It was not until the Northern and Southern dynasties that regular script rose to dominant status.[64] During that period, regular script continued evolving stylistically, reaching full maturity in the early Tang dynasty, some call the writing of the early Tang calligrapher Ouyang Xun (557–641) the first mature regular script. After this point, although developments in the art of calligraphy and in character simplification still lay ahead, there were no more major stages of evolution for the mainstream script.

Although most of the simplified Chinese characters in use today are the result of the works moderated by the government of the People's Republic of China in the 1950s and 60s, character simplification predates the republic's formation in 1949. One of the earliest proponents of character simplification was Lufei Kui, who proposed in 1909 that simplified characters should be used in education; in the years following the May Fourth Movement in 1919, many anti-imperialist Chinese intellectuals sought ways to modernise China. In the 1930s and 1940s, discussions on character simplification took place within the Kuomintang government, and many Chinese intellectuals and writers have long maintained that character simplification would help boost literacy in China; in many world languages, literacy has been promoted as a justification for spelling reforms. The People's Republic of China issued its first round of official character simplifications in two documents, the first in 1956 and the second in 1964; in the 1950s and 1960s, while confusion about simplified characters was still rampant, transitional characters that mixed simplified parts with yet-to-be simplified parts of characters together appeared briefly, then disappeared.

"Han unification" was an effort by the authors of Unicode and the Universal Character Set to map multiple character sets of the so-called CJK languages (Chinese/Japanese/Korean) into a single set of unified characters and was completed for the purposes of Unicode in 1991 (Unicode 1.0).

The Chinese script spread to Korea together with Buddhism from the 2nd century BC to 5th century AD (hanja).[65] The Japanese kanji were adopted for recording the Japanese language from the 5th century AD.[g]

Chinese characters were first used in Vietnam during the millennium of Chinese rule starting in 111 BC, they were used to write Classical Chinese and adapted around the 13th century to create the Nôm script to write Vietnamese.

Currently, the only non-Chinese language outside of China that uses Chinese characters is Japanese. Vietnamese abandoned their use in the early 20th century in favour of a Latin-based script, and Korean in the late 20th century in favour of its homegrown hangul script, although as Korean switched much more recently, many Koreans still learn them to read texts written before then, or in some cases to disambiguate homophones.

Chinese characters adapted to write Japanese words are known as Kanji. Chinese words borrowed into Japanese could be written with the Chinese character, while Japanese words could be written using the character for a Chinese word of similar meaning, because there have been multiple layers of borrowing into Japanese, a single character may have several readings in Japanese.[66]

Written Japanese also includes a pair of syllabaries known as kana, derived by simplifying Chinese characters selected to represent syllables of Japanese, the syllabaries differ because they sometimes selected different characters for a syllable, and because they used different strategies to reduce these characters for easy writing: the angular katakana were obtained by selecting a part of each character, while hiragana were derived from the cursive forms of whole characters.[67] Modern Japanese writing uses a composite system, using kanji for word stems, hiragana for inflexional endings and grammatical words, and katakana to transcribe non-Chinese loanwords as well as serve as a method to emphasize native words (similar to how italics are used in Romance languages).[68]

In times past, until the 15th century, in Korea, Literary Chinese was the dominant form of written communication, prior to the creation of hangul, the Korean alphabet. Much of the vocabulary, especially in the realms of science and sociology, comes directly from Chinese, comparable to Latin or Greek root words in European languages. However, due to the lack of tones in Korean,[69] as the words were imported from Chinese, many dissimilar characters took on identical sounds, and subsequently identical spelling in hangul.[citation needed] Chinese characters are sometimes used to this day for either clarification in a practical manner, or to give a distinguished appearance, as knowledge of Chinese characters is considered a high class attribute and an indispensable part of a classical education.[citation needed] It is also observed that the preference for Chinese characters is treated as being conservative and Confucian.

In Korea, hanja have become a politically contentious issue, with some Koreans urging a "purification" of the national language and culture by totally abandoning their use, these individuals encourage the exclusive use of the native hangul alphabet throughout Korean society and the end to character education in public schools.[citation needed]

In South Korea, educational policy on characters has swung back and forth, often swayed by education ministers' personal opinions, at present, middle and high school students (grades 7 to 12) are taught 1,800 characters,[70] albeit with the principal focus on recognition, with the aim of achieving newspaper literacy. Since there is little need to use hanja in everyday life, young adult Koreans are seldom able to read more than a few hundred characters.[citation needed]

There is a clear trend toward the exclusive use of hangul in day-to-day South Korean society. Hanja are still used to some extent, particularly in newspapers, weddings, place names and calligraphy (although it is nowhere near the extent of kanji use in day-to-day Japanese society). Hanja is also extensively used in situations where ambiguity must be avoided,[citation needed] such as academic papers, high-level corporate reports, government documents, and newspapers; this is due to the large number of homonyms that have resulted from extensive borrowing of Chinese words.

The issue of ambiguity is the main hurdle in any effort to "cleanse" the Korean language of Chinese characters.[citation needed] Characters convey meaning visually, while alphabets convey guidance to pronunciation, which in turn hints at meaning, as an example, in Korean dictionaries, the phonetic entry for 기사 gisa yields more than 30 different entries. In the past, this ambiguity had been efficiently resolved by parenthetically displaying the associated hanja.

When learning how to write hanja, students are taught to memorize the native Korean pronunciation for the hanja's meaning and the Sino-Korean pronunciations (the pronunciation based on the Chinese pronunciation of the characters) for each hanja respectively so that students know what the syllable and meaning is for a particular hanja, for example, the name for the hanja 水 is 물 수 (mul-su) in which 물 (mul) is the native Korean pronunciation for "water", while 수 (su) is the Sino-Korean pronunciation of the character. The naming of hanja is similar to if "water" were named "water-aqua", "horse-equus", or "gold-aurum" based on a hybridization of both the English and the Latin names. Other examples include 사람 인 (saram-in) for 人 "person/people", 큰 대 (keun-dae) for 大 "big/large//great", 작을 소 (jakeul-so) for 小 "small/little", 아래 하 (arae-ha) for 下 "underneath/below/low", 아비 부 (abi-bu) for 父 "father", and 나라이름 한 (naraireum-han) for 韓 "Han/Korea".[71]

In North Korea, the hanja system was once completely banned since June 1949 due to fears of collapsed containment of the country; during the 1950s, Kim Il Sung had condemned all sorts of foreign languages (even the newly proposed New Korean Orthography). The ban continued into the 21st century. However, a textbook for university history departments containing 3,323 distinct characters was published in 1971; in the 1990s, school children were still expected to learn 2,000 characters (more than in South Korea or Japan).[72]

After Kim Jong Il, the second ruler of North Korea, died in December 2011, Kim Jong Un stepped up and began mandating the use of Hanja as a source of definition for the Korean language. Currently, it is said that North Korea teaches around 3,000 Hanja characters to North Korean students, and in some cases, the characters appear within advertisements and newspapers. However, it is also said that the authorities implore students not to use the characters in public.[73] Due to North Korea's strict isolationism, accurate reports about hanja use in North Korea are hard to obtain.

Chinese characters are thought to have been first introduced to the Ryukyu Islands in 1265 by a Japanese Buddhist monk,[74] after the Okinawan kingdoms became tributaries of Ming China, especially the Ryukyu Kingdom, Classical Chinese was used in court documents, but hiragana was mostly used for popular writing and poetry. After Ryukyu became a vassal of Japan's Satsuma Domain, Chinese characters became more popular, as well as the use of Kanbun; in modern Okinawan, which is labeled as a Japanese dialect by the Japanese government, katakana and hiragana are mostly used to write Okinawan, but Chinese characters are still used.

"My mother eats vegetarian food at the pagoda every Sunday", written in the modern Vietnamese alphabet (blue) and Nom. Characters borrowed unchanged from Chinese are shown in green, while invented characters are brown.

Although Chinese characters in Vietnam are now limited to ceremonial uses, they were once in widespread use, until the early 20th century, Literary Chinese was used in Vietnam for all official and scholarly writing. Around the 13th century the Nôm script was developed to record folk literature in the Vietnamese language, the script used Chinese characters to represent both borrowed Sino-Vietnamese vocabulary and native words with similar pronunciation or meaning. In addition thousands of new compound characters were created to write Vietnamese words, this process resulted in a highly complex system that was never mastered by more than 5% of the population. Both Literary Chinese and Nôm were replaced in the early 20th century by Vietnamese written with the Latin-based Vietnamese alphabet.[75][76]

Several minority languages of south and southwest China were formerly written with scripts based on Chinese characters but also including many locally created characters, the most extensive is the sawndip script for the Zhuang language of Guangxi which is still used to this day. Other languages written with such scripts include Miao, Yao, Bouyei, Kam, Bai and Hani. All these languages are now written using Latin-based scripts.[77]

Along with Persian and Arabic, Chinese characters were also used as a foreign script to write the Mongolian language, where characters were used to phonetically transcribe Mongolian sounds. Most notably, the only surviving copies of The Secret History of the Mongols were written in such a manner; the Chinese characters 忙豁侖紐察 脫[卜]察安 (pinyin: mánghuōlúnniǔchá tuō[bo]chá'ān) is the rendering of Mongγol-un niγuca tobčiyan, the title in Mongolian.

According to the Rev. John Gulick: "The inhabitants of other Asiatic nations, who have had occasion to represent the words of their several languages by Chinese characters, have as a rule used unaspirated characters for the sounds, g, d, b, the Muslims from Arabia and Persia have followed this method … The Mongols, Manchu, and Japanese also constantly select unaspirated characters to represent the sounds g, d, b, and j of their languages. These surrounding Asiatic nations, in writing Chinese words in their own alphabets, have uniformly used g, d, b, etc., to represent the unaspirated sounds."[78]

The use of traditional Chinese characters versus simplified Chinese characters varies greatly, and can depend on both the local customs and the medium, before the official reform, character simplifications were not officially sanctioned and generally adopted vulgar variants and idiosyncratic substitutions. Orthodox variants were mandatory in printed works, while the (unofficial) simplified characters would be used in everyday writing or quick notes. Since the 1950s, and especially with the publication of the 1964 list, the People's Republic of China has officially adopted simplified Chinese characters for use in mainland China, while Hong Kong, Macau, and the Republic of China (Taiwan) were not affected by the reform. There is no absolute rule for using either system, and often it is determined by what the target audience understands, as well as the upbringing of the writer.

Although most often associated with the People's Republic of China, character simplification predates the 1949 communist victory. Caoshu, cursive written text, almost always includes character simplification, and simplified forms have always existed in print, albeit not for the most formal works. In the 1930s and 1940s, discussions on character simplification took place within the Kuomintang government, and a large number of Chinese intellectuals and writers have long maintained that character simplification would help boost literacy in China. Indeed, this desire by the Kuomintang to simplify the Chinese writing system (inherited and implemented by the Communist Party of China) also nursed aspirations of some for the adoption of a phonetic script based on the Latin script, and spawned such inventions as the Gwoyeu Romatzyh.

The People's Republic of China issued its first round of official character simplifications in two documents, the first in 1956 and the second in 1964. A second round of character simplifications (known as erjian, or "second round simplified characters") was promulgated in 1977. It was poorly received, and in 1986 the authorities rescinded the second round completely, while making six revisions to the 1964 list, including the restoration of three traditional characters that had been simplified: 叠 dié, 覆 fù, 像 xiàng.

The majority of simplified characters are drawn from conventional abbreviated forms, or ancient standard forms,[79] for example, the orthodox character 來 lái ("come") was written with the structure 来 in the clerical script (隶书 / 隸書, lìshū) of the Han dynasty. This clerical form uses one fewer stroke, and was thus adopted as a simplified form, the character 雲 yún ("cloud") was written with the structure 云 in the oracle bone script of the Shang dynasty, and had remained in use later as a phonetic loan in the meaning of "to say" while the 雨 radical was added to differentiate meanings. The simplified form adopts the original structure.

In the years after World War II, the Japanese government also instituted a series of orthographic reforms, some characters were given simplified forms called shinjitai(新字体, lit. "new character forms"); the older forms were then labelled the kyūjitai(旧字体, lit. "old character forms"). The number of characters in common use was restricted, and formal lists of characters to be learned during each grade of school were established, first the 1850-character tōyō kanji(当用漢字) list in 1945, the 1945-character jōyō kanji(常用漢字) list in 1981, and a 2136-character reformed version of the jōyō kanji in 2010. Many variant forms of characters and obscure alternatives for common characters were officially discouraged, this was done with the goal of facilitating learning for children and simplifying kanji use in literature and periodicals. These are simply guidelines, hence many characters outside these standards are still widely known and commonly used, especially those used for personal and place names (for the latter, see jinmeiyō kanji),[citation needed] as well as for some common words such as "dragon" (竜/龍,tatsu) in which both old and new forms of the kanji are both acceptable and widely known amongst native Japanese speakers.

Singapore underwent three successive rounds of character simplification. These resulted in some simplifications that differed from those used in mainland China, it ultimately adopted the reforms of the People's Republic of China in their entirety as official, and has implemented them in the educational system. However, unlike in China, personal names may still be registered in traditional characters.

Malaysia started teaching a set of simplified characters at schools in 1981, which were also completely identical to the Mainland China simplifications. Chinese newspapers in Malaysia are published in either set of characters, typically with the headlines in traditional Chinese while the body is in simplified Chinese.

Although in both countries the use of simplified characters is universal among the younger Chinese generation, a large majority of the older Chinese literate generation still use the traditional characters. Chinese shop signs are also generally written in traditional characters.

In the Philippines, most Chinese schools and businesses still use the traditional characters and bopomofo, owing from influence from the Republic of China (Taiwan) due to the shared Hokkien heritage. Recently, however, more Chinese schools now use both simplified characters and pinyin, since most readers of Chinese newspapers in the Philippines belong to the older generation, they are still published largely using traditional characters.

Public and private Chinese signage in the United States and Canada most often use Traditional Characters.[80] There is some effort to get municipal governments to implement more simplified character signage due to recent immigration from Mainland China.[81] Most community newspapers printed in North America are also printed in Traditional Characters.

There are numerous styles, or scripts, in which Chinese characters can be written, deriving from various calligraphic and historical models. Most of these originated in China and are now common, with minor variations, in all countries where Chinese characters are used.

The cursive script (草書(书), cǎoshū, literally "grass script") is used informally, the basic character shapes are suggested, rather than explicitly realized, and the abbreviations are sometimes extreme. Despite being cursive to the point where individual strokes are no longer differentiable and the characters often illegible to the untrained eye, this script (also known as draft) is highly revered for the beauty and freedom that it embodies, some of the simplified Chinese characters adopted by the People's Republic of China, and some simplified characters used in Japan, are derived from the cursive script. The Japanese hiragana script is also derived from this script.

There also exist scripts created outside China, such as the Japanese Edomoji styles; these have tended to remain restricted to their countries of origin, rather than spreading to other countries like the Chinese scripts.

Chinese calligraphy of mixed styles written by Song dynasty (1051–1108 AD) poet Mifu. For centuries, the Chinese literati were expected to master the art of calligraphy.

The art of writing Chinese characters is called Chinese calligraphy, it is usually done with ink brushes. In ancient China, Chinese calligraphy is one of the Four Arts of the Chinese Scholars. There is a minimalist set of rules of Chinese calligraphy, every character from the Chinese scripts is built into a uniform shape by means of assigning it a geometric area in which the character must occur. Each character has a set number of brushstrokes; none must be added or taken away from the character to enhance it visually, lest the meaning be lost. Finally, strict regularity is not required, meaning the strokes may be accentuated for dramatic effect of individual style. Calligraphy was the means by which scholars could mark their thoughts and teachings for immortality, and as such, represent some of the more precious treasures that can be found from ancient China.

Ming and sans-serif are the most popular in body text and are based on regular script for Chinese characters akin to Western serif and sans-serif typefaces, respectively. Regular script typefaces emulate regular script.

The Song typeface (宋体 / 宋體, sòngtǐ) is known as the Ming typeface (明朝, minchō) in Japan, and it is also somewhat more commonly known as the Ming typeface (明体 / 明體, míngtǐ) than the Song typeface in Taiwan and Hong Kong. The names of these styles come from the Song and Ming dynasties, when block printing flourished in China.

Sans-serif typefaces, called black typeface (黑体 / 黑體, hēitǐ) in Chinese and Gothic typeface (ゴシック体) in Japanese, are characterized by simple lines of even thickness for each stroke, akin to sans-serif styles such as Arial and Helvetica in Western typography.

Regular script typefaces are also commonly used, but not as common as Ming or sans-serif typefaces for body text. Regular script typefaces are often used to teach students Chinese characters, and often aim to match the standard forms of the region where they are meant to be used. Most typefaces in the Song dynasty were regular script typefaces which resembled a particular person's handwriting (e.g. the handwriting of Ouyang Xun, Yan Zhenqing, or Liu Gongquan), while most modern regular script typefaces tend toward anonymity and regularity.

Variants of the Chinese character for guī 'turtle', collected c. 1800 from printed sources. The one at left is the traditional form used today in Taiwan and Hong Kong, 龜, though 龜 may look slightly different, or even like the second variant from the left, depending on your font (see Wiktionary), the modern simplified forms used in China, 龟, and in Japan, 亀, are most similar to the variant in the middle of the bottom row, though neither is identical. A few more closely resemble the modern simplified form of the character for diàn 'lightning', 电.

Five of the 30 variant characters found in the preface of the Imperial (Kangxi) Dictionary which are not found in the dictionary itself. They are 為 (爲) wèi "due to", 此 cǐ "this", 所 suǒ "place", 能 néng "be able to", 兼 jiān "concurrently". (Although the form of 為 is not very different, and in fact is used today in Japan, the radical 爪 has been obliterated.) Another variant from the preface, 来 for 來 lái "to come", also not listed in the dictionary, has been adopted as the standard in Mainland China and Japan.

The character 次 in Simplified and Traditional Chinese, Japanese, and Korean. If you have an appropriate font installed, you can see the corresponding character in Vietnamese: 次.

Just as Roman letters have a characteristic shape (lower-case letters mostly occupying the x-height, with ascenders or descenders on some letters), Chinese characters occupy a more or less square area in which the components of every character are written to fit in order to maintain a uniform size and shape, especially with small printed characters in Ming and sans-serif styles. Because of this, beginners often practise writing on squared graph paper, and the Chinese sometimes use the term "Square-Block Characters" (方块字 / 方塊字, fāngkuàizì), sometimes translated as tetragraph,[82] in reference to Chinese characters.

Despite standardization, some nonstandard forms are commonly used, especially in handwriting; in older sources, even authoritative ones, variant characters are commonplace. For example, in the preface to the Imperial Dictionary, there are 30 variant characters which are not found in the dictionary itself.[83] A few of these are reproduced at right.

The nature of Chinese characters makes it very easy to produce allographs for many characters, and there have been many efforts at orthographical standardization throughout history; in recent times, the widespread usage of the characters in several nations has prevented any particular system becoming universally adopted and the standard form of many Chinese characters thus varies in different regions.

In addition to strictness in character size and shape, Chinese characters are written with very precise rules, the most important rules regard the strokes employed, stroke placement, and stroke order. Just as each region that uses Chinese characters has standardized character forms, each also has standardized stroke orders, with each standard being different. Most characters can be written with just one correct stroke order, though some words also have many valid stroke orders, which may occasionally result in different stroke counts, some characters are also written with different stroke orders due to character simplification.

Chinese characters are primarily morphosyllabic, meaning that most Chinese morphemes are monosyllabic and are written with a single character, though in modern Chinese most words are disyllabic and dimorphemic, consisting of two syllables, each of which is a morpheme. In modern Chinese 10% of morphemes only occur as part of a given compound. However, a few morphemes are disyllabic, some of them dating back to Classical Chinese.[84] Excluding foreign loan words, these are typically words for plants and small animals, they are usually written with a pair of phono-semantic compound characters sharing a common radical. Examples are 蝴蝶 húdié "butterfly" and 珊瑚 shānhú "coral". Note that the 蝴 hú of húdié and the 瑚 hú of shānhú have the same phonetic, 胡, but different radicals ("insect" and "jade", respectively). Neither exists as an independent morpheme except as a poetic abbreviation of the disyllabic word.

In certain cases compound words and set phrases may be contracted into single characters, some of these can be considered logograms, where characters represent whole words rather than syllable-morphemes, though these are generally instead considered ligatures or abbreviations (similar to scribal abbreviations, such as & for "et"), and as non-standard. These do see use, particularly in handwriting or decoration, but also in some cases in print; in Chinese, these ligatures are called héwén (合文), héshū (合書) or hétǐzì (合体字), and in the special case of combining two characters, these are known as "two-syllable Chinese characters" (双音节汉字, 雙音節漢字).

A commonly seen example is the double happiness symbol 囍, formed as a ligature of 喜喜 and referred to by its disyllabic name (simplified Chinese: 双喜; traditional Chinese: 雙喜; pinyin: shuāngxǐ). In handwriting, numbers are very frequently squeezed into one space or combined – common ligatures include 廿 niàn, "twenty", normally read as 二十 èrshí, 卅 sà, "thirty", normally read as 三十 sānshí, and 卌 xì "forty", normally read as 四十 "sìshí". Calendars often use numeral ligatures in order to save space; for example, the "21st of March" can be read as 三月廿一. In some cases counters are also merged into one character, such as 七十人 qīshí rén "seventy people". Another common abbreviation is 门 with a "T" written inside it, for 問題, 问题, wèntí ("question; problem"), where the "T" is from pinyin for the second syllable tí 题.[9] Since polysyllabic characters are often non-standard, they are often excluded in character dictionaries.

Modern examples particularly include Chinese characters for SI units; in Chinese these units are disyllabic and standardly written with two characters, as 厘米 límǐ "centimeter" (厘 centi-, 米 meter) or 千瓦 qiānwǎ "kilowatt". However, in the 19th century these were often written via compound characters, pronounced disyllabically, such as 瓩 for 千瓦 or 糎 for 厘米 – some of these characters were also used in Japan, where they were pronounced with borrowed European readings instead, these have now fallen out of general use, but are occasionally seen. Less systematic examples include 圕 túshūguǎn "library", a contraction of 圖書館,[85][86] A four-morpheme word, 社会主义 shèhuì zhǔyì "socialism", is commonly[weasel words] written with a single character formed by combining the last character, 义, with the radical of the first, 社, yielding roughly 礻义.[citation needed]

The use of such contractions is as old as Chinese characters themselves, and they have frequently been found in religious or ritual use; in the Oracle Bone script, personal names, ritual items, and even phrases such as 受又(祐) shòu yòu "receive blessings" are commonly contracted into single characters. A dramatic example is that in medieval manuscripts 菩薩 púsà "bodhisattva" (simplified: 菩萨) is sometimes written with a single character formed of a 2×2 grid of four 十 (derived from the grass radical over two 十).[9] However, for the sake of consistency and standardization, the CPC seeks to limit the use of such polysyllabic characters in public writing to ensure that every character only has one syllable.[87]

Conversely, with the fusion of the diminutive -er suffix in Mandarin, some monosyllabic words may even be written with two characters, as in 花儿 huār "flower", which was formerly disyllabic.

In most other languages that use the Chinese family of scripts, notably Korean, Vietnamese, and Zhuang, Chinese characters are typically monosyllabic, but in Japanese a single character is generally used to represent a borrowed monosyllabic Chinese morpheme (the on'yomi), an polysyllabic native Japanese morpheme (the kun'yomi), or even (in rare cases) a foreign loanword. These uses are completely standard and unexceptional.

Often a character not commonly used (a "rare" or "variant" character) will appear in a personal or place name in Chinese, Japanese, Korean, and Vietnamese (see Chinese name, Japanese name, Korean name, and Vietnamese name, respectively). This has caused problems as many computer encoding systems include only the most common characters and exclude the less often used characters, this is especially a problem for personal names which often contain rare or classical, antiquated characters.

One man who has encountered this problem is Taiwanese politician Yu Shyi-kun, due to the rarity of the last character in his name. Newspapers have dealt with this problem in varying ways, including using software to combine two existing, similar characters, including a picture of the personality, or, especially as is the case with Yu Shyi-kun, simply substituting a homophone for the rare character in the hope that the reader would be able to make the correct inference. Taiwanese political posters, movie posters etc. will often add the bopomofo phonetic symbols next to such a character. Japanese newspapers may render such names and words in katakana instead of kanji, and it is accepted practice for people to write names for which they are unsure of the correct kanji in katakana instead.

There are also some extremely complex characters which have understandably become rather rare. According to Joël Bellassen (1989), the most complex Chinese character is /𪚥 (U+2A6A5) zhélisten(help·info), meaning "verbose" and containing sixty-four strokes; this character fell from use around the 5th century. It might be argued, however, that while containing the most strokes, it is not necessarily the most complex character (in terms of difficulty), as it simply requires writing the same sixteen-stroke character 龍 lóng (lit. "dragon") four times in the space for one. Another 64-stroke character is /𠔻 (U+2053B) zhèng composed of 興 xīng/xìng (lit. "flourish") four times.

One of the most complex characters found in modern Chinese dictionaries[h] is 齉 (U+9F49) (nàng, listen(help·info), pictured below, middle image), meaning "snuffle" (that is, a pronunciation marred by a blocked nose), with "just" thirty-six strokes. However, this is not in common use, the most complex character that can be input using the Microsoft New Phonetic IME 2002a for traditional Chinese is 龘 (dá, "the appearance of a dragon flying"). It is composed of the dragon radical represented three times, for a total of 16 × 3 = 48 strokes, among the most complex characters in modern dictionaries and also in frequent modern use are 籲 (yù, "to implore"), with 32 strokes; 鬱 (yù, "luxuriant, lush; gloomy"), with 29 strokes, as in 憂鬱 (yōuyù, "depressed"); 豔 (yàn, "colorful"), with 28 strokes; and 釁 (xìn, "quarrel"), with 25 strokes, as in 挑釁 (tiǎoxìn, "to pick a fight"). Also in occasional modern use is 鱻 (xiān "fresh"; variant of 鮮 xiān) with 33 strokes.

In Japanese, an 84-stroke kokuji exists: , normally read taito. It is composed of three "cloud" (雲) characters on top of the abovementioned triple "dragon" character (龘). Also meaning "the appearance of a dragon in flight", it has been pronounced おとど otodo, たいとtaito, and だいと daito.[88] The most elaborate character in the jōyō kanji list is the 29-stroke 鬱, meaning "depression" or "melancholy".

The most complex Chinese character still in use may be[according to whom?]biáng (pictured right, bottom), with 58 strokes, which refers to Biang biang noodles, a type of noodle from China's Shaanxi province. This character along with the syllable biáng cannot be found in dictionaries, the fact that it represents a syllable that does not exist in any Standard Chinese word means that it could be classified as a dialectal character.

The total number of Chinese characters from past to present remains unknowable because new ones are being developed all the time – for instance, brands may create new characters when none of the existing ones allow for the intended meaning – or they have been invented by whoever wrote them and have never been adopted as official characters. Chinese characters are theoretically an open set and anyone can create new characters, though such inventions are rarely included in official character sets,[89] the number of entries in major Chinese dictionaries is the best means of estimating the historical growth of character inventory.

Even the Zhonghua Zihai does not include characters in the Chinese family of scripts created to represent non-Chinese languages. Characters formed by Chinese principles in other languages include the roughly 1,500 Japanese-made kokuji given in the Kokuji no Jiten,[103] the Korean-made gukja, the over 10,000 Sawndip characters still in use in Guangxi, and the almost 20,000 Nôm characters formerly used in Vietnam.[citation needed] More divergent descendents of Chinese script include Tangut script, which created over 5,000 characters with similar strokes but different formation principles to Chinese characters.

Modified radicals and new variants are two common reasons for the ever-increasing number of characters. There are about 300 radicals and 100 are in common use. Creating a new character by modifying the radical is an easy way to disambiguate homographs among xíngshēngzì pictophonetic compounds, this practice began long before the standardization of Chinese script by Qin Shi Huang and continues to the present day. The traditional 3rd-person pronoun tā (他 "he, she, it"), which is written with the "person radical", illustrates modifying significs to form new characters; in modern usage, there is a graphic distinction between tā (她 "she") with the "woman radical", tā (牠 "it") with the "animal radical", tā (它 "it") with the "roof radical", and tā (祂 "He") with the "deity radical", One consequence of modifying radicals is the fossilization of rare and obscure variant logographs, some of which are not even used in Classical Chinese. For instance, he 和 "harmony, peace", which combines the "grain radical" with the "mouth radical", has infrequent variants 咊 with the radicals reversed and 龢 with the "flute radical".

Cumulative frequency of simplified Chinese characters in Modern Chinese text[104]

Chinese characters should not be confused with Chinese words, as the majority of modern Chinese words, unlike their Old Chinese and Middle Chinese counterparts, are written with two or more characters, each character representing one syllable and/or morpheme. Knowing the meanings of the individual characters of a word will often allow the general meaning of the word to be inferred, but this is not always the case.

Studies in China have shown that literate individuals know and use between 3,000 and 4,000 characters. Specialists in classical literature or history, who would often encounter characters no longer in use, are estimated to have a working vocabulary of between 5,000 and 6,000 characters.[6]

In China, which uses simplified Chinese characters, the Xiàndài Hànyǔ Chángyòng Zìbiǎo (现代汉语常用字表, Chart of Common Characters of Modern Chinese) lists 2,500 common characters and 1,000 less-than-common characters, while the Xiàndài Hànyǔ Tōngyòng Zìbiǎo (现代汉语通用字表, Chart of Generally Utilized Characters of Modern Chinese) lists 7,000 characters, including the 3,500 characters already listed above. GB2312, an early version of the national encoding standard used in the People's Republic of China, has 6,763 code points. GB18030, the modern, mandatory standard, has a much higher number. The New Hànyǔ Shuǐpíng Kǎoshì (汉语水平考试, Chinese Proficiency Test) covers approximately 2,600 characters at its highest level (level six).[105]

In the Republic of China (Taiwan), which uses traditional Chinese characters, the Ministry of Education's Chángyòng Guózì Biāozhǔn Zìtǐ Biǎo (常用國字標準字體表, Chart of Standard Forms of Common National Characters) lists 4,808 characters; the Cì Chángyòng Guózì Biāozhǔn Zìtǐ Biǎo (次常用國字標準字體表, Chart of Standard Forms of Less-Than-Common National Characters) lists another 6,341 characters. The Chinese Standard Interchange Code (CNS11643)—the official national encoding standard—supports 48,027 characters, while the most widely used encoding scheme, BIG-5, supports only 13,053.

In Hong Kong, which uses traditional Chinese characters, the Education and Manpower Bureau's Soengjung Zi Zijing Biu (常用字字形表), intended for use in elementary and junior secondary education, lists a total of 4,759 characters.

In addition, there are a number of dialect characters (方言字) that are not used in formal written Chinese but represent colloquial terms in non-Mandarin varieties of Chinese. One such variety is Written Cantonese, in widespread use in Hong Kong even for certain formal documents, due to the former British colonial administration's recognition of Cantonese for use for official purposes. In Taiwan, there is also a body of characters used to represent Taiwanese Hokkien. Many varieties have specific characters for words exclusive to them, for example, the vernacular character 㓾, pronounced cii11 in Hakka, means "to kill".[106] Furthermore, Shanghainese and Sichuanese also have their own series of characters, but these are not widely used in actual texts, Mandarin being the preference for all mainland regions.

In Japanese there are 2,136 jōyō kanji (常用漢字, lit. "frequently used kanji") designated by the Japanese Ministry of Education; these are taught during primary and secondary school. The list is a recommendation, not a restriction, and many characters missing from it are still in common use.[107]

One area where character usage is officially restricted is in names, which may contain only government-approved characters, since the jōyō kanji list excludes many characters that have been used in personal and place names for generations, an additional list, referred to as the jinmeiyō kanji (人名用漢字, lit. "kanji for use in personal names"), is published.[108] It currently contains 983 characters.[citation needed]

Today, a well-educated Japanese person may know upwards of 3,500 kanji.[citation needed] The kanji kentei (日本漢字能力検定試験, Nihon Kanji Nōryoku Kentei Shiken or Test of Japanese Kanji Aptitude) tests a speaker's ability to read and write kanji. The highest level of the kanji kentei tests on approximately 6,000 kanji,[109][110] though in practice few people attain (or need to attain) this level.[citation needed]

New characters can in principle be coined at any time, just as new words can be, but they may not be adopted. Significant historically recent coinages date to scientific terms of the 19th century. Specifically, Chinese coined new characters for chemical elements – see chemical elements in East Asian languages – which continue to be used and taught in schools in China and Taiwan. In Japan, in the Meiji era (specifically, late 19th century), new characters were coined for some (but not all) SI units, such as 粁 (米 "meter" + 千 "thousand, kilo-") for kilometer, these kokuji (Japanese-coinages) have found use in China as well – see Chinese characters for SI units for details.

While new characters can be easily coined by writing on paper, they are difficult to represent on a computer – they must generally be represented as a picture, rather than as text – which presents a significant barrier to their use or widespread adoption. Compare this with the use of symbols as names in 20th century musical albums such as Led Zeppelin IV (1971) and Love Symbol Album (1993); an album cover may potentially contain any graphics, but in writing and other computation these symbols are difficult to use.

Dozens of indexing schemes have been created for arranging Chinese characters in Chinese dictionaries, the great majority of these schemes have appeared in only a single dictionary; only one such system has achieved truly widespread use. This is the system of radicals.

Chinese character dictionaries often allow users to locate entries in several ways. Many Chinese, Japanese, and Korean dictionaries of Chinese characters list characters in radical order: characters are grouped together by radical, and radicals containing fewer strokes come before radicals containing more strokes (radical-and-stroke sorting). Under each radical, characters are listed by their total number of strokes, it is often also possible to search for characters by sound, using pinyin (in Chinese dictionaries), zhuyin (in Taiwanese dictionaries), kana (in Japanese dictionaries) or hangul (in Korean dictionaries). Most dictionaries also allow searches by total number of strokes, and individual dictionaries often allow other search methods as well.

For instance, to look up the character where the sound is not known, e.g., 松 (pine tree), the user first determines which part of the character is the radical (here 木), then counts the number of strokes in the radical (four), and turns to the radical index (usually located on the inside front or back cover of the dictionary). Under the number "4" for radical stroke count, the user locates 木, then turns to the page number listed, which is the start of the listing of all the characters containing this radical, this page will have a sub-index giving remainder stroke numbers (for the non-radical portions of characters) and page numbers. The right half of the character also contains four strokes, so the user locates the number 4, and turns to the page number given, from there, the user must scan the entries to locate the character he or she is seeking. Some dictionaries have a sub-index which lists every character containing each radical, and if the user knows the number of strokes in the non-radical portion of the character, he or she can locate the correct page directly.

Another dictionary system is the four corner method, where characters are classified according to the shape of each of the four corners.

Most modern Chinese dictionaries and Chinese dictionaries sold to English speakers use the traditional radical-based character index in a section at the front, while the main body of the dictionary arranges the main character entries alphabetically according to their pinyin spelling. To find a character with unknown sound using one of these dictionaries, the reader finds the radical and stroke number of the character, as before, and locates the character in the radical index, the character's entry will have the character's pronunciation in pinyin written down; the reader then turns to the main dictionary section and looks up the pinyin spelling alphabetically.

^The character for saw, ⿰書史, is supposed to be one character, with a 書 radical on the left, and 史 radical on the right. Similarly, ndip (⿰立生) is one character, made up of 立 and 生 radicals, as of present, there are limitations in displaying Zhuang logograms in Unicode, as they are unsupported.

Simplified Chinese
–
Simplified Chinese characters are standardized Chinese characters prescribed in the Table of General Standard Chinese Characters for use in mainland China. Along with traditional Chinese characters, it is one of the two character sets of the contemporary Chinese written language. The government of the Peoples Republic of China in mainland China has

Traditional Chinese
–
Traditional Chinese characters are Chinese characters in any character set that does not contain newly created characters or character substitutions performed after 1946. They are most commonly the characters in the character sets of Taiwan, of Hong Kong. Currently, a number of overseas Chinese online newspapers allow users to switch between both s

4.
A Series of Reading workbook in Traditional Chinese used in some Elementary schools in the Philippines.

Pinyin
–
Pinyin, or Hànyǔ Pīnyīn, is the official romanization system for Standard Chinese in mainland China, Malaysia, Singapore, and Taiwan. It is often used to teach Standard Chinese, which is written using Chinese characters. The system includes four diacritics denoting tones, Pinyin without tone marks is used to spell Chinese names and words in languag

1.
A school slogan asking elementary students to speak Putonghua is annotated with pinyin, but without tonal marks.

2.
In Yiling, Yichang, Hubei, text on road signs appears both in Chinese characters and in Hanyu Pinyin

Cilix glaucata
–
The Chinese Character is a moth of the family Drepanidae. It is found in Europe, Asia Minor and North Africa, the moth flies from April to August depending on the location. Occasionally there are silvery scales. The wingtips are rounded and not curved, if they are sitting in their resting position, they imitate bird droppings. The antennae are slig

1.
Cilix glaucata

2.
wing pattern

Logographic
–
In written language, a logogram or logograph is a written character that represents a word or phrase. Chinese characters and Japanese kanji are logograms, some Egyptian hieroglyphs, the use of logograms in writing is called logography. A writing system that is based on logograms is called a logographic system, in alphabets and syllabaries, individu

1.
Egyptian hieroglyphs, which have their origins as logograms.

Chinese language
–
Chinese is a group of related, but in many cases mutually unintelligible, language varieties, forming a branch of the Sino-Tibetan language family. Chinese is spoken by the Han majority and many ethnic groups in China. Nearly 1.2 billion people speak some form of Chinese as their first language, the varieties of Chinese are usually described by nat

Japanese language
–
Japanese is an East Asian language spoken by about 125 million speakers, primarily in Japan, where it is the national language. It is a member of the Japonic language family, whose relation to language groups, particularly to Korean. Little is known of the prehistory, or when it first appeared in Japan. Chinese documents from the 3rd century record

Korean language
–
It is also one of the two official languages in the Yanbian Korean Autonomous Prefecture and Changbai Korean Autonomous County of the Peoples Republic of China. Approximately 80 million people worldwide speak Korean and this implies that Korean is not an isolate, but a member of a small family. There is still debate on whether Korean and Japanese a

Okinawan language
–
Central Okinawan distinguishes itself from the speech of Northern Okinawa, which is classified independently as the Kunigami language. Both languages have been designated as endangered by the UNESCO Atlas of the Worlds Languages in Danger since its launch in February 2009. Within Japan, Okinawan is often not seen as a language itself but is referre

1.
The Tamaoton no Hinomon (玉陵の碑文), referred to as the Tamaudun no Hinomon in modern Japanese, is the oldest known inscription of Okinawan using both hiragana and kanji.

2.
(South–Central) Okinawan, AKA Shuri–Naha

Vietnamese language
–
Vietnamese /ˌviɛtnəˈmiːz/ is an Austroasiatic language that originated in the north of modern-day Vietnam, where it is the national and official language. It is the language of the Vietnamese people, as well as a first or second language for the many ethnic minorities of Vietnam. As the result of Vietnamese emigration and cultural influence, Vietna

4.
A sign at the Hỏa Lò Prison museum in Hanoi lists rules for visitors in both Vietnamese and English.

Bronze Age China
–
The Bronze Age is a historical period characterized by the use of bronze, proto-writing, and other early features of urban civilization. The Bronze Age is the principal period of the three-age Stone-Bronze-Iron system, as proposed in modern times by Christian Jürgensen Thomsen. An ancient civilization is defined to be in the Bronze Age either by sm

Oracle Bone Script
–
The vast majority were found at the Yinxu site. They record pyromantic divinations of the last nine kings of the Shang dynasty, beginning with Wu Ding, whose accession is dated by different scholars at 1250 BCE or 1200 BCE. After the Shang were overthrown by the Zhou dynasty in c.1046 BCE, divining with milfoil became more common and it is also the

1.
Oracle bone script

2.
Wang Yirong, Chinese politician and scholar, was the first to recognize the oracle bone inscriptions as ancient writing.

3.
An oracle bone (which is incomplete) with a diviner asking the Shang king if there would be misfortune over the next ten days

4.
Oracle script from a divining

International Phonetic Alphabet
–
The International Phonetic Alphabet is an alphabetic system of phonetic notation based primarily on the Latin alphabet. It was devised by the International Phonetic Association as a representation of the sounds of spoken language. The IPA is used by lexicographers, foreign students and teachers, linguists, speech-language pathologists, singers, act

Mojibake
–
Mojibake, from the Japanese 文字 character + 化け transform, is the garbled text that is the result of text being decoded using an unintended character encoding. The result is a replacement of symbols with completely unrelated ones. This display may include the replacement character � in places where the binary representation is considered invalid. A r

1.
Mojibake caused by a song title in Cyrillic (Моя Страна) on a car audio system

2.
The UTF-8 -encoded Japanese Wikipedia article for mojibake, as displayed in the Windows-1252 encoding.

Unicode
–
Unicode is a computing industry standard for the consistent encoding, representation, and handling of text expressed in most of the worlds writing systems. As of June 2016, the most recent version is Unicode 9.0, the standard is maintained by the Unicode Consortium. Unicodes success at unifying character sets has led to its widespread, the standard

Traditional Chinese characters
–
Traditional Chinese characters are Chinese characters in any character set that does not contain newly created characters or character substitutions performed after 1946. They are most commonly the characters in the character sets of Taiwan, of Hong Kong. Currently, a number of overseas Chinese online newspapers allow users to switch between both s

3.
A Series of Reading workbook in Traditional Chinese used in some Elementary schools in the Philippines.

Simplified Chinese characters
–
Simplified Chinese characters are standardized Chinese characters prescribed in the Table of General Standard Chinese Characters for use in mainland China. Along with traditional Chinese characters, it is one of the two character sets of the contemporary Chinese written language. The government of the Peoples Republic of China in mainland China has

3.
The first batch of Simplified Characters introduced in 1935 consisted of 324 characters.

Han Chinese
–
The Han Chinese, Han people or simply Han are an ethnic group native to East Asia. They constitute approximately 92% of the population of China, 95% of Taiwan, 76% of Singapore, 23% of Malaysia, Han Chinese are the worlds largest ethnic group with over 1.3 billion people. Similarly, the Chinese language also came to be named the Han language ever s

Standard Chinese
–
Its pronunciation is based on the Beijing dialect, its vocabulary on the Mandarin dialects, and its grammar is based on written vernacular Chinese. Like other varieties of Chinese, Standard Chinese is a language with topic-prominent organization. It has more initial consonants but fewer vowels, final consonants, Standard Chinese is an analytic lang

1.
A poster outside of high school in Yangzhou urges people to speak Putonghua

2.
Zhongguo Guanhua (中国官话/中國官話), or Medii Regni Communis Loquela ("Middle Kingdom's Common Speech"), used on the frontispiece of an early Chinese grammar published by Étienne Fourmont (with Arcadio Huang) in 1742

Hanyu Pinyin
–
Pinyin, or Hànyǔ Pīnyīn, is the official romanization system for Standard Chinese in mainland China, Malaysia, Singapore, and Taiwan. It is often used to teach Standard Chinese, which is written using Chinese characters. The system includes four diacritics denoting tones, Pinyin without tone marks is used to spell Chinese names and words in languag

1.
A school slogan asking elementary students to speak Putonghua is annotated with pinyin, but without tonal marks.

2.
In Yiling, Yichang, Hubei, text on road signs appears both in Chinese characters and in Hanyu Pinyin

Bopomofo
–
Zhuyin fuhao, Zhuyin or Bopomofo is a system of phonetic notation for the transcription of spoken Chinese, particularly the Mandarin dialect. The first two are traditional terms, whereas Bopomofo is the term, also used by the ISO. Consisting of 37 characters and four marks, it transcribes all possible sounds in Mandarin. Zhuyin was introduced in Ch

Wu Chinese
–
Wu is a group of linguistically similar and historically related varieties of Chinese primarily spoken in the whole city of Shanghai, Zhejiang province, southern Jiangsu province and bordering areas. Major Wu varieties include those of Shanghai, Suzhou, Ningbo, Wuxi, Wenzhou/Oujiang, Hangzhou, Shaoxing, Jinhua, Wu speakers, such as Chiang Kai-shek,

Shanghainese
–
The Shanghainese language, also known as the Shanghai dialect, Hu language or Hu dialect, is a variety of Wu Chinese spoken in the central districts of the City of Shanghai and its surrounding areas. It is classified as part of the Sino-Tibetan language family, in English, Shanghainese sometimes refers to all Wu languages, variants and dialects, al

Gan Chinese
–
Gan is a member of the Sinitic languages of the Sino-Tibetan language family, and Hakka is the closest Chinese variety to Gan in terms of phonetics. Different dialects of Gan exist, the Nanchang dialect is taken as representative. Like all other varieties of Chinese, there is an amount of mutual unintelligibility between Gan Chinese and other varie

Hakka Chinese
–
Hakka is not mutually intelligible with Yue, Wu, Southern Min, Mandarin or other branches of Chinese, and itself contains a few mutually unintellegible varieties. It is most closely related to Gan and is classified as a variety of Gan. There is also a possibility that the similarities are just a result of shared areal features, Taiwan, where Hakka

Cantonese
–
Cantonese, or Standard Cantonese, is a variety of Chinese spoken in the city of Guangzhou in southeastern China. It is the prestige variety of Yue, one of the major subdivisions of Chinese. In mainland China, it is the lingua franca of the province of Guangdong and some neighbouring areas such as Guangxi. In Hong Kong and Macau, Cantonese serves as

1.
Street in Chinatown, San Francisco. Cantonese has traditionally been the dominant Chinese variant among Chinese populations in the Western world.

3.
Chinese dictionary from Tang dynasty. Modern Cantonese pronunciation is more similar to Middle Chinese from this era than other Chinese varieties.

Jyutping
–
Jyutping is a romanisation system for Cantonese developed by the Linguistic Society of Hong Kong, an academic group, in 1993. Its formal name is The Linguistic Society of Hong Kong Cantonese Romanisation Scheme, the LSHK promotes the use of this romanisation system. The name Jyutping is a contraction consisting of the first Chinese characters of th

1.
Jyutping Romanization.

Southern Min
–
Southern Min, or Minnan, is a branch of Min Chinese spoken in certain parts of China including southern Fujian, eastern Guangdong, Hainan, and southern Zhejiang, and in Taiwan. The Min Nan dialects are spoken by descendants of emigrants from these areas in diaspora, most notably the Philippines, Indonesia, Malaysia. In common parlance, Southern Min

1.
Koa-a books, Min Nan written in Chinese characters

2.
Distribution of Southern Min.

Hokkien
–
Hokkien /hɒˈkiɛn/ is a group of Southern Min dialects spoken throughout Southeastern China, Taiwan, Southeast Asia and by other overseas Chinese. Hokkien originated in southern Fujian, the Min-speaking province and it is closely related to Teochew, though there is limited mutual intelligibility, and is somewhat more distantly related to Hainanese a

1.
Distribution of Min Nan dialects. Hokkien is dark green.

Teochew dialect
–
Teochew is a variety of Southern Min spoken mainly by the Teochew people in the Chaoshan region of eastern Guangdong and by their respective diaspora around the world. It is sometimes referred to as Chiuchow, its Cantonese name, Teochew preserves many Old Chinese pronunciations and vocabulary that have been lost in some of the other modern varietie

Eastern Min
–
Eastern Min, or Min Dong, is a branch of the Min group of varieties of Chinese. The prestige form and most-cited representative form is the Fuzhou dialect, Eastern Min varieties are mainly spoken in the eastern part of Fujian Province in Peoples Republic of China, in and near the cities of Fuzhou and Ningde. They are also encountered as the mother

1.
Min Dong (violet)

Fuzhou dialect
–
The Fuzhou dialect, also Fuzhounese, Foochow or Hok-chiu, is the prestige variety of the Eastern Min branch of Min Chinese spoken mainly in eastern Fujian province. Like many other varieties of Chinese, the Fuzhou dialect is dominated by monosyllabic morphemes which carry lexical tones, while the Eastern Min branch that it belongs to is closer to S

Foochow Romanized
–
Foochow Romanized, also known as Bàng-uâ-cê or Hók-ciŭ-uâ Lò̤-mā-cê, is a Latin alphabet for the Fuzhou dialect of Eastern Min adopted in the middle of 19th century by Western missionaries. It had varied at different times, and became standardized in the 1890s, Foochow Romanized was mainly used inside of Church circles, and was taught in some Missi

1.
An English-Chinese Dictionary of the Foochow Dialect, 2nd Edition, published in 1905

2.
Bible in Foochow Romanized (Exodus), published by British and Foreign Bible Society in 1908

3.
Dictionary of the Foochow dialect, 3rd Edition, published in 1929

Middle Chinese
–
The fanqie method used to indicate pronunciation in these dictionaries, though an improvement on earlier methods, proved awkward in practice. The mid 12th-century Yunjing and other rime tables incorporate a more sophisticated, the rime tables attest to a number of sound changes that had occurred over the centuries following the publication of the Q

Zhuang language
–
The Zhuang languages are any of more than a dozen Tai languages spoken by the Zhuang people of southern China in the province of Guangxi and adjacent parts of Yunnan and Guangdong. The Zhuang languages do not form a linguistic unit, as northern and southern Zhuang languages are more closely related to other Tai languages than to each other. Standar

Hangul
–
The Korean alphabet, known as Hangul in South Korea and as Chosŏngŭl/Chosŏn Muntcha in North Korea is the alphabet that has been used to write the Korean language since the 15th century. It was created during the Joseon Dynasty in 1443 by King Sejong the Great, in South Korea, Hangul is used primarily to write the Korean language as using Hanja in

Hanja
–
Hanja is the Korean name for Chinese characters. Borrowed from Chinese and incorporated into the Korean language with Korean pronunciation, hanja-mal or hanja-eo refers to words that can be written with hanja, and hanmun refers to Classical Chinese writing, although hanja is sometimes used loosely to encompass these other concepts. Because hanja ne

Revised Romanization of Korean
–
The Revised Romanization of Korean is the official Korean language romanization system in South Korea proclaimed by Ministry of Culture and Tourism to replace the older McCune–Reischauer system. The new system eliminates diacritics in favor of digraphs and adheres closely to Korean phonology than to a suggestive rendition of Korean phonetics for no

Kanji
–
Kanji, or kanji, are the adopted logographic Chinese characters that are used in the modern Japanese writing system along with hiragana and katakana. The Japanese term kanji for the Chinese characters literally means Han characters and is using the same characters as the Chinese word hànzì. Chinese characters first came to Japan on official seals,

Hiragana
–
Hiragana is a Japanese syllabary, one component of the Japanese writing system, along with katakana, kanji, and in some cases rōmaji. It is a lettering system. The word hiragana literally means ordinary or simple kana, Hiragana and katakana are both kana systems. With one or two exceptions, each sound in the Japanese language is represented by one

1.
Hiragana characters' shapes were derived from the Chinese cursive script (sōsho). Shown here is a sample of the cursive script by Chinese Tang Dynasty calligrapher Sun Guoting, from the late 7th century.

Hepburn romanization
–
The system was originally proposed by the Romanization Club in 1885. The revised edition by Romaji-Hirome-kai in 1908 is called standard style romanization, Hepburn is based on English phonology and has competed with the alternative Nihon-shiki romanization, which was developed in Japan as a replacement of Japanese script. In 1930, a Special Romani

Kunrei-shiki romanization
–
Kunrei-shiki rōmaji is a Cabinet-ordered, Japanese romanization system, i. e. a system for transcribing the Japanese language into the Latin alphabet. Its name is rendered Kunreisiki using Kunrei-shiki itself, Kunrei-shiki is sometimes known as the Monbushō system in English, because it is taught in the Monbushō-approved elementary school curriculu

Chinese script styles
–
In Chinese calligraphy, Chinese characters can be written according to five major styles. These styles are intrinsically linked to the history of Chinese script, the Japanese syllabaries of katakana and hiragana are used in calligraphy, the katakana were derived from the shapes of regular script characters and hiragana from those of cursive script.

Neolithic signs in China
–
However, only isolated instances of these symbols have been found, and they show no indication of representing speech or of the non-pictorial processes that a writing system requires. These sites include those pertaining to the cultures of Yangshao, Liangzhu, Majiayao, proponents of the view that they are early Chinese writing tend to see evidence

Oracle bone script
–
The vast majority were found at the Yinxu site. They record pyromantic divinations of the last nine kings of the Shang dynasty, beginning with Wu Ding, whose accession is dated by different scholars at 1250 BCE or 1200 BCE. After the Shang were overthrown by the Zhou dynasty in c.1046 BCE, divining with milfoil became more common and it is also the

1.
Oracle bone script

2.
Wang Yirong, Chinese politician and scholar, was the first to recognize the oracle bone inscriptions as ancient writing.

3.
An oracle bone (which is incomplete) with a diviner asking the Shang king if there would be misfortune over the next ten days

4.
Oracle script from a divining

Chinese bronze inscriptions
–
Early bronze inscriptions were almost always cast, while later inscriptions were often engraved after the bronze was cast. The bronze inscriptions are one of the earliest scripts in the Chinese family of scripts, for the early Western Zhou to early Warring States period, the bulk of writing which has been unearthed has been in the form of bronze in

4.
Bronze biānzhōng bells from the early Warring States Tomb of Marquis Yĭ of Zēng

Seal script
–
Seal script is an ancient style of writing Chinese characters that was common throughout the latter half of the 1st millennium BC. It evolved organically out of the Zhou dynasty script, arising in the Warring State of Qin, there are two uses of the word seal script, the Large or Great Seal script and the lesser or Small Seal Script, the latter is a

Bird-worm seal script
–
Bird-worm seal script is a type of ancient seal script originated in China. The Chinese character 鸟 means bird, the Chinese character 虫 means any creature that looks like a worm, including invertebrate worms and reptiles such as snakes and lizards. The character 篆 means seal script, other names for this kind of seal script, Niao-Chong Script. The C

1.
Bird seal script on the Sword of Goujian and its equivalents in modern Chinese

Large Seal Script
–
The term is in contrast to the name of the official script of the Qin dynasty, which is often called Small or Lesser Seal Script. However, due to the lack of precision in the term, scholars often avoid it and instead refer more specifically to the provenance of particular examples of writing. At that time, there was knowledge of even older, often m

Small Seal Script
–
Small Seal Script, formerly romanized as Hsiao-chuan and also known as Seal Script, Lesser Seal Script and Qin Script, is an archaic form of Chinese calligraphy. It was standardized and promulgated as a standard by Li Si, prime minister under Shi Huangdi. Hence coaches, roads, currency, laws, weights, measures, characters which were different from

1.
A passage from the Thousand Character Classic in sans-serif typeface. The rightmost line is the original Chinese. The middle and the left lines are transliterations in Japanese kana and Korean Hangul, respectively.

2.
In the Chinese character 媽 mā "mother", the left part is the radical 女 nǚ "female". In this case the radical is the semantic component of a phono-semantic compound, while the right part, 馬 mǎ "horse", is the phonetic component.

2.
From right to left: Kangxi Dictionary forms, Mainland China standard, Hong Kong standard, Taiwan standard, Japanese standard. Areas in the rightmost column where there are significant differences among different standards are highlighted in yellow. (玄 is not written completely in the Kangxi Dictionary because 玄 is a character in the Kangxi Emperor's given name, 玄燁. It was taboo to write a character in the emperor's given name.)

1.
The writing on this wall is meant to say "production output will increase multiple times" (产量翻几番), but uses non-standard characters. During the Cultural Revolution, such sights were common as citizens were encouraged to innovate and participate in the character simplification process.

3.
Jikji, the first known book printed with movable metal type in 1377. Bibliothèque Nationale de Paris

LIST OF IMAGES

1.
Simplified Chinese
–
Simplified Chinese characters are standardized Chinese characters prescribed in the Table of General Standard Chinese Characters for use in mainland China. Along with traditional Chinese characters, it is one of the two character sets of the contemporary Chinese written language. The government of the Peoples Republic of China in mainland China has promoted them for use in printing since the 1950s and 1960s in an attempt to increase literacy and they are officially used in the Peoples Republic of China and Singapore. Traditional Chinese characters are used in Hong Kong, Macau. Overseas Chinese communities generally tend to use traditional characters, Simplified Chinese characters may be referred to by their official name above or colloquially. Strictly, the latter refers to simplifications of character structure or body, character forms that have existed for thousands of years alongside regular, Simplified character forms were created by decreasing the number of strokes and simplifying the forms of a sizable proportion of traditional Chinese characters. Some simplifications were based on popular cursive forms embodying graphic or phonetic simplifications of the traditional forms, some characters were simplified by applying regular rules, for example, by replacing all occurrences of a certain component with a simplified version of the component. Variant characters with the pronunciation and identical meaning were reduced to a single standardized character. Finally, many characters were left untouched by simplification, and are identical between the traditional and simplified Chinese orthographies. Some simplified characters are very dissimilar to and unpredictably different from traditional characters and this often leads opponents not well-versed in the method of simplification to conclude that the overall process of character simplification is also arbitrary. In reality, the methods and rules of simplification are few, on the other hand, proponents of simplification often flaunt a few choice simplified characters as ingenious inventions, when in fact these have existed for hundreds of years as ancient variants. However, the Chinese government never officially dropped its goal of further simplification in the future, in August 2009, the PRC began collecting public comments for a modified list of simplified characters. The new Table of General Standard Chinese Characters consisting of 8,105 characters was promulgated by the State Council of the Peoples Republic of China on June 5,2013, cursive written text almost always includes character simplification. Simplified forms used in print have always existed, they date back to as early as the Qin dynasty, One of the earliest proponents of character simplification was Lubi Kui, who proposed in 1909 that simplified characters should be used in education. In the years following the May Fourth Movement in 1919, many anti-imperialist Chinese intellectuals sought ways to modernise China, Traditional culture and values such as Confucianism were challenged. Soon, people in the Movement started to cite the traditional Chinese writing system as an obstacle in modernising China and it was suggested that the Chinese writing system should be either simplified or completely abolished. Fu Sinian, a leader of the May Fourth Movement, called Chinese characters the writing of ox-demons, lu Xun, a renowned Chinese author in the 20th century, stated that, If Chinese characters are not destroyed, then China will die. Recent commentators have claimed that Chinese characters were blamed for the problems in China during that time

2.
Traditional Chinese
–
Traditional Chinese characters are Chinese characters in any character set that does not contain newly created characters or character substitutions performed after 1946. They are most commonly the characters in the character sets of Taiwan, of Hong Kong. Currently, a number of overseas Chinese online newspapers allow users to switch between both sets. In contrast, simplified Chinese characters are used in mainland China, Singapore, the debate on traditional and simplified Chinese characters has been a long-running issue among Chinese communities. Although simplified characters are taught and endorsed by the government of Mainland China, Traditional characters are used informally in regions in China primarily in handwriting and also used for inscriptions and religious text. They are often retained in logos or graphics to evoke yesteryear, nonetheless, the vast majority of media and communications in China is dominated by simplified characters. Taiwan has never adopted Simplified Chinese characters since it is ruled by the Republic of China, the use of simplified characters in official documents is even prohibited by the government in Taiwan. Simplified characters are not well understood in general, although some stroke simplifications that have incorporated into Simplified Chinese are in common use in handwriting. For example, while the name of Taiwan is written as 臺灣, similarly, in Hong Kong and Macau, Traditional Chinese has been the legal written form since colonial times. In recent years, because of the influx of mainland Chinese tourists, today, even government websites use simplified Chinese, as they answer to the Beijing government. This has led to concerns by residents to protect their local heritage. In Southeast Asia, the Chinese Filipino community continues to be one of the most conservative regarding simplification, while major public universities are teaching simplified characters, many well-established Chinese schools still use traditional characters. Publications like the Chinese Commercial News, World News, and United Daily News still use traditional characters, on the other hand, the Philippine Chinese Daily uses simplified. Aside from local newspapers, magazines from Hong Kong, such as the Yazhou Zhoukan, are found in some bookstores. In case of film or television subtitles on DVD, the Chinese dub that is used in Philippines is the same as the one used in Taiwan and this is because the DVDs belongs to DVD Region Code 3. Hence, most of the subtitles are in Traditional Characters, overseas Chinese in the United States have long used traditional characters. A major influx of Chinese immigrants to the United States occurred during the half of the 19th century. Therefore, the majority of Chinese language signage in the United States, including street signs, Traditional Chinese characters are called several different names within the Chinese-speaking world

3.
Pinyin
–
Pinyin, or Hànyǔ Pīnyīn, is the official romanization system for Standard Chinese in mainland China, Malaysia, Singapore, and Taiwan. It is often used to teach Standard Chinese, which is written using Chinese characters. The system includes four diacritics denoting tones, Pinyin without tone marks is used to spell Chinese names and words in languages written with the Latin alphabet, and also in certain computer input methods to enter Chinese characters. The pinyin system was developed in the 1950s by many linguists, including Zhou Youguang and it was published by the Chinese government in 1958 and revised several times. The International Organization for Standardization adopted pinyin as a standard in 1982. The system was adopted as the standard in Taiwan in 2009. The word Hànyǔ means the language of the Han people. In 1605, the Jesuit missionary Matteo Ricci published Xizi Qiji in Beijing and this was the first book to use the Roman alphabet to write the Chinese language. Twenty years later, another Jesuit in China, Nicolas Trigault, neither book had much immediate impact on the way in which Chinese thought about their writing system, and the romanizations they described were intended more for Westerners than for the Chinese. One of the earliest Chinese thinkers to relate Western alphabets to Chinese was late Ming to early Qing Dynasty scholar-official, the first late Qing reformer to propose that China adopt a system of spelling was Song Shu. A student of the great scholars Yu Yue and Zhang Taiyan, Song had been to Japan and observed the effect of the kana syllabaries. This galvanized him into activity on a number of fronts, one of the most important being reform of the script, while Song did not himself actually create a system for spelling Sinitic languages, his discussion proved fertile and led to a proliferation of schemes for phonetic scripts. The Wade–Giles system was produced by Thomas Wade in 1859, and it was popular and used in English-language publications outside China until 1979. This Sin Wenz or New Writing was much more sophisticated than earlier alphabets. In 1940, several members attended a Border Region Sin Wenz Society convention. Mao Zedong and Zhu De, head of the army, both contributed their calligraphy for the masthead of the Sin Wenz Societys new journal. Outside the CCP, other prominent supporters included Sun Yat-sens son, Sun Fo, Cai Yuanpei, the countrys most prestigious educator, Tao Xingzhi, an educational reformer. Over thirty journals soon appeared written in Sin Wenz, plus large numbers of translations, biographies, some contemporary Chinese literature, and a spectrum of textbooks

Pinyin
–
A school slogan asking elementary students to speak Putonghua is annotated with pinyin, but without tonal marks.
Pinyin
–
In Yiling, Yichang, Hubei, text on road signs appears both in Chinese characters and in Hanyu Pinyin

4.
Cilix glaucata
–
The Chinese Character is a moth of the family Drepanidae. It is found in Europe, Asia Minor and North Africa, the moth flies from April to August depending on the location. Occasionally there are silvery scales. The wingtips are rounded and not curved, if they are sitting in their resting position, they imitate bird droppings. The antennae are slightly combed. The larvae feed on Rubus, Crataegus and Prunus species

Cilix glaucata
–
Cilix glaucata
Cilix glaucata
–
wing pattern

5.
Logographic
–
In written language, a logogram or logograph is a written character that represents a word or phrase. Chinese characters and Japanese kanji are logograms, some Egyptian hieroglyphs, the use of logograms in writing is called logography. A writing system that is based on logograms is called a logographic system, in alphabets and syllabaries, individual written characters represent sounds rather than concepts. Unlike logograms, phonograms do not necessarily have meaning by themselves, Writing language in this way is called phonemic orthography. Logographic systems include the earliest writing systems, the first historical civilizations of the Near East, Africa, China, a more recent attempt is Zlango, intended for use in text messaging, currently including around 300 icons. The term logosyllabary is used to emphasize the partially phonetic nature of these scripts when the domain is the syllable. For example, Egyptian was used to write both sȝ duck and sȝ son, though it is likely that these words were not pronounced the same apart from their consonants and this can be illustrated with Chinese. Not all Chinese characters represent morphemes, some morphemes are composed of more than one character, for example, the Chinese word for spider, 蜘蛛 zhīzhū, was created by fusing the rebus 知朱 zhīzhū with the bug determinative 虫. Neither *蜘 zhī nor *蛛 zhū can be used separately, in Archaic Chinese, one can find the reverse, a single character representing more than one morpheme. An example is Archaic Chinese 王 hjwangs, a combination of a morpheme hjwang meaning king, in modern Mandarin, bimorphemic syllables are always written with two characters, for example 花儿 huār flower. These logograms, called hozwārishn, were dispensed with altogether after the Arab conquest of Persia, logograms are used in modern shorthand to represent common words. In addition, the numerals and mathematical symbols used in systems are logograms—1 one,2 two, + plus, = equals, and so on. In English, the ampersand & is used for and and et, % for percent, # for number, § for section, $ for dollar, € for euro, £ for pound, ° for degree, @ for at, etc. All historical logographic systems include a dimension, as it is impractical to have a separate basic character for every word or morpheme in a language. In some cases, such as cuneiform as it was used for Akkadian, many logographic systems also have a semantic/ideographic component, called determinatives in the case of Egyptian and radicals in the case of Chinese. In the case of Chinese, the vast majority of characters are a combination of a radical that indicates its nominal category. The Mayan system used logograms with phonetic complements like the Egyptian, Chinese scholars have traditionally classified the Chinese characters hanzi into six types by etymology. The first two types are single-body, meaning that the character was created independently of other characters, single-body pictograms and ideograms make up only a small proportion of Chinese logograms

Logographic
–
Egyptian hieroglyphs, which have their origins as logograms.

6.
Chinese language
–
Chinese is a group of related, but in many cases mutually unintelligible, language varieties, forming a branch of the Sino-Tibetan language family. Chinese is spoken by the Han majority and many ethnic groups in China. Nearly 1.2 billion people speak some form of Chinese as their first language, the varieties of Chinese are usually described by native speakers as dialects of a single Chinese language, but linguists note that they are as diverse as a language family. The internal diversity of Chinese has been likened to that of the Romance languages, There are between 7 and 13 main regional groups of Chinese, of which the most spoken by far is Mandarin, followed by Wu, Min, and Yue. Most of these groups are mutually unintelligible, although some, like Xiang and certain Southwest Mandarin dialects, may share common terms, all varieties of Chinese are tonal and analytic. Standard Chinese is a form of spoken Chinese based on the Beijing dialect of Mandarin. It is the language of China and Taiwan, as well as one of four official languages of Singapore. It is one of the six languages of the United Nations. The written form of the language, based on the logograms known as Chinese characters, is shared by literate speakers of otherwise unintelligible dialects. Of the other varieties of Chinese, Cantonese is the spoken language and official in Hong Kong and Macau. It is also influential in Guangdong province and much of Guangxi, dialects of Southern Min, part of the Min group, are widely spoken in southern Fujian, with notable variants also spoken in neighboring Taiwan and in Southeast Asia. Hakka also has a diaspora in Taiwan and southeast Asia. Shanghainese and other Wu varieties are prominent in the lower Yangtze region of eastern China, Chinese can be traced back to a hypothetical Sino-Tibetan proto-language. The first written records appeared over 3,000 years ago during the Shang dynasty, as the language evolved over this period, the various local varieties became mutually unintelligible. In reaction, central governments have sought to promulgate a unified standard. Difficulties have included the great diversity of the languages, the lack of inflection in many of them, in addition, many of the smaller languages are spoken in mountainous areas that are difficult to reach, and are often also sensitive border zones. Without a secure reconstruction of proto-Sino-Tibetan, the structure of the family remains unclear. A top-level branching into Chinese and Tibeto-Burman languages is often assumed, the earliest examples of Chinese are divinatory inscriptions on oracle bones from around 1250 BCE in the late Shang dynasty

7.
Japanese language
–
Japanese is an East Asian language spoken by about 125 million speakers, primarily in Japan, where it is the national language. It is a member of the Japonic language family, whose relation to language groups, particularly to Korean. Little is known of the prehistory, or when it first appeared in Japan. Chinese documents from the 3rd century recorded a few Japanese words, during the Heian period, Chinese had considerable influence on the vocabulary and phonology of Old Japanese. Late Middle Japanese saw changes in features that brought it closer to the modern language, the standard dialect moved from the Kansai region to the Edo region in the Early Modern Japanese period. Following the end in 1853 of Japans self-imposed isolation, the flow of loanwords from European languages increased significantly, English loanwords in particular have become frequent, and Japanese words from English roots have proliferated. Japanese is an agglutinative, mora-timed language with simple phonotactics, a vowel system, phonemic vowel and consonant length. Word order is normally subject–object–verb with particles marking the grammatical function of words, sentence-final particles are used to add emotional or emphatic impact, or make questions. Nouns have no number or gender, and there are no articles. Verbs are conjugated, primarily for tense and voice, but not person, Japanese equivalents of adjectives are also conjugated. Japanese has a system of honorifics with verb forms and vocabulary to indicate the relative status of the speaker, the listener. Japanese has no relationship with Chinese, but it makes extensive use of Chinese characters, or kanji, in its writing system. Along with kanji, the Japanese writing system uses two syllabic scripts, hiragana and katakana. Latin script is used in a fashion, such as for imported acronyms. Very little is known about the Japanese of this period, Old Japanese is the oldest attested stage of the Japanese language. Through the spread of Buddhism, the Chinese writing system was imported to Japan, the earliest texts found in Japan are written in Classical Chinese, but they may have been meant to be read as Japanese by the kanbun method. Some of these Chinese texts show the influences of Japanese grammar, in these hybrid texts, Chinese characters are also occasionally used phonetically to represent Japanese particles. The earliest text, the Kojiki, dates to the early 8th century, the end of Old Japanese coincides with the end of the Nara period in 794

Japanese language
–
A page from Nihon Shoki (The Chronicles of Japan), the second oldest book of classical Japanese history.
Japanese language
–
Map of Japanese dialects and Japonic languages
Japanese language
–
Two pages from a 12th-century emaki scroll of The Tale of Genji from the 11th century.
Japanese language
–
Calligraphy

8.
Korean language
–
It is also one of the two official languages in the Yanbian Korean Autonomous Prefecture and Changbai Korean Autonomous County of the Peoples Republic of China. Approximately 80 million people worldwide speak Korean and this implies that Korean is not an isolate, but a member of a small family. There is still debate on whether Korean and Japanese are related with each other, the Korean language is agglutinative in its morphology and SOV in its syntax. A relation of Korean with Japonic languages has been proposed by linguists like William George Aston, Chinese characters arrived in Korea together with Buddhism during the pre-Three Kingdoms period. Mainly privileged elites were educated to read and write in hanja, however, today, the hanja are largely unused in everyday life, but in South Korea they experience revivals on artistic works and are important in historic and/or linguistic studies of Korean. Since the Korean War, through 70 years of separation, North–South differences have developed in standard Korean, including variations in pronunciation, verb inflection, the Korean names for the language are based on the names for Korea used in North Korea and South Korea. In South Korea, the Korean language is referred to by names including hanguk-eo Korean language, hanguk-mal, Korean speech and uri-mal. In hanguk-eo and hanguk-mal, the first part of the word, hanguk, refers to the Korean nation while -eo and -mal mean language and speech, Korean is also simply referred to as guk-eo, literally national language. This name is based on the same Chinese characters meaning nation + language that are used in Taiwan and Japan to refer to their respective national languages. In North Korea and China, the language is most often called Chosŏn-mal, or more formally, the English word Korean is derived from Goryeo, which is thought to be the first dynasty known to Western countries. Korean people in the former USSR refer to themselves as Koryo-saram and Goryeo In, the majority of historical and modern linguists classify Korean as a language isolate. Such factors of typological divergence as Middle Mongolians exhibition of gender agreement can be used to argue that a relationship with Altaic is unlikely. Sergei Anatolyevich Starostin found about 25% of potential cognates in the Japanese–Korean 100-word Swadesh list, a good example might be Middle Korean sàm and Japanese asa, meaning hemp. Also, the doublet wo meaning hemp is attested in Western Old Japanese and it is thus plausible to assume a borrowed term. Among ancient languages, various relatives of Korean have been proposed. Some classify the language of Jeju Island as a distinct modern Koreanic language, Other famous theories are the Dravido-Korean languages theory and the mostly unknown southern-theory which suggest an Austronesian relation. Korean is spoken by the Korean people in North Korea and South Korea and by the Korean diaspora in countries including the Peoples Republic of China, the United States, Japan. Korean-speaking minorities exist in these states, but because of cultural assimilation into host countries, Korean is the official language of South Korea and North Korea

9.
Okinawan language
–
Central Okinawan distinguishes itself from the speech of Northern Okinawa, which is classified independently as the Kunigami language. Both languages have been designated as endangered by the UNESCO Atlas of the Worlds Languages in Danger since its launch in February 2009. Within Japan, Okinawan is often not seen as a language itself but is referred to as the Okinawan dialect or more specifically the Central. Okinawan speakers are undergoing language shift as they switch to Japanese, Okinawan is still spoken by many older people. It is also alive in popular music, tourist shows, and in theaters featuring a local drama called uchinaa shibai. Pre-Ryukyu Kingdom Okinawan is a Japonic language, derived from Old Japanese, the split between Old Japanese and the Ryukyuan languages has been estimated to have occurred as early as the first century AD to as late as the twelfth century AD. Chinese and Japanese characters were first introduced by a Japanese missionary in 1265, Ryukyu Kingdom Era Pre-Satsuma Hiragana was much more popular than kanji, poems were commonly written solely in hiragana or with little kanji. Post-Satsuma to Annexation After Ryukyu became a vassal of Satsuma Domain, kanji gained more prominence in poetry, Japanese Annexation to End of World War II When Ryukyu was annexed by Japan in 1879, the majority of people on Okinawa Island spoke Okinawan. Within ten years, the Japanese government began a policy of Japanization. The education system was the heart of Japanization, where Okinawan children were taught Japanese and punished for speaking their native language, by 1945, many Okinawans spoke Japanese, and many were bilingual. During the Battle of Okinawa, some Okinawans were killed by Japanese soldiers for speaking Okinawan, American Occupation Under American administration, there was an attempt to revive and standardize Okinawan, however this proved difficult and was shelved in favor of Japanese. There have been attempts to revive Okinawan by notable people such as Byron Fija and Seijin Noborikawa, Okinawan is sometimes grouped with Kunigami as the Okinawan languages, however some linguists dont use this grouping or claim that Kunigami is a dialect of Okinawan. Okinawan is also grouped with Amami as the Northern Ryukyuan languages, dialect of the Japanese language Since the creation of Okinawa Prefecture, Okinawan was labeled a dialect of Japanese as part of a policy of assimilation. Later, Japanese linguists, such as Tōjō Misao, who studied the Ryukyuan languages argued that they are indeed dialects and this is due to the misconception that Japan is a homogeneous state, and classifying the Ryukyuan languages as such would discredit this belief. The policy of assimilation, coupled with increased interaction between Japan and Okinawa through media and economics, has led to the development of Okinawan Japanese, dialect of the Ryukyuan language Okinawan linguist Seizen Nakasone states that the Ryukyuan languages are in fact groupings of similar dialects. As each community has its own dialect, there is no one language. Nakasone attributes this diversity to the isolation caused by immobility, citing the story of his mother who wanted to visit the town of Nago and its own distinct language Outside Japan, Okinawan is considered a separate language from Japanese. This was first proposed by Basil Hall Chamberlain, who compared the relationship between Okinawan and Japanese to that of the Romance languages, UNESCO has marked it as an endangered language

Okinawan language
–
The Tamaoton no Hinomon (玉陵の碑文), referred to as the Tamaudun no Hinomon in modern Japanese, is the oldest known inscription of Okinawan using both hiragana and kanji.
Okinawan language
–
(South–Central) Okinawan, AKA Shuri–Naha

10.
Vietnamese language
–
Vietnamese /ˌviɛtnəˈmiːz/ is an Austroasiatic language that originated in the north of modern-day Vietnam, where it is the national and official language. It is the language of the Vietnamese people, as well as a first or second language for the many ethnic minorities of Vietnam. As the result of Vietnamese emigration and cultural influence, Vietnamese speakers are found throughout the world, notably in East and Southeast Asia, North America, Australia, Vietnamese has also been officially recognized as a minority language in the Czech Republic. It is part of the Austroasiatic language family of which it has by far the most speakers, Vietnamese vocabulary has borrowings from Chinese, and it formerly used a modified set of Chinese characters called chữ nôm given vernacular pronunciation. The Vietnamese alphabet in use today is a Latin alphabet with diacritics for tones. As the national language, Vietnamese is spoken throughout Vietnam by ethnic Vietnamese, Vietnamese is also the native language of the Gin minority group in southern Guangxi Province in China. A significant number of speakers also reside in neighboring Cambodia. In the United States, Vietnamese is the sixth most spoken language, with over 1.5 million speakers and it is the third most spoken language in Texas, fourth in Arkansas and Louisiana, and fifth in California. Vietnamese is the seventh most spoken language in Australia, in France, it is the most spoken Asian language and the eighth most spoken immigrant language at home. Vietnamese is the official and national language of Vietnam. It is the first language of the majority of the Vietnamese population, in the Czech Republic, Vietnamese has been recognized as one of 14 minority languages, on the basis of communities that have either traditionally or on a long-term basis resided in the country. This status grants Czech citizens from the Vietnamese community the right to use Vietnamese with public authorities, Vietnamese is increasingly being taught in schools and institutions outside of Vietnam. Since the 1980s, Vietnamese language schools have been established for youth in many Vietnamese-speaking communities around the world, furthermore, there has also been a number of Germans studying Vietnamese due to increased economic investment in Vietnam. Vietnamese is taught in schools in the form of immersion to a varying degree in Cambodia, Laos. Classes teach students subjects in Vietnamese and another language, furthermore, in Thailand, Vietnamese is one of the most popular foreign languages in schools and colleges. Vietnamese was identified more than 150 years ago as part of the Mon–Khmer branch of the Austroasiatic language family. Later, Muong was found to be closely related to Vietnamese than other Mon–Khmer languages. The term Vietic was proposed by Hayes, who proposed to redefine Viet–Muong as referring to a subbranch of Vietic containing only Vietnamese and Muong

11.
Bronze Age China
–
The Bronze Age is a historical period characterized by the use of bronze, proto-writing, and other early features of urban civilization. The Bronze Age is the principal period of the three-age Stone-Bronze-Iron system, as proposed in modern times by Christian Jürgensen Thomsen. An ancient civilization is defined to be in the Bronze Age either by smelting its own copper and alloying with tin, arsenic, or other metals, or by trading for bronze from production areas elsewhere. Copper-tin ores are rare, as reflected in the fact there were no tin bronzes in Western Asia before trading in bronze began in the third millennium BC. Worldwide, the Bronze Age generally followed the Neolithic period, with the Chalcolithic serving as a transition, although the Iron Age generally followed the Bronze Age, in some areas, the Iron Age intruded directly on the Neolithic. Bronze Age cultures differed in their development of the first writing, according to archaeological evidence, cultures in Mesopotamia and Egypt developed the earliest viable writing systems. The overall period is characterized by use of bronze, though the place and time of the introduction. Human-made tin bronze technology requires set production techniques, tin must be mined and smelted separately, then added to molten copper to make bronze alloy. The Bronze Age was a time of use of metals. The dating of the foil has been disputed, the Bronze Age in the ancient Near East began with the rise of Sumer in the 4th millennium BC. Societies in the region laid the foundations for astronomy and mathematics, the usual tripartite division into an Early, Middle and Late Bronze Age is not used. Instead, a division based on art-historical and historical characteristics is more common. The cities of the Ancient Near East housed several tens of thousands of people, ur in the Middle Bronze Age and Babylon in the Late Bronze Age similarly had large populations. The earliest mention of Babylonia appears on a tablet from the reign of Sargon of Akkad in the 23rd century BC, the Amorite dynasty established the city-state of Babylon in the 19th century BC. Over 100 years later, it took over the other city-states. Babylonia adopted the written Semitic Akkadian language for official use, by that time, the Sumerian language was no longer spoken, but was still in religious use. Elam was an ancient civilization located to the east of Mesopotamia, in the Old Elamite period, Elam consisted of kingdoms on the Iranian plateau, centered in Anshan, and from the mid-2nd millennium BC, it was centered in Susa in the Khuzestan lowlands. Its culture played a role in the Gutian Empire and especially during the Achaemenid dynasty that succeeded it

12.
Oracle Bone Script
–
The vast majority were found at the Yinxu site. They record pyromantic divinations of the last nine kings of the Shang dynasty, beginning with Wu Ding, whose accession is dated by different scholars at 1250 BCE or 1200 BCE. After the Shang were overthrown by the Zhou dynasty in c.1046 BCE, divining with milfoil became more common and it is also the oldest known member and ancestor of the Chinese family of scripts, preceding the bronzeware script. The term oracle bone was first coined by the American missionary Frank H. Chalfant in his 1906 book Early Chinese Writing, because turtle shells as well as bones were used, the oracle bone script is also sometimes called shell and bone script. As the majority of oracle bones bearing writing date to the late Shang dynasty and it is certain that Shang-lineage writing underwent a period of development before the Anyang oracle bone script, because of its mature nature. However, no significant quantity of clearly identifiable writing from before or during the early to middle Shang cultural period has been discovered, the oracle bone script of the late Shang appears pictographic, as does its contemporary, the Shang writing on bronzes. The earliest oracle bone script appears even more so than examples from late in the period, additionally, the writing of characters in vertical columns, from top to bottom, is for the most part carried over from the bamboo books to oracle bone inscriptions. The vertical columns of text in Chinese writing are traditionally ordered from right to left and this level of maturity clearly implies an earlier period of development of at least several hundred years. Compare, for instance, the third and fourth graphs in the row below, without careful research to compare these to later forms, one would probably not know that these represented 豕 shĭ swine and 犬 quǎn dog respectively. By the late Shang oracle bone script, the graphs had already evolved into a variety of mostly non-pictographic functions, phonetic loan graphs, semantic-phonetic compounds, and associative compounds were already common. Although it was a fully functional writing system, the oracle bone script was not fully standardized, a graph when inverted horizontally generally refers to the same word, and additional components are sometimes present without changing the meaning. These irregularities persisted until the standardization of the script in the Qin dynasty. Of the thousands of characters found from all the fragments so far. One reason for this is that components of certain oracle bone script characters may differ in later script forms, such differences may be accounted for by character simplification and/or by later generations misunderstanding the original graph, which had evolved beyond recognition. Another reason is that some characters exist only in oracle bone script, in such cases, context - when available - may be used to determine the possible meaning of the character. One good example is shown in the fragment below, labeled oracle bone script for Spring, the top left character in this image has no known modern Chinese counterpart. One of the better known characters however is shown directly beneath it looking like an isosceles triangle with a line cutting through the upper portion. This is the oracle bone script character for 王 wáng, the numbers of oracle bones with inscriptions contemporaneous with the end of Shang and the beginning of Zhou is relatively few in number compared with the entire corpus of Shang inscriptions

Oracle Bone Script
–
Oracle bone script
Oracle Bone Script
–
Wang Yirong, Chinese politician and scholar, was the first to recognize the oracle bone inscriptions as ancient writing.
Oracle Bone Script
–
An oracle bone (which is incomplete) with a diviner asking the Shang king if there would be misfortune over the next ten days
Oracle Bone Script
–
Oracle script from a divining

13.
International Phonetic Alphabet
–
The International Phonetic Alphabet is an alphabetic system of phonetic notation based primarily on the Latin alphabet. It was devised by the International Phonetic Association as a representation of the sounds of spoken language. The IPA is used by lexicographers, foreign students and teachers, linguists, speech-language pathologists, singers, actors, constructed language creators. The IPA is designed to represent only those qualities of speech that are part of language, phones, phonemes, intonation. IPA symbols are composed of one or more elements of two types, letters and diacritics. For example, the sound of the English letter ⟨t⟩ may be transcribed in IPA with a letter, or with a letter plus diacritics. Often, slashes are used to signal broad or phonemic transcription, thus, /t/ is less specific than, occasionally letters or diacritics are added, removed, or modified by the International Phonetic Association. As of the most recent change in 2005, there are 107 letters,52 diacritics and these are shown in the current IPA chart, posted below in this article and at the website of the IPA. In 1886, a group of French and British language teachers, led by the French linguist Paul Passy, for example, the sound was originally represented with the letter ⟨c⟩ in English, but with the digraph ⟨ch⟩ in French. However, in 1888, the alphabet was revised so as to be uniform across languages, the idea of making the IPA was first suggested by Otto Jespersen in a letter to Paul Passy. It was developed by Alexander John Ellis, Henry Sweet, Daniel Jones, since its creation, the IPA has undergone a number of revisions. After major revisions and expansions in 1900 and 1932, the IPA remained unchanged until the International Phonetic Association Kiel Convention in 1989, a minor revision took place in 1993 with the addition of four letters for mid central vowels and the removal of letters for voiceless implosives. The alphabet was last revised in May 2005 with the addition of a letter for a labiodental flap, apart from the addition and removal of symbols, changes to the IPA have consisted largely in renaming symbols and categories and in modifying typefaces. Extensions to the International Phonetic Alphabet for speech pathology were created in 1990, the general principle of the IPA is to provide one letter for each distinctive sound, although this practice is not followed if the sound itself is complex. There are no letters that have context-dependent sound values, as do hard, finally, the IPA does not usually have separate letters for two sounds if no known language makes a distinction between them, a property known as selectiveness. These are organized into a chart, the chart displayed here is the chart as posted at the website of the IPA. The letters chosen for the IPA are meant to harmonize with the Latin alphabet, for this reason, most letters are either Latin or Greek, or modifications thereof. Some letters are neither, for example, the letter denoting the glottal stop, ⟨ʔ⟩, has the form of a question mark

International Phonetic Alphabet
–
The official chart of the IPA as of 2005. Click to enlarge. The proper symbols for upstep and downstep are not available in this font, and have been substituted.
International Phonetic Alphabet
–
X-ray photos show the sounds [i, u, a, ɑ]

14.
Mojibake
–
Mojibake, from the Japanese 文字 character + 化け transform, is the garbled text that is the result of text being decoded using an unintended character encoding. The result is a replacement of symbols with completely unrelated ones. This display may include the replacement character � in places where the binary representation is considered invalid. A replacement can also involve multiple consecutive symbols, as viewed in one encoding and this is either because of differing constant length encoding, or the use of variable length encodings. Failed rendering of glyphs due to either missing fonts or missing glyphs in a font is a different issue that is not to be confused with mojibake, symptoms of this failed rendering include blocks with the codepoint displayed in hexadecimal or using the generic replacement character �. Importantly, these replacements are valid and are the result of error handling by the software. To correctly reproduce the text that was encoded, the correspondence between the encoded data and the notion of its encoding must be preserved. As mojibake is the instance of incompliance between these, it can be achieved by manipulating the data itself, or just relabeling it. Mojibake is often seen with text data that have been tagged with an encoding, it may not even be tagged at all. A major source of trouble are communication protocols that rely on settings on each computer rather than sending or storing metadata together with the data. Whereas Linux distributions mostly switched to UTF-8 for all uses of text, Microsoft Windows still uses codepages for text files, for some writing systems, an example being Japanese, several encodings have historically been employed, causing users to see mojibake relatively often. If the encoding is not specified, it is up to the software to decide it by other means, depending on type of software, the typical solution is either configuration or charset detection heuristics. Both are prone to mispredict in not-so-uncommon scenarios, the encoding of text files is usually governed by the OS-level setting, which depends on brand of operating system and possibly the users language. Therefore, the encoding is systematically wrong for files that come from a computer with a different setting. One solution is to use a byte order mark, but for source code and other machine readable text, another is storing the encoding as metadata in the filesystem. Filesystems that support extended file attributes can store this as user. charset and this also requires support in software that wants to take advantage of it, but does not disturb other software. While a few encodings are easy to detect, in particular UTF-8, Mojibake also occurs when the encoding is wrongly specified. This often happens between encodings that are similar, for example, the Eudora email client for Windows was known to send emails labelled as ISO-8859-1 that were in reality Windows-1252

Mojibake
–
Mojibake caused by a song title in Cyrillic (Моя Страна) on a car audio system
Mojibake
–
The UTF-8 -encoded Japanese Wikipedia article for mojibake, as displayed in the Windows-1252 encoding.

15.
Unicode
–
Unicode is a computing industry standard for the consistent encoding, representation, and handling of text expressed in most of the worlds writing systems. As of June 2016, the most recent version is Unicode 9.0, the standard is maintained by the Unicode Consortium. Unicodes success at unifying character sets has led to its widespread, the standard has been implemented in many recent technologies, including modern operating systems, XML, Java, and the. NET Framework. Unicode can be implemented by different character encodings, the most commonly used encodings are UTF-8, UTF-16 and the now-obsolete UCS-2. UTF-8 uses one byte for any ASCII character, all of which have the same values in both UTF-8 and ASCII encoding, and up to four bytes for other characters. UCS-2 uses a 16-bit code unit for each character but cannot encode every character in the current Unicode standard, UTF-16 extends UCS-2, using one 16-bit unit for the characters that were representable in UCS-2 and two 16-bit units to handle each of the additional characters. Many traditional character encodings share a common problem in that they allow bilingual computer processing, Unicode, in intent, encodes the underlying characters—graphemes and grapheme-like units—rather than the variant glyphs for such characters. In the case of Chinese characters, this leads to controversies over distinguishing the underlying character from its variant glyphs. In text processing, Unicode takes the role of providing a unique code point—a number, in other words, Unicode represents a character in an abstract way and leaves the visual rendering to other software, such as a web browser or word processor. This simple aim becomes complicated, however, because of concessions made by Unicodes designers in the hope of encouraging a more rapid adoption of Unicode, the first 256 code points were made identical to the content of ISO-8859-1 so as to make it trivial to convert existing western text. For other examples, see duplicate characters in Unicode and he explained that he name Unicode is intended to suggest a unique, unified, universal encoding. In this document, entitled Unicode 88, Becker outlined a 16-bit character model, Unicode could be roughly described as wide-body ASCII that has been stretched to 16 bits to encompass the characters of all the worlds living languages. In a properly engineered design,16 bits per character are more than sufficient for this purpose, Unicode aims in the first instance at the characters published in modern text, whose number is undoubtedly far below 214 =16,384. By the end of 1990, most of the work on mapping existing character encoding standards had been completed, the Unicode Consortium was incorporated in California on January 3,1991, and in October 1991, the first volume of the Unicode standard was published. The second volume, covering Han ideographs, was published in June 1992, in 1996, a surrogate character mechanism was implemented in Unicode 2.0, so that Unicode was no longer restricted to 16 bits. The Microsoft TrueType specification version 1.0 from 1992 used the name Apple Unicode instead of Unicode for the Platform ID in the naming table, Unicode defines a codespace of 1,114,112 code points in the range 0hex to 10FFFFhex. Normally a Unicode code point is referred to by writing U+ followed by its hexadecimal number, for code points in the Basic Multilingual Plane, four digits are used, for code points outside the BMP, five or six digits are used, as required. Code points in Planes 1 through 16 are accessed as surrogate pairs in UTF-16, within each plane, characters are allocated within named blocks of related characters

16.
Traditional Chinese characters
–
Traditional Chinese characters are Chinese characters in any character set that does not contain newly created characters or character substitutions performed after 1946. They are most commonly the characters in the character sets of Taiwan, of Hong Kong. Currently, a number of overseas Chinese online newspapers allow users to switch between both sets. In contrast, simplified Chinese characters are used in mainland China, Singapore, the debate on traditional and simplified Chinese characters has been a long-running issue among Chinese communities. Although simplified characters are taught and endorsed by the government of Mainland China, Traditional characters are used informally in regions in China primarily in handwriting and also used for inscriptions and religious text. They are often retained in logos or graphics to evoke yesteryear, nonetheless, the vast majority of media and communications in China is dominated by simplified characters. Taiwan has never adopted Simplified Chinese characters since it is ruled by the Republic of China, the use of simplified characters in official documents is even prohibited by the government in Taiwan. Simplified characters are not well understood in general, although some stroke simplifications that have incorporated into Simplified Chinese are in common use in handwriting. For example, while the name of Taiwan is written as 臺灣, similarly, in Hong Kong and Macau, Traditional Chinese has been the legal written form since colonial times. In recent years, because of the influx of mainland Chinese tourists, today, even government websites use simplified Chinese, as they answer to the Beijing government. This has led to concerns by residents to protect their local heritage. In Southeast Asia, the Chinese Filipino community continues to be one of the most conservative regarding simplification, while major public universities are teaching simplified characters, many well-established Chinese schools still use traditional characters. Publications like the Chinese Commercial News, World News, and United Daily News still use traditional characters, on the other hand, the Philippine Chinese Daily uses simplified. Aside from local newspapers, magazines from Hong Kong, such as the Yazhou Zhoukan, are found in some bookstores. In case of film or television subtitles on DVD, the Chinese dub that is used in Philippines is the same as the one used in Taiwan and this is because the DVDs belongs to DVD Region Code 3. Hence, most of the subtitles are in Traditional Characters, overseas Chinese in the United States have long used traditional characters. A major influx of Chinese immigrants to the United States occurred during the half of the 19th century. Therefore, the majority of Chinese language signage in the United States, including street signs, Traditional Chinese characters are called several different names within the Chinese-speaking world

Traditional Chinese characters
–
Chinese characters
Traditional Chinese characters
–
Job announcement in a Filipino Chinese daily newspaper written in Traditional Chinese characters.
Traditional Chinese characters
–
A Series of Reading workbook in Traditional Chinese used in some Elementary schools in the Philippines.

17.
Simplified Chinese characters
–
Simplified Chinese characters are standardized Chinese characters prescribed in the Table of General Standard Chinese Characters for use in mainland China. Along with traditional Chinese characters, it is one of the two character sets of the contemporary Chinese written language. The government of the Peoples Republic of China in mainland China has promoted them for use in printing since the 1950s and 1960s in an attempt to increase literacy and they are officially used in the Peoples Republic of China and Singapore. Traditional Chinese characters are used in Hong Kong, Macau. Overseas Chinese communities generally tend to use traditional characters, Simplified Chinese characters may be referred to by their official name above or colloquially. Strictly, the latter refers to simplifications of character structure or body, character forms that have existed for thousands of years alongside regular, Simplified character forms were created by decreasing the number of strokes and simplifying the forms of a sizable proportion of traditional Chinese characters. Some simplifications were based on popular cursive forms embodying graphic or phonetic simplifications of the traditional forms, some characters were simplified by applying regular rules, for example, by replacing all occurrences of a certain component with a simplified version of the component. Variant characters with the pronunciation and identical meaning were reduced to a single standardized character. Finally, many characters were left untouched by simplification, and are identical between the traditional and simplified Chinese orthographies. Some simplified characters are very dissimilar to and unpredictably different from traditional characters and this often leads opponents not well-versed in the method of simplification to conclude that the overall process of character simplification is also arbitrary. In reality, the methods and rules of simplification are few, on the other hand, proponents of simplification often flaunt a few choice simplified characters as ingenious inventions, when in fact these have existed for hundreds of years as ancient variants. However, the Chinese government never officially dropped its goal of further simplification in the future, in August 2009, the PRC began collecting public comments for a modified list of simplified characters. The new Table of General Standard Chinese Characters consisting of 8,105 characters was promulgated by the State Council of the Peoples Republic of China on June 5,2013, cursive written text almost always includes character simplification. Simplified forms used in print have always existed, they date back to as early as the Qin dynasty, One of the earliest proponents of character simplification was Lubi Kui, who proposed in 1909 that simplified characters should be used in education. In the years following the May Fourth Movement in 1919, many anti-imperialist Chinese intellectuals sought ways to modernise China, Traditional culture and values such as Confucianism were challenged. Soon, people in the Movement started to cite the traditional Chinese writing system as an obstacle in modernising China and it was suggested that the Chinese writing system should be either simplified or completely abolished. Fu Sinian, a leader of the May Fourth Movement, called Chinese characters the writing of ox-demons, lu Xun, a renowned Chinese author in the 20th century, stated that, If Chinese characters are not destroyed, then China will die. Recent commentators have claimed that Chinese characters were blamed for the problems in China during that time

18.
Han Chinese
–
The Han Chinese, Han people or simply Han are an ethnic group native to East Asia. They constitute approximately 92% of the population of China, 95% of Taiwan, 76% of Singapore, 23% of Malaysia, Han Chinese are the worlds largest ethnic group with over 1.3 billion people. Similarly, the Chinese language also came to be named the Han language ever since, in the Oxford Dictionary, the Han are defined as The dominant ethnic group in China. In the Encyclopedia of the Peoples of Asia and Oceania, the Han are called the dominant population in China, as well as in Taiwan, according to the Merriam-Webster Dictionary, the Han are the Chinese peoples especially as distinguished from non-Chinese elements in the population. The name Hanzhong, in turn, was derived from the Han River, which flows through the regions plains. The river, in turn, derives its name from such as Tianhan, Yinhan, Xinghan or Yunhan, all ancient Chinese poetic nicknames for the Milky Way. This gave rise to a commonly used nowadays by overseas Chinese for ethnic identity – Huaren. The term is used in conversation and is also an element in the Cantonese word for Chinatown. The vast majority of Han Chinese – over 1.2 billion of them – live in areas under the jurisdiction of the Peoples Republic of China, where they constitute about 92% of its population. Han Chinese also constitute the majority in both of the administrative regions of the PRC—about 95% and 96% of the population of Hong Kong and Macau. There are over 22 million Han Chinese in Taiwan, they began migrating from the coastal provinces of mainland China to Taiwan during the 13th to 17th century. At first, these migrants chose to settle in locations that bore a resemblance to the areas they had left behind in mainland China, hoklo immigrants from Quanzhou settled in coastal regions, and those from Zhangzhou tended to gather on inland plains, while the Hakka inhabited hilly areas. Clashes between these groups over land, water, and cultural differences led to the relocation of some communities, of about 40 million overseas Chinese worldwide, nearly 30 million live in Southeast Asia. They are collectively called Nanyang Chinese, according to a population genetic study, Singapore is the country with the biggest proportion of Hans in Southeast Asia. Up until the past few decades, overseas Han communities originated predominantly from areas in southern China, christmas Island has a Chinese majority at 70%, large Chinese populations also live in Malaysia and Thailand. Prior to the 1965 split, Malaysia and Singapore used to have the largest overseas Chinese population in the world and this position has since been taken by Thailand. The prehistory of the Han ethnic group is closely intertwined with both records and mythology. Han Chinese trace their ancestry from a confederation of late neolithic/early bronze-age agricultural tribes that lived along the Guanzhong, the Yellow Emperor is traditionally credited to have united with the neighbouring Shennong tribes after defeating their leader, Flame Emperor, at the Battle of Banquan

Han Chinese
–
1983 Map of ethnolinguistic groups in mainland China and Taiwan (Han is in olive green)
Han Chinese
–
A female servant and male advisor dressed in silk robes, ceramic figurines from the Western Han era
Han Chinese
–
Han Chinese man wears a queue in compliance with Manchu custom during the Qing dynasty
Han Chinese
–
Zhang Zeduan 's painting Along the River During the Qingming Festival captures the daily life of people from the Song period at the capital, Bianjing, today's Kaifeng.

19.
Standard Chinese
–
Its pronunciation is based on the Beijing dialect, its vocabulary on the Mandarin dialects, and its grammar is based on written vernacular Chinese. Like other varieties of Chinese, Standard Chinese is a language with topic-prominent organization. It has more initial consonants but fewer vowels, final consonants, Standard Chinese is an analytic language, though with many compound words. There exist two standardised forms of the language, namely Putonghua in Mainland China and Guoyu in Taiwan, aside from a number of differences in pronunciation and vocabulary, Putonghua is written using simplified Chinese characters, while Guoyu is written using traditional Chinese characters. There are many characters that are identical between the two systems, in English, the governments of China and Hong Kong use Putonghua, Putonghua Chinese, Mandarin Chinese, and Mandarin, while those of Taiwan, Singapore, and Malaysia, use Mandarin. The name Putonghua also has a long, albeit unofficial, history and it was used as early as 1906 in writings by Zhu Wenxiong to differentiate a modern, standard Chinese from classical Chinese and other varieties of Chinese. For some linguists of the early 20th century, the Putonghua, or common tongue/speech, was different from the Guoyu. The former was a prestige variety, while the latter was the legal standard. Based on common understandings of the time, the two were, in fact, different, Guoyu was understood as formal vernacular Chinese, which is close to classical Chinese. By contrast, Putonghua was called the speech of the modern man. The use of the term Putonghua by left-leaning intellectuals such as Qu Qiubai, prior to this, the government used both terms interchangeably. In Taiwan, Guoyu continues to be the term for Standard Chinese. The term Putonghua, on the contrary, implies nothing more than the notion of a lingua franca, Huayu, or language of the Chinese nation, originally simply meant Chinese language, and was used in overseas communities to contrast Chinese with foreign languages. Over time, the desire to standardise the variety of Chinese spoken in these communities led to the adoption of the name Huayu to refer to Mandarin and it also incorporates the notion that Mandarin is usually not the national or common language of the areas in which overseas Chinese live. The term Mandarin is a translation of Guānhuà, which referred to the lingua franca of the late Chinese empire, in English, Mandarin may refer to the standard language, the dialect group as a whole, or to historic forms such as the late Imperial lingua franca. The name Modern Standard Mandarin is sometimes used by linguists who wish to distinguish the current state of the language from other northern. Chinese has long had considerable variation, hence prestige dialects have always existed. Confucius, for example, used yǎyán rather than colloquial regional dialects, rime books, which were written since the Northern and Southern dynasties, may also have reflected one or more systems of standard pronunciation during those times

Standard Chinese
–
A poster outside of high school in Yangzhou urges people to speak Putonghua
Standard Chinese
–
Zhongguo Guanhua (中国官话/中國官話), or Medii Regni Communis Loquela ("Middle Kingdom's Common Speech"), used on the frontispiece of an early Chinese grammar published by Étienne Fourmont (with Arcadio Huang) in 1742

20.
Hanyu Pinyin
–
Pinyin, or Hànyǔ Pīnyīn, is the official romanization system for Standard Chinese in mainland China, Malaysia, Singapore, and Taiwan. It is often used to teach Standard Chinese, which is written using Chinese characters. The system includes four diacritics denoting tones, Pinyin without tone marks is used to spell Chinese names and words in languages written with the Latin alphabet, and also in certain computer input methods to enter Chinese characters. The pinyin system was developed in the 1950s by many linguists, including Zhou Youguang and it was published by the Chinese government in 1958 and revised several times. The International Organization for Standardization adopted pinyin as a standard in 1982. The system was adopted as the standard in Taiwan in 2009. The word Hànyǔ means the language of the Han people. In 1605, the Jesuit missionary Matteo Ricci published Xizi Qiji in Beijing and this was the first book to use the Roman alphabet to write the Chinese language. Twenty years later, another Jesuit in China, Nicolas Trigault, neither book had much immediate impact on the way in which Chinese thought about their writing system, and the romanizations they described were intended more for Westerners than for the Chinese. One of the earliest Chinese thinkers to relate Western alphabets to Chinese was late Ming to early Qing Dynasty scholar-official, the first late Qing reformer to propose that China adopt a system of spelling was Song Shu. A student of the great scholars Yu Yue and Zhang Taiyan, Song had been to Japan and observed the effect of the kana syllabaries. This galvanized him into activity on a number of fronts, one of the most important being reform of the script, while Song did not himself actually create a system for spelling Sinitic languages, his discussion proved fertile and led to a proliferation of schemes for phonetic scripts. The Wade–Giles system was produced by Thomas Wade in 1859, and it was popular and used in English-language publications outside China until 1979. This Sin Wenz or New Writing was much more sophisticated than earlier alphabets. In 1940, several members attended a Border Region Sin Wenz Society convention. Mao Zedong and Zhu De, head of the army, both contributed their calligraphy for the masthead of the Sin Wenz Societys new journal. Outside the CCP, other prominent supporters included Sun Yat-sens son, Sun Fo, Cai Yuanpei, the countrys most prestigious educator, Tao Xingzhi, an educational reformer. Over thirty journals soon appeared written in Sin Wenz, plus large numbers of translations, biographies, some contemporary Chinese literature, and a spectrum of textbooks

Hanyu Pinyin
–
A school slogan asking elementary students to speak Putonghua is annotated with pinyin, but without tonal marks.
Hanyu Pinyin
–
In Yiling, Yichang, Hubei, text on road signs appears both in Chinese characters and in Hanyu Pinyin

21.
Bopomofo
–
Zhuyin fuhao, Zhuyin or Bopomofo is a system of phonetic notation for the transcription of spoken Chinese, particularly the Mandarin dialect. The first two are traditional terms, whereas Bopomofo is the term, also used by the ISO. Consisting of 37 characters and four marks, it transcribes all possible sounds in Mandarin. Zhuyin was introduced in China by the Republican Government in the 1910s and used alongside the Wade-Giles system, the Wade system was replaced by Hanyu Pinyin in 1958 by the Government of the Peoples Republic of China, and at the International Organization for Standardization in 1982. The informal name Bopomofo is derived from the first four syllables in the ordering of available syllables in Mandarin Chinese. The four Bopomofo characters that correspond to these syllables are placed first in a list of these characters. The same sequence is used by other speakers of Chinese to refer to other phonetic systems. The original formal name of the system was Guóyīn Zìmǔ and Zhùyīn Zìmǔ and it was later renamed Zhùyīn Fúhào, meaning phonetic symbols. In official documents, Zhuyin is occasionally called Mandarin Phonetic Symbols I, in English translations, the system is often also called either Chu-yin or the Mandarin Phonetic Symbols. A romanized phonetic system was released in 1984 as Mandarin Phonetic Symbols II, the Commission on the Unification of Pronunciation, led by Wu Zhihui from 1912 to 1913, created a system called Zhuyin Zimu, which was based on Zhang Binglins shorthand. A draft was released on July 11,1913, by the Republic of China National Ministry of Education and it was later renamed first Guoyin Zimu and then, in April 1930, Zhuyin Fuhao. The last renaming addressed fears that the system might independently replace Chinese characters. Zhuyin remains the predominant phonetic system in teaching reading and writing in school in Taiwan. It is also one of the most popular ways to enter Chinese characters into computers and smartphones, in elementary school, particularly in the lower years, Chinese characters in textbooks are often annotated with Zhuyin as ruby characters as an aid to learning. Additionally, one newspaper in Taiwan, the Mandarin Daily News. In teaching Mandarin, Taiwan institutions and some communities use Zhuyin as a learning tool. The Zhuyin characters were created by Zhang Binglin, and taken mainly from regularised forms of ancient Chinese characters, the modern readings of which contain the sound that each letter represents. It is to be noted that the first consonants are articulated from the front of the mouth to the back, /b/, /p/, /m/, /f/, /d/, /t/, /n/, Zhuyin is written in the same stroke order rule as Chinese characters

22.
Wu Chinese
–
Wu is a group of linguistically similar and historically related varieties of Chinese primarily spoken in the whole city of Shanghai, Zhejiang province, southern Jiangsu province and bordering areas. Major Wu varieties include those of Shanghai, Suzhou, Ningbo, Wuxi, Wenzhou/Oujiang, Hangzhou, Shaoxing, Jinhua, Wu speakers, such as Chiang Kai-shek, Lu Xun and Cai Yuanpei, occupied positions of great importance in modern Chinese culture and politics. Wu can also be found being used in Shaoxing opera, which is only in national popularity to Peking opera, as well as in the performances of the popular entertainer. Wu is also spoken in a number of diaspora communities, with significant centers of immigration originating from Shanghai, Qingtian. Suzhou has traditionally been the center of Wu and was likely the first place the distinct variety of Sinitic known as Wu developed. Suzhou dialect is considered to be the most linguistically representative of the family. Due to the influence of Shanghainese, Wu as a whole is incorrectly labelled in English as simply, Shanghainese, among speakers of other Sinitic languages, Wu is often subjectively judged to be soft, light, and flowing. There is an idiom in Mandarin that specifically describes these qualities of Wu speech, Ngu nung nioe ngiu, Wu varieties have the largest vowel quality inventories in the world. The Jinhui dialect spoken in Shanghais Fengxian District has 20 vowel qualities, Wu Chinese, along with Min, is also of great significance to historical linguists due to their retention of many ancient features. These two languages have proven pivotal in determining the history of the Chinese languages. More pressing concerns of the present are those of language preservation, however, many analysts believe that a stable state of diglossia will endure for at least several generations if not indefinitely. Saying one speaks Wu is akin to saying one speaks a Romance language and it is not a particularly defined entity like Standard Mandarin or Hochdeutsch. They do this by affixing 話 Wo to their locations endonym, for example, 溫州話 Wēnzhōuhuà is used for Wenzhounese. Affixing 閒話 xiánhuà is also common and more typical of the Taihu division, Wu, the formal name and standard reference in dialectology literature. Northern Wu, Wu typically spoken in the north of Zhejiang and it by default includes the Xuanzhou division in Anhui as well, however this division is often neglected in Northern Wu discussions. Southern Wu, Wu spoken in southern Zhejiang and periphery, comprising the Oujiang, Wuzhou, Western Wu, A term gaining in usage as a synonym for the Xuanzhou division and modeled after the previous two terms since the Xuanzhou division is less representative of Northern Wu. Shanghainese, is also a common name, used because Shanghai is the most well-known city in the Wu-speaking region. The term Shanghainese is never used by linguists to refer to anything

23.
Shanghainese
–
The Shanghainese language, also known as the Shanghai dialect, Hu language or Hu dialect, is a variety of Wu Chinese spoken in the central districts of the City of Shanghai and its surrounding areas. It is classified as part of the Sino-Tibetan language family, in English, Shanghainese sometimes refers to all Wu languages, variants and dialects, although they are only partially intelligible with one another. Shanghainese proper is a language of Taihu Wu, it contains vocabulary and expressions from the entire Taihu Wu area of southern Jiangsu. With nearly 14 million speakers, Shanghainese is also the largest single form of Wu Chinese and it serves as the lingua franca of the entire Yangtze River Delta region. Shanghainese is rich in vowels and in consonants, like other Taihu Wu dialects, Shanghainese has voiced initials. Neither Cantonese nor Mandarin has voiced initial stops or affricates, the Shanghainese tonal system is also significantly different from other Chinese varieties, sharing more similarities with the Japanese pitch accent. Shanghainese has two level tonal contrasts, while Cantonese and Mandarin are typical of contour tonal languages, Shanghai did not become a regional center of commerce until it was opened to foreign investment during the late Qing dynasty. Consequently, languages and dialects spoken around Shanghai had long been subordinate to those spoken around Jiaxing, in the late 19th century, most vocabulary of the Shanghai region had been a hybrid between Southern Jiangsu and Ningbonese. It underwent sustained growth that reached a hiatus in the 1930s during the Republican era, after 1949, the government imposed Mandarin as the official language of the whole nation of China. The dominance and influence of Shanghainese began to wane slightly, especially since Chinese economic reform began in 1978, Shanghai became home to a great number of migrants from all over the country. Due to the prominence of Mandarin, learning Shanghainese was no longer necessary for migrants. However, Shanghainese remained a vital part of the citys culture. In the 1990s, it was common for local radio. In 1995, the TV series Sinful Debt featured extensive Shanghainese dialogues, the Shanghainese TV series Lao Niang Jiu was broadcast from 1995 to 2007 and received tremendous popularity among Shanghainese residents. Shanghainese programming has since declined, over concerns of regionalist/localist accusations. From 1992 onward, Shanghainese use was discouraged in schools, in addition, Shanghais emergence as a cosmopolitan global city consolidated the status of Mandarin as the standard language of business and services, at the expense of the local language. Since 2005, new movements have emerged to protect Shanghainese from fading away, Shanghais former party boss Chen Liangyu, a native Shanghainese himself, reportedly supported her proposal. There have been talks of re-integrating Shanghainese into pre-kindergarten education, because children are unable to speak any Shanghainese

24.
Gan Chinese
–
Gan is a member of the Sinitic languages of the Sino-Tibetan language family, and Hakka is the closest Chinese variety to Gan in terms of phonetics. Different dialects of Gan exist, the Nanchang dialect is taken as representative. Like all other varieties of Chinese, there is an amount of mutual unintelligibility between Gan Chinese and other varieties. Within the variation of Chinese dialects, Gan has more similarities with Mandarin than with Yue or Min, scholars in mainland China use Gan or Gan dialect. Jiāngxī huà is commonly used in Chinese, but since the borders of the language do not follow the borders of the province, this name is not geographically exact. Xi, an ancient name, now used, arising from the fact that most Gan speakers live south of the Yangtze River. Most Gan speakers live in the middle and lower reaches of the Gan River, the area of the Fu River. There are also many Gan speakers living in eastern Hunan, eastern Hubei, southern Anhui, northwest Fujian, in the early years of the Han Dynasty, Nanchang was established as the capital of the Yuzhang Commandery, along with the 18 counties of Jiangxi Province. The population of the Yuzhang Commandery increased to 1,670,000 from 350,000, the Yuzhang Commandery ranked fourth in population among the more than 100 contemporary commanderies of China. As the largest commandery of Yangzhou, Yuzhang accounted for two fifths of the population and Gan gradually took shape during this period, as a result of continuous warfare in the region of central China, the first large-scale emigration in the history of China took place. Large numbers of people in central China relocated to southern China in order to escape the bloodshed and at this time, also, during this period, ancient Gan began to be exposed to the northern Mandarin dialects. After centuries of rule by the Southern Dynasties, Gan still retained many original characteristics despite having absorbed some elements of Mandarin, up until the Tang Dynasty, there was little difference between old Gan and the contemporary Gan of that era. During this period, following hundreds of years of migration, Gan spread to its current areas of distribution, Guanhua evolved into a standard language based on Beijing Mandarin, owing largely to political factors. At the same time, the differences between Gan and Guan-hua continued to more pronounced. However, because Jiangxi borders on Jianghuai, a Guanhua, Xiang, after 1949, as a dialect in Mainland China, Gan faced a critical period. The impact of Mandarin is quite evident today as a result of governmental language campaigns. Currently, many youths are unable to master Gan expressions, there are differences within the Gan speaking region. For example, in Anfu county, which was categorized as Ji-Cha, people from one region cannot even understand people from the other region if they were not well educated or exposed to the other dialects

25.
Hakka Chinese
–
Hakka is not mutually intelligible with Yue, Wu, Southern Min, Mandarin or other branches of Chinese, and itself contains a few mutually unintellegible varieties. It is most closely related to Gan and is classified as a variety of Gan. There is also a possibility that the similarities are just a result of shared areal features, Taiwan, where Hakka is the native language of a significant minority of the islands residents, is a center for the study and preservation of the language. Pronunciation differences exist between the Taiwanese Hakka dialects and Mainland Chinas Hakka dialects, even in Taiwan, two local varieties of Hakka exist. The Meixian dialect of northeast Guangdong in China has been taken as the dialect by the Peoples Republic of China. The Guangdong Provincial Education Department created a romanization of Moiyen in 1960. The name of the Hakka people who are the predominant original native speakers of the variety literally means guest families or guest people, Hak 客 means guest, and ka 家 means family. Among themselves, Hakka people variously called their language Hak-ka-fa 客家話, Hak-fa, 客話, Tu-gong-dung-fa 土廣東話, literally Native Guangdong language, and Ngai-fa 我話, My/our language. The forebears of the Hakka came from present-day Central Plains provinces of Henan and Shaanxi, the presence of many archaic features occur in modern Hakka, including final consonants -p -t -k, as are found in other modern southern Chinese varieties, but which have been lost in Mandarin. Due to the migration of its speakers, Hakka may have influenced by other language areas through which the Hakka-speaking forebears migrated. For instance, common vocabulary is found in Hakka, Min, in recent times, many She people have become Hakka speakers. A regular pattern of change can generally be detected in Hakka, as in most Chinese varieties. Some examples, Characters such as 武 or 屋, are pronounced roughly mwio and uk in Early Middle Chinese, have an initial v phoneme in Hakka, being vu and vuk in Hakka respectively. Like in Mandarin, labiodentalisation process also changed mj- to a sound in Hakka before grave vowels. Middle Chinese initial phonemes /ɲ/ of the characters 人 and 日, among others, for comparison, in Mandarin, /ɲ/ became r-, while in Cantonese, it merged with initial /j/. The initial consonant phoneme exhibited by the character 話 is pronounced f or v in Hakka, the initial consonant of 學 hɔk usually corresponds with an h approximant in Hakka and a voiceless alveo-palatal fricative in Mandarin. Hakka has as many regional dialects as there are counties with Hakka speakers as the majority, some of these Hakka dialects are not mutually intelligible with each other. Surrounding Meixian are the counties of Pingyuan, Dabu, Jiaoling, Xingning, Wuhua, each is said to have its own special phonological points of interest

26.
Cantonese
–
Cantonese, or Standard Cantonese, is a variety of Chinese spoken in the city of Guangzhou in southeastern China. It is the prestige variety of Yue, one of the major subdivisions of Chinese. In mainland China, it is the lingua franca of the province of Guangdong and some neighbouring areas such as Guangxi. In Hong Kong and Macau, Cantonese serves as one of their official languages and it is also spoken amongst overseas Chinese in Southeast Asia and throughout the Western World. When Cantonese and the closely related Yuehai dialects are classified together, Cantonese is viewed as vital part of the cultural identity for its native speakers across large swathes of southeastern China, Hong Kong and Macau. Although Cantonese shares some vocabulary with Mandarin, the two varieties are mutually unintelligible because of differences in pronunciation, grammar and lexicon, sentence structure, in particular the placement of verbs, sometimes differs between the two varieties. This results in the situation in which a Cantonese and a Mandarin text may look similar, in English, the term Cantonese is ambiguous. Cantonese proper is the variety native to the city of Canton and this narrow sense may be specified as Canton language or Guangzhou language in English. However, Cantonese may also refer to the branch of Cantonese that contains Cantonese proper as well as Taishanese and Gaoyang. In this article, Cantonese is used for Cantonese proper, historically, speakers called this variety Canton speech or Guangzhou speech, although this term is now seldom used outside mainland China. In Guangdong province, people call it provincial capital speech or plain speech. In Hong Kong and Macau, as well as among overseas Chinese communities, in mainland China, the term Guangdong speech is also increasingly being used among both native and non-native speakers. Due to its status as a prestige dialect among all the dialects of the Cantonese or Yue branch of Chinese varieties, the official languages of Hong Kong are Chinese and English, as defined in the Hong Kong Basic Law. The Chinese language has different varieties, of which Cantonese is one. Given the traditional predominance of Cantonese within Hong Kong, it is the de facto official spoken form of the Chinese language used in the Hong Kong Government and all courts and it is also used as the medium of instruction in schools, alongside English. A similar situation exists in neighboring Macau, where Chinese is an official language along with Portuguese. As in Hong Kong, Cantonese is the predominant spoken variety of Chinese used in life and is thus the official form of Chinese used in the government. The variant spoken in Hong Kong and Macau is known as Hong Kong Cantonese, Cantonese first developed around the port city of Guangzhou in the Pearl River Delta region of southeastern China

Cantonese
–
Street in Chinatown, San Francisco. Cantonese has traditionally been the dominant Chinese variant among Chinese populations in the Western world.
Cantonese
–
Gwóngdūng Wah / gwong 2 dung 1 waa 6 (Cantonese) written in traditional Chinese (left) and simplified Chinese (right) characters
Cantonese
–
Chinese dictionary from Tang dynasty. Modern Cantonese pronunciation is more similar to Middle Chinese from this era than other Chinese varieties.

27.
Jyutping
–
Jyutping is a romanisation system for Cantonese developed by the Linguistic Society of Hong Kong, an academic group, in 1993. Its formal name is The Linguistic Society of Hong Kong Cantonese Romanisation Scheme, the LSHK promotes the use of this romanisation system. The name Jyutping is a contraction consisting of the first Chinese characters of the terms Jyut6jyu5, only the finals m and ng can be used as standalone nasal syllables. ^ ^ ^ Referring to the pronunciation of these words. There are nine tones in six distinct tone contours in Cantonese, however, as three of the nine are entering tones, which only appear in syllables ending with p, t, and k, they do not have separate tone numbers in Jyutping. Jyutping and the Yale Romanisation of Cantonese represent Cantonese pronunciations with the letters in, The initials, b, p, m, f, d, t, n, l, g, k, ng, h, s, gw, kw. The vowel, aa, a, e, i, o, u, the coda, i, u, m, n, ng, p, t, k. But they differ in the following, The vowels eo and oe represent /ɵ/ and /œː/ respectively in Jyutping, the initial j represents /j/ in Jyutping whereas y is used instead in Yale. The initial z represents /ts/ in Jyutping whereas j is used instead in Yale, the initial c represents /tsʰ/ in Jyutping whereas ch is used instead in Yale. In Jyutping, if no consonant precedes the vowel yu, then the initial j is appended before the vowel, in Yale, the corresponding initial y is never appended before yu under any circumstances. Jyutping defines three finals not in Yale, eu /ɛːu/, em /ɛːm/, and ep /ɛːp/ and these three finals are used in colloquial Cantonese words, such as deu6, lem2, and gep6. To represent tones, only tone numbers are used in Jyutping whereas Yale traditionally uses tone marks together with the letter h. Jyutping and Cantonese Pinyin represent Cantonese pronunciations with the letters in, The initials, b, p, m, f, d, t, n, l, g, k, ng, h, s, gw, kw. The vowel, aa, a, e, i, o, u, the coda, i, u, m, n, ng, p, t, k. But they have differences, The vowel oe represents both /ɵ/ and /œː/ in Cantonese Pinyin whereas eo and oe represent /ɵ/ and /œː/ respectively in Jyutping. The vowel y represents /y/ in Cantonese Pinyin whereas both yu and i are used in Jyutping, the initial dz represents /ts/ in Cantonese Pinyin whereas z is used instead in Jyutping. The initial ts represents /tsʰ/ in Cantonese Pinyin whereas c is used instead in Jyutping. To represent tones, the numbers 1 to 9 are usually used in Cantonese Pinyin, however, only the numbers 1 to 6 are used in Jyutping

Jyutping
–
Jyutping Romanization.

28.
Southern Min
–
Southern Min, or Minnan, is a branch of Min Chinese spoken in certain parts of China including southern Fujian, eastern Guangdong, Hainan, and southern Zhejiang, and in Taiwan. The Min Nan dialects are spoken by descendants of emigrants from these areas in diaspora, most notably the Philippines, Indonesia, Malaysia. In common parlance, Southern Min usually refers to Hokkien, including Amoy and Taiwanese Hokkien, the Southern Min dialect group also includes Teochew, though Teochew has limited mutual intelligibility with Hokkien. Hainanese is not mutually intellgible with other Southern Min and is considered a separate branch of Min. Southern Min is not mutually intelligible with Eastern Min, Pu-Xian Min, any other Min branch, Hakka, Cantonese, Shanghainese or Mandarin. Southern Min dialects are spoken in the part of Fujian. The variant spoken in Leizhou, Guangdong as well as Hainan is Hainanese and is not mutually intelligible with other Southern Min or Teochew, Hainanese is classified in some schemes as part of Southern Min and in other schemes as separate. Puxian Min was originally based on the Quanzhou dialect, but over time became heavily influenced by Eastern Min, eventually losing intellegility with Minnan. A forms of Southern Min spoken in Taiwan, collectively known as Taiwanese, Southern Min is a first language for most of the Hoklo people, the main ethnicity of Taiwan. The correspondence between language and ethnicity is not absolute, as some Hoklo have very limited proficiency in Southern Min while some non-Hoklo speak Southern Min fluently, there are many Southern Min speakers also among Overseas Chinese in Southeast Asia. Many ethnic Chinese immigrants to the region were Hoklo from southern Fujian and brought the language to what is now Burma, Indonesia and present-day Malaysia and Singapore. In general, Southern Min from southern Fujian is known as Hokkien, Hokkienese, many Southeast Asian ethnic Chinese also originated in the Chaoshan region of Guangdong and speak Teochew language, the variant of Southern Min from that region. Southern Min-speakers form the majority of Chinese in Singapore, with the largest group being Hokkien, despite the similarities the two groups are rarely seen as part of the same Minnan Chinese subgroups. The variants of Southern Min spoken in Zhejiang province are most akin to that spoken in Quanzhou, the variants spoken in Taiwan are similar to the three Fujian variants and are collectively known as Taiwanese. Those Southern Min variants that are known as Hokkien in Southeast Asia also originate from these variants. The variants of Southern Min in the Chaoshan region of eastern Guangdong province are known as Teochew or Chaozhou. Teochew is of importance in the Southeast Asian Chinese diaspora, particularly in Malaysia, Thailand, Cambodia, Vietnam, Sumatra. The Philippines variant is mostly from the Quanzhou area as most of their forefathers are from the aforementioned area, the Southern Min language variant spoken around Shanwei and Haifeng differs markedly from Teochew and may represent a later migration from Zhangzhou

29.
Hokkien
–
Hokkien /hɒˈkiɛn/ is a group of Southern Min dialects spoken throughout Southeastern China, Taiwan, Southeast Asia and by other overseas Chinese. Hokkien originated in southern Fujian, the Min-speaking province and it is closely related to Teochew, though there is limited mutual intelligibility, and is somewhat more distantly related to Hainanese and Leizhou dialect. Besides Hokkien, there are also other Min and Hakka dialects in Fujian province, the term Hokkien is etymologically derived from the Southern Min pronunciation for Fujian, the province from which the language hails. The variety is known by other terms such as the more general Min Nan or Southern Min. Fujianese and Fukienese are also used, although they are somewhat imprecise, the term Hokkien is not usually used in Mainland China or Taiwan. Conversely Hokkien is the name in Southeast Asia in both English, Chinese or other languages. Speakers of Hokkien, particularly those in Southeast Asia, typically refer to Hokkien as a dialect, people in Taiwan most often refer to Hokkien as the Taiwanese language, with Minnan and Holo also being used and 福建話 is not as common. Hokkien originated in the area of Fujian province, an important center for trade and migration. The major pole of Hokkien varieties outside of Fujian is Taiwan, the Taiwanese version mostly have origins with the Quanzhou and Zhangzhou variants, but since then, the Amoy dialect is becoming the modern prestige standard for the language. There are many Hokkien speakers among overseas Chinese in Southeast Asia as well as in the United States, many ethnic Han Chinese emigrants to the region were Hoklo from southern Fujian, and brought the language to what is now Burma, Indonesia and present day Malaysia and Singapore. Many of the Hokkien dialects of this region are similar to Taiwanese and Amoynese. Hokkien is reportedly the native language of up to 80% of the Chinese people in the Philippines, Hokkien speakers form the largest group of overseas Chinese in Singapore, Malaysia, Indonesia and Philippines. Southern and part of western Fujian is home to four principal Hokkien dialects, Chinchew, Amoy, Chiangchew and Longyan, originating from the cities of Quanzhou, Xiamen, Zhangzhou and Longyan. As Xiamen is the city of southern Fujian, Amoy is considered the most important, or even the prestige dialect. It is a hybrid of the Quanzhou and Zhangzhou dialects, same as Amoy dialect, the varieties of Hokkien spoken in Taiwan are hybrids of the Quanzhou and Zhangzhou dialects, and are collectively known as Taiwanese Hokkien or just Taiwanese. Used by a majority of the population, it bears much importance from a socio-political perspective, the varieties of Hokkien in Southeast Asia originate from these dialects. The Singaporeans, Southern Malaysians and people in Indonesias Riau and surrounding islands variant is from the Quanzhou area and they speak a distinct form of Quanzhou Hokkien called Southern Peninsular Malaysian Hokkien. Among ethnic Chinese inhabitants of Penang, and other states in Northern Malaysia and Medan, with areas in North Sumatra, Indonesia

Hokkien
–
Distribution of Min Nan dialects. Hokkien is dark green.

30.
Teochew dialect
–
Teochew is a variety of Southern Min spoken mainly by the Teochew people in the Chaoshan region of eastern Guangdong and by their respective diaspora around the world. It is sometimes referred to as Chiuchow, its Cantonese name, Teochew preserves many Old Chinese pronunciations and vocabulary that have been lost in some of the other modern varieties of Chinese. As such, many linguists consider Teochew one of the most conservative Chinese dialects, Teochew is a member of the Southern Min subgroup, which in turn constitutes a part of Min Chinese, one of the seven major language groups of Chinese. Even within the Teochew dialects, there is variation in phonology between different regions of Chaoshan and between different Teochew communities overseas. Parts of the Hakka-speaking regions of Jiexi County, Dabu County and Fengshun, as Chaoshan was one of the major sources of Chinese emigration to Southeast Asia during the 18th to 20th centuries, a considerable Overseas Chinese community in that region is Teochew-speaking. In particular, the Teochew people settled in significant numbers in Cambodia, Thailand and Laos, teochew-speakers form a minority among Chinese communities in Vietnam, Singapore, Malaysia and Indonesia. Teochew was never popular in Chinese communities in Japan and South Korea, since most of the Teochew people who migrated to these countries are secondary immigrants from Hong Kong and Taiwan. Most of them are people from Hong Kong and Taiwan who speak Cantonese, Hokkien and Mandarin, as well as Korean and Japanese. This refers to Chaozhou, the variety of Teochew spoken in China, Chaozhou children are introduced to Standard Chinese as early as in kindergarten, however, Chaozhou remains the primary medium of instruction. In the early years of education, Mandarin becomes the sole language of instruction. Mandarin is widely understood, however minimally, by most younger Chaozhou speakers, native Chaozhou speakers find the neutral tone in Mandarin hardest to master. Chaozhou has lost the alveolar nasal ending and so the people often replace the sound in Mandarin with the velar nasal, none of the southern Min dialects have a front rounded vowel, therefore a typical Chaozhou accent supplants the unrounded counterpart for. Chaozhou, like its ancient ancestor, lacks labio-dentals, people therefore substitute or for when they speak Mandarin, Chaozhou does not have any of the retroflex consonants in the northern dialects, so they pronounce, and instead of, and. Since Chaoan, Raoping and Jieyang border the Hakka-speaking region in the north, some people in these regions speak Hakka, Chaozhou people have historically had a great deal of contact with the Hakka people, but Hakka has had little, if any, influence on Chaozhou. Similarly, in Dabu and Fengshun, where the Chaozhou- and Hakka-speaking regions meet, in the mountainous area of Fenghuang, the She language, an endangered Hmong–Mien language, is spoken by the She people, who are an officially-recognised non-Han ethnic minority. They predominantly speak Hakka and Teochew, only about 1,000 She still speak their eponymous language, the voiced stops and and also are voicelessly prenasalised, respectively. They are in distribution with the tenuis stops, occurring before nasal vowels and nasal codas, whereas the tenuis stops occur before oral vowels. The voiced affricate dz, initial in such words as 字, 二, 然, 若 loses its affricate property with some younger speakers abroad, and is relaxed to

31.
Eastern Min
–
Eastern Min, or Min Dong, is a branch of the Min group of varieties of Chinese. The prestige form and most-cited representative form is the Fuzhou dialect, Eastern Min varieties are mainly spoken in the eastern part of Fujian Province in Peoples Republic of China, in and near the cities of Fuzhou and Ningde. They are also encountered as the mother tongue on the Matsu Islands. Additionally, the inhabitants of Taishun and Cangnan to the north of Fujian in Zhejiang also speak Eastern Min varieties, Eastern Min generally coexists with the official standard Chinese in all these areas. Cities with high concentrations of immigrants include New York City, especially Little Fuzhou, Manhattan, Sunset Park, Brooklyn. They are also found in various Chinatown communities in Europe, including London, Paris, Chinese communities within Ikebukuro, Tokyo, Japan as well as Sibu, Sarawak, Malaysia have significant populations of Eastern Min speakers. Fuzhou communities can also be found in Sitiawan, Perak and Yong Peng, the Eastern Min group is conventionally divided into three branches, Houguan dialect subgroup, including the Fuzhou dialect, Fuqing dialect, Lianjiang dialect and the dialect of the Matsu Islands. Funing dialect subgroup, including the Ningde dialect and the Fuan dialect, manjiang dialect, spoken in parts of Taishun and Cangnan, Wenzhou, Zhejiang. Besides these three branches, some islands in the southern Chinese province of Guangdong were classified as Eastern Min. Zhongshan Min is a group of Min varieties spoken in the Zhongshan county of Guangdong, according to Nicholas Bodman, only the Longdu dialect and Nanlang dialect belong to the Eastern Min group, while the Sanxiang dialect belongs to Southern Min. A preliminary report on the dialects of Mintung, the Gutian dialect of Min Dong District

Eastern Min
–
Min Dong (violet)

32.
Fuzhou dialect
–
The Fuzhou dialect, also Fuzhounese, Foochow or Hok-chiu, is the prestige variety of the Eastern Min branch of Min Chinese spoken mainly in eastern Fujian province. Like many other varieties of Chinese, the Fuzhou dialect is dominated by monosyllabic morphemes which carry lexical tones, while the Eastern Min branch that it belongs to is closer to Southern Min than to other Sinitic branches such as Mandarin or Hakka, they are still not mutually intelligible. Centered in Fuzhou City, the Fuzhou dialect covers eleven cities and counties, Fuzhou City Proper, Pingnan, Gutian, Luoyuan, Minqing, Lianjiang, Minhou, Changle, Yongtai, Fuqing and Pingtan. It is also the local language in many northern and middle Fujian cities and counties such as Nanping, Shaowu, Shunchang, Sanming. Fuzhou dialect is widely spoken in some regions abroad, especially in Southeastern Asian countries like Malaysia and Indonesia. The city of Sibu in Malaysia is called New Fuzhou due to the influx of immigrants there in the late 19th century and early 1900s. Similarly, quite a significant number of Fuzhounese have emigrated to Singapore, Taiwan, United States, Canada, United Kingdom, Australia, in older works, the variety is called Foochow dialect, based on the Chinese postal romanization of Fuzhou. In Chinese, it is sometimes called 福州語, native speakers also refer to it as Bàng-uâ, meaning the everyday language. After the Qin Dynasty conquered the Minyue kingdom of southeast China in 110 BC and it once served to standardize the language and is still widely quoted as an authoritative reference book in modern academic research in Min Chinese phonology. In 1842, Fuzhou was open to Westerners as a treaty port after the signing of the Treaty of Nanjing, but due to the language barrier, however, the first Christian missionary base in this city did not take place without difficulties. In order to convert Fuzhou people, those missionaries found it necessary to make a careful study of the Fuzhou dialect. Their most notable works are listed below,1856, M. C, White, The Chinese language spoken at Fuh Chau 1870, R. S. Maclay & C. C. Baldwin, An alphabetic dictionary of the Chinese language in the Foochow dialect 1871, baldwin, Manual of the Foochow dialect 1891, T. B. Adam, An English-Chinese Dictionary of the Foochow Dialect 1893, Charles Hartwell, Three Character Classic of Gospel in the Foochow Colloquial 1898, baldwin, An Alphabetic Dictionary of the Chinese Language of the Foochow Dialect, 2nd edition 1905, T. B. Adam, An English-Chinese Dictionary of the Foochow Dialect, 2nd edition]1906, The Foochow translation of the complete Bible 1923, T. B. Adam & L. P. Peet, An English-Chinese dictionary of the Foochow dialect, 2nd edition 1929, one of their most famous works was the Japanese-Chinese Translation, Fuzhou Dialect published in 1940 in Taipei, in which katakana was used to represent Fuzhou pronunciation. By the end of the Qing Dynasty, Fuzhou society had been largely monolingual, but for decades the Chinese government has discouraged the use of the vernacular in school education and in media, so the number of Mandarin speakers has been greatly boosted. Recent reports indicate that less than 50% of young people in Fuzhou are able to speak Fuzhou dialect, in Mainland China, the Fuzhou dialect has been officially listed as an Intangible Cultural Heritage and promotion work is being systematically carried out to preserve its use

33.
Foochow Romanized
–
Foochow Romanized, also known as Bàng-uâ-cê or Hók-ciŭ-uâ Lò̤-mā-cê, is a Latin alphabet for the Fuzhou dialect of Eastern Min adopted in the middle of 19th century by Western missionaries. It had varied at different times, and became standardized in the 1890s, Foochow Romanized was mainly used inside of Church circles, and was taught in some Mission Schools in Fuzhou. But unlike its counterpart Pe̍h-ōe-jī for Hokkien, even in its prime days Foochow Romanized was by no means universally understood by Christians. After Fuzhou became one of the five Chinese treaty ports opened by the Treaty of Nanjing at the end of First Opium War, faced with widespread illiteracy, they developed Latin alphabets for Fuzhou dialect. The first attempt in romanizing Fuzhou dialect was made by the American Methodist M. C, white, who borrowed a system of orthography known as the System of Sir William Jones. In this system,14 initials were designed according to their voicing. ⟨p⟩, ⟨t⟩, ⟨k⟩ and ⟨ch⟩ stand for, and, besides the default five vowels of Latin alphabet, four diacritic-marked letters ⟨è⟩, ⟨ë⟩, ⟨ò⟩ and ⟨ü⟩ were also introduced, representing, and, respectively. This system is described at length in Whites linguistic work The Chinese Language Spoken at Fuh Chau, the most significant change was made for the plosive consonants, where the spiritus lenis ⟨᾿⟩ of the aspirated initials was removed and the letters ⟨b⟩, ⟨d⟩ and ⟨g⟩ substituted for and. In the aspect of vowels, ⟨è⟩, ⟨ë⟩, ⟨ò⟩ and ⟨ü⟩ were replaced by ⟨a̤⟩, ⟨e̤⟩, ⟨o̤⟩, since the diacritical marks were all shifted to underneath the vowels, this left room above the vowels which was occupied by the newly introduced tonal marks. Thus Foochow Romanized avoids the potentially awkward diacritic stacking seen for instance in the Vietnamese script, the sample characters are taken from the Qi Lin Bayin, a renowned phonology book about the Fuzhou dialect written in the Qing Dynasty. The pronunciations are recorded in standard IPA symbols, note that Foochow Romanized uses the breve, not the caron, to indicate Yīnpíng and Yángrù tones of Fuzhou dialect. Everything You Want To Know About Foochow Romanized Gô Iók Cŭ, The Old Testament, sĭng Iók Cŭ, The New Testament, in Foochow Romanized. An English-Chinese Dictionary of the Foochow Dialect, by T. B, adam,1905 Learning material of Foochow Romanized at the Wayback Machine

Foochow Romanized
–
An English-Chinese Dictionary of the Foochow Dialect, 2nd Edition, published in 1905
Foochow Romanized
–
Bible in Foochow Romanized (Exodus), published by British and Foreign Bible Society in 1908
Foochow Romanized
–
Dictionary of the Foochow dialect, 3rd Edition, published in 1929
Foochow Romanized

34.
Middle Chinese
–
The fanqie method used to indicate pronunciation in these dictionaries, though an improvement on earlier methods, proved awkward in practice. The mid 12th-century Yunjing and other rime tables incorporate a more sophisticated, the rime tables attest to a number of sound changes that had occurred over the centuries following the publication of the Qieyun. Linguists sometimes refer to the system of the Qieyun as Early Middle Chinese, the dictionaries and tables describe pronunciations in relative terms, but do not give their actual sounds. The Swedish linguist Bernard Karlgren believed that the recorded a speech standard of the capital Changan of the Sui and Tang dynasties. This composite system contains important information for the reconstruction of the system of Old Chinese phonology. The Middle Chinese system is used as a framework for the study. Branches of the Chinese family such as Mandarin, Yue and Wu can be treated as divergent developments from the Qieyun system. The reconstruction of Middle Chinese phonology is largely dependent upon detailed descriptions in a few original sources, the most important of these is the Qieyun rime dictionary and its revisions. The Qieyun is often used together with interpretations in Song dynasty rime tables such as the Yunjing, Qiyinlue, Chinese scholars of the Northern and Southern dynasties period were concerned with the correct recitation of the classics. Various schools produced dictionaries to codify reading pronunciations and the associated rhyme conventions of regulated verse, the Qieyun was an attempt to merge the distinctions in six earlier dictionaries, which were eclipsed by its success and are no longer extant. It was accepted as the standard reading pronunciation during the Tang dynasty, the Qieyun is thus the oldest surviving rime dictionary and the main source for the pronunciation of characters in Early Middle Chinese. The rime dictionaries organize Chinese characters by their pronunciation, according to a hierarchy of tone, rhyme, the fanqie system uses multiple equivalent characters to represent each particular initial, and likewise for finals. The categories of initials and finals actually represented were first identified by the Cantonese scholar Chen Li in an analysis published in his Qièyùn kǎo. The Qieyun classified homonyms under 193 rhyme classes, each of which is placed one of the four tones. A single rhyme class may contain multiple finals, generally differing only in the medial or in so-called chongniu doublets, the Yunjing is the oldest of the so-called rime tables, which provide a more detailed phonological analysis of the system contained in the Qieyun. However, the analysis shows some influence from LMC, which needs to be taken into account when interpreting difficult aspects of the system. The Yunjing is organized into 43 tables, each covering several Qieyun rhyme classes, and classified as, One of 16 broad rhyme classes, each described as either inner or outer. The meaning of this is debated but it has suggested that it refers to the height of the main vowel, with outer finals having an open vowel

Middle Chinese
–
The start of the first rhyme class of the Guangyun (東 dōng "east")
Middle Chinese
–
The first table of the Yunjing, covering the Guangyun rhyme classes 東 dōng, 董 dǒng, 送 sòng and 屋 wū (-k in Middle Chinese)

35.
Zhuang language
–
The Zhuang languages are any of more than a dozen Tai languages spoken by the Zhuang people of southern China in the province of Guangxi and adjacent parts of Yunnan and Guangdong. The Zhuang languages do not form a linguistic unit, as northern and southern Zhuang languages are more closely related to other Tai languages than to each other. Standard Zhuang is based on the northern Zhuang dialect of Wuming, the Tai languages are believed to have been originally spoken in what is now southern China, with speakers of the Southwestern Tai languages having emigrated in the face of Chinese expansion. He also argues that the departure of the Thai from southern China must predate the 5th century AD, zhāng Jūnrús Zhuàngyǔ Fāngyán Yánjiù is the most detailed study of Zhuang dialectology published to date. It reports survey work carried out in the 1950s, and includes a 1465-word list covering 36 varieties of Zhuang, for the list of the 36 Zhuang variants below from Zhang, the name of the region is given first, followed by the specific village. The phylogenetic position of each variant follows that of Pittayaporn, the Zhuang language has been divided by Chinese linguists into northern and southern dialects, each of which has been divided into a number of vernacular varieties by Chinese linguists. Although Southern Zhuang varieties have aspirated stops, Northern Zhuang varieties lack them, there are over 60 distinct tonal systems with 5–11 tones depending on the variety. Castro and Hansen distinguished three mutually unintelligible varieties, Central Hongshuihe, Eastern Hongshuihe and Liuqian, min Zhuang is a recently discovered variety that has never been described previous to Johnson. Pyang Zhuang and Myang Zhuang are recently described Central Tai languages spoken in Debao County, Guangxi, the Zhuang languages have been written in the Old Zhuang script, Sawndip, for over a thousand years, and possibly Sawgoek previous to that. It is used for writing songs about every aspect of life, there has also been the occasional use of pictographic proto-writing, such as in the example at right. In addition, Standard Zhuang and Bouyei are written in Latin script, chongzuo Tai languages Languages of China Ningming Mingjiang language Northern Tai languages Yongnan languages Zhuang studies Zhuàng-Hàn cíhuì 壮汉词汇. Edmondson, Jerold A. and David B, solnit, ed. Comparative Kadai, The Tai Branch. Dallas, TX, Summer Institute of Linguistics, University of Texas at Arlington,1997, a sociolinguistic introduction to the Central Taic languages of Wenshan Prefecture, China. SIL Electronic Survey Reports 2010-027,114 p. http, //www. sil. org/silesr/abstract. asp. ref=2010-027, luo Liming, Qin Yaowu, Lu Zhenyu, Chen Fulong. Zhuang–Chinese–English Dictionary / Cuengh Gun Yingh Swzdenj, nationality Press,1882 pp. ISBN 7-105-07001-3. Tán Xiǎoháng 覃晓航, Xiàndài Zhuàngyǔ 现代壮语, Tán Guóshēng 覃国生, Zhuàngyǔ fāngyán gàilùn 壮语方言概论. The Nationalities Publishing House of Yunnan, wéi Qìngwěn 韦庆稳, Tán Guóshēng 覃国生, Zhuàngyǔ jiǎnzhì 壮语简志. Zhang Junru 张均如, et al. 壮语方言研究 / Zhuang yu fang yan yan jiu, chengdu, 四川民族出版社 / Sichuan min zu chu ban she,1999

36.
Hangul
–
The Korean alphabet, known as Hangul in South Korea and as Chosŏngŭl/Chosŏn Muntcha in North Korea is the alphabet that has been used to write the Korean language since the 15th century. It was created during the Joseon Dynasty in 1443 by King Sejong the Great, in South Korea, Hangul is used primarily to write the Korean language as using Hanja in typical Korean writing had fallen out of common usage during the late 1990s. In its classical and modern forms, the alphabet has 19 consonant and 21 vowel letters, however, instead of being written sequentially like the letters of the Latin alphabet, Hangul letters are grouped into blocks, such as 한 han, each of which transcribes a syllable. That is, although the syllable 한 han may look like a single character, each syllabic block consists of two to six letters, including at least one consonant and one vowel. These blocks are arranged horizontally from left to right or vertically from top to bottom. Each Korean word consists of one or more syllables, hence one or more blocks, of the 11,172 possible Hangul syllables, the most frequent 256 have a cumulative frequency of 88. 2%, with the top 512, it reaches 99. 9%. The modern name Hangul was coined by Ju Sigyeong in 1912, han meant great in archaic Korean, and geul is the native Korean word for script. Taken together, then, the meaning is great script, as the word han had also become one way of indicating Korea as a whole the name could also potentially be interpreted as Korean script. Korean 한글 is pronounced, and in English as /ˈhɑːn. ɡʊl/ or /ˈhɑːŋɡʊl/, when used as an English word, it is often rendered without the diacritics, hangul, and it is often capitalized as Hangul, as it appears in many English dictionaries. Hankul in the Yale romanization, a system recommended for technical linguistic studies, North Koreans call it Chosŏngŭl, after Chosŏn, the North Korean name for Korea. Because of objections to the names Hangeul, Chosŏngŭl, and urigeul by Koreans in China, until the early 20th century, Hangul was denigrated as vulgar by the literate elite, who preferred the traditional hanja writing system. They gave it such names as these, Achimgeul, in the original Hanja, it is rendered as 故智者不終朝而會，愚者可浹旬而學。 Gugmun Eonmun Amgeul. Am is a prefix that signifies a noun is feminine Ahaetgeul or Ahaegeul Hangul was promulgated by Sejong the Great, the Hall of Worthies, a group of scholars who worked with Sejong to develop and refine the new alphabet, is often credited for the work. The project was completed in late December 1443 or January 1444, the publication date of the Hunmin Jeong-eum, October 9, became Hangul Day in South Korea. Its North Korean equivalent, Chosongul Day, is on January 15, various speculations about the creation process were put to rest by the discovery in 1940 of the 1446 Hunmin Jeong-eum Haerye. This document explains the design of the consonant letters according to articulatory phonetics, to assuage this problem, King Sejong created the unique alphabet known as Hangul to promote literacy among the common people. However, it entered popular culture as Sejong had intended, being used especially by women, the late 16th century, however, saw a revival of Hangul, with gasa literature and later sijo flourishing. In the 17th century, Hangul novels became a major genre, by this point spelling had become quite irregular

Hangul
–
Korean writing systems
Hangul
–
Chosŏn'gŭl (top), and Hangul (bottom)
Hangul
–
A page from the Hunmin Jeong-eum Eonhae. The Hangul-only column, third from the left (나랏말 ᄊᆞ 미), has pitch-accent diacritics to the left of the syllable blocks.
Hangul
–
Calligraphy

37.
Hanja
–
Hanja is the Korean name for Chinese characters. Borrowed from Chinese and incorporated into the Korean language with Korean pronunciation, hanja-mal or hanja-eo refers to words that can be written with hanja, and hanmun refers to Classical Chinese writing, although hanja is sometimes used loosely to encompass these other concepts. Because hanja never underwent major reform, they are almost entirely identical to traditional Chinese, only a small number of hanja characters are modified or unique to Korean. By contrast, many of the Chinese characters currently in use in Japan and Mainland China have been simplified, and contain fewer strokes than the corresponding hanja characters. Today, a working knowledge of Chinese characters is still important for anyone who wishes to study older texts. Learning a certain number of hanja is very helpful for understanding the etymology of Sino-Korean words, hanja are not used to write native Korean words, which are always rendered in hangul, and even words of Chinese origin—hanja-eo —are written with the hangul alphabet most of the time. A major motivation for the introduction of Chinese characters into Korea was the spread of Buddhism, the major Chinese text that introduced hanja to Koreans, however, was not a religious text but the Chinese text, Cheonjamun. One way of adapting hanja to write Korean in such systems was to represent native Korean grammatical particles, for example, Gugyeol uses the characters 爲尼 to transcribe the Korean word hăni, in modern Korean, that means does, and so. However, in Chinese, the characters are read as the expression wéi ní. This is an example of Gugyeol words where the radical is read in Korean for its meaning. Hanja was the means of writing Korean until King Sejong the Great promoted the invention of hangul in the 15th century. However, even after the invention of hangul, most Korean scholars continued to write in hanmun and it was not until the 20th century that hangul truly replaced hanja. Officially, hanja has not been used in North Korea since June 1949, additionally, many words borrowed from Chinese have been replaced in the North with native Korean words. However, there are a number of Chinese-borrowed words in widespread usage in the North. The replacement has been less total in South Korea where, although usage has declined over time, some remains in common usage in some contexts. Each hanja is composed of one of 214 radicals plus in most cases one or more additional elements, the vast majority of hanja use the additional elements to indicate the sound of the character, but a few hanja are purely pictographic, and some were formed in other ways. This dual meaning-sound reading of a character is called eumhun, the word or words used to denote the meaning are often—though hardly always—words of native Korean origin, and are sometimes archaic words no longer commonly used. South Korean primary schools abandoned the teaching of hanja in 1971 and it is taught in separate courses in South Korean high schools, separately from the normal Korean-language curriculum

38.
Revised Romanization of Korean
–
The Revised Romanization of Korean is the official Korean language romanization system in South Korea proclaimed by Ministry of Culture and Tourism to replace the older McCune–Reischauer system. The new system eliminates diacritics in favor of digraphs and adheres closely to Korean phonology than to a suggestive rendition of Korean phonetics for non-native speakers. The Revised Romanization limits itself to the ISO basic Latin alphabet, apart from limited and it was developed by the National Academy of the Korean Language from 1995 and was released to the public on 7 July 2000 by South Koreas Ministry of Culture and Tourism in Proclamation No. 2000-8, which cites these reasons for the new system, It promotes consistent romanization by native Korean speakers by the transcription of important language characteristics. It reduces the confusion caused by the frequent omission of apostrophes and it rationalizes the Korean language with the plain ASCII text of internet domain names. Basic principles of romanization are, Romanization is based on standard Korean pronunciation, symbols other than Roman letters are avoided to the greatest extent possible. These are notable features of the Revised Romanization system, Vowels ㅓ/ʌ/ and ㅡ/ɯ/ are written as digraphs, however, ㅝ/wʌ/ is written as wo, and ㅢ/ɰi/ is written as ui. Unlike McCune–Reischauer, aspirated consonants have no apostrophe, k, t, p and their unaspirated counterparts are written with letters that are voiced in English, g, d, b, j. ㅅ/s/ is written as s regardless of the vowels and semivowels, there is no sh, 사 → sa. When followed by another consonant or when in position, it is written as t, 옷 → ot. ㄹ/l/ is r before a vowel or a semivowel and l everywhere else, 리을 → rieul, 철원 → Cheorwon, 울릉도 → Ulleungdo, other rules and recommendations include the following, A hyphen optionally disambiguates syllables, 가을 → ga-eul versus 개울 → gae-ul. However, few official publications make use of this provision since actual instances of ambiguity among names are rare, however, names for geographic features and artificial structures are not hyphenated, 설악산 → Seoraksan, 해인사 → Haeinsa Proper nouns are capitalized. Given names and commercial names are encouraged to change, but it is not required, all Korean textbooks were required to comply with the new system by February 28,2002. English-language newspapers in South Korea initially resisted the new system by citing its flaws, the Korea Times was the last major English-language newspaper to do so and switched only in May 2006. North Korea continues to use a version of the McCune–Reischauer system of Romanization, textbooks and dictionaries intended for students of the Korean language tend to include this Romanization. However, some publishers have acknowledged the difficulties or confusion it can cause for non-native Korean speakers who are unused to the conventions of style of Romanization. ㄱ, ㄷ, ㅂ, and ㄹ are usually transcribed as g, d, b, and r when appearing before a vowel, and as k, t, p, and l when followed by another consonant or when appearing at the end of a word. The revised romanization transcribes certain phonetic changes that occur with combinations of the consonant of a character

39.
Kanji
–
Kanji, or kanji, are the adopted logographic Chinese characters that are used in the modern Japanese writing system along with hiragana and katakana. The Japanese term kanji for the Chinese characters literally means Han characters and is using the same characters as the Chinese word hànzì. Chinese characters first came to Japan on official seals, letters, swords, coins, mirrors, the earliest known instance of such an import was the King of Na gold seal given by Emperor Guangwu of Han to a Yamato emissary in 57 AD. Chinese coins from the first century AD have been found in Yayoi-period archaeological sites, however, the Japanese of that era probably had no comprehension of the script, and would remain illiterate until the fifth century AD. The earliest Japanese documents were written by bilingual Chinese or Korean officials employed at the Yamato court. For example, the correspondence from King Bu of Wa to Emperor Shun of Liu Song in 478 has been praised for its skillful use of allusion. Later, groups of people called fuhito were organized under the monarch to read, during the reign of Empress Suiko, the Yamato court began sending full-scale diplomatic missions to China, which resulted in a large increase in Chinese literacy at the Japanese court. The Japanese language had no form at the time Chinese characters were introduced. Chinese characters also came to be used to write Japanese words, around 650 CE, a writing system called manyōgana evolved that used a number of Chinese characters for their sound, rather than for their meaning. Manyōgana written in cursive style evolved into hiragana, or onna-de, that is, ladies hand, major works of Heian-era literature by women were written in hiragana. Katakana emerged via a path, monastery students simplified manyōgana to a single constituent element. Thus the two writing systems, hiragana and katakana, referred to collectively as kana, are descended from kanji. Katakana are mostly used for representing onomatopoeia, non-Japanese loanwords, the names of plants and animals, and for emphasis on certain words. In 1946, following World War II and under the Allied Occupation of Japan and this was done with the goal of facilitating learning for children and simplifying kanji use in literature and periodicals. The number of characters in circulation was reduced, and formal lists of characters to be learned during each grade of school were established, some characters were given simplified glyphs, called shinjitai. Many variant forms of characters and obscure alternatives for common characters were officially discouraged and these are simply guidelines, so many characters outside these standards are still widely known and commonly used, these are known as hyōgaiji. The kyōiku kanji are 1,006 characters that Japanese children learn in elementary school, originally the list only contained 881 characters. This was expanded to 996 characters in 1977 and it was not until 1982 the list was expanded to its current size

Kanji
–
Chinese characters
Kanji
–
A young woman practicing kanji. Ukiyo-e woodblock print by Yōshū Chikanobu, 1897
Kanji
–
Nihon Shoki (720 AD), considered by historians and archaeologists as the most complete extant historical record of ancient Japan, was written entirely in kanji.
Kanji
–
A jūbako (重箱 ?), which has a mixed on-kun reading.

40.
Hiragana
–
Hiragana is a Japanese syllabary, one component of the Japanese writing system, along with katakana, kanji, and in some cases rōmaji. It is a lettering system. The word hiragana literally means ordinary or simple kana, Hiragana and katakana are both kana systems. With one or two exceptions, each sound in the Japanese language is represented by one character in each system. Because the characters of the kana do not represent single consonants, words that do have common kanji renditions may also sometimes be written instead in hiragana, according to an individual authors preference, for example to impart an informal feel. Hiragana is also used to write furigana, a reading aid that shows the pronunciation of kanji characters, There are two main systems of ordering hiragana, the old-fashioned iroha ordering and the more prevalent gojūon ordering. Of the 50 theoretically possible combinations, yi and wu do not exist in the language, wo is usually pronounced as a vowel in modern Japanese, and is preserved in only one use, as a particle. Romanization of the kana does not always follow the consonant-vowel scheme laid out in the table. For example, ち, nominally ti, is very often romanised as chi in an attempt to represent the actual sound in Japanese. These basic characters can be modified in various ways, by adding a dakuten marker, a voiceless consonant is turned into a voiced consonant, k→g, ts/s→z, t→d, h→b and ch/sh→j. Hiragana beginning with an h can also add a handakuten marker changing the h to a p, a small version of the hiragana for ya, yu or yo may be added to hiragana ending in i. This changes the i vowel sound to a glide to a, u or o, for example, き plus ゃ becomes きゃ. Addition of the small y kana is called yōon, a small tsu っ, called a sokuon, indicates that the following consonant is geminated. In Japanese this is an important distinction in pronunciation, for example, the sokuon also sometimes appears at the end of utterances, where it denotes a glottal stop, as in いてっ. However, it cannot be used to double the na, ni, nu, ne, no syllables consonants – to double these, Hiragana usually spells long vowels with the addition of a second vowel kana, for example, おかあさん. In informal writing, small versions of the five vowel kana are used to represent trailing off sounds. Standard and voiced iteration marks are written in hiragana as ゝ and ゞ respectively, the following table shows the complete hiragana together with the Hepburn romanization and IPA transcription in the gojūon order. Hiragana with dakuten or handakuten follow the gojūon kana without them, obsolete and normally unused kana are shown in gray

Hiragana
–
Hiragana characters' shapes were derived from the Chinese cursive script (sōsho). Shown here is a sample of the cursive script by Chinese Tang Dynasty calligrapher Sun Guoting, from the late 7th century.

41.
Hepburn romanization
–
The system was originally proposed by the Romanization Club in 1885. The revised edition by Romaji-Hirome-kai in 1908 is called standard style romanization, Hepburn is based on English phonology and has competed with the alternative Nihon-shiki romanization, which was developed in Japan as a replacement of Japanese script. In 1930, a Special Romanization Study Commission was appointed to compare the two, the ordinance was temporarily overturned by the Supreme Commander for the Allied Powers during the Occupation of Japan, but was reissued in 1954. In 1972, a version of Hepburn was codified as ANSI standard Z39. 11-1972. It was proposed in 1989 as a draft for ISO3602, the ANSI Z39. 11-1972 standard was consequently deprecated on October 6,1994. As of 1978, the Ministry of Foreign Affairs, the Ministry of International Trade and Industry, in addition The Japan Times, the Japan Travel Bureau, and many other private organizations used Hepburn instead of Kunrei-shiki. The National Diet Library used Kunrei-shiki, although Hepburn is not a government standard, some government agencies mandate it. In many other areas where it de jure status, Hepburn remains the de facto standard. Signs and notices in city offices and police stations, at shrines, temples, English-language newspapers and media use the simplified form of Hepburn. Cities and prefectures use it in information for English-speaking residents and visitors, official tourism information put out by the government uses it, as do guidebooks, local and foreign, on Japan. Many students of Japanese as a foreign language learn Hepburn, there are many variants of Hepburn romanization. The two most common styles are, Traditional Hepburn, as defined in various editions of Hepburns dictionary and this variant is characterized by the rendering of syllabic n as m before the consonants b, m and p, e. g. Shimbashi for 新橋. Modified Hepburn, also known as Revised Hepburn, in which the rendering of syllabic n as m before certain consonants is no longer used, in Japan itself, there are some variants officially mandated for various uses, Railway Standard, which follows the Hyōjun-shiki Rōmaji. All JR railways and other major railways use this type for station names, Ministry of Land, Infrastructure, Transport and Tourism Standard, which follows the modified Hepburn style. This is used for road signs, Ministry of Foreign Affairs Passport Standard, a permissive standard which explicitly allows the use of non-Hepburn romaji in personal names, notably for passports. In particular, rendering the syllabic n as m before b, m, p, details of these variants can be found below. The romanizations set out in the first and second versions of Hepburns dictionary are primarily of historical interest, supporters argue that Hepburn is not intended as a linguistic tool. The long vowels are indicated by macrons

42.
Kunrei-shiki romanization
–
Kunrei-shiki rōmaji is a Cabinet-ordered, Japanese romanization system, i. e. a system for transcribing the Japanese language into the Latin alphabet. Its name is rendered Kunreisiki using Kunrei-shiki itself, Kunrei-shiki is sometimes known as the Monbushō system in English, because it is taught in the Monbushō-approved elementary school curriculum. The ISO has standardized Kunrei-shiki under ISO3602, Kunrei-shiki is based on the older Nihon-shiki system, modified for modern standard Japanese. For example, the word かなづかい, romanized kanadukai in Nihon-shiki, is pronounced kanazukai in common modern Japanese, Kunrei-shiki competes with the older Hepburn romanization system, which was promoted by the authorities during the occupation of Japan. In the period before World War II there was a conflict between supporters of Hepburn romanization and supporters of Nihon-shiki romanization. In 1930 a board of inquiry under the aegis of the Minister of Education was established to determine the proper romanization system, the Japanese government, by cabinet order, announced on September 21,1937 that a modified form of Nihon-shiki would be officially adopted as Kunrei-shiki. The form at the time differs slightly from the modern day form, originally the system was called the Kokutei system. The Japanese government gradually introduced Kunrei-shiki and it appeared in secondary education, on railway station signboards, on nautical charts, and on the 1,1,000,000 scale International Map of the World. In the period when the government had strong control, from 1937 to 1945. In Japan, some use of Nihon-shiki and Modified Hepburn remained because some individuals supported the use of those systems, some editorials printed in Japanese newspapers advocated for only using Hepburn. Supporters of Hepburn denounced pro-Kunrei-shiki and pro-Nihon-shiki advocates to the SCAP offices, accusing them of being inactive militarists, unger said that the nature of Kunrei-shiki led to pent-up anger among the Hepburn supporters. During the post-war period, several educators and scholars tried to introduce romanized letters as a teaching device, however Kunrei-shiki had associations with Japanese militarism and the U. S. occupying government was reluctant to promote it. On December 9,1954, the Japanese government re-confirmed Kunrei-shiki as its official system, eleanor Jorden, an American linguist, made textbooks that used a modified version of Kunrei-shiki. They were used in the 1960s in courses given to U. S. diplomats, the use of her books did not change the U. S. governments hesitation to use Kunrei-shiki. As of 1974, according to the Geographical Survey Institute, Kunrei-shiki was used for topographical maps while Modified Hepburn was used for geological maps, as of 1978, the National Diet Library used Kunrei-shiki. The Ministry of Foreign Affairs, the Ministry of International Trade and Industry, in addition The Japan Times, the Japan Travel Bureau, and many other private organizations used Hepburn instead of Kunrei-shiki. The system was promulgated as Japanese Cabinet Order No.3 as of September 21,1937. But since this had been overturned by the SCAP during the Occupation of Japan, Kunrei-shiki has been recognized, along with Nihon-shiki, in ISO3602,1989

43.
Chinese script styles
–
In Chinese calligraphy, Chinese characters can be written according to five major styles. These styles are intrinsically linked to the history of Chinese script, the Japanese syllabaries of katakana and hiragana are used in calligraphy, the katakana were derived from the shapes of regular script characters and hiragana from those of cursive script. In Korea, the post-Korean War period saw the use of hangul. The seal script is the script of the Qín system of writing. Today, this style of Chinese writing is used predominantly in seals, most people today cannot read the seal script, so it is considered an ‘ancient’ script, generally not used outside the fields of calligraphy and carved seals. Compared with the seal script, forms are strikingly rectilinear, however, some curvature. Seal script tended towards uniformity of stroke width, but clerical script gave the brush freer rein, most noticeable is the dramatically flared tail of one dominant horizontal or downward-diagonal stroke, especially that to the lower right. This characteristic stroke has famously been called silkworm head and wild goose tail due to its distinctive shape, the clerical script remains common as a typeface used for decorative purposes, but other than in artistic calligraphy, adverts and signage, it is not commonly written. The semi-cursive script approximates normal handwriting in which strokes and, more rarely, in writing in the semi-cursive script, the brush leaves the paper less often than in the regular script. Characters appear less angular and rounder, in general, an educated person in China or Japan can read characters written in the semi-cursive script with relative ease, but may have occasional difficulties with certain idiosyncratic shapes. Entire characters may be written without lifting the brush from the paper at all, strokes are modified or eliminated completely to facilitate smooth writing and to create a beautiful, abstract appearance. Characters are highly rounded and soft in appearance, with a lack of angular lines. The cursive script is the source of Japanese hiragana, as well as many modern simplified forms in Simplified Chinese characters and it emerged from a neatly written, early period semi-cursive form of clerical script. As the name suggests, the script is regular, with each of the strokes placed slowly and carefully. The regular script is also the most easily and widely recognized style, as it is the script to which children in East Asian countries and beginners of East Asian languages are first introduced. In the regular script samples to the right, the characters in the column are in Traditional Chinese while those to the right are in Simplified Chinese. These styles are not taught in Japanese calligraphy schools. Chinese and Korean people can read edomoji, but the style has a distinct Japanese feel to it and it is therefore commonly used in China and Korea to advertise Japanese restaurants

44.
Neolithic signs in China
–
However, only isolated instances of these symbols have been found, and they show no indication of representing speech or of the non-pictorial processes that a writing system requires. These sites include those pertaining to the cultures of Yangshao, Liangzhu, Majiayao, proponents of the view that they are early Chinese writing tend to see evidence in comparisons of individual signs with individual oracle bone script characters. Others believe that Neolithic signs are part of an incipient semiotic system that led to the development of mature Chinese writing. As Qiu explains, Only when symbols. are consciously used to record words used to form sentences is there a true sign that the development of script has begun. It is still safe to conclude that the earliest known undisputed examples of writing in China are the oracle bones of the late Shang dynasty. The earliest of Chinas Neolithic signs come from Jiahu, Dadiwan, Jiahu is a Neolithic site in Wuyang County, Henan Province, in the basin of the Yellow River, dated to 6600–6200 BCE. This site has yielded turtle plastrons that were pitted and inscribed with markings known as the Jiahu symbols, what did persist through these long periods was the idea of sign use. We should not assume there was a single path or pace for the development of a script. It seems astonishing that they would be connected, We cant call it writing until we have more evidence, Dadiwan is a Neolithic site discovered in Qinan County, in the province of Gansu. Its earliest phase has yielded symbols painted on the surfaces of pottery basins. More recent excavations there have also uncovered a handful of Neolithic symbols and these are reputed to be similar to some of the oracle bone characters, which is to be expected given that the oracle bones, which are true writing, still retain a significant pictorial flavor. As the Banpo symbols were discovered early and are relatively numerous. Still others feel they may be ownership mark or potters marks, finally, some scholars sound a note of caution, calling such conclusions unwarranted or premature. This is because all the Banpo-type symbols occur singly, on pottery and pottery fragments, unlike written words, thus, there is no context from which to conclude that the symbols are actually being used to represent language. Furthermore, there is no evidence of the phonetic loan usage, thus, leading scholar Qiu Xigui argues that, What these symbols represent definitely cannot be a fully formed system of writing, this much is quite clear. Is there any possibility that they are primitive writing and we simply possess no basis for saying that they were already being used to record language. Nor viewed from the standpoint of the symbols of this type that continued to be used following the creation of Chinese script do they even resemble script. Quite a number of people, basing themselves on the Banpo-type symbols, have said that the history of Chinese writing goes back more than 6,000 years

45.
Oracle bone script
–
The vast majority were found at the Yinxu site. They record pyromantic divinations of the last nine kings of the Shang dynasty, beginning with Wu Ding, whose accession is dated by different scholars at 1250 BCE or 1200 BCE. After the Shang were overthrown by the Zhou dynasty in c.1046 BCE, divining with milfoil became more common and it is also the oldest known member and ancestor of the Chinese family of scripts, preceding the bronzeware script. The term oracle bone was first coined by the American missionary Frank H. Chalfant in his 1906 book Early Chinese Writing, because turtle shells as well as bones were used, the oracle bone script is also sometimes called shell and bone script. As the majority of oracle bones bearing writing date to the late Shang dynasty and it is certain that Shang-lineage writing underwent a period of development before the Anyang oracle bone script, because of its mature nature. However, no significant quantity of clearly identifiable writing from before or during the early to middle Shang cultural period has been discovered, the oracle bone script of the late Shang appears pictographic, as does its contemporary, the Shang writing on bronzes. The earliest oracle bone script appears even more so than examples from late in the period, additionally, the writing of characters in vertical columns, from top to bottom, is for the most part carried over from the bamboo books to oracle bone inscriptions. The vertical columns of text in Chinese writing are traditionally ordered from right to left and this level of maturity clearly implies an earlier period of development of at least several hundred years. Compare, for instance, the third and fourth graphs in the row below, without careful research to compare these to later forms, one would probably not know that these represented 豕 shĭ swine and 犬 quǎn dog respectively. By the late Shang oracle bone script, the graphs had already evolved into a variety of mostly non-pictographic functions, phonetic loan graphs, semantic-phonetic compounds, and associative compounds were already common. Although it was a fully functional writing system, the oracle bone script was not fully standardized, a graph when inverted horizontally generally refers to the same word, and additional components are sometimes present without changing the meaning. These irregularities persisted until the standardization of the script in the Qin dynasty. Of the thousands of characters found from all the fragments so far. One reason for this is that components of certain oracle bone script characters may differ in later script forms, such differences may be accounted for by character simplification and/or by later generations misunderstanding the original graph, which had evolved beyond recognition. Another reason is that some characters exist only in oracle bone script, in such cases, context - when available - may be used to determine the possible meaning of the character. One good example is shown in the fragment below, labeled oracle bone script for Spring, the top left character in this image has no known modern Chinese counterpart. One of the better known characters however is shown directly beneath it looking like an isosceles triangle with a line cutting through the upper portion. This is the oracle bone script character for 王 wáng, the numbers of oracle bones with inscriptions contemporaneous with the end of Shang and the beginning of Zhou is relatively few in number compared with the entire corpus of Shang inscriptions

Oracle bone script
–
Oracle bone script
Oracle bone script
–
Wang Yirong, Chinese politician and scholar, was the first to recognize the oracle bone inscriptions as ancient writing.
Oracle bone script
–
An oracle bone (which is incomplete) with a diviner asking the Shang king if there would be misfortune over the next ten days
Oracle bone script
–
Oracle script from a divining

46.
Chinese bronze inscriptions
–
Early bronze inscriptions were almost always cast, while later inscriptions were often engraved after the bronze was cast. The bronze inscriptions are one of the earliest scripts in the Chinese family of scripts, for the early Western Zhou to early Warring States period, the bulk of writing which has been unearthed has been in the form of bronze inscriptions. As a result, it is common to refer to the variety of scripts of this period as bronze script, the term usually includes bronze inscriptions of the preceding Shang dynasty as well. In addition, artistic scripts also emerged in the late Spring and Autumn to early Warring States, such as Bird Script, also called Bird Seal Script, and Worm Script. 寅 Yín in Four Different Scripts on Shang–Zhou bronzes Of the abundant Chinese ritual bronze artifacts extant today and these have been periodically unearthed ever since their creation, and have been systematically collected and studied since at least the Song dynasty. Examples, Of the 12,000 inscribed bronzes extant today, roughly 3,000 date from the Shang dynasty,6,000 from the Zhou dynasty, and the final 3,000 from the Qin and Han dynasties. As in the oracle bone script, characters could be written facing left or right, turned 90 degrees, for instance, and both represent the modern character xū 戌, while and are both hóu 侯 marquis. This was true of normal as well as extra complex identificational graphs, such as the hǔ 虎 tiger clan emblem at right and these inscriptions are almost all cast, and are relatively short and simple. Western Zhou dynasty characters basically continue from the Shang writing system, although most are not pictographs in function, the early Western Zhou bronze inscriptions have been described as more pictographic in flavor than those of subsequent periods. During the Western Zhou, many begin to show signs of simplification and linearization, with the result being a decrease in pictographic quality. Some flexibility in orientation of graphs continues in the Western Zhou, the graphs start to become slightly more uniform in structure, size and arrangement by the time of the third Zhou sovereign, King Kāng, and after the ninth, King Yì, this trend becomes more obvious. Some have used the term large seal to refer to the script of this period. This term dates back to the Han dynasty, when seal script, in the Eastern Zhou, the various states initially continued using the same forms as in the late Western Zhou. However, regional forms then began to diverge stylistically as early as the Spring and Autumn period, in the same areas, in the late Spring and Autumn to early Warring States, scripts which embellished basic structures with decorative forms such as birds or worms also appeared. Some bronzes of the period were incised in a rough, casual manner and it is thought that these reflected the popular writing of the time which coexisted with the formal script. Seals have been found from the Warring States period, mostly cast in bronze and these form an additional, valuable resource for the study of Chinese bronze inscriptions. It is also from this period that the first surviving bamboo, in the early Warring States period, typical bronze inscriptions were similar in content and length to those in the late Western Zhou to Spring and Autumn period. One of the most famous sets of bronzes ever discovered dates to the early Warring States, the total length of the inscriptions on this set was almost 2,800 characters

47.
Seal script
–
Seal script is an ancient style of writing Chinese characters that was common throughout the latter half of the 1st millennium BC. It evolved organically out of the Zhou dynasty script, arising in the Warring State of Qin, there are two uses of the word seal script, the Large or Great Seal script and the lesser or Small Seal Script, the latter is also called simply seal script. The Large Seal script was originally a later, vague Han dynasty reference to writing of the Qin system similar to and it has also been used to refer to Western Zhou forms or even oracle bones as well. There were several different variants of seal script which developed in each kingdom independently during the warring state period and spring, the birds and worms script was used in the Kingdoms of Wu, Chu, and Yue. It was found on several artifacts including the Spear of Fuchai, on one side of the blade of Goujian, eight characters were written in an ancient script. The script was found to be the one called 鳥蟲文 birds and worms characters, initial analysis of the text deciphered six of the characters, 越王 and 自作用劍. As a southern state, Chu was close to the Wu-Yue influences, Chu produced broad bronze swords that were similar to Wuyue swords, but not as intricate. Chu also used the Birds and Worms style, which was borrowed by the Wu, the script of the Qin system had evolved organically from the Zhou script starting in the Spring and Autumn period. Beginning around the Warring States period, it became vertically elongated with a regular appearance and this was the period of maturation of Small Seal script, also called simply seal script. Through Chinese commentaries, it is known that Li Si compiled Cangjiepian and their form is characterized by being less rectangular and more squarish. The first known character dictionary was the 3rd century BC Erya, collated and bibliographed by Liu Xiang and his son Liu Xin, not long after, however, the Shuowen Jiezi, the lifework of Xu Shen, was written. Its 9,353 entries reproduce the standardized small-seal script variant for each entry, entries are categorized under 540 section headers. It has been anticipated that the Small Seal script will some day be encoded in Unicode, codepoints U+34000 to U+368FF have been tentatively allocated. Old Texts Phags-pa Chén Zhāoróng Research on the Qín Lineage of Writing, academia Sinica, Institute of History and Philology Monograph. Qin Yun Song, Big Seal Script Banner Script translation Richard Sears on seal script

48.
Bird-worm seal script
–
Bird-worm seal script is a type of ancient seal script originated in China. The Chinese character 鸟 means bird, the Chinese character 虫 means any creature that looks like a worm, including invertebrate worms and reptiles such as snakes and lizards. The character 篆 means seal script, other names for this kind of seal script, Niao-Chong Script. The Chinese character 书 here means script, the Chinese character 文 here means character. There are two subcategories, Bird seal script In this style, some parts of characters have a bird-like head, the bird style sign is a combination of two parts, a complete seal script character and one bird shape. Worm seal script In this style, some or all the strokes are winding, thus producing a worm-like character and this kind of seal script first appeared in the middle era of the Spring and Autumn period. It then became popular during the late Spring and Autumn period and it was often seen in southern kingdoms, such as the Wu Kingdom, Yue Kingdom, Chu Kingdom, Cai Kingdom, Xu Kingdom, and the Song Kingdom. Each state in China during the Warring States Period had its own variety of script, the bird seal script is often seen on bronze and iron antiques of the Yue Kingdom. The script was used on bronze and iron weapons, like swords, the characters engraved on the famous Sword of Goujian provide a fine example. A few examples of the seal script can be seen in or on containers. The bird seal script was used occasionally in the Han Dynasty seals, as well as a few eaves tiles. The worm seal script is more common, and probably originated from the Wu Kingdom or Chu Kingdom, examples can be seen on antique bronze weapons, containers, jades, and seals, and constructional or decorative parts like tiles, etc. The characters on the famous Spear of Fuchai would be an example of this category of seal script. Bronze inscriptions Seal script Large Seal Script Small Seal Script Seal Shuowen Jiezi, by Xu Shen. 《鸟虫书通考》, by CAO Jinyan, 《鸟虫篆大鉴》, by Xu Gupu, ISBN 7-80569-368-4, Shanghai Bookstore Press

Bird-worm seal script
–
Bird seal script on the Sword of Goujian and its equivalents in modern Chinese

49.
Large Seal Script
–
The term is in contrast to the name of the official script of the Qin dynasty, which is often called Small or Lesser Seal Script. However, due to the lack of precision in the term, scholars often avoid it and instead refer more specifically to the provenance of particular examples of writing. At that time, there was knowledge of even older, often more complex graphs which differed from the Qin seal script forms. As a result, two terms emerged to describe them, greater seal script for the complex, earlier forms. It is only recently that the term greater seal script has been extended to refer to Western Zhou forms or even oracle bone script. Xu Shen, the author of Shuowen, included these when they differed from the structures of the Qin seal script and this name comes from the name of the book and not the name of a script. Thus, it is not correct to refer to the ca.800 BCE Zhoū dynasty script as Zhòuwén, similarly, the Zhòu graphs are merely examples of large seal script when that term is used in a broad sense. Seal Script Small Seal Script Chén Zhāoróng Research on the Qín Lineage of Writing, academia Sinica, Institute of History and Philology Monograph. Translation of 文字學概要 by Gilbert L. Mattos and Jerry Norman, Early China Special Monograph Series No.4. Berkeley, The Society for the Study of Early China and the Institute of East Asian Studies, University of California, Berkeley

50.
Small Seal Script
–
Small Seal Script, formerly romanized as Hsiao-chuan and also known as Seal Script, Lesser Seal Script and Qin Script, is an archaic form of Chinese calligraphy. It was standardized and promulgated as a standard by Li Si, prime minister under Shi Huangdi. Hence coaches, roads, currency, laws, weights, measures, characters which were different from those found in Qin were discarded and Li Sis small seal characters became the standard for all regions within the empire. This policy came in about 220 BC, the year after Qins unification of the Chinese states, the small cursive form clerical script came after the small script. Li Sis compilation is only through Chinese commentaries through the centuries. It is stated to contain 3,300 characters, small Seal Script has been proposed for inclusion in Unicode