c
| Name | C |
| Letter | C c |
| Script | Latin script |
| Type | Alphabet |
| Typedesc | ic |
| Language | Latin language |
| Unicode | U+0043, U+0063 |
| Alphanumber | 3 |
| Number | 100 |
| Fam1 | |
| Fam2 | |
| Fam3 | |
| Fam4 | |
| Fam5 | |
| Fam6 | Γ γ |
| Fam7 | 𐌂 |
| Usageperiod | present |
| Image | |
| Imageclass | skin-invert-image |
C, or c, is the third letter of the Latin alphabet, used in the modern English alphabet, the alphabets of other western European languages and others worldwide. Its name in English is cee (pronounced 'siː), plural cees.
History
| Egyptian | Phoenician gaml | Western Greek Gamma | Etruscan C | Old Latin C (G) | Latin C |
|---|---|---|---|---|---|
"C" comes from the same letter as "G". The Semites named it gimel. The sign is possibly adapted from an Egyptian hieroglyph for a staff sling, which may have been the meaning of the name gimel. Another possibility is that it depicted a camel, the Semitic name for which was gamal. Barry B. Powell, a specialist in the history of writing, states "It is hard to imagine how gimel = "camel" can be derived from the picture of a camel (it may show his hump, or his head and neck!)".
In the Etruscan language, plosive consonants had no contrastive voicing, so the Greek 'Γ' (Gamma) was adopted into the Etruscan alphabet to represent /k/. Already in the Western Greek alphabet, Gamma first took a '

Other alphabets have letters homoglyphic to 'c' but not analogous in use and derivation, like the Cyrillic letter Es (С, с) which derives from the lunate sigma.
Later use
When the Roman alphabet was introduced into Britain, represented only /k/, and this value of the letter has been retained in loanwords to all the insular Celtic languages: in Welsh, Irish, and Gaelic, represents only /k/. The Old English Latin-based writing system was learned from the Celts, apparently of Ireland; hence, in Old English also originally represented /k/; the Modern English words kin, break, broken, thick, and seek all come from Old English words written with : «cyn, brecan, brocen, þicc», and «séoc». However, during the course of the Old English period, /k/ before front vowels (/e/ and /i/) was palatalized, having changed by the tenth century to tʃ, though was still used, as in «cir(i)ce, wrecc(e)a». On the continent, meanwhile, a similar phonetic change before the same two vowels had also been going on in almost all of the Romance languages (for example, in Italian).
In Vulgar Latin, /k/ became palatalized to tʃ in Italy and Dalmatia; in France and the Iberian Peninsula, it became ts. Yet for these new sounds, was still used before the letters and . The letter thus represented two distinct values. Subsequently, the Latin phoneme /kw/ (spelled ) de-labialized to /k/, meaning that the various Romance languages had /k/ before front vowels. In addition, Norman used the letter so that the sound /k/ could be represented by either or , the latter of which could represent either /k/ or /ts/ depending on whether it preceded a front vowel letter or not. The convention of using both and was applied to the writing of English after the Norman Conquest, causing a considerable re-spelling of the Old English words. Thus, while Old English «candel, clif, corn, crop», and «cú», remained unchanged, «cent, cǣᵹ (cēᵹ), cyng, brece», and «sēoce», were now (without any change of sound) spelled «Kent, keȝ, kyng, breke», and «seoke»; even «cniht» () was subsequently changed to «kniht», and «þic» () was changed to «thik» or «thikk». The Old English was also at length displaced by the French so that the Old English «cwēn» () and «cwic» () became Middle English «quen» and «quik», respectively.
The sound tʃ, to which Old English palatalized /k/ had advanced, also occurred in French, chiefly from Latin /k/ before . In French, it was represented by the digraph , as in «champ» (from Latin «camp-um»), and this spelling was introduced into English: the Hatton Gospels, written , have in Matt. i-iii, «child, chyld, riche», and «mychel», for the «cild, rice», and «mycel» of the Old English version whence they were copied. In these cases, the Old English gave way to , and ; on the other hand, in its new value of /ts/ appeared largely in French words like «processiun, emperice», and «grace» and was also substituted for in a few Old English words, as «miltse, bletsien», in early Middle English «milce, blecien». By the end of the thirteenth century, both in France and England, this sound /ts/ was de-affricated to /s/; and from that time, has represented /s/ before front vowels either for etymological reasons, as in lance, cent, or to avoid the ambiguity due to the "etymological" use of for /z/, as in ace, mice, once, pence, defence.
Thus, to show etymology, English spelling has advise, devise (instead of *advize, *devize), while advice, device, dice, ice, mice, twice, etc., do not reflect etymology; example has extended this to hence, pence, defence, etc., where there is no etymological reason for using . Former generations also wrote sence for sense. Hence, today, the Romance languages and English have a common feature inherited from Vulgar Latin spelling conventions where takes on either a "hard" or "soft" value depending on the following letter.
Use in writing systems
| Afar | ||
| Albanian | ||
| Cypriot Arabic | ||
| Azeri | ||
| Berber | ||
| Bukawa | ||
| Catalan | Except before e, i | |
| Before e, i | ||
| (Pinyin) | ||
| Crimean Tatar | ||
| Cornish | ||
| Czech | ||
| Danish | Except before e, i, y, æ, ø | |
| Before e, i, y, æ, ø | ||
| Dutch | Except before e, i, y | |
| Before e, i, y | ||
| Before e, i in loanwords from Italian | ||
| English | Except before e, i, y | |
| Before e, i, y | ||
| Before ea, ia, ie, io, iu | ||
| Esperanto | ||
| Fijian | ||
| Filipino | Except before e, i | |
| Before e, i | ||
| French | Except before e, i, y | |
| Before e, i, y | ||
| Fula | ||
| Gagauz | ||
| Galician | Except before e, i | |
| or | Before e, i | |
| German | Except before ä, e, i, ö, ü, y in loanwords and names | |
| Before ä, e, i, ö, ü, y in loanwords and names | ||
| Hausa | ||
| Hungarian | ||
| Indonesian | ||
| Irish | Except before e, i; or after i | |
| Before e, i; or after i | ||
| Italian | Except before e, i | |
| Before e, i | ||
| Khmer (ALA-LC) | ||
| Kurmanji (Hawar) | ||
| Latin | (and in early Latin) | |
| Latvian | ||
| Malay | ||
| Manding | ||
| Norwegian | Except before e, i, y, æ, ø in loanwords and names | |
| Before e, i, y, æ, ø in loanwords and names | ||
| Polish | Except before i | |
| Before i | ||
| Portuguese | Except before e, i, y | |
| Before e, i, y | ||
| Romanian | Except before e, i | |
| Before e, i | ||
| Romansh | Except before e, i | |
| Before e, i | ||
| Scottish Gaelic | Except before e, i; or after i | |
| Before e, i; or after i | ||
| Serbo-Croatian | ||
| Slovak | ||
| Slovene | ||
| Somali | ||
| Spanish | Except before e, i, y | |
| or | Before e, i, y | |
| Swedish | Except before e, i, y, ä, ö | |
| Before e, i, y, ä, ö | ||
| Tajik | ||
| Tatar | ||
| Turkish | ||
| Valencian | Except before e, i | |
| Before e, i | ||
| Vietnamese | Except word-finally | |
| Word-finally | ||
| Welsh | ||
| Xhosa | ||
| Yabem | ||
| Yup'ik | ||
| Zulu | (a click) |
English
In English orthography, generally represents the "soft" value of s before the letters (including the Latin-derived digraphs and , or the corresponding ligatures and ), , and , and a "hard" value of k before any other letters or at the end of a word. However, there are a number of exceptions in English: "soccer", "celt" and "sceptic" are words that have k where s would be expected. The "soft" may represent the ʃ sound in the digraph when this precedes a vowel, as in the words 'delicious' and 'appreciate', and also in the word "ocean" and its derivatives.
The digraph most commonly represents tʃ, but can also represent k (mainly in words of Greek origin) or ʃ (mainly in words of French origin). For some dialects of English, it may also represent x in words like loch, while other speakers pronounce the final sound as k. The trigraph always represents tʃ. The digraph is often used to represent the sound k after short vowels, like in "wicket".
C is the twelfth most frequently used letter in the English language (after E, T, A, O, I, N, S, H, R, D, and L), with a frequency of about 2.8% in words.
Other languages
In the Romance languages French, Spanish, Italian, Romanian, and Portuguese, generally has a "hard" value of /k/ and a "soft" value whose pronunciation varies by language. In French, Portuguese, Catalan, and Spanish from Latin America and some places in Spain, the soft value is /s/ as it is in English. In the Spanish spoken in most of Spain, the soft is a voiceless dental fricative /θ/. In Italian and Romanian, the soft is t͡ʃ.
Germanic languages usually use for Romance loans or digraphs, such as and , but the rules vary across languages. Of all the Germanic languages, only English uses the initial in native Germanic words like come. Other than English, Dutch uses the most, for most Romance loans and the digraph . German uses in the digraphs and , and the trigraph , but by itself only in unassimilated loanwords and proper names. Danish keeps soft in Romance words but changes hard to . Swedish has the same rules for soft and hard as Danish, and also uses in the digraph and the very common word och, "and". Norwegian, Afrikaans, and Icelandic are the most restrictive, replacing all cases of with or , and reserving for unassimilated loanwords and names.
All Balto-Slavic languages that use the Latin alphabet, as well as Albanian, Hungarian, Pashto, several Sami languages, Esperanto, Ido, Interlingua, and Americanist phonetic notation (and those aboriginal languages of North America whose practical orthography derives from it), use to represent /t͡s/, the voiceless alveolar or voiceless dental sibilant affricate. In Hanyu Pinyin, the standard romanization of Mandarin Chinese, the letter represents an aspirated version of this sound, /t͡sh/.
Among non-European languages that have adopted the Latin alphabet, represents a variety of sounds. Yup'ik, Indonesian, Malay, and a number of African languages such as Hausa, Fula, and Manding share the soft Italian value of /t͡ʃ/. In Azeri, Crimean Tatar, Kurmanji Kurdish, and Turkish, stands for the voiced counterpart of this sound, the voiced postalveolar affricate /d͡ʒ/. In Yabem and similar languages, such as Bukawa, stands for a glottal stop /ʔ/. Xhosa and Zulu use this letter to represent the click /ǀ/. In some other African languages, such as Berber languages, is used for /ʃ/. In Fijian, stands for a voiced dental fricative /ð/, while in Somali it has the value of /ʕ/.
The letter is also used as a transliteration of Cyrillic in the Latin forms of Serbian, Macedonian, and sometimes Ukrainian, along with the digraph .
Other systems
As a phonetic symbol, lowercase is the International Phonetic Alphabet (IPA) and X-SAMPA symbol for the voiceless palatal plosive, and capital is the X-SAMPA symbol for the voiceless palatal fricative.
Digraphs
There are several common digraphs with , the most common being , which in some languages (such as German) is far more common than alone. takes various values in other languages.
As in English, , with the value /k/, is often used after short vowels in other Germanic languages such as German and Swedish (other Germanic languages, such as Dutch and Norwegian, use instead). The digraph is found in Polish and in Hungarian, representing /t͡ʂ/ and /t͡ʃ/ respectively. The digraph represents /ʃ/ in Old English, Italian, and a few languages related to Italian (where this only happens before front vowels, while otherwise it represents /sk/). The trigraph represents /ʃ/ in German.
Other uses
- In the hexadecimal (base 16) numbering system, C is a number that corresponds to the number 12 in decimal (base 10) counting.
- In the Roman numeral system, C represents 100.
- Unit prefix c, meaning one hundredth.
Related characters
Ancestors, descendants and siblings
- ⟨𐤂⟩ : Semitic letter Gimel, from which the following symbols originally derive:
- Phonetic alphabet symbols related to ⟨C⟩:
- ⟨⟩ : Small ⟨c⟩ with curl
- ⟨ʗ⟩ : Stretched ⟨c⟩
- ⟨𝼏⟩ : Stretched ⟨c⟩ with curl – Used by Douglas Beach for a nasal click in his phonetic description of Khoekhoe.
- ⟨𝼝⟩ : Small letter ⟨c⟩ with retroflex hook – Para-IPA version of the IPA retroflex tʂ.
- ⟨ꟲ⟩ : Modifier letter capital ⟨c⟩ – Used to mark tone for the Chatino orthography in Oaxaca, Mexico; used as a generic transcription for a falling tone; also used in para-IPA notation.
- ⟨⟩ : Modifier letter small c
- ⟨ᶝ⟩ : Modifier letter small ⟨c⟩ with curl
- ⟨ᴄ⟩ : Small capital ⟨c⟩ is used in the Uralic Phonetic Alphabet.
- ⟨Ꞔ⟩ ⟨ꞔ⟩ : ⟨C⟩ with palatal hook, used for writing Mandarin Chinese using the early draft version of pinyin romanization during the mid-1950s.
- ⟨C⟩ with diacritics: Ć ć Ĉ ĉ Č č Ċ ċ Ḉ ḉ Ƈ ƈ C̈ c̈ Ȼ ȼ Ç ç Ꞔ ꞔ Ꞓ ꞓ
- ⟨Ↄ⟩ ⟨ↄ⟩ : Claudian letters
Derived ligatures, abbreviations, signs and symbols
- © : copyright symbol
- °C : degree Celsius
- ¢ : cent
- ₡ : colón (currency)
- ₢ : Brazilian cruzeiro (currency)
- ₵ : Ghana cedi (currency)
- ₠ : European Currency Unit CE
- : blackboard bold C, denoting the complex numbers
- ℭ : blackletter C
- Ꜿ ꜿ : Medieval abbreviation for Latin syllables con- and com-, and Portuguese -us and -os.
Other representations
===Computing ===
The Latin letters and have Unicode encodings and . These are the same code points as those used in ASCII and ISO 8859. There are also precomposed character encodings for and with diacritics, for most of those listed above; the remainder are produced using combining diacritics.
Variant forms of the letter have unique code points for specialist use: the alphanumeric symbols set in mathematics and science, voiceless palatal sounds in linguistics, and halfwidth and fullwidth forms for legacy CJK font compatibility. The Cyrillic homoglyph of the Latin has a separate encoding: .
Other
See also
References
- Powell, Barry B., Writing: Theory and History of the Technology of Civilization, 27 Mar 2009, Wiley Blackwell
- New Comparative Grammar of Greek and Latin, Andrew L., Sihler, illustrated, Oxford University Press, 1995, New York
- C, Murray, James A. H., C. World English Historical Dictionary, 1893, van Leeuwen, Steven H., A New English Dictionary on Historical Principles, 18 August 2025, WEHD.com, MacMillan, New York, Online published 2025
- Reading Middle Welsh -- 29 Medieval Spelling, 2019-11-19, www.mit.edu
- L2/20-115R: Unicode request for additional phonetic click letters, 2020-07-10, Kirk, Miller, Bonny, Sands
- L2/21-041: Unicode request for additional para-IPA letters, 2021-01-11, Kirk, Miller
- L2/20-251: Unicode request for modifier Latin capital letters, 2020-09-25, Kirk, Miller, Craig, Cornelius
- L2/04-132 Proposal to add additional phonetic characters to the UCS, 2004-04-19, Peter, Constable
- L2/02-141: Uralic Phonetic Alphabet characters for the UCS, 2002-03-20, Michael, Everson, Michael Everson, etal
- L2/17-013: Proposal to encode three uppercase Latin letters used in early Pinyin, 2017-01-16, Andrew, West, Andrew West (linguist), Eiso, Chan, Michael, Everson, Michael Everson
- L2/05-193R2: Proposal to add Claudian Latin letters to the UCS, 2005-08-12, Michael, Everson
- L2/06-027: Proposal to add Medievalist characters to the UCS, 2006-01-30, Michael, Everson, Peter, Baker, António, Emiliano, Florian, Grammel, Odd Einar, Haugen, Diana, Luft, Susana, Pedro, Gerd, Schumacher, Andreas, Stötzner
External links
Category:ISO basic Latin letters