pronounced as /notice/
The Catalan and Valencian orthographies encompass the spelling and punctuation of standard Catalan (set by the IEC) and Valencian (set by the AVL). There are also several adapted variants to the peculiarities of local dialects of Insular Catalan (Alguerese and the Balearic subdialects).
The history of the Catalan and Valencian orthographies shows a singularity in regard to the other Romance languages. These have been mostly developed from Latin, adapting them to their own phonetic particularities. It had been a gradual and slow process through centuries until the creation of the Academies in the 18th century that fixed the orthography from their language dominant variety.[1]
In the case of Catalan and Valencian, the mediaeval orthography had a noticeable homogeneity. The Royal Chancellery set a unitary written model in several fields. Thus, Ramon Muntaner expressed in his Chronicle (1325–1328) that the Catalans are the largest group with a single language, since all the Romance-speaking regions had very divided languages like the difference that exists between Catalans and Aragonese.[2]
In the 16th century, just after the Golden Age, the split of Catalans started. With the isolation of the Royal Court and several political events, the unitary linguistic consciousness and the shared cultural tradition broke off. The production became more dialectal.
In the 19th century, the recovery of the unity emerged, beginning with the orthography. Institutions like the Acadèmia de Bones Lletres or the Floral Games were in the middle of several orthographic dilemmas.
The orthographic norms of Catalan were first defined officially in the First International Congress of the Catalan Language, held in Barcelona in October 1906. Subsequently, the Philological Section of the Institut d'Estudis Catalans (IEC, founded in 1911) published the Normes ortogràfiques in 1913 under the direction of Antoni Maria Alcover and Pompeu Fabra. Despite some opposition, the spelling system was adopted immediately and became widespread enough that, in 1932, Valencian writers and intellectuals gathered in Castelló to make a formal adoption of the so-called Normes de Castelló, a set of guidelines following Pompeu Fabra's Catalan language norms.[3]
In 1917, Fabra published an Orthographic Dictionary following the orthographic norms of the IEC. In 1931–1932 the Diccionari General de la Llengua Catalana (General Dictionary of the Catalan language) appeared. In 1995, a new normative dictionary, the Dictionary of the Catalan Language of the Institute of Catalan Studies (DIEC), marked a new milestone in the orthographic fixation of the language, in addition to the incorporation of neologisms and modern uses of the language.
On the 24th October 2016, the IEC published a new orthography for Catalan, the Catalan; Valencian: Ortografia catalana, which outlined several modifications, including a reduced number of monosyllabic words that take an acute or grave diacritic for reasons of disambiguation.[4] Thus, the disyllabic word is now generally spelled ; the monosyllabic words ("dry", pronounced pronounced as //sɛk// in Central Catalan) and ("fold, wrinkle", pronounced pronounced as //sek//) are both written after the reform. Discretionary use of a diacritic is possible if the context is not sufficient for disambiguation.
Like those of many other Romance languages, the Catalan and Valencian alphabet derives from the Latin alphabet and is largely based on the respective language's phonology.
The Catalan and Valencian alphabet consists of the 26 letters of the ISO basic Latin alphabet:
Upper case | Z | ||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Lower case | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o | p | q | r | s | t | u | v | w | x | y | z |
The following letter-diacritic combinations are used, but they do not constitute distinct letters in the alphabet: À à, É é, È è, Í í, Ï ï, Ó ó, Ò ò, Ú ú, Ü ü and Ç ç (though the Catalan keyboard includes the letter Ç as a separate key).[5] K k and W w are used only in loanwords. Outside loanwords, the letters Q q and Y y appear only in the digraphs qu, qü and ny. However, Y was used until the official orthography was established in 1913, when it was replaced with I, except in the digraph ny and loanwords.[6] Some Catalan surnames conserve the letter y and the word-final digraph ch (pronounced pronounced as //k//), e. g. Layret, Aymerich.
The following table shows the letters and their names in Standard Catalan (IEC) and Standard Valencian (AVL):
Letter | Catalan | Valencian | |||
---|---|---|---|---|---|
Name (IEC) | Pronunciation | Name (AVL) | Pronunciation | ||
Aa | a | pronounced as //ˈa// | a | pronounced as //ˈa// | |
Bb | be, be alta | pronounced as //ˈbe//, pronounced as //ˈbe ˈaltə// | be, be alta | pronounced as //ˈbe//, pronounced as //ˈbe ˈalta// | |
Cc | ce | pronounced as //ˈse// | ce | pronounced as //ˈse// | |
Dd | de | pronounced as //ˈde// | de | pronounced as //ˈde// | |
Ee | e | pronounced as //ˈe// | e | pronounced as //ˈe// | |
Ff | efa | pronounced as //ˈefə// | efe, ef | pronounced as //ˈefe//, pronounced as //ˈef// | |
Gg | ge | pronounced as //ˈʒe// | ge | pronounced as //ˈdʒe// | |
Hh | hac | pronounced as //ˈak// | hac | pronounced as //ˈak// | |
Ii | i, i llatina | pronounced as //ˈi//, pronounced as //ˈi ʎəˈtinə// | i, i llatina | pronounced as //ˈi//, pronounced as //ˈi ʎaˈtina// | |
Jj | jota | pronounced as //ˈʒɔtə// | jota | pronounced as //ˈdʒota// | |
Kk | ca | pronounced as //ˈka// | ca | pronounced as //ˈka// | |
Ll | ela | pronounced as //ˈelə// | ele, el | pronounced as //ˈele//, pronounced as //ˈel// | |
Mm | ema | pronounced as //ˈemə// | eme, em | pronounced as //ˈeme//, pronounced as //ˈem// | |
Nn | ena | pronounced as //ˈenə// | ene, en | pronounced as //ˈene//, pronounced as //ˈen// | |
Oo | o | pronounced as //ˈo// | o | pronounced as //ˈo// | |
Pp | pe | pronounced as //ˈpe// | pe | pronounced as //ˈpe// | |
cu | pronounced as //ˈku// | cu | pronounced as //ˈku// | ||
Rr | erra | pronounced as //ˈɛrə// | erre, er | pronounced as //ˈere//, pronounced as //ˈeɾ// | |
Ss | essa | pronounced as //ˈesə// | esse, es | pronounced as //ˈese//, pronounced as //ˈes// | |
Tt | te | pronounced as //ˈte// | te | pronounced as //ˈte// | |
Uu | u | pronounced as //ˈu// | u | pronounced as //ˈu// | |
Vv | ve, ve baixa | pronounced as //ˈve//, pronounced as //ˈbe ˈbaʃə// | ve, ve baixa | pronounced as //ˈve//, pronounced as //ˈbe ˈbajʃa// | |
Ww | ve doble | pronounced as //ˈve ˈdobːlə//, pronounced as //ˈbe ˈdobːlə// | ve doble | pronounced as //ˈve ˈdoble//, pronounced as //ˈbe ˈdoble// | |
Xx | ics, xeix | pronounced as //ˈiks//, pronounced as //ˈʃeʃ// | ics, xeix | pronounced as //ˈiks//, pronounced as //ˈʃejʃ// | |
Yy | i grega | pronounced as //ˈi ˈɡɾeɡə// | i grega | pronounced as //ˈi ˈɡɾeɡa// | |
Zz | zeta | pronounced as //ˈzetə// | zeta | pronounced as //ˈzeta// |
The names efa (pronounced as //ˈefa//), ela (pronounced as //ˈela//), ema (pronounced as //ˈema//), ena (pronounced as //ˈena//), erra (pronounced as //ˈera//), and essa (pronounced as //ˈesa//) are also used in certain speeches of Valencian.
The names be alta ("high b") and ve baixa ("low v") are used by speakers who do not distinguish the phonemes pronounced as //b// and pronounced as //v//. Speakers that do distinguish them use the simple names be and ve.[7]
See main article: Lists of spelling-to-sound correspondences in Catalan. Catalan is a pluricentric language; the pronunciation of some of the letters is different in Eastern Catalan (IEC) and Valencian (AVL). Apart from those variations, the pronunciation of most consonants is fairly straightforward and is similar to French, Occitan or Portuguese pronunciation. (The following list includes a quick pronunciation of letters in standard Catalan and Valencian, for an in-depth view see attached main article on top of this section).
|
|
Catalan and Valencian also use the acute and grave accents to mark stress or vowel quality. An acute on (é ó) indicates that the vowel is stressed and close-mid (pronounced as //e o//), while grave on (è ò) indicates that the vowel is stressed and open-mid (pronounced as //ɛ ɔ//). Grave on (à) and acute on (í ú) simply indicate that the vowels are stressed. Thus, the acute is used on close or close-mid vowels, and the grave on open or open-mid vowels.[8]
The circumflex is rarely used in modern Catalan and Valencian, nonetheless it has been used in the beginning of the 19th century by Antoni Febrer i Cardona to represent schwa in the Balearic subdialects. According to the Diccionari català-valencià-balear, in modern times there are some cases where the circumflex can be used to indicate silent etymological sounds (similar to French)[10] or a contraction.[11] Contrary to the restrictions of the acute and grave accent, the circumflex can be used with all vowels (â ê î ô û), the most common, especially in Valencian, being (â) (i.e. due to the elision of pronounced as //d//), e.g. mascletâes (instead of mascletades 'pyrotechnic festivals'), anâ (instead of anar 'to go'), témê (instead of témer 'to fear'), sortî (instead of sortir 'to exit'), pâ ('to', preposition in colloquial Valencian).
The diaeresis has two different uses: to mark hiatus over (ï, ü), and to mark that (u) is not silent in the groups (gü, qü).
If a diaeresis appears over an (i) or (u) that follows another vowel, it denotes a hiatus, examples:
This diaeresis is not used over a stressed vowel that already should have an accent. Examples: suís pronounced as //suˈis// ('Swiss' masculine), but suïssa pronounced as //suˈisə// or pronounced as //suˈisa// ('Swiss' feminine), suïs pronounced as //ˈsuis// ('that you sweat' subjunctive) (without the diaeresis, this last example would be pronounced pronounced as //ˈsui̯s//, i.e. as only one syllable, like reis pronounced as //ˈrei̯s// 'kings').
Certain verb forms of verbs ending in -uir do not receive a diaeresis, although they are pronounced with separate syllables. This concerns the infinitive, gerund, future and conditional forms (for example traduir, traduint, traduiré and traduiria, all with bisyllabic pronounced as //u.i//). All other forms of such verbs do receive a diaeresis on the ï according to the normal rules (e.g. traduïm, traduïa).
In addition to this, (ü) represents pronounced as //w// between a velar consonant pronounced as //ɡ// or pronounced as //k// and a front vowel ((gu) and (qu) are used to represent a hard (i.e. velar) pronunciation before (i) or (e)).
Forms of the verb argüir represents a rare case of the sequence pronounced as //ɡu.i//, and the rules for pronounced as //gu// and pronounced as //ui// clash in this case. The ambiguity is resolved by an additional rule, which states that in cases where diaereses would appear on two consecutive letters, only the second receives one. This thus gives arguïm /arguˈim/, i.e. and arguïa /arguˈia/, but argüir /arˈgwir/, argüint /arˈgwint/ and argüiré /argwiˈre/ as these forms don't receive a diaeresis on the i normally, according to the exception above.
Catalan and Valencian ce trencada (Ç ç), literally in English 'broken cee', is a modified (c) with a cedilla mark (¸). It is only used before (a u o) to indicate a soft c pronounced as //s//, much like in Portuguese, Occitan or French (e.g. compare coça pronounced as //ˈkosə// or pronounced as //ˈkosa// 'kick', coca pronounced as //ˈkokə// or pronounced as //ˈkoka// 'cake' and cosa pronounced as //ˈkɔzə// or pronounced as //ˈkɔza// 'thing'). In Catalan and Valencian, ce trencada also appears as last letter of a word (e.g. feliç pronounced as //fəˈlis// or pronounced as //feˈlis// 'happy', falç pronounced as //ˈfals// 'sickle'), but then (ç) may be voiced to pronounced as /[z]/ before vowels and voiced consonants, e.g. feliçment pronounced as //fəˌlizˈmen(t)// or pronounced as //feˌlizˈmen(t)// ('happily') and braç esquerre pronounced as //ˈbɾaz əsˈkɛrə// or pronounced as //ˈbɾaz esˈkɛre// ('left arm').
The so-called punt volat or middot is only used in the group (ŀl) (called ela or el(e) geminada, 'geminate el') to represent a geminated sound pronounced as //lː//, as (ll) is used to represent the palatal lateral pronounced as //ʎ//. This usage of the middot sign is a recent invention from the beginning of twentieth century (in medieval and modern Catalan, before Fabra's standardization, this symbol was sometimes used to note certain elisions, especially in poetry). The only (and improbable) case of ambiguity in the whole language that could arise is the pair ceŀla pronounced as //ˈsɛlːə// or pronounced as //ˈsɛlːa// ('cell') vs cella pronounced as //ˈsɛʎə// or pronounced as //ˈseʎa// ('eyebrow').
The hyphen (called a guionet) is used in Catalan and Valencian to separate a verb and the combination of pronouns that follow them (e.g. menjar-se-les), to separate certain compounds (e.g. vint-i-un and para-sol), and to split a word at the end of a line of text for the purpose of maintaining page margins.
Compounds are hyphenated in cases that involve numerals (e.g. trenta-sis, and trenta-sisè/é); cardinal points (e.g. sud-americà); repetitive and expressive compounds (xup-xup); those compounds in which the first element ends in a vowel and the second starts with (r), (s), or (x) (e.g. penya-segat); and those compounds in which the combination of the two elements can lead to wrong reading (e.g. pit-roig). There are also compound terms in which the first element carries a grave accent (mà-llarg), the construction no plus substantive (but not no plus adjective, no-violència but the nacions no violentes) and certain singular constructions like abans-d'ahir and adéu-siau.
Since 1996, the normative set that in the none mentioned cases in the previous paragraph do not carry hyphen. Thus, the general norm set that the prefixed forms, aside from the cited exceptions, are written without hyphen (the only normative option, then, is to write arxienemic and fisicoquímic).
In regard to numbers, hyphen is set according to the D-U-C rule (Desenes-Unitats-Centenes, 'Tens-Units-Hundreds'), thus, a hyphen is placed between tens and units (quaranta-dos) and between units and hundreds (tres-cents). For example, the number 35,422 is written trenta-cinc mil quatre-cents vint-i-dos.
In the case of the separation of a term at the end of line, syllable boundaries are maintained. Still, there are digraphs that can be separated and others that cannot. The digraphs that can be separated are those that, when splitting them, they result in two graphs the corresponding sound from which they share a phonetic trait with the sound of the digraph. (Thus, the digraph rr, for example it corresponds with the nearest sound of a rhotic alveolar trill. Cor-randes, calit-ja and as-sas-sí are words with digraphs that can be split). The digraphs that cannot be separated are those in which the two graphs correspond to sounds that they are not related with the sound of the digraph. (For example, the digraph ny cannot be separated.)
To orthographic effects, the syllabic separation of words follow the following norms:
ix (quei-xa), rr (car-rer), ss (pas-sar), sc (es-ce-na), l·l (vil-la), tj (jut-jat), tg (fet-ge), tx (pit-xer), tl (vet-la), tll (rot-llo), tm (rit-me), tn (cot-na), ts (pot-ser), tx (despat-xar), tz (set-ze), mm (im-mens), nn (in-no-cent)
gu (jo-guet), ny (pe-nya), qu (pa-quet), ig (ba-teig), ll (pe-lle-ter)
ad-herir, in-expert, ben-estar, mil-hòmens, des-encolar, vos-altres
d'a-mor, aber-rant, l'a-plicació, histò-ria
Catalan and Valencian follow some apostrophation rules that serve to determine whether it is necessary to use an apostrophe (') with an article, preposition or pronoun or not if the word that follows it or precedes it begins or finishes in a vowel, respectively.
In case of apostrophation, the specific forms al (dial. as), del (dial. des), pel (dial. pes), cal (dial. cas) and can are broken and become a l' (dial. a s'), de l' (dial. de s'), per l' (dial. per s'), ca l' (dial. ca s') and ca n' respectively.
The feminine singular article (la, na and dialectally sa) are apostrophated in the following cases: When the following word start with a vowel: l'emoció, l'ungla, l'aigua, n'Elena; when the word start with a silent h: l'heura, l'holografia, n'Hermínia, s'horabaixa. It is not apostrophated in the following cases: When it goes before word that starts with a consonantic i or u (with h or not): la hiena; when it goes before a word that begins with unstressed i or u (with h or not): la humitat, la universitat, la imatge; before some specific terms like la una (when referring to the time), la ira, la host, la Haia (toponym); before the name of the letters (la i, la hac, la essa); before a word that start with s followed by a consonant, la Scala de Milà.
Traditionally, to avoid ambiguities, words beginning with the negative prefix a- did not take an apostrophe. Nowadays, general apostrophation rules are followed in written text: l'anormalitat, l'amoralitat, l'atipicitat, l'asimetria, l'asèpsia, etc. The Diccionari de l'Institut d'Estudis Catalans (DIEC) of 1995 started to apply the new criteria; however, it was never formulated explicitly. In the same way, the introduction of DIEC writes about the abnormality of the situation, and the outline of the new normative grammar that prepares the IEC already does not collect that traditional exception.
Before a verb that starts with a vowel, using its elided form: m'agrada, n'abastava, s'estimaran, l'aconseguiria. At the end of a verb that finishes in a vowel, using the reduced form: menja'n, trenca'l, fondre's, compra'ns. When there are two, the second if the orthographic rules allow it: me'n, li'n, se'm, te'ls, la'n, n'hi; if it is possible, it takes the apostrophe with the following word, like me n'ha dut tres. The apostrophe always goes the further to the right possible: te l'emportes, not *te'l emportes.
Does not take the apostrophe:
The pronouns us, vos, hi, ho, li, les: us el dono or vos el done, se us esperava or se vos esperava. Like in the case of the article, the pronoun before words that start by unstressed i and u (with silent h or without): la ignora, la hi pren, la humitejarem, la usàvem. It also does not take the apostrophe the first weak pronoun in the forms la hi and se us.
Catalan and Valencian do not capitalize the days of the week, months, or national adjectives.
dilluns, setembre, anglès
'Monday', 'September', 'English'
Catalan and Valencian punctuation rules are similar to English, with some minor differences.
—Què proposes, doncs?
—El que hauriem de fer —s'atreví a suggerir— és anar a...
'What do you propose, then?'
'What we should do' she ventured to suggest 'is go to and ...'
The distribution of the two rhotics pronounced as //r// and pronounced as //ɾ// closely parallels that of Spanish. Between vowels, the two contrast but they are otherwise in complementary distribution: in the onset, an alveolar trill, pronounced as /[r]/, appears unless preceded by a consonant; different dialects vary in regards to rhotics in the coda with Western Catalan generally featuring an alveolar tap, pronounced as /[ɾ]/, and Central Catalan dialects like those of Barcelona or Girona featuring a weakly trilled pronounced as /[r]/ unless it precedes a vowel-initial word in the same prosodic unit, in which case pronounced as /[ɾ]/ appears.
In Eastern Catalan and North Western Catalan, most instances of word-final (r) are silent, but there are plenty of unpredictable exceptions (e.g. in Central Eastern Catalan pronounced as /[ˈpo]/ 'fear' but pronounced as /[ˈmɑɾ]/ 'sea'). In Central Eastern Catalan monosyllabic words with a pronounced final (r) get a reinforcement final consonant pronounced as /[t]/ when in absolute final position (e.g. final (r) of ('heart') in pronounced as //ˈrejnə dəl ˈmew ˈkɔrt// 'queen of my heart' vs pronounced as //əl ˈkɔɾ əz ˈmɔw// 'the heart is moving').
In Valencian, most instances of word-final (r) are pronounced.
Standard rules governing the presence of accents are based on word endings and the position of the stressed syllable. In particular, accents are expected for:
This does not occur in words like parleu pronounced as //pəɾˈlɛw// or pronounced as //paɾˈlɛw// ('you are speaking' plural), or parlem pronounced as //pəɾˈlɛm// or pronounced as //paɾˈlɛm// ('we are speaking').
This does not occur in words like parla pronounced as //ˈpaɾlə// or pronounced as //ˈpaɾla// ('he is speaking'), parles pronounced as //ˈpaɾləs// or pronounced as //ˈpaɾles// ('you are speaking' singular), or parlen pronounced as //ˈpaɾlən// or pronounced as //ˈpaɾlen// ('they are speaking').
Since there is no need to mark the stressed syllable of a monosyllabic word, most of them do not have an accent. Exceptions are those with a diacritical accent differentiating words that would otherwise be homographic. Example: es pronounced as //əs// or pronounced as //es// ('it' impersonal) vs és pronounced as //ˈes// ('is'), te pronounced as //tə// or pronounced as //te// ('you' clitic) vs té pronounced as //ˈte// ('s/he has'), mes pronounced as //ˈmɛs// or pronounced as //ˈmes// ('month') vs més pronounced as //ˈmes// ('more'), dona pronounced as //ˈdɔnə// or pronounced as //ˈdɔna// ('woman') vs dóna pronounced as //ˈdonə// or pronounced as //ˈdona// ('s/he gives'). In most cases, the word bearing no accent is either unstressed (as in the case of 'es' and 'te'), or the word without the accent is more common, usually a function word.
The different distribution of open e pronounced as //ɛ// vs closed e pronounced as //e// between Eastern Catalan and Western Catalan is reflected in some orthographic divergences between standard Catalan and Valencian norms, for example: pronounced as //əŋˈɡlɛs// (Catalan) vs pronounced as //aŋˈɡles// (Valencian) ('English'). In the Balearic Islands, open e pronounced as //ɛ// tends to be a centralised e (pronounced as //ə//) in the same cases where open e contrasts with closed e in Catalan and Valencian. The cases where the difference of pronunciation of e can have graphical repercussions are the followings:[8]