Swedish phonology

Swedish has a large vowel inventory, with nine vowels distinguished in quality and to some degree quantity, making 17 vowel phonemes in most dialects. Swedish pronunciation of most consonants is similar to that of other Germanic languages. Another notable feature is the pitch accent, which is unusual for European languages.

There are 18 consonant phonemes of which and  show considerable variation depending on both social and dialectal context.

Standard pronunciation
Contrary to the situation with Danish or Finnish, there is not a uniform nation-wide spoken Standard Swedish. Instead there are several regional standard varieties (acrolects or prestige dialects), i.e. the most intelligible or prestigious forms of spoken Swedish, each within its area.

The differences in the phonology of the various forms of prestigious Central Swedish can be considerable, although as a rule less marked than between localized dialects, including differences in prosody, vowel quality and assimilation. The differences between the various regional dialects may be compared with those of General American, Australian English, and British Received Pronunciation.

In Sweden, the Central Swedish varieties often go under the name of rikssvenska ('National Swedish')

Vowels

 * All pronunciations below are made by a male native speaker of Swedish, age 25, in a variety of Central Standard Swedish spoken in the greater Stockholm region.

Swedish has 9 vowels that, as with many other Germanic languages, come in long and short pairs. The length covaries with the quality of the vowels, as shown below, with short variants being more centered and lax. Traditionally, length has been viewed as the primary distinction, with quality being secondary. No short vowels appear in open stressed syllables. The front vowels appear in rounded-unrounded pairs.


 * Central Standard Swedish is near-close near-front . In other dialects it may be central.


 * are mid.
 * has been variously described as central and front.

Rounded vowels have two types of rounding:
 * ,, and  are compressed , ,  and
 * ,, and its pre- allophone ,  and its pre- allophone ,  and  are protruded , , , , , ,  and.

Type of rounding is the primary way of distinguishing from, especially in Central Standard Swedish.

, (in stressed syllables),  (with a few exceptions), and  are lowered to, ,  and , respectively, when preceding. The low allophones are becoming unmarked in younger speakers of Stockholm Swedish, so that läsa ('to read') and köpa ('to buy') are pronounced and  instead of standard  and. These speakers often also pronounce pre-rhotic and  even lower, i.e.  and. This is especially true for the long allophone. Also, the allophone is sometimes difficult to distinguish from the long.

In some Swedish varieties, traditionally spoken around Gothenburg and in Östergötland, but today more widely spoken e.g. in Stockholm and especially by younger speakers, and  merge into. Words like fördömande ('judging', pronounced in Standard Swedish) and fördummande ('dumbing', pronounced  in Standard Swedish) are then often pronounced similarly, if not identically.


 * ära → ; ('honor')
 * ärt: →  ('pea')
 * öra → ; ('ear')
 * dörr: →  ('door')

Unstressed is realized as, i.e. a basic schwa. This feature is common to most varieties of Swedish. (e.g. begå, 'to commit' → ). In many central and eastern areas (including Stockholm), the contrast between short and  is lost, especially the short variants, except before  when the small vowel distinction between the words herre 'master' and märr 'mare' is kept. The loss of this contrast has the effect that hetta ('heat') and hätta ('cap') are pronounced the same.

In Central Standard Swedish, long is weakly rounded. The rounding is stronger in Gothenburg and weaker in most North Swedish dialects.

One of the varieties of is made with a constriction that is more forward than usual. describe this vowel as being pronounced "by slightly lowering the body of the tongue while simultaneously raising the blade of the tongue (...) Acoustically this pronunciation is characterized by having a very high F3, and an F2 which is lower than that in ." They suggest that this may be the usual Stockholm pronunciation of.

There is some variation in the interpretations of vowel length's phonemicity. , for example, treats vowel quantity as its own separate phoneme (a "prosodeme") so that long and short vowels are allophones of a single vowel phoneme.

Patterns of diphthongs of long vowels occur in three major dialect groups. In Central Standard Swedish, the high vowels, , and  can be phonetically a short vowel followed by the corresponding fricative (also described as approximant) , , , and  or , , , and. The rounding of the fricative/approximant agrees with the rounding of the vowel, so that /  is unrounded,  /  is protruded, more narrowly transcribed  /, and both  /  and  /  are compressed, more narrowly transcribed  /  and  /. , and  are often realized as centering diphthongs,  and.

In Southern Swedish dialects, particularly in Scania, the diphthongs are preceded by a rising of the tongue from a central position so that and  are realized as  and  respectively, i.e. rising diphthongs. A third type of distinctive diphthongs occur in the dialects of Gotland. The pattern of diphthongs is more complex than those of southern and eastern Sweden;, and  tend to rise while and  and  fall; , ,  and  are not diphthongized at all.

Consonants
The table below shows the Swedish consonant phonemes and the range of their realizations in spoken Standard Swedish.

are dental, but can be either dental  or alveolar. If is alveolar, then  is also alveolar. Dental realization of is the predominant one in Central Standard Swedish.

Stops
Initial fortis stops are aspirated in stressed position, but unaspirated when preceded by  within the same morpheme. Hence ko ('cow') is, but sko ('shoe') becomes. Compare English ('cool') vs  ('school'). Preaspiration of medial and final fortis stops, including the devoicing of preceding sonorants is common, though its length and normativity varies from dialect to dialect, being optional (and idiolectal ) in Central Standard Swedish but obligatory in, for example, the Swedish dialects of Gräsö, Vemdalen, and Arjeplog. In Gräsö, preaspiration is blocked in certain environments (such as an following the fortis consonant or a morpheme boundary between the vowel and the consonant ), while it is a general feature of fortis medial consonants in Central Standard Swedish. When not preaspirated, medial and final fortis stops are simply unaspirated. In clusters of fortis stops, the second "presonorant" stop is unaspirated and the former patterns with other medial final stops (that is, it is either unaspirated or is preaspirated.

The phonetic attributes of preaspiration also varies. In the Swedish of Stockholm, preaspiration is often realized as a fricative subject to the character of surrounding vowels or consonants so that it may be labial, velar, or dental; it may also surface as extra length of the preceding vowel. In the province of Härjedalen, though, it resembles or. The duration of preaspiration is highest in the dialects of Vemdalen and Arjeplog. Helgason notes that preaspiration is longer after short vowels, in lexically stressed syllables, as well as in pre-pausal position.

Fricatives
is dental in Central Standard Swedish,  but retracted alveolar  in Blekinge, Bohuslän, Halland and Scania.

The Swedish fricatives and  are often considered to be the most difficult aspects of Swedish pronunciation for foreign students. The combination of occasionally similar and rather unusual sounds as well as the large variety of partly overlapping allophones of often presents difficulties for non-natives in telling the two apart. The existence of a third sibilant in the form of tends to confuse matters even more, and in some cases realizations that are labiodental can also be confused with. In Finland Swedish, is an affricate:  or.

The Swedish phoneme (the "sje-sound" or voiceless postalveolar-velar fricative) and its alleged coarticulation is a difficult and complex issue debated amongst phoneticians. Though the acoustic properties of its allophones are fairly similar, the realizations can vary considerably according to geography, social status, age, gender as well as social context and are notoriously difficult to describe and transcribe accurately. Most common are various -like sounds, with occurring mainly in northern Sweden and  in Finland. A voiceless uvular fricative,, can sometimes be used in the varieties influenced by major immigrant languages like Arabic and Kurdish. The different realizations can be divided roughly into the following categories:


 * "Dark sounds" -, and , commonly used in the Southern Standard Swedish. Some of the varieties specific, but not exclusive, to areas with a larger immigrant population commonly realize the phoneme as a voiceless uvular fricative.
 * "Light sounds" -, used in the northern varieties and , and (or something in between) in Finland Swedish.
 * Combination of "light" and "dark" - darker sounds are used as morpheme initials preceding stressed vowels (sjuk, station; "sick", "station"), while the lighter sounds are used before unstressed vowels and at the end of morphemes (bagage, dusch; 'baggage', 'shower').

and are pronounced with weak friction and function phonotactically with the sonorants.

Sonorants
has distinct variations in Standard Swedish. The realization as an alveolar trill occurs among most speakers only in contexts where emphatic stress is used. In Central Swedish, it is often pronounced as a fricative (transcribed as ) or approximant (transcribed as ), which is especially frequent in weakly articulated positions such as word-finally and somewhat less frequent in stressed syllable onsets, in particular after other consonants. It may also be an apico-alveolar tap. One of the most distinct features of the southern varieties is the uvular realization of, which may be a trill , a fricative or an approximant. The last one is the standard realization of in Danish.

In most varieties of Swedish that use an alveolar (in particular, the central and northern forms), the combination of  with dental consonants  produces retroflex consonant realizations, a recursive sandhi process called "retroflexion". Thus, ("map") is realized as,  ("north") as ,  ('Vänern') as , and  ('fresh') as. The combination of and, does not uniformly cause retroflexion, so that it may also be pronounced with two separate consonants , and even, occasionally in a few words and expressions, as a mere. Thus sorl ('murmur') may be pronounced, but also.

As the table to the right shows, this process is not limited by word boundaries, though there is still some sensitivity to the type of boundary between the and the dental in that retroflexion is less likely with boundaries higher up in the prosodic hierarchy. In the southern varieties, which use a uvular, retroflex realizations don't occur. For example, ('map') is realized as, etc. A double sequence usually won't trigger retroflexion so that spärrnät ('anti-sub net') is pronounced. The process of retroflexion is not limited to just one dental, and e.g. först is pronounced.

Variations of are not as common, though some phonetic variation exists, such as a retroflex flap  that exists as an allophone in proximity to a labial or velar consonant (e.g. glad, 'glad') or after most long vowels.

In casual speech, the nasals tend to assimilate to the place of articulation of a following obstruent so that, for example, han kom ('he came') is pronounced.

Stress and pitch
As in English, there are many Swedish word pairs that are differentiated by stress:


 * formel &mdash; 'formula'
 * formell &mdash; 'formal'

Stressed syllables differentiate two tones, often described as pitch accents, or tonal word accents by Scandinavian linguists. They are called acute and grave accent, tone/accent 1 and tone/accent 2, or Single Tone and Double Tone. The actual realizations of these two tones varies from dialect to dialect. In Standard Central Swedish, for example, the acute accent has a low tone while the grave accent has a high one. Generally, the grave accent is characterized by a later timing of the intonational pitch rise as compared with the acute accent; the so-called two-peaked dialects (such as Central and Western Swedish) also have another, earlier pitch peak in the grave accent, hence the term "two-peaked".

The phonemicity of this tonal system is demonstrated in the nearly 300 pairs of two-syllable words differentiated only by their use of either grave or acute accent. Outside of these pairs, the main tendency for tone is that the acute accent appears in monosyllables (since the grave accent cannot appear in monosyllabic words) while the grave accent appears in polysyllabic words. Polysyllabic forms resulting from declension or derivation also tend to have a grave accent except when it is the definite article that is added. This tonal distinction has been present in Scandinavian dialects at least since Old Norse though a greater number of polysyllables now have an acute accent. These are mostly words that were monosyllabic in Old Norse, but have subsequently become disyllabic, as have many loanwords. For example, Old Norse kømr ('comes') has become kommer in Swedish (with an acute accent).

In Central Swedish, this is a high, slightly falling tone followed by a low tone; that is, a single drop from high to low pitch spread over two syllables. In Central Swedish, a mid falling tone followed by a high falling tone; that is, a double falling tone.
 * Acute accent: anden or  &mdash; 'the duck' (from and 'duck')
 * Grave accent: anden or  &mdash; 'the spirit' (from ande 'spirit')

The exact realization of the tones also depends on the syllable's position in an utterance. For instance, at the beginning of an utterance, the acute accent may have a rising rather than slightly falling pitch on the first syllable. Also, these are word tones that are spread across the syllables of the word. In the grave accent, trisyllabic words the second fall in pitch is distributed across the second and third syllables, with the result that the pitches are mid–low falling, high–mid falling, and low:


 * Grave-accent trisyllable: flickorna or  &mdash; 'the girls'

The position of the tone is dependent upon stress: The first stressed syllable has a high or falling tone, as does the following syllable(s) in grave-accented words.

Prosody in Swedish often varies substantially between different dialects including the spoken varieties of Standard Swedish. As in most languages, stress can be applied to emphasize certain words in a sentence. To some degree prosody may indicate questions, although less so than in English.

In most Finland-Swedish varieties, however, the distinction between grave and acute accent is missing.

Phonotactics
At a minimum, a word must consist of either a long vowel or a short vowel and a long consonant. Like many other Germanic languages, Swedish has a tendency for closed syllables with a relatively large amount of consonant clusters in initial as well as final position. Though not as complex as that of most Slavic languages, examples of up to 7 consecutive consonants can occur when adding Swedish inflections to some foreign loanwords or names, and especially when combined with the tendency of Swedish to make long compound nouns. The syllable structure of Swedish can therefore be described with the following formula:


 * (C)(C)(C)V(C)(C)(C)

This means that a Swedish one-syllable morpheme can have up to three consonants preceding the vowel that forms the nucleus of the syllable, and three consonants following it. Examples: skrämts (verb 'scare' past participle, passive voice) or sprängts  (verb 'explode' past participle, passive voice). All but one of the consonant phonemes,, can occur at the beginning of a morpheme, though there are only 6 possible three-consonant combinations, all of which begin with , and a total of 31 initial two-consonant combinations. All consonants except for and  can occur finally, and the total amount of possible final two-consonant clusters is 62.

In some cases this can result in near-unpronounceable combinations, such as in västkustskt, consisting of västkust ('west coast') with the adjective suffix -sk and the neuter suffix -t.

Central Standard Swedish and most other Swedish dialects feature a rare "complementary quantity" feature wherein a phonologically short consonant follows a long vowel and a long consonant follows a short vowel; this is true only for stressed syllables and all segments are short in unstressed syllables. This arose from the historical shift away from a system with a four-way contrast (that is, VːCː, VC, VːC, and VCː were all possible) inherited from Proto-Germanic to a three-way one (VC, VːC, and VCː), and finally the present two-way one; certain Swedish dialects have not undergone these shifts and exhibit one of the other two phonotactic systems instead. In literature on Swedish phonology, there are a number of ways to transcribe complementary relationship, including:
 * A length mark for either the vowel, the consonant , or both.
 * Gemination of the consonant ( vs. )
 * Diphthongization of the vowel ( vs. )
 * The position of the stress marker ( vs. )

With the conventional assumption that medial long consonants are ambisyllabic (that is, penna, 'pen', is syllabified as ), all stressed syllables are thus "heavy". In unstressed syllables, the distinction is lost between and  or between. With each successive post-stress syllable, the number of contrasting vowels decreases gradually with distance from the point of stress; at three syllables from stress, only and  occur.

Sample
The sample text is a reading of The North Wind and the Sun. The transcriptions are based on the section on Swedish found in The Handbook on the International Phonetic Association. The broad transcription is phonemic while the narrow is phonetic.

Orthographic version
Nordanvinden och solen tvistade en gång om vem av dem som var starkast. Just då kom en vandrare vägen fram insvept i en varm kappa. De kom då överens om att den som först kunde få vandraren att ta av sig kappan, han skulle anses vara starkare än den andra. Då blåste nordanvinden så hårt han nånsin kunde, men ju hårdare han blåste desto tätare svepte vandraren kappan om sig, och till slut gav nordanvinden upp försöket. Då lät solen sina strålar skina helt varmt och genast tog vandraren av sig kappan och så var nordanvinden tvungen att erkänna att solen var den starkaste av de två.