fbpx
Wikipedia

Two dots (diacritic)

Diacritical marks of two dots ¨, placed side-by-side over or under a letter, are used in a number of languages for several different purposes. The most familiar to English-language speakers are the diaeresis and the umlaut, though there are numerous others. For example, in Albanian, ë represents a schwa. Such diacritics are also sometimes used for stylistic reasons (as in the family name Brontë or the band name Mötley Crüe).

◌̈ ◌̤
Two dots
  • U+0308 ◌̈ COMBINING DIAERESIS[a]
  • U+0324 ◌̤ COMBINING DIAERESIS BELOW
  • U+07F3 ߳ NKO COMBINING DOUBLE DOT ABOVE

In modern computer systems using Unicode, the two-dot diacritics are almost always encoded identically, having the same code point.[1] For example, U+00E4 ä LATIN SMALL LETTER A WITH DIAERESIS represents both a-umlaut and a-diaeresis. Their appearance in print or on screen may vary between typefaces but rarely within the same typeface.

Uses edit

Diaeresis edit

The "diaeresis" diacritic is used to mark the separation of two distinct vowels in adjacent syllables when an instance of diaeresis (or hiatus) occurs, so as to distinguish from a digraph or diphthong. For example, in the spelling "coöperate", the diaeresis reminds the reader that the word has four syllables co-op-er-ate, not three. It is used in several languages of western and southern Europe, though rarely now in English.[2]

Umlaut edit

The "umlaut" diacritic indicates a sound shift phenomenon – also known as umlaut – in which a back vowel becomes a front vowel. It is a specific phenomenon in German and other Germanic languages, affecting the graphemes ⟨a⟩, ⟨o⟩, ⟨u⟩ and ⟨au⟩, which are modified to ä, ö, ü and ⟨äu⟩.

Stylistic use edit

The two dot diacritic is also sometimes used for purely stylistic reasons. For example, the Brontë family, whose surname was derived from gaelic and had been anglicised as "Prunty", or "Brunty": At some point, the father of the sisters, Patrick Brontë (born Brunty), decided on the alternative spelling with a diaeresis diacritic over the terminal ⟨e⟩ to indicate that the name had two syllables.

Similarly the "metal umlaut" is a diacritic that is sometimes used gratuitously or decoratively over letters in the names of hard rock or heavy metal bands – for example, those of Motörhead and Mötley Crüe, and of parody bands, such as Spın̈al Tap.

Other uses by language edit

A double dot is also used as a diacritic in cases where it functions as neither a diaeresis nor an umlaut. In the International Phonetic Alphabet (IPA), a double dot above a letter is used for a centralized vowel, a situation more similar to umlaut than to diaeresis. In other languages it is used for vowel length, nasalization, tone, and various other uses where diaeresis or umlaut was available typographically. The IPA uses a double dot below a letter to indicate breathy (murmured) voice.[3][b]

Vowels edit

  • In Albanian, Tagalog, and Kashubian, ⟨ë⟩ represents a schwa [ə].
  • In Aymara, a double dot is used on ⟨ä⟩ ⟨ï⟩ ⟨ü⟩ for vowel length.
  • In the Basque dialect of Soule, ⟨ü⟩ represents [y]
  • In the DMG romanization of Tunisian Arabic, ⟨ä⟩, ⟨ö⟩, ⟨ṏ⟩, ⟨ü⟩, and ⟨ṻ⟩ represent [æ], [œ], [œ̃], [y], and [y:].
  • In Ligurian official orthography, ⟨ö⟩ is used to represent the sound [oː].
  • In Māori, a diaeresis (e.g. Mäori) was often used on computers in the past instead of the macron to indicate long vowels, as the diaeresis was relatively easy to produce on many systems, and the macron difficult or impossible.[4][5]
  • In Seneca, ⟨ë⟩ ⟨ö⟩ are nasal vowels, though ⟨ä⟩ is [ɛ], as in German umlaut.
  • In Vurës (Vanuatu), ⟨ë⟩ and ⟨ö⟩ encode respectively [œ] and [ø].
  • In the Pahawh Hmong script, a double dot is used as one of several tone marks.
  • The double dot was used in the early Cyrillic alphabet, which was used to write Old Church Slavonic. The modern Cyrillic Belarusian and Russian alphabets include the letter ⟨ё⟩ (yo), although replacing it with the letter е without the diacritic is allowed in Russian.
  • Since the 1870s, ⟨Ї⟩, ⟨ї⟩ (Cyrillic letter yi) has been used in the Ukrainian alphabet for iotated [ji]; plain і is not iotated [i]. In Udmurt, ӥ is used for uniotated [i], with и for iotated [ji].
  • The form ⟨ÿ⟩ is common in Dutch handwriting and also occasionally used in printed text – but is a form of the digraph "ij" rather than a modification of the letter ⟨y⟩.
  • Komi and Udmurt use Ӧ (a Cyrillic O with two dots) for [ə].
  • The Swedish, Finnish and Estonian languages use Ä and Ö to represent [æ] and [ø]
  • In the languages of J.R.R. Tolkien's Middle-Earth novels, a diaeresis is used to separate vowels belonging to different syllables (e.g. in Eärendil) and on final e to mark it as not a schwa (e.g. in Manwë, Aulë, Oromë, etc.). (There is no schwa in these languages but Tolkien wanted to make sure that readers wouldn't mistakenly pronounce one when speaking the names aloud.)[citation needed]

Consonants edit

Jacaltec (a Mayan language) and Malagasy are among the very few languages with a double dot on the letter "n"; in both, is the velar nasal [ŋ].

In Udmurt, a double dot is also used with the consonant letters ӝ [dʒ] (from ж [ʒ]), ӟ [dʑ] (from з [z] ~ [ʑ]) and ӵ [tʃ] (from ч [tɕ]).

When distinction is important, and are used for representing [ħ] and [ɣ] in the Kurdish Kurmanji alphabet (which are otherwise represented by "h" and "x"). These sounds are borrowed from Arabic.

and ÿ: Ÿ is generally a vowel, but it is used as the (semi-vowel) consonant [ɰ] (a [w] without the use of the lips) in Tlingit. This sound is also found in Coast Tsimshian, where it is written .

A number of languages in Vanuatu use double dots on consonants, to represent linguolabial (or "apicolabial") phonemes in their orthography. Thus Araki contrasts bilabial p [p] with linguolabial [t̼]; bilabial m [m] with linguolabial [n̼]; and bilabial v [β] with linguolabial [ð̼].

Seneca uses ⟨s̈⟩ for [ʃ].

In Arabic the letter is used in the ISO 233 transliteration for the tāʾ marbūṭah [ة], used to mark feminine gender in nouns and adjectives.

Syriac uses a two dots above a letter, called Siyame, to indicate that the word should be understood as plural. For instance, ܒܝܬܐ (bayta) means "house", while ܒܝ̈ܬܐ (bayte) means "houses". The sign is used especially when no vowel marks are present, which could differentiate between the two forms. Although the origin of the Siyame is different from that of the diaeresis sign, in modern computer systems both are represented by the same Unicode character. This, however, often leads to wrong rendering of the Syriac text.

The N'Ko script, used to write the Mandé languages of West Africa uses a two-dot diacritic (among others) to represent non-native sounds. The dots are slightly larger than those used for diaeresis or umlaut.

Diacritic underneath edit

The IPA specifies a "subscript umlaut", for example Hindi [kʊm̤ar] "potter";[3]: 25  the ALA-LC romanization system provides for its use and is one of the main schemes to romanize Persian (for example, rendering ض as ⟨z̤⟩). The notation was used to write some Asian languages in Latin script, for example Red Karen.

Computer encodings edit

In Unicode edit

Character encoding generally treats the umlaut and the diaeresis as the same diacritic mark. Unicode refers to both as diaereses without making any distinction, although the term itself has a more precise literary meaning. For example, U+00E4 ä LATIN SMALL LETTER A WITH DIAERESIS represents both a-umlaut and a-diaeresis, while similar codes are used to represent all such cases.

Unicode encodes a number of cases of "letter with a two dots diacritic" as precomposed characters and these are displayed above. (Unicode uses the term "Diaeresis" for all two-dot diacritics, irrespective of the actual term used for the language in question.) In addition, many more symbols may be composed using the combining character facility, U+0308 ◌̈ COMBINING DIAERESIS, that may be used with any letter or other diacritic to create a customised symbol but this does not mean that the result has any real-world application.

Both the combining character U+0308 and the pre-composed codepoints may be regarded as an umlaut or a diaeresis according to context. Compound diacritics are possible, for example U+01DA ǚ LATIN SMALL LETTER U WITH DIAERESIS AND CARON, used as a tonal marks for Hanyu Pinyin, which uses both a two dots diacritic with a caron diacritic. Conversely, when the letter to be accented is an ⟨i⟩, the diacritic replaces the tittle, thus: ⟨ï⟩.

Sometimes, there's a need to distinguish between the umlaut sign and the diaeresis sign. For instance, either may appear in a German name. ISO/IEC JTC 1/SC 2/WG 2 recommends the following for these cases:[6]

  • To represent the umlaut use the Combining Diaeresis (U+0308)
  • To represent the diaeresis use Combining Grapheme Joiner (CGJ, U+034F) + Combining Diaeresis (U+0308)

The same advice can be found in the official Unicode FAQ.[7]

Since version 3.2.0, Unicode also provides U+0364 ◌ͤ COMBINING LATIN SMALL LETTER E which can produce the older umlaut typography.

Unicode provides a combining double dot below as U+0324 ◌̤ COMBINING DIAERESIS BELOW.

Finally, for use with the N'Ko script, there is U+07F3 ◌߳ NKO COMBINING DOUBLE DOT ABOVE.

In ASCII, ISO/IEC 646 and ISO 8859 edit

ASCII, a seven-bit code with just 95 "printable" characters, has no provision for any kind of dot diacritic. Subsequent standardisation treated ASCII as the US national variant of ISO/IEC 646: the French, German and other national variants reassigned a few code points to specific vowels with diacritics, as precomposed characters.

The subsequent (eight bit) ISO 8859-1 character encoding includes the letters ä, ë, ï, ö, ü, and their respective capital forms, as well as ÿ in lower case only, with Ÿ added in the revised edition ISO 8859-15 and Windows-1252.

These standards are technically obsolete, having been replaced by Unicode.

Computer usage edit

Character encoding generally treats the umlaut and the diaeresis as the same diacritic mark. Unicode refers to both as diaereses without making any distinction, although the term itself has a more precise literary meaning. For example, U+00E4 ä LATIN SMALL LETTER A WITH DIAERESIS represents both a-umlaut and a-diaeresis, while similar codes are used to represent all such cases.

Keyboard input edit

 
Letters with umlaut on a German computer keyboard.

If letters with double dots are not present on the keyboard (or if they are not recognized by the operating system), there are a number of ways to input them into a computer system.

Apple MacOS, iOS edit

iOS provides accented letters through press-and-hold on most European Latin-script keyboards, including English. Some keyboard layouts feature combining-accent keys that can add accents to any appropriate letter. A letter with double dots can be produced by pressing ⌥ Option+U, then the letter. This works on English and other keyboards and is documented further in the supplied manuals.[8]

Google ChromeOS edit

For ChromeOS with US-International keyboard setting, the combination is "+(letter).[9] For ChromeOS with UK extended setting, use AltGr⇧ Shift2, release, then the letter.[citation needed] Alternatively, the Unicode codepoint may be entered directly, using Ctrl+⇧ Shift+u, release, then the four-digit code, then ↵ Enter or Space.[10]

Linux edit

In some Linux desktop environments a letter with double dots can be produced by pressing AltGr⇧ Shift:, then the letter.[citation needed] When the system has a compose key, the same procedure as that described at X-Windows (below) may be used.

Microsoft Windows edit

AZERTY and QZERTY keyboards (as used in much of Europe) include precomposed characters (accented letters) as standard and these are fully supported by Microsoft Windows, typically accessed using the AltGr key.

For users with a US keyboard layout, Windows includes a setting "US International", which supports creation of accented letters by changing the function of some keys into dead keys. If the user enters ", nothing will appear on screen, until the user types another character, after which the characters will be merged if possible, or added independently at once if not. Otherwise, the desired character may be generated using the Alt table above.

For users in the United Kingdom and Ireland with QWERTY keyboards, Windows has an "Extended" setting such that an accented letter can be created using AltGr2 then the base letter.

When using Microsoft Word or Outlook, a letter with double dots can be produced by pressing Ctrl⇧ Shift: and then the letter.

On Microsoft Windows keyboard layouts that do not have double dotted characters, one may use Windows Alt keycodes. Double dots are then entered by pressing the left Alt key, and entering the full decimal value of the character's position in the Windows code page on the numeric keypad, provided that the compatible code page is used as a system code page. One can also use numbers from Code page 850; these are used without a leading 0.

Character Windows Code Page Code CP850 Code Unicode
ä Alt+0228 Alt+132 00E4
ë Alt+0235 Alt+137 00EB
ï Alt+0239 Alt+139 00EF
ö Alt+0246 Alt+148 00F6
ü Alt+0252 Alt+129 00FC
ÿ Alt+0255 Alt+152 00FF
Ä Alt+0196 Alt+142 00C4
Ë Alt+0203 Alt+211 00CB
Ï Alt+0207 Alt+216 00CF
Ö Alt+0214 Alt+153 00D6
Ü Alt+0220 Alt+154 00DC
Ÿ Alt+0159 N/A 0178

X Window System edit

X-based systems with a Compose key set in the system can usually insert characters with double dots by typing Compose, quotedbl (i.e. ") followed by the letter. Compose+⇧ Shift, letter may also work, depending on the system's set-up. However, most modern UNIX-like systems also accept the sequence Compose+⇧ Shift+U to initiate the direct input of a Unicode value. Thus, typing Compose+⇧ Shift+U, 00F6, finishing with Space or ↵ Enter, will insert ö into the document.

Dedicated keys edit

The German keyboard has dedicated keys for ü ö ä. Scandinavian and Turkish keyboards have dedicated keys for their respective language-specific letters, including ö for Swedish, Finnish, and Icelandic, and both ö and ü for Turkish. French and Belgian AZERTY keyboards have a dead key which adds a circumflex (if without Shift) or a diaeresis/umlaut (if with Shift) to the letter key immediately following (for instance Shift-^ followed by e gives ë).

Other scripts edit

For non-Latin scripts, Greek and Russian use press-and-hold for double-dot diacritics on only a few characters. The Greek keyboard has dialytica and dialytica–tonos variants for upsilon and iota (ϋ ΰ ϊ ΐ), but not for ε ο α η ω, following modern monotonic usage. Russian keyboards feature separate keys for е and ё.

On-screen keyboards edit

The early 21st century has seen noticeable growth in stylus- and touch-operated interfaces, making the use of on-screen keyboards operated by pointing devices (mouse, stylus, or finger) more important. These "soft" keyboards may replicate the modifier keys found on hardware keyboards, but they may also employ other means of selecting options from a base key, such as right-click or press-and-hold. Soft keyboards may also have multiple contexts, such as letter, numeric, and symbol.

HTML edit

In HTML, vowels with double dots can be entered with an entity reference of the form &?uml;, where ? can be any of a, e, i, o, u, y or their majuscule counterparts. With the exception of the uppercase Ÿ, these characters are also available in all of the ISO 8859 character sets and thus have the same codepoints in ISO-8859-1 (-2, -3, -4, -9, -10, -13, -14, -15, -16) and Unicode. The uppercase Ÿ is available in ISO 8859-15 and Unicode, and Unicode provides a number of other letters with double dots as well.

Umlauts
Character Replacement HTML Unicode
Ä
ä
A or Ae
a or ae
Ä
ä
U+00C4
U+00E4
Ö
ö
O or Oe
o or oe
Ö
ö
U+00D6
U+00F6
Ü
ü
U or Ue
u or ue
Ü
ü
U+00DC
U+00FC
Other double dots
Character HTML Unicode
Ë
ë
Ë
ë
U+00CB
U+00EB

U+1E26
U+1E27
Ï
ï
Ï
ï
U+00CF
U+00EF
U+1E97

U+1E72
U+1E73

U+1E84
U+1E85

U+1E8C
U+1E8D
Ÿ
ÿ
Ÿ
ÿ
U+0178
U+00FF

Note: when replacing umlaut characters with plain ASCII, use ae, oe, etc. for the German language, and the simple character replacements for all other languages.

TeX and LaTeX edit

TeX (and its derivatives, most notably LaTeX) also allows double dots to be placed over letters. The standard way is to use the control sequence \" followed by the relevant letter, e.g. \"u. It is good practice to set the sequence off with curly braces: {\"u} or \"{u}.

TeX's "German" package can be used: it adds the " control sequence (without the backslash) to produce the Umlaut. However, this can cause conflicts if the main language of the document is not German. Since the integration of Unicode through the development of XeTeX and XeLaTeX, it is also possible to input the Unicode character directly into the document, using one of the recognized methods such as Compose key or direct Unicode input.

TeX's traditional control sequences can still be used and will produce the same output (in very early versions of TeX these sequences would produce double dots that were too far above the letter's body).

All these methods can be used with all available font variations (italic, bold etc.).

See also edit

Notes edit

  1. ^ The diacritic is referred to in Unicode as a diaeresis, without distinction, although the term has a more precise literary meaning.
  2. ^ The IPA Handbook calls the mark "subscript umlaut", in contrast with the Unicode Consortium's choice of "diaeresis below".

References edit

  1. ^ The Unicode Standard v 5.0. San Francisco: Addison-Wesley. 2006. p. 228. ISBN 0-321-48091-0.
  2. ^ Baum, Dan (16 December 2010). . Dscriber. Trade Secrets. Archived from the original on 16 December 2010. Among the many mysteries of The New Yorker is that funny little umlaut over words like coöperate and reëlect. The New Yorker seems to be the only publication on the planet that uses it, and I always found it a little pretentious until I did some research. Turns out, it's not an umlaut. It's a diaeresis.
  3. ^ a b International Phonetic Association (2021). Handbook of the International Phonetic Association : a guide to the use of the International Phonetic Alphabet. Cambridge: Cambridge University Press. ISBN 9780521652360..
  4. ^ . Māori Language Commission. Archived from the original on 2009-09-06. Retrieved 11 June 2010.
  5. ^ "Māori language on the internet". Te Ara: The Encyclopedia of New Zealand.
  6. ^ Kaplan, Michael S (4 September 2006). "Every character has a story #24: U+0308 (COMBINING DIAERESIS)".
  7. ^ "Characters and Combining Marks | Q: Unicode doesn't seem to distinguish between tréma and umlaut, but I need to distinguish. What shall I do?". Unicode Consortium.
  8. ^ "Enter characters with accent marks on Mac". apple.com. Retrieved 2021-08-07.
  9. ^ Randall, Angela (February 18, 2014). "How to Write Foreign Character Accents Using Your Chromebook". Retrieved March 2, 2020.
  10. ^ Busch, Jack (April 20, 2018). "Type Special Characters with a Chromebook (Accents, Symbols, Em Dashes)". groovypost.com. Retrieved February 28, 2020.

External links edit

dots, diacritic, diacritical, marks, dots, placed, side, side, over, under, letter, used, number, languages, several, different, purposes, most, familiar, english, language, speakers, diaeresis, umlaut, though, there, numerous, others, example, albanian, repre. Diacritical marks of two dots placed side by side over or under a letter are used in a number of languages for several different purposes The most familiar to English language speakers are the diaeresis and the umlaut though there are numerous others For example in Albanian e represents a schwa Such diacritics are also sometimes used for stylistic reasons as in the family name Bronte or the band name Motley Crue Two dotsU 0308 COMBINING DIAERESIS a U 0324 COMBINING DIAERESIS BELOWU 07F3 NKO COMBINING DOUBLE DOT ABOVEThis page uses orthographic and related notations For the notations and used in this article see IPA Brackets and transcription delimiters In modern computer systems using Unicode the two dot diacritics are almost always encoded identically having the same code point 1 For example U 00E4 a LATIN SMALL LETTER A WITH DIAERESIS represents both a umlaut and a diaeresis Their appearance in print or on screen may vary between typefaces but rarely within the same typeface Contents 1 Uses 1 1 Diaeresis 1 2 Umlaut 1 3 Stylistic use 1 4 Other uses by language 1 4 1 Vowels 1 4 2 Consonants 1 4 3 Diacritic underneath 2 Computer encodings 2 1 In Unicode 2 2 In ASCII ISO IEC 646 and ISO 8859 3 Computer usage 3 1 Keyboard input 3 1 1 Apple MacOS iOS 3 1 2 Google ChromeOS 3 1 3 Linux 3 1 4 Microsoft Windows 3 1 5 X Window System 3 1 6 Dedicated keys 3 1 7 Other scripts 3 2 On screen keyboards 3 3 HTML 3 4 TeX and LaTeX 4 See also 5 Notes 6 References 7 External linksUses editDiaeresis edit Main article Diaeresis diacritic The diaeresis diacritic is used to mark the separation of two distinct vowels in adjacent syllables when an instance of diaeresis or hiatus occurs so as to distinguish from a digraph or diphthong For example in the spelling cooperate the diaeresis reminds the reader that the word has four syllables co op er ate not three It is used in several languages of western and southern Europe though rarely now in English 2 Umlaut edit Main article Umlaut diacritic The umlaut diacritic indicates a sound shift phenomenon also known as umlaut in which a back vowel becomes a front vowel It is a specific phenomenon in German and other Germanic languages affecting the graphemes a o u and au which are modified to a o u and au Stylistic use edit The two dot diacritic is also sometimes used for purely stylistic reasons For example the Bronte family whose surname was derived from gaelic and had been anglicised as Prunty or Brunty At some point the father of the sisters Patrick Bronte born Brunty decided on the alternative spelling with a diaeresis diacritic over the terminal e to indicate that the name had two syllables Similarly the metal umlaut is a diacritic that is sometimes used gratuitously or decoratively over letters in the names of hard rock or heavy metal bands for example those of Motorhead and Motley Crue and of parody bands such as Spin al Tap Other uses by language edit A double dot is also used as a diacritic in cases where it functions as neither a diaeresis nor an umlaut In the International Phonetic Alphabet IPA a double dot above a letter is used for a centralized vowel a situation more similar to umlaut than to diaeresis In other languages it is used for vowel length nasalization tone and various other uses where diaeresis or umlaut was available typographically The IPA uses a double dot below a letter to indicate breathy murmured voice 3 b Vowels edit In Albanian Tagalog and Kashubian e represents a schwa e In Aymara a double dot is used on a i u for vowel length In the Basque dialect of Soule u represents y In the DMG romanization of Tunisian Arabic a o ṏ u and ṻ represent ae œ œ y and y In Ligurian official orthography o is used to represent the sound oː In Maori a diaeresis e g Maori was often used on computers in the past instead of the macron to indicate long vowels as the diaeresis was relatively easy to produce on many systems and the macron difficult or impossible 4 5 In Seneca e o are nasal vowels though a is ɛ as in German umlaut In Vures Vanuatu e and o encode respectively œ and o In the Pahawh Hmong script a double dot is used as one of several tone marks The double dot was used in the early Cyrillic alphabet which was used to write Old Church Slavonic The modern Cyrillic Belarusian and Russian alphabets include the letter yo yo although replacing it with the letter e without the diacritic is allowed in Russian Since the 1870s Yi yi Cyrillic letter yi has been used in the Ukrainian alphabet for iotated ji plain i is not iotated i In Udmurt ӥ is used for uniotated i with i for iotated ji The form y is common in Dutch handwriting and also occasionally used in printed text but is a form of the digraph ij rather than a modification of the letter y Komi and Udmurt use Ӧ a Cyrillic O with two dots for e The Swedish Finnish and Estonian languages use A and O to represent ae and o In the languages of J R R Tolkien s Middle Earth novels a diaeresis is used to separate vowels belonging to different syllables e g in Earendil and on final e to mark it as not a schwa e g in Manwe Aule Orome etc There is no schwa in these languages but Tolkien wanted to make sure that readers wouldn t mistakenly pronounce one when speaking the names aloud citation needed Consonants edit Jacaltec a Mayan language and Malagasy are among the very few languages with a double dot on the letter n in both n is the velar nasal ŋ In Udmurt a double dot is also used with the consonant letters ӝ dʒ from zh ʒ ӟ dʑ from z z ʑ and ӵ tʃ from ch tɕ When distinction is important Ḧ and ẍ are used for representing ħ and ɣ in the Kurdish Kurmanji alphabet which are otherwise represented by h and x These sounds are borrowed from Arabic Ẅ and y Ÿ is generally a vowel but it is used as the semi vowel consonant ɰ a w without the use of the lips in Tlingit This sound is also found in Coast Tsimshian where it is written ẅ A number of languages in Vanuatu use double dots on consonants to represent linguolabial or apicolabial phonemes in their orthography Thus Araki contrasts bilabial p p with linguolabial p t bilabial m m with linguolabial m n and bilabial v b with linguolabial v d Seneca uses s for ʃ In Arabic the letter ẗ is used in the ISO 233 transliteration for the taʾ marbuṭah ة used to mark feminine gender in nouns and adjectives Syriac uses a two dots above a letter called Siyame to indicate that the word should be understood as plural For instance ܒܝܬܐ bayta means house while ܒܝ ܬܐ bayte means houses The sign is used especially when no vowel marks are present which could differentiate between the two forms Although the origin of the Siyame is different from that of the diaeresis sign in modern computer systems both are represented by the same Unicode character This however often leads to wrong rendering of the Syriac text The N Ko script used to write the Mande languages of West Africa uses a two dot diacritic among others to represent non native sounds The dots are slightly larger than those used for diaeresis or umlaut Diacritic underneath edit See also Romanization The IPA specifies a subscript umlaut for example Hindi kʊm ar potter 3 25 the ALA LC romanization system provides for its use and is one of the main schemes to romanize Persian for example rendering ض as z The notation was used to write some Asian languages in Latin script for example Red Karen Computer encodings editIn Unicode edit Character encoding generally treats the umlaut and the diaeresis as the same diacritic mark Unicode refers to both as diaereses without making any distinction although the term itself has a more precise literary meaning For example U 00E4 a LATIN SMALL LETTER A WITH DIAERESIS represents both a umlaut and a diaeresis while similar codes are used to represent all such cases Unicode encodes a number of cases of letter with a two dots diacritic as precomposed characters and these are displayed above Unicode uses the term Diaeresis for all two dot diacritics irrespective of the actual term used for the language in question In addition many more symbols may be composed using the combining character facility U 0308 COMBINING DIAERESIS that may be used with any letter or other diacritic to create a customised symbol but this does not mean that the result has any real world application Diaeresis Latin A aǞ ǟA a B b C c E eḦ ḧI iḮ ḯJ j K k L l M m N n O oȪ ȫǪ ǫ Ṏ ṏP p Q q Q q R r S s T ẗU uǕ ǖǗ ǘǙ ǚǛ ǜṲ ṳṺ ṻṲ ṳ ᴞV v Ẅ ẅẌ ẍŸ yZ z Greek I iY yῢ y ῧϔ Cyrillic Ӓ ӓYo yoӚ ӛӜ ӝӞ ӟӤ ӥYi yiӦ ӧӪ ӫӰ ӱӴ ӵӸ ӹӬ ӭ Both the combining character U 0308 and the pre composed codepoints may be regarded as an umlaut or a diaeresis according to context Compound diacritics are possible for example U 01DA ǚ LATIN SMALL LETTER U WITH DIAERESIS AND CARON used as a tonal marks for Hanyu Pinyin which uses both a two dots diacritic with a caron diacritic Conversely when the letter to be accented is an i the diacritic replaces the tittle thus i Sometimes there s a need to distinguish between the umlaut sign and the diaeresis sign For instance either may appear in a German name ISO IEC JTC 1 SC 2 WG 2 recommends the following for these cases 6 To represent the umlaut use the Combining Diaeresis U 0308 To represent the diaeresis use Combining Grapheme Joiner CGJ U 034F Combining Diaeresis U 0308 The same advice can be found in the official Unicode FAQ 7 Since version 3 2 0 Unicode also provides U 0364 COMBINING LATIN SMALL LETTER E which can produce the older umlaut typography Unicode provides a combining double dot below as U 0324 COMBINING DIAERESIS BELOW Finally for use with the N Ko script there is U 07F3 NKO COMBINING DOUBLE DOT ABOVE In ASCII ISO IEC 646 and ISO 8859 edit For broader coverage of this topic see Western Latin character sets computing ASCII a seven bit code with just 95 printable characters has no provision for any kind of dot diacritic Subsequent standardisation treated ASCII as the US national variant of ISO IEC 646 the French German and other national variants reassigned a few code points to specific vowels with diacritics as precomposed characters The subsequent eight bit ISO 8859 1 character encoding includes the letters a e i o u and their respective capital forms as well as y in lower case only with Ÿ added in the revised edition ISO 8859 15 and Windows 1252 These standards are technically obsolete having been replaced by Unicode Computer usage editCharacter encoding generally treats the umlaut and the diaeresis as the same diacritic mark Unicode refers to both as diaereses without making any distinction although the term itself has a more precise literary meaning For example U 00E4 a LATIN SMALL LETTER A WITH DIAERESIS represents both a umlaut and a diaeresis while similar codes are used to represent all such cases Keyboard input edit nbsp Letters with umlaut on a German computer keyboard If letters with double dots are not present on the keyboard or if they are not recognized by the operating system there are a number of ways to input them into a computer system Apple MacOS iOS edit iOS provides accented letters through press and hold on most European Latin script keyboards including English Some keyboard layouts feature combining accent keys that can add accents to any appropriate letter A letter with double dots can be produced by pressing Option U then the letter This works on English and other keyboards and is documented further in the supplied manuals 8 Google ChromeOS edit For ChromeOS with US International keyboard setting the combination is letter 9 For ChromeOS with UK extended setting use AltGr Shift2 release then the letter citation needed Alternatively the Unicode codepoint may be entered directly using Ctrl Shift u release then the four digit code then Enter or Space 10 Linux edit In some Linux desktop environments a letter with double dots can be produced by pressing AltGr Shift then the letter citation needed When the system has a compose key the same procedure as that described at X Windows below may be used Microsoft Windows edit AZERTY and QZERTY keyboards as used in much of Europe include precomposed characters accented letters as standard and these are fully supported by Microsoft Windows typically accessed using the AltGr key For users with a US keyboard layout Windows includes a setting US International which supports creation of accented letters by changing the function of some keys into dead keys If the user enters nothing will appear on screen until the user types another character after which the characters will be merged if possible or added independently at once if not Otherwise the desired character may be generated using the Alt table above For users in the United Kingdom and Ireland with QWERTY keyboards Windows has an Extended setting such that an accented letter can be created using AltGr2 then the base letter When using Microsoft Word or Outlook a letter with double dots can be produced by pressing Ctrl Shift and then the letter On Microsoft Windows keyboard layouts that do not have double dotted characters one may use Windows Alt keycodes Double dots are then entered by pressing the left Alt key and entering the full decimal value of the character s position in the Windows code page on the numeric keypad provided that the compatible code page is used as a system code page One can also use numbers from Code page 850 these are used without a leading 0 Character Windows Code Page Code CP850 Code Unicodea Alt 0228 Alt 132 00E4e Alt 0235 Alt 137 00EBi Alt 0239 Alt 139 00EFo Alt 0246 Alt 148 00F6u Alt 0252 Alt 129 00FCy Alt 0255 Alt 152 00FFA Alt 0196 Alt 142 00C4E Alt 0203 Alt 211 00CBI Alt 0207 Alt 216 00CFO Alt 0214 Alt 153 00D6U Alt 0220 Alt 154 00DCŸ Alt 0159 N A 0178X Window System edit X based systems with a Compose key set in the system can usually insert characters with double dots by typing Compose quotedbl i e followed by the letter Compose Shift letter may also work depending on the system s set up However most modern UNIX like systems also accept the sequence Compose Shift U to initiate the direct input of a Unicode value Thus typing Compose Shift U 00F6 finishing with Space or Enter will insert o into the document Dedicated keys edit The German keyboard has dedicated keys for u o a Scandinavian and Turkish keyboards have dedicated keys for their respective language specific letters including o for Swedish Finnish and Icelandic and both o and u for Turkish French and Belgian AZERTY keyboards have a dead key which adds a circumflex if without Shift or a diaeresis umlaut if with Shift to the letter key immediately following for instance Shift followed by e gives e Other scripts edit For non Latin scripts Greek and Russian use press and hold for double dot diacritics on only a few characters The Greek keyboard has dialytica and dialytica tonos variants for upsilon and iota y y i i but not for e o a h w following modern monotonic usage Russian keyboards feature separate keys for e and yo On screen keyboards edit The early 21st century has seen noticeable growth in stylus and touch operated interfaces making the use of on screen keyboards operated by pointing devices mouse stylus or finger more important These soft keyboards may replicate the modifier keys found on hardware keyboards but they may also employ other means of selecting options from a base key such as right click or press and hold Soft keyboards may also have multiple contexts such as letter numeric and symbol HTML edit In HTML vowels with double dots can be entered with an entity reference of the form amp uml where can be any of a e i o u y or their majuscule counterparts With the exception of the uppercase Ÿ these characters are also available in all of the ISO 8859 character sets and thus have the same codepoints in ISO 8859 1 2 3 4 9 10 13 14 15 16 and Unicode The uppercase Ÿ is available in ISO 8859 15 and Unicode and Unicode provides a number of other letters with double dots as well Umlauts Character Replacement HTML UnicodeAa A or Aea or ae amp Auml amp auml U 00C4U 00E4Oo O or Oeo or oe amp Ouml amp ouml U 00D6U 00F6Uu U or Ueu or ue amp Uuml amp uuml U 00DCU 00FC Other double dots Character HTML UnicodeEe amp Euml amp euml U 00CBU 00EBḦḧ U 1E26U 1E27Ii amp Iuml amp iuml U 00CFU 00EFẗ U 1E97Ṳṳ U 1E72U 1E73Ẅẅ U 1E84U 1E85Ẍẍ U 1E8CU 1E8DŸy amp Yuml amp yuml U 0178U 00FF Note when replacing umlaut characters with plain ASCII use ae oe etc for the German language and the simple character replacements for all other languages TeX and LaTeX edit TeX and its derivatives most notably LaTeX also allows double dots to be placed over letters The standard way is to use the control sequence followed by the relevant letter e g u It is good practice to set the sequence off with curly braces u or u TeX s German package can be used it adds the control sequence without the backslash to produce the Umlaut However this can cause conflicts if the main language of the document is not German Since the integration of Unicode through the development of XeTeX and XeLaTeX it is also possible to input the Unicode character directly into the document using one of the recognized methods such as Compose key or direct Unicode input TeX s traditional control sequences can still be used and will produce the same output in very early versions of TeX these sequences would produce double dots that were too far above the letter s body All these methods can be used with all available font variations italic bold etc See also editDot diacritic Two dots disambiguation Notes edit The diacritic is referred to in Unicode as a diaeresis without distinction although the term has a more precise literary meaning The IPA Handbook calls the mark subscript umlaut in contrast with the Unicode Consortium s choice of diaeresis below References edit The Unicode Standard v 5 0 San Francisco Addison Wesley 2006 p 228 ISBN 0 321 48091 0 Baum Dan 16 December 2010 The New Yorker s odd mark the diaeresis Dscriber Trade Secrets Archived from the original on 16 December 2010 Among the many mysteries of The New Yorker is that funny little umlaut over words like cooperate and reelect The New Yorker seems to be the only publication on the planet that uses it and I always found it a little pretentious until I did some research Turns out it s not an umlaut It s a diaeresis a b International Phonetic Association 2021 Handbook of the International Phonetic Association a guide to the use of the International Phonetic Alphabet Cambridge Cambridge University Press ISBN 9780521652360 Maori Orthographic Conventions Maori Language Commission Archived from the original on 2009 09 06 Retrieved 11 June 2010 Maori language on the internet Te Ara The Encyclopedia of New Zealand Kaplan Michael S 4 September 2006 Every character has a story 24 U 0308 COMBINING DIAERESIS Characters and Combining Marks Q Unicode doesn t seem to distinguish between trema and umlaut but I need to distinguish What shall I do Unicode Consortium Enter characters with accent marks on Mac apple com Retrieved 2021 08 07 Randall Angela February 18 2014 How to Write Foreign Character Accents Using Your Chromebook Retrieved March 2 2020 Busch Jack April 20 2018 Type Special Characters with a Chromebook Accents Symbols Em Dashes groovypost com Retrieved February 28 2020 External links edit nbsp Look up a E e or o in Wiktionary the free dictionary Retrieved from https en wikipedia org w index php title Two dots diacritic amp oldid 1185450490, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.