Knowledge Base Wiki

Search for LIMS content across all our Wiki Knowledge Bases.

Type a search term to find related articles by LIMS subject matter experts gathered from the most trusted and dynamic collaboration tools in the laboratory informatics industry.

IJ digraph

IJ (lowercase ij; Dutch pronunciation: [ɛi] ; also encountered as Unicode compatibility characters IJ and ij) is a digraph of the letters i and j. Occurring in the Dutch language, it is sometimes considered a ligature, or a letter in itself. In most fonts that have a separate character for ij, the two composing parts are not connected but are separate glyphs, which are sometimes slightly kerned.

An ij in written Dutch usually represents the diphthong [ɛi], similar to the pronunciation of ⟨ay⟩ in "pay".[1] In standard Dutch and most Dutch dialects, there are two possible spellings for the diphthong [ɛi]: ij and ei. That causes confusion for school children, who need to learn which words to write with ei and which with ij. To distinguish between the two, the ij is referred to as the lange ij ("long ij"), the ei as korte ei ("short ei") or simply E – I.[2] In certain Dutch dialects (notably West Flemish and Zeelandic) and the Dutch Low Saxon dialects of Low German, a difference in the pronunciation of ei and ij is maintained. Whether it is pronounced identically to ei or not, the pronunciation of ij is often perceived as being difficult by people who do not have either sound in their native language.

The ij originally represented a 'long i'.[3] It used to be written as ii, as in Finnish and Estonian, but for orthographic purposes, the second i was eventually elongated, which is a reason why it is called lange ij. This can still be seen in the pronunciation of some words like bijzonder (bi.zɔn.dər), and the etymology of some words in the Dutch form of several foreign placenames: Berlin and Paris are spelled Berlijn and Parijs. Nowadays, the pronunciation mostly follows the spelling, and they are pronounced with [ɛi]. The ij is distinct from the letter y. Particularly when writing capitals, Y used to be common instead of IJ in the past. That practice has long been deprecated, since 1804, and the standard Dutch alphabet has the letter ij in the alphabet between the "x" and the "z". The letter "y" is not part of the Dutch alphabet, but commonly used in loan words and archaic names.[4] In scientific disciplines such as mathematics and physics, the symbol y is usually pronounced ij.[5]

To distinguish the Y from IJ in common speech, however, Y is often called Griekse ij (meaning "Greek Y"), a literal translation of i-grec (from French, with the stress on grec: [iˈɡrɛk]) or alternatively called Ypsilon. In modern Dutch, the letter Y occurs only in loanwords,[6] proper nouns, or when deliberately spelled as Early Modern Dutch. The spelling of Afrikaans (a daughter language of early modern Dutch) has evolved in the exact opposite direction and IJ has been completely replaced by Y.

However, the ancient use of Y in Dutch has survived in some personal names, particularly that of Dutch immigrants in the United States, Canada, Australia and New Zealand where as a result of anglicization, the IJ became a Y. For example, the surname Spijker was often changed into Spyker and Snijder into Snyder.

The words ijsvrij and yoghurt in various forms. Depending on the form of handwriting or font used, the IJ and Y can look either nearly identical or very different.
Apt to confusion: (1) i + j, (2) ligature ij, (3) y with diaeresis, (4) y; all in Garamond typeface
Logo of the Rijksmuseum in Amsterdam.
Two signs of Rijssen railway station, each using a different format
IJ here is written as one letter.
Here, IJ is written as Y.

History

IJ probably developed out of ii, representing a long [iː] sound (which it still does in some cases, such as in the word bijzonder and in several Dutch dialects).[3] In the Middle Ages, the i was written without a dot in handwriting, and the combination ıı was often confused with u. Therefore, the second i was elongated: ıȷ. Later, the dots were added, albeit not in Afrikaans, a language that has its roots in Dutch. In this language y is used instead.

Alternatively, the letter J may have developed as a swash form of i. In other European languages it was first used for the final i in Roman numerals when there was more than one i in a row, such as iij for "three", to prevent the fraudulent addition of an extra i to change the number. In Dutch, which had a native ii, the "final i in a row elongated" rule was applied as well, leading to ij.

Another theory is that IJ might have arisen from the lowercase y being split into two strokes in handwriting. At some time in the 15th or 16th century, this combination began to be spelled as a ligature ij. An argument against this theory is that even in handwriting which does not join letters, ij is often written as a single sign.

Some time after the birth of this new letter, the sound which was now represented by ij, in most cases, began to be pronounced much like ei instead, but words containing it were still spelled the same. Nowadays, ij in most cases represents the diphthong [ɛi], except in the suffix -lijk, where it is usually pronounced as a schwa. In one special case, the Dutch word bijzonder, the (old) sound [iː] is correct standard pronunciation, although [i] is more common and [ɛi] is also allowed.

In proper names, ij often appears instead of i at the end of other diphthongs, where it does not affect the pronunciation: aaij, eij, oeij, ooij and uij are pronounced identically to aai [aːi], ei [ɛi], oei [ui], ooi [oːi] and ui [œy]. This derives from an old orthographic practice (also seen in older French and German) of writing y instead of i after another vowel; later, when y and ij came to be seen as interchangeable, the spellings with ij came to be used. Spelling reforms and standardization have removed the redundant js in common words, but proper names continue to use these archaic spellings.

Status

A poster showing the letters of the alphabet used for writing education in the Netherlands. The final three letter pairs read "Xx IJij Zz".
In this version the ij is a single glyph.

As the rules of usage for the IJ differ from those that apply to the many other digraphs in the Dutch language—in some situations behaving more as a single ligature or letter than a digraph—the IJ is not only confusing to foreigners, but also a source of discussion among native speakers of Dutch. Its actual usage in the Netherlands and in Flanders (Belgium) sometimes differs from the official recommendations.

Official status

Both the Dutch Language Union and the Genootschap Onze Taal consider the ij to be a digraph of the letters i and j.[4][5] The descriptive dictionary Van Dale Groot woordenboek van de Nederlandse taal states that ij is a "letter combination consisting of the signs i and j, used, in some words, to represent the diphthong ɛi."[7] The Winkler Prins encyclopedia states that ij is the 25th letter of the Dutch alphabet, placed between X and Y. However, this definition is not generally accepted.[citation needed]

In words where i and j are in different syllables, they do not form the digraph ij. In compound words, a hyphen is added, as in gummi-jas.[8]

Netherlands

In the Netherlands, IJ is often used as a ligature:

  • In Dutch primary schools, ij used to be taught as being the 25th letter of the alphabet, and some primary school writing materials still list it as such. However, ij is according to Onze Taal not part of the Dutch alphabet and is usually sorted under the i as it is considered to consist of two letters.[5]
  • When a word starting with IJ is capitalised, the entire digraph is capitalised: IJsselmeer, IJmuiden.[5]
  • On mechanical Dutch typewriters, there is a key that produces 'ij' (in a single letterspace, located directly to the right of the L). However, this is not the case on modern computer keyboards.
  • In many word puzzles, such as Lingo, ij fills one square, but in others, such as Scrabble, ij fills two squares.

Flanders

In Flanders (Belgium), IJ is generally described in schools as a combination of two separate characters.

  • As in the Netherlands, words that begin with IJ (and that are the first word in a sentence) usually capitalise the entire pair: IJzer, IJzertoren.

Usage

Capitalisation

Road sign in Rotterdam area with capitalised digraph IJ in proper name IJsselmonde

When a Dutch word starting with IJ is capitalised, the entire digraph is capitalised, as in IJsselmeer or IJmuiden.[9]

Support for this property in software is limited. Poorly localised text editors with autocorrect functionality may incorrectly convert the second of two initial capital letters in a word to lowercase; such improper spelling can thus be found in informal writing. Support on the internet is similarly inconsistent: Web pages styled with the CSS property text-transform: capitalize are specified to be handled with Unicode language-specific case mapping rules (content language being indicated with HTML language attributes, such as lang="nl" for Dutch), but support for language-specific cases is limited to Mozilla Firefox (version 14 and above) as of January 2021.[10]

Collation

Dutch dictionaries since about 1850 invariably sort ij as an i followed by a j, i.e. between ih and ik. This is the preferred sorting by the Taalunie.[4] On the other hand, some encyclopedias, like the Winkler Prins, 7th edition, sort ij as a single letter positioned between x and y.

Telephone directories as well as the Yellow Pages in the Netherlands (but not those in Belgium) sort ij and y together, as if they were the same, between x and z. Thanks to this, surnames like Bruijn and Bruyn which sound the same (and even look similar), can be found in the same area. However, Bruin, though it sounds the same as well, is placed with "Brui-" and not with "Bruy-".

Abbreviations

When words or (first) names are shortened to their initials, in the Netherlands a word or proper name starting with IJ is abbreviated to IJ. For example, IJsbrand Eises Ypma is shortened to IJ. E. Ypma.[11] The digraph "ei" in "Eises", like other digraphs in Dutch, is shortened to one letter.

Stress

The Dutch word bijna (almost, nearly) with ad hoc stress on the first syllable indicated by two acute accents on the digraph ij

In Dutch orthography, ad hoc indication of stress can be marked by placing an acute accent on the vowel of the stressed syllable. In case of a diphthong or double vowel, both vowels should be marked with an acute accent; this also applies to the IJ (even though J by itself is not a vowel, the digraph IJ represents one distinct vowel sound). However, due to technical limitations the accent on the j is often omitted in electronic documents: "bíjna".[12] Nevertheless, in Unicode it is possible to combine characters into a j with an acute accent — "bíj́na" — though this might not be supported or rendered correctly by some fonts or systems. This is the combination of the regular (soft-dotted) j (U+006A) and the combining acute accent  ́ (U+0301).

Spelling

Vrijdag can be spelled out in two ways, depending on whether the speller considers ij to be one letter or not:

  • V – R – IJ – D – A – G
  • V – R – I – J – D – A – G

Wide inter-letter spacing

On this signboard of a liquor store (slijterij), IJ occupies the same space as single letters. The I is put over the lower end of the J to reinforce their unity, but this is optional and I and J can also be found separately on other signs

When words are written with large inter-letter spacing, IJ is often, but not always, kept together. F r a n k r ij k or F r a n k r i j k.

When words are written from top to bottom, with non-rotated letters, IJ is usually, but not always, kept together. Keeping it together is the preferred way.[11]

F
r
a
n
k
r
ij
k

or

F
r
a
n
k
r
i
j
k

Spelling of proper names

In Dutch names, interchangeability of i, ij and y is frequent. Some names are changed unofficially for commercial reasons or by indifference:

The Dutch football team of Feyenoord changed its name from the original "Feijenoord" to "Feyenoord" after achieving international successes. This was done as a reaction to foreign people often mispronouncing the name. The Feijenoord district in Rotterdam, the namesake of the football club maintains its original spelling.

Phonetic radio alphabet

In the Dutch phonetic radio alphabet, the codeword IJmuiden represents the IJ. This is clearly different from the codeword Ypsilon, which is used to represent the Y. Dutch and Belgian armed forces use the official NATO phonetic alphabet, where "Y" is "Yankee" and "IJ" is spelled out "India Juliett".

Word games

In crossword puzzles (except for Scrabble – see next paragraph), and in the game Lingo, IJ is considered one letter, filling one square, but the IJ and the Y are considered distinct. In other word games, the rules may vary.

The Dutch version of Scrabble has a Y with a face value of eight. Some players used it to represent IJ or Y. The recent Dutch version comes with an example game, which clearly indicates that Y is only Y, and IJ should be composed of I and J. In previous editions of Scrabble there was a single IJ sign.

In word games that make a distinction between vowels and consonants, IJ is considered to be a vowel, if it is considered one letter. Whether Y is a vowel or a consonant, is another matter of discussion, since Y can represent both a vowel or a (half-) consonant.

Technical details

Print and handwriting

Lijnbus (road section reserved for public bus line, literally "line bus") showing IJ as a "broken U" glyph

In print, the letter ÿ (lowercase y with diaeresis) and ij look very different, but handwriting usually makes ÿ, ij and Y, IJ look identical. However, since y occurs only in loanwords, the letter ÿ is extremely rare (if not altogether nonexistent) in Dutch.

The long ij extends below the baseline and so is written with a long stroke. It is often written as a single sign, even in handwriting that does not join letters.

On some road signs in the Netherlands, IJ appears as a single glyph formed like a U with a break in the left-hand stroke.

Uppercase IJ glyph with the distinctive "broken U" ligature in a Helvetica font for Omega TeX
IJ as a "broken U" glyph

Braille

Dutch Braille, which is used in the Netherlands, has ⟨ij⟩ represented by , which represents ⟨y⟩ in other varieties of Braille. ⟨y⟩ is written as .[13]

In Belgium, French Braille is used, in which ⟨ij⟩ is written simply as ⟨i⟩ + ⟨j⟩: .

Encoding

The Dutch ij is not present in the ASCII code, nor in any of the ISO 8859 character encodings. Therefore, the digraph is most often encoded as an i followed by a j. The ligature is present as a national-use character within the Dutch version of ISO 646, one implementation of which is of DEC's National Replacement Character Set (NRCS)[14] aka code page 1102,[15] and it also existed in the Atari ST character set[16][17][18][19][20][21] (but not in the GEM character set for PCs) as well as in the Lotus Multi-Byte Character Set (LMBCS).[22][23] It is also present in Unicode in the Latin Extended-A range as U+0132 IJ LATIN CAPITAL LIGATURE IJ (IJ) and U+0133 ij LATIN SMALL LIGATURE IJ (ij).[24][25] These characters are considered compatibility-decomposable.[25] They are included for compatibility and round-trip convertibility with legacy encodings, but their use is discouraged.[26] Therefore, even with Unicode available, it is recommended to encode ij as two separate letters.[11][27] Nonetheless, some fonts use this code point for the ligated form of IJ if it exists.

Keyboards

While Dutch typewriters usually have a separate key for lowercase ij, Belgian typewriters do not.[28] In the Netherlands, a QWERTY computer keyboard layout is common. The standard US layout (often in "International Mode") is widely used, although a specific but rarely used Dutch variant (KBD143) does exist. In Belgium, a specific Belgian variant of AZERTY keyboard layout (KBD120) is widely used. None of these keyboards feature a key for ij or IJ.

Not as a digraph

This Dutch shopkeeper wrote 'byoux' instead of 'bijoux'.

If the i and the j belong to different syllables, such as in the mathematical term bijectie (syllablised "bi‧jec‧tie"), words with old spelling minijurk (syllablised "mi‧ni‧jurk"), skijas (syllablised "ski‧jas"), foreign placenames like Beijing, Dijon, Fiji or person names like Khadija, Elijah, Marija, they do not form a ligature or a single letter. Earlier statements about sorting ij on par with y, keeping ij together in the kerning of printed texts, the single square in crossword puzzles, etc., do not apply.

References

  1. ^ Booij, GE (1995), The Phonology of Dutch (Google Books), Oxford University Press, p. 4, ISBN 9780198238690.
  2. ^ Woordenlijst Nederlandse Taal (in Dutch), pp. 22–23.
  3. ^ a b "IJ: oorsprong van de lange ij". Genootschap Onze Taal (in Dutch).
  4. ^ a b c "IJ - alfabetiseren". Nederlandse Taalunie (in Dutch). Retrieved 3 January 2015.
  5. ^ a b c d "IJ: plaats in alfabet". Genootschap Onze Taal (in Dutch). Retrieved 3 January 2015.
  6. ^ "Y (klinker / medeklinker)". Genootschap Onze Taal (in Dutch). Retrieved 3 January 2015.
  7. ^ Van Dale Groot woordenboek van de Nederlandse taal: "lettercombinatie bestaande uit de tekens i en j, gebruikt om, in een aantal woorden, de tweeklank ɛi weer te geven"
  8. ^ "7.1 Welke klinkers botsen? | woordenlijst". woordenlijst.org. Retrieved 2022-11-14.
  9. ^ "Ijsland / IJsland". Nederlandse Taalunie (in Dutch). Retrieved 3 January 2015.
  10. ^ "text-transform - CSS: Cascading Style Sheets | MDN". MDN Web Docs. Retrieved 30 September 2019.
  11. ^ a b c Demchenko, Yuri. "European rules for the use of the IJ in public records". UA zone. Retrieved 2012-07-19.
  12. ^ "Klemtoonteken (algemeen) - Taaladvies.net". taaladvies.net. Retrieved 2022-11-14.
  13. ^ Bols, Kim. "Het brailleschrift". BE: Kimbols. Archived from the original on 2009-02-09. Retrieved 2007-06-10.
  14. ^ "VT220 Programmer Reference Manual" (2 ed.). Digital Equipment Corporation (DEC). 1984 [1983].
  15. ^ "SBCS code page information - CPGID: 01102 / Name: Dutch NRC Set". IBM Software: Globalization: Coded character sets and related resources: Code pages by CPGID: Code page identifiers. 1. IBM. 1992-10-01. Archived from the original on 2016-12-05. Retrieved 2016-12-05. [1] [2] [3]
  16. ^ Bettencourt, Rebecca G. (2016-08-01) [1999]. "Character Encodings - Legacy Encodings - Atari ST". Kreative Korporation. Retrieved 2016-08-09.
  17. ^ Kostis, Kosta; Lehmann, Alexander. "Atari ST/TT Character Encoding". 1.56. Archived from the original on 2017-01-16. Retrieved 2017-01-16.
  18. ^ "Atari Wiki - The Atari character set". Archived from the original on 2017-01-16. Retrieved 2017-01-16.
  19. ^ "Codepages / Ascii Table Atari ST/TT Character Encoding". ASCII.ca. 2016 [2006]. Archived from the original on 2017-01-16. Retrieved 2017-01-16.
  20. ^ Verdy, Philippe; Haible, Bruno; Zibis, Ulf; Rinquin, Yves-Marie K. (2015-10-08) [1998]. "AtariST to Unicode". 1.3. Retrieved 2016-12-09.
  21. ^ Flohr, Guido (2016) [2006]. "Locale::RecodeData::ATARI_ST - Conversion routines for ATARI-ST". CPAN libintl-perl. 1.1. Archived from the original on 2017-01-14. Retrieved 2017-01-14.
  22. ^ "lmb-excp.ucm". megadaddeln / icu_chrome. 2010 [1995]. Archived from the original on 2016-12-06. Retrieved 2016-12-06. [4]
  23. ^ "Anhang 2. Der Lotus Multibyte Zeichensatz (LMBCS)" [Appendix 2. The Lotus Multibyte Character Set (LMBCS)]. Lotus 1-2-3 Version 3.1 Referenzhandbuch [Lotus 1-2-3 Version 3.1 Reference Manual] (in German) (1 ed.). Cambridge, MA, USA: Lotus Development Corporation. 1989. pp. A2-1–A2-13. 302168.
  24. ^ Latin Extended-A.
  25. ^ a b "Range 0100–017F: Latin Extended-A", Code charts (PDF) (10.0 ed.), Unicode.
  26. ^ "3" (PDF), The Unicode standard (4.0 ed.), The Unicode consortium, 2003, pp. 71–72.
  27. ^ "Unicode two and three Latin letter combinations", Scripts, SIL international.
  28. ^ Most Belgian typewriters use the French AZERTY keyboard, though some may use the Belgian one instead; in both cases minus the peripheral keys to the left of the 1 and to the right of the ù, and of course all modifiers (including AltGr) other than Shift and CapsLock. The latter keyboard is used on Belgian computers. Neither of them knows ij or IJ except as i+j or I+J.

Further reading