fbpx
Wikipedia

Urdu alphabet

The Urdu alphabet (Urdu: اردو حروفِ تہجی, romanizedurdū ḥurūf-i tahajjī) is the right-to-left alphabet used for writing Urdu. It is a modification of the Persian alphabet, which itself is derived from the Arabic script. It has official status in the republics of Pakistan, India and South Africa. The Urdu alphabet has up to 39[4] or 40[5] distinct letters with no distinct letter cases and is typically written in the calligraphic Nastaʿlīq script, whereas Arabic is more commonly written in the Naskh style.

Urdu Alphabet
اُرْدُو حُرُوفِ تَہَجِّی‌
Urdū ḥurūf-i tahajjī
The word Urdū written in the Urdu alphabet
Script type
Official script
Languages
Related scripts
Parent systems
Unicode
U+0600 to U+06FF

U+0750 to U+077F
U+FB50 to U+FDFF

U+FE70 to U+FEFF
 This article contains phonetic transcriptions in the International Phonetic Alphabet (IPA). For an introductory guide on IPA symbols, see Help:IPA. For the distinction between [ ], / / and  , see IPA § Brackets and transcription delimiters.

Usually, bare transliterations of Urdu into the Latin alphabet (called Roman Urdu) omit many phonemic elements that have no equivalent in English or other languages commonly written in the Latin script.

History edit

The standard Urdu script is a modified version of the Perso-Arabic script and has its origins in the 13th century Iran. It is also related to Shahmukhi, used for the Punjabi language varieties in Punjab, Pakistan. It is closely related to the development of the Nastaʻliq style of Perso-Arabic script.

Despite the invention of the Urdu typewriter in 1911, Urdu newspapers continued to publish prints of handwritten scripts by calligraphers known as katibs or khush-navees until the late 1980s. The Pakistani national newspaper Daily Jang was the first Urdu newspaper to use Nastaʿlīq computer-based composition. There are efforts under way to develop more sophisticated and user-friendly Urdu support on computers and the internet. Nowadays, nearly all Urdu newspapers, magazines, journals, and periodicals are composed on computers with Urdu software programs.

Other than the Indian subcontinent, the Urdu script is also used by Pakistan's large diaspora, including in the United Kingdom, the United Arab Emirates, the United States, Canada, Saudi Arabia and other places.[5]

Nastaliq edit

 
Example showing Nastaliq's (Persian) proportion rules.[citation needed]

Urdu is written in the Nastaliq style (Persian: نستعلیق Nastaʿlīq). The Nastaliq calligraphic writing style began as a Persian mixture of the Naskh and Ta'liq scripts. After the Muslim conquest of the Indian subcontinent, Nastaʻliq became the preferred writing style for Urdu. It is the dominant style in Pakistan and many Urdu writers elsewhere in the world use it. Nastaʿlīq is more cursive and flowing than its Naskh counterpart.

In the Arabic alphabet, and many others derived from it, letters are regarded as having two or three general forms each, based on their position in the word (though Arabic calligraphy can add a great deal of complexity). But the Nastaliq style in which Urdu is written uses more than three general forms for many letters, even in simple non-decorative documents.[6]

Alphabet edit

The Urdu script is an abjad script derived from the modern Persian script, which is itself a derivative of the Arabic script. As an abjad, the Urdu script only shows consonants and long vowels; short vowels can only be inferred by the consonants' relation to each other. While this type of script is convenient in Semitic languages like Arabic and Hebrew, whose consonant roots are the key of the sentence, Urdu is an Indo-European language, which requires more precision in vowel sound pronunciation, hence necessitating more memorisation. The number of letters in the Urdu alphabet is somewhat ambiguous and debated.[7]

Letter names and phonemes edit

Name[8] Forms IPA Romanization Unicode Order
Urdu
Roman Urdu
Isolated Final Medial Initial ALA-LC[9] Hunterian[10] [A] [12] [B]
الف
alif
ا ـا /ɑː/, /ʔ/, silent[C] ā, – ā, – U+0627 1 1 1
بے
ب ـب ـبـ بـ /b/ b b U+0628 2 2 2
پے
پ ـپ ـپـ پـ /p/ p p U+067E 3 3 3
تے
ت ـت ـتـ تـ /t/ t t U+062A 4 4 4
ٹے
ṭē
ٹ ـٹ ـٹـ ٹـ /ʈ/ t U+0679 5 5 5
ثے
s̱ē
ث ـث ـثـ ثـ /s/ s U+062B 6 6 6
جيم
jīm
ج ـج ـجـ جـ /d͡ʒ/ j j U+062C 7 7 7
چے
چ ـچ ـچـ چـ /t͡ʃ/ c ch U+0686 8 8 8
بڑی حے
baṛī ḥē
ح ـح ـحـ حـ /ɦ/ h U+062D 9 9 9
ہٹی حے
hutti ḥē
خے
k͟hē
خ ـخ ـخـ خـ /x/ k͟h kh U+062E 10 10 10
دال
dāl
د ـد /d/ d d U+062F 11 11 11
ڈال
ḍāl
ڈ ـڈ /ɖ/ d U+0688 12 12 12
ذال
ẕāl
ذ ـذ /z/ z U+0630 13 13 13
رے
ر ـر /r/ r r U+0631 14 14 14
ڑے
ṛē
ڑ ـڑ /ɽ/
[D]
r U+0691 15 15 15
زے
ز ـز /z/ z z U+0632 16 16 16
ژے
zhē
ژ ـژ /ʒ/
[E]
zh zh U+0698 17 17 17
سین
sīn
س ـس ـسـ سـ /s/ s s U+0633 18 18 18
شین
shīn
ش ـش ـشـ شـ /ʃ/ sh sh U+0634 19 19 19
صاد
ṣwād
ص ـص ـصـ صـ /s/ s U+0635 20 20 20
ضاد
ẓwād
ض ـض ـضـ ضـ /z/ z U+0636 21 21 21
طوے
t̤oʼē
ط ـط ـطـ طـ /t/ t U+0637 22 22 22
ظوے
z̤oʼē
ظ ـظ ـظـ ظـ /z/ z U+0638 23 23 23
عین
ʻain
ع ـع ـعـ عـ /ɑː/, /oː/, /eː/,
/ʔ/, /ʕ/, silent
ʻ ʻ
[citation needed]
U+0639 24 24 24
غین
g͟hain
غ ـغ ـغـ غـ /ɣ/ g͟h gh U+063A 25 25 25
فے
ف ـف ـفـ فـ /f/ f f U+0641 26 26 26
قاف
qāf
ق ـق ـقـ قـ /q/ q q U+0642 27 27 27
کاف
kāf
ک ـک ـکـ کـ /k/ k k U+06A9 28 28 28
گاف
gāf
گ ـگ ـگـ گـ /ɡ/ g g U+06AF 29 29 29
لام
lām
ل ـل ـلـ لـ /l/ l l U+0644 30 30 30
میم
mīm
م ـم ـمـ مـ /m/ m m U+0645 31 31 31
نون
nūn
ن ـن ـنـ نـ /n/, /ɲ/,
/ɳ/, /ŋ/
n n U+0646 32 32 32
نون غنّہ
nūn g͟hunnā
ں
٘
ـں ـںـ ںـ / ◌̃ /
[D]
n U+06BA
U+0658
[F]
[G] 32a 33
واؤ
wāʼo
و ـو /ʋ /, /uː/, /ʊ /,
/o ː /, /ɔː /
v,
ū, u, o, au
w,
ū, u, o, au
U+0648 33 33 34
ہے
ہ ـہ ـہـ ہـ /ɦ/, /ɑː/, /eː/ h, ā, e h, ā, e U+06C1
[H]
34 34 35
چھوٹی ہے
choṭī hē
34a
دو چشمی ہے
do-cashmī hē
ھ ـھ ـھـ ھـ /ʰ/ or /ʱ/
[D]
h h U+06BE 35 34b 36
یے
ی ـی ـیـ یـ /j/, /iː/, /ɑː/ y, ī, á y, ī, á U+06CC 36 35 38
بڑی یے
baṛī yē
ے ـے /ɛː/, /eː/
[D]
ai, e ai, e U+06D2 37 35b 39
ہمزہ
hamzah
ئ ـئ ـئـ ئـ /ʔ/ or silent
[I]
ʼ, –, yi ʼ, –, yi U+0626 35a 37
[J]
ء U+0621 0
Footnotes:
  1. ^ dictionary order[11]
  2. ^ [citation needed]
  3. ^ At the beginning of a word it can represent another vowel, holding a vowel diacritic that would normally be held by the consonant preceding the vowel, for examble اُردو "Urdu". But the diacritic indicating which vowel is often omitted اردو like other short vowel diacritcs.
  4. ^ a b c d No Urdu word begins with ں, ھ, ڑ, or ے.[citation needed]
  5. ^ Used mainly for Persian loanwords.
  6. ^ The version shown on the left is U+06BA, which is used only at the end of words. When it is used in the middle of a word it is a diacritic U+0658, which is usually omitted (see below for further information on diacritic omission in Urdu).
  7. ^ Not present in dictionary order because it is not used at the beginning of words.
  8. ^ Sometimes choṭī hē is used to refer to hey but choṭī hē can also refer to the Arabic / Persian variant, a stylistic variation representing an equivalent letter, but Persian and Arabic usually use U+0647 whereas Urdu uses U+06C1 for gōl hey.[12] See also: Urdu in Unicode.
  9. ^ Hamzah: In Urdu, hamzah is silent in all its forms except for when it is used as hamzah-e-izafat. The main use of hamzah in Urdu is to indicate a vowel cluster.
  10. ^ [citation needed]

Additional characters and variations edit

Arabic Tāʼ marbūṭah edit

Tāʼ marbūṭah is also sometimes considered the 40th letter of the Urdu alphabet, though it is rarely used except for in certain loan words from Arabic. Tāʼ marbūṭah is regarded as a form of tā, the Arabic version of Urdu tē, but it is not pronounced as such, and when replaced with an Urdu letter in naturalised loan words it is usually replaced with Gol hē.

Table edit

Table of additional characters and variations
Group Letter [A] Name (see: Glossary of key words) Unicode [13][14]
Nastaliq
[B]
Naskh with
diacritics
Roman Urdu or English [4][12]
Alif آ آ الف مدہ الِف مَدّه
[12]
alif maddah
[12][C]
U+0622
alef with madda above [14]
Hamza [D] ء ء ہمزہ ہَمْزه
[12]
hamzah U+0621
hamza [14]
___ ___ hamza on the line
ٔ ــٔـ ___ ___ hamza diacritic
[C][E]
U+0654
Hamza Above
ئ ئ ہمزہ ہَمْزه
[12]
hamzah U+0626
yeh with hamza above [14]
___ ___ yē hamza / alif hamza
ۓ ۓ ___ ___ baṛī yē hamza U+06D3
yeh barree with hamza above [13]
ؤ ؤ واوِ مَہْمُوز واوِ مَہْمُوز
[12]
vāv-e mahmūz
[12]
U+0624
waw with hamza above [14]
ۂ  ‍ۂ ۂ ـۂ ___ ___ U+06C2
heh goal with hamza above [13]
or U+06C1 + U+0654
Arabic [F] ۃ  ‍ۃ ۃ ـۃ Arabic:
تاء مربوطة
Arabic:
تَاء مَرْبُوطَة
tāʼ marbūṭah
"bound ta"
U+06C3
teh marbuta goal [13]
ة  ‍ة ة ـة U+0629
teh marbuta [14]
ت ت Arabic:
تاء مفتوحة
Arabic:
تَاء مَفْتُوحَة
tāʼ maftūḥah
"open ta"
U+062A
Teh
Footnotes:

  1. ^ Left: Urdu Nastaliq. Right: Arabic Naskh or modern style.
  2. ^ The Nastaliq text will display in a different style if there is not an appropriate font installed on the machine.
  3. ^ a b Most vowel diacritics are omitted in most Urdu writing, but Urdu writing usually does distinguish alif mad, and include hamza over bari ye, gol he, and wow. For example, alif mad and bare alif in آزادی - "āzādī", ɑ:zɑ:d̪i, freedom
    [15] - are distinguished in most contexts.
  4. ^ See: Hamzah in Nastaliq.
  5. ^ See: Hamzah in Nastaliq.
  6. ^ see: Arabic Tāʼ marbūṭah above.

Hamza in Nastaliq edit

Hamza can be difficult to recognise in Urdu handwriting and fonts designed to replicate it, closely resembling two dots above as featured in ت Té and ق Qaf, whereas in Arabic and Geometric fonts it is more distinct and closely resembles the western form of the numeral 2 (two).

Digraphs edit

The digraphs of aspirated consonants are as follows.
No. Digraph[9] Transcription[9] IPA Example
1 بھ bh [bʱ] بھاری
2 پھ ph [pʰ] پھول
3 تھ th [tʰ] تھم
4 ٹھ ṭh [ʈʰ] ٹھنڈا
5 جھ jh [d͡ʒʱ] جھاڑی
6 چھ ch [t͡ʃʰ] چھتری
7 دھ dh [dʱ] دھوبی
8 ڈھ ḍh [ɖʱ] ڈھول
9 رھ rh [rʱ] تیرھواں
10 ڑھ ṛh [ɽʱ] اڑھائی
11 کھ kh [kʰ] کھانسی
12 گھ gh [ɡʱ] گھوڑا
13 لھ lh [lʱ] دولھا (alternative of دُلہا)
14 مھ mh [mʱ] تمھیں (alternative of تمہیں)
15 نھ nh [nʱ] ننھا (though arguably just a consonant cluster)
16 وھ wh [ʋʱ] وھاب (though arguably just a consonant cluster)
17 یھ yh [jʱ] [example needed]

Differences from Persian alphabet edit

Urdu has more letters added to the Perso-Arabic base to represent sounds not present in Persian, which already has additional letters added to the Arabic base itself to represent sounds not present in Arabic. The letters added are shown in the table below:

Letter IPA
ٹ /ʈ/
ڈ /ɖ/
ڑ /ɽ/
ں /◌̃/
ے /ɛ:/ or /e:/.

A separate do-chashmi-he letter, ھ, exists to denote a /ʰ/ or a /ʱ/. This letter is mainly used as part of the multitude of digraphs, detailed in above sub-section. In addition to these letters, Shahmukhi adds 2 consonants on top of Urdu alphabet to write Punjabi, and Saraiki further adds 4 more consonants.

Retroflex letters edit

 

Old Hindustani used four dots ٿ ڐ ڙ over three Arabic letters ت د ر to represent retroflex consonants.[16] In handwriting those dots were often written as a small vertical line attached to a small triangle. Subsequently, this shape became identical to a small letter ط t̤oʼē.[17] It is commonly and erroneously assumed that ṭāʾ itself was used to indicate retroflex consonants because of it being an emphatic alveolar consonant that Arabic scribes thought approximated the Hindustani retroflexes.[citation needed] In modern Urdu, called to'e is always pronounced as a dental, not a retroflex. [citation needed]

Vowels edit

The Urdu language has ten vowels and ten nasalized vowels. Each vowel has four forms depending on its position: initial, middle, final and isolated. Like in its parent Arabic alphabet, Urdu vowels are represented using a combination of digraphs and diacritics. Alif, Waw, Ye, He and their variants are used to represent vowels.

Vowel chart edit

Urdu does not have standalone vowel letters. Short vowels (a, i, u) are represented by optional diacritics (zabar, zer, pesh) upon the preceding consonant or a placeholder consonant (alif, ain, or hamzah) if the syllable begins with the vowel, and long vowels by consonants alif, ain, ye, and wa'o as matres lectionis, with disambiguating diacritics, some of which are optional (zabar, zer, pesh), whereas some are not (madd, hamzah). Urdu does not have short vowels at the end of words. This is a table of Urdu vowels:

Romanization Pronunciation Final Middle Initial
a /ə/ N/A ـَ اَ
ā /aː/ ـَا، ـَی، ـَہ ـَا آ
i /ɪ/ N/A ـِ اِ
ī /iː/ ـِى ـِیـ اِی
e /eː/ ـے‬ ـیـ اے
ai /ɛː/ ـَے‬ ـَیـ اَے
u /ʊ/ N/A ـُ اُ
ū /uː/ ـُو اُو
o /oː/ ـو او
au /ɔː/ ـَو اَو

Alif edit

Alif is the first letter of the Urdu alphabet, and it is used exclusively as a vowel. At the beginning of a word, alif can be used to represent any of the short vowels: اب ab, اسم ism, اردو Urdū. For long ā at the beginning of words alif-mad is used: آپ āp, but a plain alif in the middle and at the end: بھاگنا bhāgnā.

Wāʾo edit

Wāʾo is used to render the vowels "ū", "o", "u" and "au" ([uː], [oː], [ʊ] and [ɔː] respectively), and it is also used to render the labiodental approximant, [ʋ]. Only when preceded by the consonant k͟hē (خ), can wāʾo render the "u" ([ʊ]) sound (such as in خود, "k͟hud" - myself), or not pronounced at all (such as in خواب, "k͟haab" - dream). This is known as the silent wāʾo, and is only present in words loaned from Persian.[18]

Ye edit

Ye is divided into two variants: choṭī ye ("little ye") and baṛī ye ("big ye").

Choṭī ye (ی) is written in all forms exactly as in Persian. It is used for the long vowel "ī" and the consonant "y".

Baṛī ye (ے) is used to render the vowels "e" and "ai" (/eː/ and /ɛː/ respectively). Baṛī ye is distinguishable in writing from choṭī ye only when it comes at the end of a word/ligature. Additionally, Baṛī ye is never used to begin a word/ligature, unlike choṭī ye.

Letter's name Final Form Middle Form Initial Form Isolated Form
چھوٹی يے
Choṭī ye
ـی ـیـ یـ ی
بڑی يے
Baṛī ye
ـے ے

The 2 he's edit

He is divided into two variants: gol he ("round he") and do-cašmi he ("two-eyed he").

Gol he (ہ) is written round and zigzagged, and can impart the "h" (/ɦ/) sound anywhere in a word. Additionally, at the end of a word, it can be used to render the long "a" or the "e" vowels (/ɑː/ or /eː/), which also alters its form slightly (on modern digital writing systems, this final form is achieved by writing two he's consecutively).

Do-cašmi he (ھ) is written as in Arabic Naskh style (as a loop), in order to create the aspirate consonants and write Arabic words.

Letter's name Final Form Middle Form Initial Form Isolated Form
گول ہے
Gol he
ـہ ـہـ ہـ ہ
دو چشمی ہے
Do-cašmi he
ـھ ـھـ ھـ ھ

Ayn edit

Ayn in its initial and final position is silent in pronunciation and is replaced by the sound of its preceding or succeeding vowel.

Nun Ghunnah edit

Vowel nasalization is represented by nun ghunna written after their non-nasalized versions, for example: ہَے when nasalized would become ہَیں. In middle form nun ghunna is written just like nun and is differentiated by a diacritic called maghnoona or ulta jazm which is a superscript V symbol above the ن٘.

Examples:

Form Urdu Transcription
Orthography ں
End form میں maiṉ
Middle form کن٘ول kaṉwal

Diacritics edit

Urdu uses the same subset of diacritics used in Arabic based on Persian conventions. Urdu also uses Persian names of the diacritics instead of Arabic names. Commonly used diacritics are zabar (Arabic fatḥah), zer (Arabic kasrah), pesh (Arabic dammah) which are used to clarify the pronunciation of vowels, as shown above. Jazam (ـْـ, Arabic sukun) is used to indicate a consonant cluster and tashdid (ـّـ, Arabic shaddah) is used to indicate a gemination, although it is never used for verbs, which require double consonants to be spelled out separately. Other diacritics include khari zabar (Arabic dagger alif), do zabar (Arabic fathatan) which are found in some common Arabic loan words. Other Arabic diacritics are also sometimes used though very rarely in loan words from Arabic. Zer-e-izafat and hamzah-e-izafat are described in the next section.

Other than common diacritics, Urdu also has special diacritics, which are often found only in dictionaries for the clarification of irregular pronunciation. These diacritics include kasrah-e-majhool, fathah-e-majhool, dammah-e-majhool, maghnoona, ulta jazam, alif-e-wavi and some other very rare diacritics. Among these, only maghnoona is used commonly in dictionaries and has a Unicode representation at U+0658. Other diacritics are only rarely written in printed form, mainly in some advanced dictionaries.[19]

Iẓāfat edit

Iẓāfat is a syntactical construction of two nouns, where the first component is a determined noun, and the second is a determiner. This construction was borrowed from Persian. A short vowel "i" is used to connect these two words, and when pronouncing the newly formed word the short vowel is connected to the first word. If the first word ends in a consonant or an ʿain (ع), it may be written as zer (  ِ ) at the end of the first word, but usually is not written at all. If the first word ends in choṭī he (ہ) or ye (ی or ے) then hamzā (ء) is used above the last letter (ۂ or ئ or ۓ). If the first word ends in a long vowel (ا or و), then a different variation of baṛī ye (ے) with hamzā on top (ئے, obtained by adding ے to ئ) is added at the end of the first word.[20]

Forms Example Transliteration Meaning
ــِ شیرِ پنجاب sher-e-Panjāb the lion of Punjab
ۂ ملکۂ دنیا malikā-e-dunyā the queen of the world
ئ ولئ کامل walī-e-kāmil perfect saint
ۓ مۓ عشق mai-e-ishq the wine of love
ئے روئے زمین rū-'e-zamīn the surface of the Earth
صدائے بلند sadā-'e-buland a high voice

Computers and the Urdu alphabet edit

In the early days of computers, Urdu was not properly represented on any code page. One of the earliest code pages to represent Urdu was IBM Code Page 868 which dates back to 1990.[21] Other early code pages which represented Urdu alphabets were Windows-1256 and MacArabic encoding both of which date back to the mid-1990s. In Unicode, Urdu is represented inside the Arabic block. Another code page for Urdu, which is used in India, is Perso-Arabic Script Code for Information Interchange. In Pakistan, the 8-bit code page which is developed by National Language Authority is called Urdu Zabta Takhti (اردو ضابطہ تختی) (UZT)[22] which represents Urdu in its most complete form including some of its specialized diacritics, though UZT is not designed to coexist with the Latin alphabet.

Encoding Urdu in Unicode edit

Confusable glyphs in Urdu and Arabic script
Characters
in Urdu
Characters
in Arabic
ہ (U+06C1)
ھ (U+06BE)
ه (U+0647)
ی (U+06CC) ى (U+0649)
ي (U+064A)
ک (U+06A9) ك (U+0643)

Like other writing systems derived from the Arabic script, Urdu uses the 0600–06FF Unicode range.[23] Certain glyphs in this range appear visually similar (or identical when presented using particular fonts) even though the underlying encoding is different. This presents problems for information storage and retrieval. For example, the University of Chicago's electronic copy of John Shakespear's "A Dictionary, Hindustani, and English"[24] includes the word 'بهارت' (bhārat "India"). Searching for the string "بھارت" returns no results, whereas querying with the (identical-looking in many fonts) string "بهارت" returns the correct entry.[25] This is because the medial form of the Urdu letter do chashmi he (U+06BE)—used to form aspirate digraphs in Urdu—is visually identical in its medial form to the Arabic letter hāʾ (U+0647; phonetic value /h/). In Urdu, the /h/ phoneme is represented by the character U+06C1, called gol he (round he), or chhoti he (small he).

In 2003, the Center for Research in Urdu Language Processing (CRULP)[26]—a research organisation affiliated with Pakistan's National University of Computer and Emerging Sciences—produced a proposal for mapping from the 1-byte UZT encoding of Urdu characters to the Unicode standard.[27] This proposal suggests a preferred Unicode glyph for each character in the Urdu alphabet.

Software edit

The Daily Jang was the first Urdu newspaper to be typeset digitally in Nastaʻliq by computer. There are efforts underway to develop more sophisticated and user-friendly Urdu support on computers and on the Internet. Nowadays, nearly all Urdu newspapers, magazines, journals and periodicals are composed on computers via various Urdu software programmes, the most widespread of which is InPage Desktop Publishing package. Microsoft has included Urdu language support in all new versions of Windows and both Windows Vista and Microsoft Office 2007 are available in Urdu through Language Interface Pack[28] support. Most Linux Desktop distributions allow the easy installation of Urdu support and translations as well.[29] Apple implemented the Urdu language keyboard across Mobile devices in its iOS 8 update in September 2014.[30]

Romanization standards and systems edit

There are several romanization standards for writing Urdu with the Latin alphabet, though they are not very popular because most fall short of representing the Urdu language properly. Instead of standard romanization schemes, people on Internet, mobile phones and media often use a non-standard form of romanization which tries to mimic English orthography. The problem with this kind of romanization is that it can only be read by native speakers, and even for them with great difficulty. Among standardized romanization schemes, the most accurate is ALA-LC romanization, which is also supported by National Language Authority. Other romanization schemes are often rejected because either they are unable to represent sounds in Urdu properly, or they often do not take regard of Urdu orthography, and favor pronunciation over orthography.[31]

The National Language Authority of Pakistan has developed a number of systems with specific notations to signify non-English sounds, but these can only be properly read by someone already familiar with the loan letters.[citation needed]

Roman Urdu also holds significance among the Christians of Pakistan and North India. Urdu was the dominant native language among Christians of Karachi and Lahore in present-day Pakistan and Madhya Pradesh, Uttar Pradesh Rajasthan in India, during the early part of the 19th and 20th century, and is still used by Christians in these places. Pakistani and Indian Christians often used the Roman script for writing Urdu. Thus Roman Urdu was a common way of writing among Pakistani and Indian Christians in these areas up to the 1960s. The Bible Society of India publishes Roman Urdū Bibles that enjoyed sale late into the 1960s (though they are still published today). Church songbooks are also common in Roman Urdu. However, the usage of Roman Urdu is declining with the wider use of Hindi and English in these states.

Glossary of key words from letter names edit

Translations and other uses of key words from Urdu letter names
Letter name(s) Urdu word Examples of other uses
Isolated
form
Urdu
name
Roman Urdu Urdu IPA Roman Urdu
name
English Translation Urdu Roman Urdu or IPA Translation
ح بَڑی حے baṛī ħē بَڑی bəɽi[15] baṛī /
bari
big / elder[15] بڑی آنت Baṛi ant large intestine
ے بَڑی يـے baṛī yē آنت Ant intestine
ی چھوٹی یے čhōṭī yē چھوٹی tʃʰoːʈi[15] choti small / minor / junior[15]
ہ چھوٹی ہے čhōṭī hē چھوٹی آنت small intestine
گول ہـے gōl hē گول goːl[15] gōl round / spherical / vague / silly / obese[32] گول گپے gol gappay panipuri
ھ دوچَشْمی ہے dō-čašmī hē دوچَشْمی do-cashmī two-eyed
[citation needed]
دو چشمی دوربین do-cashmi

dorabīn

binoculars
دوربین dorabīn telescope
دو do 2 / two دو ایوانیت do ayvanīt bicameralism
چشم /tʃəʃm/[15] chashm the eye / hope / expectation[32] چشم cashm eye
ں نُونِ غُنّہ nūn-e ğunnah غُنّہ ɣʊnnɑ[15] ğunnah / g͟hunnah nasal sound or twang[15] [example needed]
آ الِف مَدّه alif maddah مَدّه maddah Arabic: [example needed]
ؤ واوِ مَہْمُوز vāv-e mahmūz مَہْمُوز mæhmuːz[15] mahmūz defective / improper[15] [example needed]
ء ا آ ب پ ت ٹ ث ب ج چ خ ح د ڈ ذ ر ڑ ز ژ س ش ص ض ط ظ ع غ ف ق ک گ ل م ن ں و ہ ھ ی ے حروف تہجی
[33]
harūf tahajī (alphabet) تہجی tahajī sequence
[citation needed]
[example needed]
حُرُوف /hʊruːf/[15] harūf letters (plural)[15]
(often referred to as "alphabets" in informal Pakistani English)
[example needed]
حَرْف /hərf/[15] harf "letter of the alphabet" / handwriting / statement / blame / stigma[15] [example needed]

See also edit

References edit

  1. ^ "Constitution of the Republic of South Africa, 1996 - Chapter 1: Founding Provisions". www.gov.za. Retrieved 6 December 2014.
  2. ^ "Balti alphabet and pronunciation". omniglot.com. Retrieved 31 January 2023.
  3. ^ Bashir, Elena; Hussain, Sarmad; Anderson, Deborah (5 May 2006). "N3117: Proposal to add characters needed for Khowar, Torwali, and Burushaski" (PDF). ISO/IEC JTC1/SC2/WG2.
  4. ^ a b Project Fluency (7 October 2016). Urdu: The Complete Urdu Learning Course for Beginners: Start Speaking Basic Urdu Immediately (Kindle ed.). Createspace Independent Publishing Platform. p. Kindle Locations 66–67. ISBN 978-1539047803.
  5. ^ a b "Urdu". Omniglot.
  6. ^ "*positional chart*". Urdu: some thoughts about the script and grammar, and other general notes for students. Retrieved 28 February 2020 – via www.columbia.edu.
  7. ^ "Controversy over number of letters in Urdu alphabet". DAWN.COM. 15 June 2009.
  8. ^ Delacy 2003, pp. XV–XVI.
  9. ^ a b c "Urdu romanization" (PDF). The Library of Congress.
  10. ^ Geographical Names Romanization in Pakistan. UNGEGN, 18th Session. Geneva, 12–23 August 1996. Working Papers No. 85 and No. 85 Add. 1.
  11. ^ Bhatia, Tej K.; Khoul, Ashok; Koul, Ashok (2015). Colloquial Urdu: The Complete Course for Beginners. Routledge. pp. 41–42. ISBN 978-1-317-30471-5. Retrieved 30 June 2020.
  12. ^ a b c d e f g h i . www.user.uni-hannover.de. Archived from the original on 11 September 2019. Retrieved 29 February 2020.
  13. ^ a b c d "Extended Arabic Letter". unicode.org. Retrieved 6 April 2020.
  14. ^ a b c d e f "Based on ISO 8859-6". unicode.org. Retrieved 6 April 2020.
  15. ^ a b c d e f g h i j k l m n o . Urdu: Oxford Living Dictionaries. Archived from the original on 18 October 2016. Retrieved 15 March 2020.
  16. ^ Ballantyne, James Robert (1842). A Grammar of the Hindustani Language, with Brief Notices of the Braj and Dakhani Dialects. Madden & Company. p. 11.
  17. ^ Berggren, Olaf (2002). Scripts. Bibliotheca Alexandrina. p. 108.
  18. ^ Grierson, George Abraham. "Urdu Language Management". Language Information Services (LIS)-India. Retrieved 23 July 2022.
  19. ^ "Proposal of Inclusion of Certain Characters in Unicode" (PDF).
  20. ^ Delacy 2003, pp. 99–100.
  21. ^ "IBM 868 code page"
  22. ^ "Urdu Zabta Takhti" (PDF).
  23. ^ "Arabic" (PDF). unicode.org. Retrieved 7 April 2019.
  24. ^ "A dictionary, Hindustani and English". Dsal.uchicago.edu. 29 September 2009. Retrieved 18 December 2011.
  25. ^ "A dictionary, Hindustani and English". Dsal.uchicago.edu. Archived from the original on 15 December 2012. Retrieved 18 December 2011.
  26. ^ "Center for Research in Urdu Language Processing". Crulp.org. Retrieved 18 December 2011.
  27. ^ at the Wayback Machine
  28. ^ "مائِیکروسافٹ ڈاؤُن لوڈ مَرکَزWindows". Microsoft.com. Retrieved 18 December 2011.
  29. ^ "Ubuntu in Urdu « Aasim's Web Corner". Aasims.wordpress.com. Retrieved 18 December 2011.
  30. ^ "E-Urdu: How one man's plea for Nastaleeq was heard by Apple". The Express Tribune. 16 October 2014. Retrieved 29 March 2015.
  31. ^ "اردو میں نقل حرفی ۔ ایک ابتدائی تعارف : نبلٰی پیرزادہ". nlpd.gov.pk.
  32. ^ a b "Urdu: Oxford Living Dictionaries (Urdu to English Translation)". Urdu: Oxford Living Dictionaries. Retrieved 15 March 2020.[dead link]
  33. ^ "خلا سے زمین پر انگریزی کےحروف تہجی". BBC News اردو (in Urdu). 5 January 2016. Retrieved 7 May 2020.

Sources edit

  • Delacy, Richard (2003). Beginner's Urdu Script. McGraw-Hill.
  • Delacy, Richard (2010). Read and write Urdu script. McGraw-Hill. ISBN 9780071747462.
  • "Urdu romanization" (PDF). The Library of Congress.
  • Ishida, Richard. "Urdu script notes".

External links edit

  • Urdu alphabet
  • Urdu alphabet with Devanagari equivalents 11 September 2019 at the Wayback Machine
  • Hugo's Urdu Alphabet Page 16 July 2020 at the Wayback Machine
  • , a resource for Urdu calligraphy and script
  • Urdu Script Introduction from Columbia University
  • National Council for Promotion of Urdu Language 6 March 2018 at the Wayback Machine

urdu, alphabet, urdu, اردو, حروف, تہجی, romanized, urdū, ḥurūf, tahajjī, right, left, alphabet, used, writing, urdu, modification, persian, alphabet, which, itself, derived, from, arabic, script, official, status, republics, pakistan, india, south, africa, dis. The Urdu alphabet Urdu اردو حروف تہجی romanized urdu ḥuruf i tahajji is the right to left alphabet used for writing Urdu It is a modification of the Persian alphabet which itself is derived from the Arabic script It has official status in the republics of Pakistan India and South Africa The Urdu alphabet has up to 39 4 or 40 5 distinct letters with no distinct letter cases and is typically written in the calligraphic Nastaʿliq script whereas Arabic is more commonly written in the Naskh style Urdu Alphabetا ر د و ح ر وف ت ہ ج ی Urdu ḥuruf i tahajjiThe word Urdu written in the Urdu alphabetScript typeAbjadOfficial script Pakistan India South Africa 1 LanguagesUrdu Balti 2 Shina Khowar Burushaski 3 Related scriptsParent systemsEgyptian hieroglyphsProto SinaiticPhoenicianAramaicNabataeanArabicPerso ArabicUrdu AlphabetUnicodeUnicode rangeU 0600 to U 06FF U 0750 to U 077F U FB50 to U FDFF U FE70 to U FEFF This article contains phonetic transcriptions in the International Phonetic Alphabet IPA For an introductory guide on IPA symbols see Help IPA For the distinction between and see IPA Brackets and transcription delimiters This article contains Urdu text Without proper rendering support you may see unjoined letters running left to right or other symbols instead of Urdu script Usually bare transliterations of Urdu into the Latin alphabet called Roman Urdu omit many phonemic elements that have no equivalent in English or other languages commonly written in the Latin script Contents 1 History 2 Nastaliq 3 Alphabet 3 1 Letter names and phonemes 3 2 Additional characters and variations 3 2 1 Arabic Taʼ marbuṭah 3 2 2 Table 3 2 3 Hamza in Nastaliq 3 3 Digraphs 3 4 Differences from Persian alphabet 3 5 Retroflex letters 4 Vowels 4 1 Vowel chart 4 2 Alif 4 3 Waʾo 4 4 Ye 4 5 The 2 he s 4 6 Ayn 4 7 Nun Ghunnah 5 Diacritics 6 Iẓafat 7 Computers and the Urdu alphabet 7 1 Encoding Urdu in Unicode 7 2 Software 8 Romanization standards and systems 9 Glossary of key words from letter names 10 See also 11 References 12 Sources 13 External linksHistory editThe standard Urdu script is a modified version of the Perso Arabic script and has its origins in the 13th century Iran It is also related to Shahmukhi used for the Punjabi language varieties in Punjab Pakistan It is closely related to the development of the Nastaʻliq style of Perso Arabic script Despite the invention of the Urdu typewriter in 1911 Urdu newspapers continued to publish prints of handwritten scripts by calligraphers known as katibs or khush navees until the late 1980s The Pakistani national newspaper Daily Jang was the first Urdu newspaper to use Nastaʿliq computer based composition There are efforts under way to develop more sophisticated and user friendly Urdu support on computers and the internet Nowadays nearly all Urdu newspapers magazines journals and periodicals are composed on computers with Urdu software programs Other than the Indian subcontinent the Urdu script is also used by Pakistan s large diaspora including in the United Kingdom the United Arab Emirates the United States Canada Saudi Arabia and other places 5 Nastaliq edit nbsp Example showing Nastaliq s Persian proportion rules citation needed Main article Nastaliq Urdu is written in the Nastaliq style Persian نستعلیق Nastaʿliq The Nastaliq calligraphic writing style began as a Persian mixture of the Naskh and Ta liq scripts After the Muslim conquest of the Indian subcontinent Nastaʻliq became the preferred writing style for Urdu It is the dominant style in Pakistan and many Urdu writers elsewhere in the world use it Nastaʿliq is more cursive and flowing than its Naskh counterpart In the Arabic alphabet and many others derived from it letters are regarded as having two or three general forms each based on their position in the word though Arabic calligraphy can add a great deal of complexity But the Nastaliq style in which Urdu is written uses more than three general forms for many letters even in simple non decorative documents 6 Alphabet editThe Urdu script is an abjad script derived from the modern Persian script which is itself a derivative of the Arabic script As an abjad the Urdu script only shows consonants and long vowels short vowels can only be inferred by the consonants relation to each other While this type of script is convenient in Semitic languages like Arabic and Hebrew whose consonant roots are the key of the sentence Urdu is an Indo European language which requires more precision in vowel sound pronunciation hence necessitating more memorisation The number of letters in the Urdu alphabet is somewhat ambiguous and debated 7 Letter names and phonemes edit Name 8 Forms IPA Romanization Unicode Order UrduRoman Urdu Isolated Final Medial Initial ALA LC 9 Hunterian 10 A 12 B الف alif ا ـا ɑː ʔ silent C a a U 0627 1 1 1 بے be ب ـب ـبـ بـ b b b U 0628 2 2 2 پے pe پ ـپ ـپـ پـ p p p U 067E 3 3 3 تے te ت ـت ـتـ تـ t t t U 062A 4 4 4 ٹے ṭe ٹ ـٹ ـٹـ ٹـ ʈ ṭ t U 0679 5 5 5 ثے s e ث ـث ـثـ ثـ s s s U 062B 6 6 6 جيم jim ج ـج ـجـ جـ d ʒ j j U 062C 7 7 7 چے ce چ ـچ ـچـ چـ t ʃ c ch U 0686 8 8 8 بڑی حے baṛi ḥe ح ـح ـحـ حـ ɦ ḥ h U 062D 9 9 9 ہٹی حے hutti ḥe خے k he خ ـخ ـخـ خـ x k h kh U 062E 10 10 10 دال dal د ـد d d d U 062F 11 11 11 ڈال ḍal ڈ ـڈ ɖ ḍ d U 0688 12 12 12 ذال ẕal ذ ـذ z ẕ z U 0630 13 13 13 رے re ر ـر r r r U 0631 14 14 14 ڑے ṛe ڑ ـڑ ɽ D ṛ r U 0691 15 15 15 زے ze ز ـز z z z U 0632 16 16 16 ژے zhe ژ ـژ ʒ E zh zh U 0698 17 17 17 سین sin س ـس ـسـ سـ s s s U 0633 18 18 18 شین shin ش ـش ـشـ شـ ʃ sh sh U 0634 19 19 19 صاد ṣwad ص ـص ـصـ صـ s ṣ s U 0635 20 20 20 ضاد ẓwad ض ـض ـضـ ضـ z ẓ z U 0636 21 21 21 طوے t oʼe ط ـط ـطـ طـ t t t U 0637 22 22 22 ظوے z oʼe ظ ـظ ـظـ ظـ z z z U 0638 23 23 23 عین ʻain ع ـع ـعـ عـ ɑː oː eː ʔ ʕ silent ʻ ʻ citation needed U 0639 24 24 24 غین g hain غ ـغ ـغـ غـ ɣ g h gh U 063A 25 25 25 فے fe ف ـف ـفـ فـ f f f U 0641 26 26 26 قاف qaf ق ـق ـقـ قـ q q q U 0642 27 27 27 کاف kaf ک ـک ـکـ کـ k k k U 06A9 28 28 28 گاف gaf گ ـگ ـگـ گـ ɡ g g U 06AF 29 29 29 لام lam ل ـل ـلـ لـ l l l U 0644 30 30 30 میم mim م ـم ـمـ مـ m m m U 0645 31 31 31 نون nun ن ـن ـنـ نـ n ɲ ɳ ŋ n n U 0646 32 32 32 نون غن ہ nun g hunna ں ـں ـںـ ںـ D ṉ n U 06BA U 0658 F G 32a 33 واؤ waʼo و ـو ʋ uː ʊ o ː ɔː v u u o au w u u o au U 0648 33 33 34 ہے he ہ ـہ ـہـ ہـ ɦ ɑː eː h a e h a e U 06C1 H 34 34 35 چھوٹی ہے choṭi he 34a دو چشمی ہے do cashmi he ھ ـھ ـھـ ھـ ʰ or ʱ D h h U 06BE 35 34b 36 یے ye ی ـی ـیـ یـ j iː ɑː y i a y i a U 06CC 36 35 38 بڑی یے baṛi ye ے ـے ɛː eː D ai e ai e U 06D2 37 35b 39 ہمزہ hamzah ئ ـئ ـئـ ئـ ʔ or silent I ʼ yi ʼ yi U 0626 35a 37 J ء U 0621 0 Footnotes dictionary order 11 citation needed At the beginning of a word it can represent another vowel holding a vowel diacritic that would normally be held by the consonant preceding the vowel for examble ا ردو Urdu But the diacritic indicating which vowel is often omitted اردو like other short vowel diacritcs a b c d No Urdu word begins with ں ھ ڑ or ے citation needed Used mainly for Persian loanwords The version shown on the left is U 06BA which is used only at the end of words When it is used in the middle of a word it is a diacritic U 0658 which is usually omitted see below for further information on diacritic omission in Urdu Not present in dictionary order because it is not used at the beginning of words Sometimes choṭi he is used to refer to hey but choṭi he can also refer to the Arabic Persian variant a stylistic variation representing an equivalent letter but Persian and Arabic usually use U 0647 whereas Urdu uses U 06C1 for gōl hey 12 See also Urdu in Unicode Hamzah In Urdu hamzah is silent in all its forms except for when it is used as hamzah e izafat The main use of hamzah in Urdu is to indicate a vowel cluster citation needed Additional characters and variations edit Arabic Taʼ marbuṭah edit Taʼ marbuṭah is also sometimes considered the 40th letter of the Urdu alphabet though it is rarely used except for in certain loan words from Arabic Taʼ marbuṭah is regarded as a form of ta the Arabic version of Urdu te but it is not pronounced as such and when replaced with an Urdu letter in naturalised loan words it is usually replaced with Gol he Table edit Table of additional characters and variations Group Letter A Name see Glossary of key words Unicode 13 14 Nastaliq B Naskh with diacritics Roman Urdu or English 4 12 Alif آ آ الف مدہ ال ف م د ه 12 alif maddah 12 C U 0622 alef with madda above 14 Hamza D ء ء ہمزہ ہ م زه 12 hamzah U 0621 hamza 14 hamza on the line ــ ـ hamza diacritic C E U 0654 Hamza Above ئ ئ ہمزہ ہ م زه 12 hamzah U 0626 yeh with hamza above 14 ye hamza alif hamza ۓ ۓ baṛi ye hamza U 06D3 yeh barree with hamza above 13 ؤ ؤ واو م ہ م وز واو م ہ م وز 12 vav e mahmuz 12 U 0624 waw with hamza above 14 ۂ ۂ ۂ ـۂ U 06C2 heh goal with hamza above 13 or U 06C1 U 0654 Arabic F ۃ ۃ ۃ ـۃ Arabic تاء مربوطة Arabic ت اء م ر ب وط ة taʼ marbuṭah bound ta U 06C3 teh marbuta goal 13 ة ة ة ـة U 0629 teh marbuta 14 ت ت Arabic تاء مفتوحة Arabic ت اء م ف ت وح ة taʼ maftuḥah open ta U 062A Teh Footnotes dd Left Urdu Nastaliq Right Arabic Naskh or modern style The Nastaliq text will display in a different style if there is not an appropriate font installed on the machine a b Most vowel diacritics are omitted in most Urdu writing but Urdu writing usually does distinguish alif mad and include hamza over bari ye gol he and wow For example alif mad and bare alif in آزادی azadi ɑ zɑ d i freedom 15 are distinguished in most contexts See Hamzah in Nastaliq See Hamzah in Nastaliq see Arabic Taʼ marbuṭah above Hamza in Nastaliq edit Hamza can be difficult to recognise in Urdu handwriting and fonts designed to replicate it closely resembling two dots above as featured in ت Te and ق Qaf whereas in Arabic and Geometric fonts it is more distinct and closely resembles the western form of the numeral 2 two Digraphs edit The digraphs of aspirated consonants are as follows No Digraph 9 Transcription 9 IPA Example 1 بھ bh bʱ بھاری 2 پھ ph pʰ پھول 3 تھ th tʰ تھم 4 ٹھ ṭh ʈʰ ٹھنڈا 5 جھ jh d ʒʱ جھاڑی 6 چھ ch t ʃʰ چھتری 7 دھ dh dʱ دھوبی 8 ڈھ ḍh ɖʱ ڈھول 9 رھ rh rʱ تیرھواں 10 ڑھ ṛh ɽʱ اڑھائی 11 کھ kh kʰ کھانسی 12 گھ gh ɡʱ گھوڑا 13 لھ lh lʱ دولھا alternative of د لہا 14 مھ mh mʱ تمھیں alternative of تمہیں 15 نھ nh nʱ ننھا though arguably just a consonant cluster 16 وھ wh ʋʱ وھاب though arguably just a consonant cluster 17 یھ yh jʱ example needed Differences from Persian alphabet edit Urdu has more letters added to the Perso Arabic base to represent sounds not present in Persian which already has additional letters added to the Arabic base itself to represent sounds not present in Arabic The letters added are shown in the table below Letter IPA ٹ ʈ ڈ ɖ ڑ ɽ ں ے ɛ or e A separate do chashmi he letter ھ exists to denote a ʰ or a ʱ This letter is mainly used as part of the multitude of digraphs detailed in above sub section In addition to these letters Shahmukhi adds 2 consonants on top of Urdu alphabet to write Punjabi and Saraiki further adds 4 more consonants Retroflex letters edit nbsp Old Hindustani used four dots ٿ ڐ ڙ over three Arabic letters ت د ر to represent retroflex consonants 16 In handwriting those dots were often written as a small vertical line attached to a small triangle Subsequently this shape became identical to a small letter ط t oʼe 17 It is commonly and erroneously assumed that ṭaʾ itself was used to indicate retroflex consonants because of it being an emphatic alveolar consonant that Arabic scribes thought approximated the Hindustani retroflexes citation needed In modern Urdu called to e is always pronounced as a dental not a retroflex citation needed Vowels editThe Urdu language has ten vowels and ten nasalized vowels Each vowel has four forms depending on its position initial middle final and isolated Like in its parent Arabic alphabet Urdu vowels are represented using a combination of digraphs and diacritics Alif Waw Ye He and their variants are used to represent vowels Vowel chart edit Urdu does not have standalone vowel letters Short vowels a i u are represented by optional diacritics zabar zer pesh upon the preceding consonant or a placeholder consonant alif ain or hamzah if the syllable begins with the vowel and long vowels by consonants alif ain ye and wa o as matres lectionis with disambiguating diacritics some of which are optional zabar zer pesh whereas some are not madd hamzah Urdu does not have short vowels at the end of words This is a table of Urdu vowels Romanization Pronunciation Final Middle Initial a e N A ـ ا a aː ـ ا ـ ی ـ ہ ـ ا آ i ɪ N A ـ ا i iː ـ ى ـ یـ ا ی e eː ـے ـیـ اے ai ɛː ـ ے ـ یـ ا ے u ʊ N A ـ ا u uː ـ و ا و o oː ـو او au ɔː ـ و ا و Alif edit Alif is the first letter of the Urdu alphabet and it is used exclusively as a vowel At the beginning of a word alif can be used to represent any of the short vowels اب ab اسم ism اردو Urdu For long a at the beginning of words alif mad is used آپ ap but a plain alif in the middle and at the end بھاگنا bhagna Waʾo edit Waʾo is used to render the vowels u o u and au uː oː ʊ and ɔː respectively and it is also used to render the labiodental approximant ʋ Only when preceded by the consonant k he خ can waʾo render the u ʊ sound such as in خود k hud myself or not pronounced at all such as in خواب k haab dream This is known as the silent waʾo and is only present in words loaned from Persian 18 Ye edit Ye is divided into two variants choṭi ye little ye and baṛi ye big ye Choṭi ye ی is written in all forms exactly as in Persian It is used for the long vowel i and the consonant y Baṛi ye ے is used to render the vowels e and ai eː and ɛː respectively Baṛi ye is distinguishable in writing from choṭi ye only when it comes at the end of a word ligature Additionally Baṛi ye is never used to begin a word ligature unlike choṭi ye Letter s name Final Form Middle Form Initial Form Isolated Form چھوٹی يے Choṭi ye ـی ـیـ یـ ی بڑی يے Baṛi ye ـے ے The 2 he s edit He is divided into two variants gol he round he and do casmi he two eyed he Gol he ہ is written round and zigzagged and can impart the h ɦ sound anywhere in a word Additionally at the end of a word it can be used to render the long a or the e vowels ɑː or eː which also alters its form slightly on modern digital writing systems this final form is achieved by writing two he s consecutively Do casmi he ھ is written as in Arabic Naskh style as a loop in order to create the aspirate consonants and write Arabic words Letter s name Final Form Middle Form Initial Form Isolated Form گول ہے Gol he ـہ ـہـ ہـ ہ دو چشمی ہے Do casmi he ـھ ـھـ ھـ ھ Ayn edit Ayn in its initial and final position is silent in pronunciation and is replaced by the sound of its preceding or succeeding vowel Nun Ghunnah edit Vowel nasalization is represented by nun ghunna written after their non nasalized versions for example ہ ے when nasalized would become ہ یں In middle form nun ghunna is written just like nun and is differentiated by a diacritic called maghnoona or ulta jazm which is a superscript V symbol above the ن Examples Form Urdu Transcription Orthography ں ṉ End form میں maiṉ Middle form کن ول kaṉwalDiacritics editUrdu uses the same subset of diacritics used in Arabic based on Persian conventions Urdu also uses Persian names of the diacritics instead of Arabic names Commonly used diacritics are zabar Arabic fatḥah zer Arabic kasrah pesh Arabic dammah which are used to clarify the pronunciation of vowels as shown above Jazam ـ ـ Arabic sukun is used to indicate a consonant cluster and tashdid ـ ـ Arabic shaddah is used to indicate a gemination although it is never used for verbs which require double consonants to be spelled out separately Other diacritics include khari zabar Arabic dagger alif do zabar Arabic fathatan which are found in some common Arabic loan words Other Arabic diacritics are also sometimes used though very rarely in loan words from Arabic Zer e izafat and hamzah e izafat are described in the next section Other than common diacritics Urdu also has special diacritics which are often found only in dictionaries for the clarification of irregular pronunciation These diacritics include kasrah e majhool fathah e majhool dammah e majhool maghnoona ulta jazam alif e wavi and some other very rare diacritics Among these only maghnoona is used commonly in dictionaries and has a Unicode representation at U 0658 Other diacritics are only rarely written in printed form mainly in some advanced dictionaries 19 Iẓafat editIẓafat is a syntactical construction of two nouns where the first component is a determined noun and the second is a determiner This construction was borrowed from Persian A short vowel i is used to connect these two words and when pronouncing the newly formed word the short vowel is connected to the first word If the first word ends in a consonant or an ʿain ع it may be written as zer at the end of the first word but usually is not written at all If the first word ends in choṭi he ہ or ye ی or ے then hamza ء is used above the last letter ۂ or ئ or ۓ If the first word ends in a long vowel ا or و then a different variation of baṛi ye ے with hamza on top ئے obtained by adding ے to ئ is added at the end of the first word 20 Forms Example Transliteration Meaning ــ شیر پنجاب sher e Panjab the lion of Punjab ۂ ملکۂ دنیا malika e dunya the queen of the world ئ ولئ کامل wali e kamil perfect saint ۓ مۓ عشق mai e ishq the wine of love ئے روئے زمین ru e zamin the surface of the Earth صدائے بلند sada e buland a high voiceComputers and the Urdu alphabet editIn the early days of computers Urdu was not properly represented on any code page One of the earliest code pages to represent Urdu was IBM Code Page 868 which dates back to 1990 21 Other early code pages which represented Urdu alphabets were Windows 1256 and MacArabic encoding both of which date back to the mid 1990s In Unicode Urdu is represented inside the Arabic block Another code page for Urdu which is used in India is Perso Arabic Script Code for Information Interchange In Pakistan the 8 bit code page which is developed by National Language Authority is called Urdu Zabta Takhti اردو ضابطہ تختی UZT 22 which represents Urdu in its most complete form including some of its specialized diacritics though UZT is not designed to coexist with the Latin alphabet Encoding Urdu in Unicode edit Confusable glyphs in Urdu and Arabic script Characters in Urdu Characters in Arabic ہ U 06C1 ھ U 06BE ه U 0647 ی U 06CC ى U 0649 ي U 064A ک U 06A9 ك U 0643 Like other writing systems derived from the Arabic script Urdu uses the 0600 06FF Unicode range 23 Certain glyphs in this range appear visually similar or identical when presented using particular fonts even though the underlying encoding is different This presents problems for information storage and retrieval For example the University of Chicago s electronic copy of John Shakespear s A Dictionary Hindustani and English 24 includes the word بهارت bharat India Searching for the string بھارت returns no results whereas querying with the identical looking in many fonts string بهارت returns the correct entry 25 This is because the medial form of the Urdu letter do chashmi he U 06BE used to form aspirate digraphs in Urdu is visually identical in its medial form to the Arabic letter haʾ U 0647 phonetic value h In Urdu the h phoneme is represented by the character U 06C1 called gol he round he or chhoti he small he In 2003 the Center for Research in Urdu Language Processing CRULP 26 a research organisation affiliated with Pakistan s National University of Computer and Emerging Sciences produced a proposal for mapping from the 1 byte UZT encoding of Urdu characters to the Unicode standard 27 This proposal suggests a preferred Unicode glyph for each character in the Urdu alphabet Software edit The Daily Jang was the first Urdu newspaper to be typeset digitally in Nastaʻliq by computer There are efforts underway to develop more sophisticated and user friendly Urdu support on computers and on the Internet Nowadays nearly all Urdu newspapers magazines journals and periodicals are composed on computers via various Urdu software programmes the most widespread of which is InPage Desktop Publishing package Microsoft has included Urdu language support in all new versions of Windows and both Windows Vista and Microsoft Office 2007 are available in Urdu through Language Interface Pack 28 support Most Linux Desktop distributions allow the easy installation of Urdu support and translations as well 29 Apple implemented the Urdu language keyboard across Mobile devices in its iOS 8 update in September 2014 30 Romanization standards and systems editMain article Roman Urdu There are several romanization standards for writing Urdu with the Latin alphabet though they are not very popular because most fall short of representing the Urdu language properly Instead of standard romanization schemes people on Internet mobile phones and media often use a non standard form of romanization which tries to mimic English orthography The problem with this kind of romanization is that it can only be read by native speakers and even for them with great difficulty Among standardized romanization schemes the most accurate is ALA LC romanization which is also supported by National Language Authority Other romanization schemes are often rejected because either they are unable to represent sounds in Urdu properly or they often do not take regard of Urdu orthography and favor pronunciation over orthography 31 The National Language Authority of Pakistan has developed a number of systems with specific notations to signify non English sounds but these can only be properly read by someone already familiar with the loan letters citation needed Roman Urdu also holds significance among the Christians of Pakistan and North India Urdu was the dominant native language among Christians of Karachi and Lahore in present day Pakistan and Madhya Pradesh Uttar Pradesh Rajasthan in India during the early part of the 19th and 20th century and is still used by Christians in these places Pakistani and Indian Christians often used the Roman script for writing Urdu Thus Roman Urdu was a common way of writing among Pakistani and Indian Christians in these areas up to the 1960s The Bible Society of India publishes Roman Urdu Bibles that enjoyed sale late into the 1960s though they are still published today Church songbooks are also common in Roman Urdu However the usage of Roman Urdu is declining with the wider use of Hindi and English in these states Glossary of key words from letter names editTranslations and other uses of key words from Urdu letter names Letter name s Urdu word Examples of other uses Isolated form Urdu name Roman Urdu Urdu IPA Roman Urdu name English Translation Urdu Roman Urdu or IPA Translation ح ب ڑی حے baṛi ħe ب ڑی beɽi 15 baṛi bari big elder 15 بڑی آنت Baṛi ant large intestine ے ب ڑی يـے baṛi ye آنت Ant intestine ی چھوٹی یے chōṭi ye چھوٹی tʃʰoːʈi 15 choti small minor junior 15 ہ چھوٹی ہے chōṭi he چھوٹی آنت small intestine گول ہـے gōl he گول goːl 15 gōl round spherical vague silly obese 32 گول گپے gol gappay panipuri ھ دوچ ش می ہے dō casmi he دوچ ش می do cashmi two eyed citation needed دو چشمی دوربین do cashmi dorabin binoculars دوربین dorabin telescope دو do 2 two دو ایوانیت do ayvanit bicameralism چشم tʃeʃm 15 chashm the eye hope expectation 32 چشم cashm eye ں ن ون غ ن ہ nun e gunnah غ ن ہ ɣʊnnɑ 15 gunnah g hunnah nasal sound or twang 15 example needed آ ال ف م د ه alif maddah م د ه maddah Arabic example needed ؤ واو م ہ م وز vav e mahmuz م ہ م وز maehmuːz 15 mahmuz defective improper 15 example needed ء ا آ ب پ ت ٹ ث ب ج چ خ ح د ڈ ذ ر ڑ ز ژ س ش ص ض ط ظ ع غ ف ق ک گ ل م ن ں و ہ ھ ی ے حروف تہجی 33 haruf tahaji alphabet تہجی tahaji sequence citation needed example needed ح ر وف hʊruːf 15 haruf letters plural 15 often referred to as alphabets in informal Pakistani English example needed ح ر ف herf 15 harf letter of the alphabet handwriting statement blame stigma 15 example needed See also editNastaʻliq script Romanization of Urdu Urdu Braille Urdu Informatics Urdu keyboard Urdu WikipediaReferences edit Constitution of the Republic of South Africa 1996 Chapter 1 Founding Provisions www gov za Retrieved 6 December 2014 Balti alphabet and pronunciation omniglot com Retrieved 31 January 2023 Bashir Elena Hussain Sarmad Anderson Deborah 5 May 2006 N3117 Proposal to add characters needed for Khowar Torwali and Burushaski PDF ISO IEC JTC1 SC2 WG2 a b Project Fluency 7 October 2016 Urdu The Complete Urdu Learning Course for Beginners Start Speaking Basic Urdu Immediately Kindle ed Createspace Independent Publishing Platform p Kindle Locations 66 67 ISBN 978 1539047803 a b Urdu Omniglot positional chart Urdu some thoughts about the script and grammar and other general notes for students Retrieved 28 February 2020 via www columbia edu Controversy over number of letters in Urdu alphabet DAWN COM 15 June 2009 Delacy 2003 pp XV XVI a b c Urdu romanization PDF The Library of Congress Geographical Names Romanization in Pakistan UNGEGN 18th Session Geneva 12 23 August 1996 Working Papers No 85 and No 85 Add 1 Bhatia Tej K Khoul Ashok Koul Ashok 2015 Colloquial Urdu The Complete Course for Beginners Routledge pp 41 42 ISBN 978 1 317 30471 5 Retrieved 30 June 2020 a b c d e f g h i Urdu Alphabet www user uni hannover de Archived from the original on 11 September 2019 Retrieved 29 February 2020 a b c d Extended Arabic Letter unicode org Retrieved 6 April 2020 a b c d e f Based on ISO 8859 6 unicode org Retrieved 6 April 2020 a b c d e f g h i j k l m n o Urdu Oxford Living Dictionaries Urdu Oxford Living Dictionaries Archived from the original on 18 October 2016 Retrieved 15 March 2020 Ballantyne James Robert 1842 A Grammar of the Hindustani Language with Brief Notices of the Braj and Dakhani Dialects Madden amp Company p 11 Berggren Olaf 2002 Scripts Bibliotheca Alexandrina p 108 Grierson George Abraham Urdu Language Management Language Information Services LIS India Retrieved 23 July 2022 Proposal of Inclusion of Certain Characters in Unicode PDF Delacy 2003 pp 99 100 IBM 868 code page Urdu Zabta Takhti PDF Arabic PDF unicode org Retrieved 7 April 2019 A dictionary Hindustani and English Dsal uchicago edu 29 September 2009 Retrieved 18 December 2011 A dictionary Hindustani and English Dsal uchicago edu Archived from the original on 15 December 2012 Retrieved 18 December 2011 Center for Research in Urdu Language Processing Crulp org Retrieved 18 December 2011 Archive index at the Wayback Machine مائ یکروسافٹ ڈاؤ ن لوڈ م رک زWindows Microsoft com Retrieved 18 December 2011 Ubuntu in Urdu Aasim s Web Corner Aasims wordpress com Retrieved 18 December 2011 E Urdu How one man s plea for Nastaleeq was heard by Apple The Express Tribune 16 October 2014 Retrieved 29 March 2015 اردو میں نقل حرفی ایک ابتدائی تعارف نبل ی پیرزادہ nlpd gov pk a b Urdu Oxford Living Dictionaries Urdu to English Translation Urdu Oxford Living Dictionaries Retrieved 15 March 2020 dead link خلا سے زمین پر انگریزی کےحروف تہجی BBC News اردو in Urdu 5 January 2016 Retrieved 7 May 2020 Sources editDelacy Richard 2003 Beginner s Urdu Script McGraw Hill Delacy Richard 2010 Read and write Urdu script McGraw Hill ISBN 9780071747462 Urdu romanization PDF The Library of Congress Ishida Richard Urdu script notes External links edit nbsp Wikibooks has a book on the topic of Urdu Urdu alphabet Urdu alphabet with Devanagari equivalents Archived 11 September 2019 at the Wayback Machine Hugo s Urdu Alphabet Page Archived 16 July 2020 at the Wayback Machine calligraphyislamic com a resource for Urdu calligraphy and script Urdu Script Introduction from Columbia University National Council for Promotion of Urdu Language Archived 6 March 2018 at the Wayback Machine Retrieved from https en wikipedia org w index php title Urdu alphabet amp oldid 1220305941, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.