Template:Unichar/testcases - Wikiwand
For faster navigation, this Iframe is preloading the Wikiwand page for Template:Unichar/testcases.

Template:Unichar/testcases

From Wikipedia, the free encyclopedia

Each case is paired:

  • first the live version
  • then the sandbox version

Special code points: control, surrogate, private, non-character (Cc Cs Co Cn) in BMP

Note: only fixed non-characters are testable for now (explicitly defined cps, e.g. FFFF). "reserved" non-characters are not detected.
  • U+0000 <control-0000> (0decimal)
  • U+0000 <control-0000>
  • U+00A0   CHK (160decimal)
  • U+00A0   NO-BREAK SPACE
  • U+FFFD CHK (65533decimal)
  • U+FFFDREPLACEMENT CHARACTER
  • U+FFFE <noncharacter-FFFE> (65534decimal)
  • U+FFFE <noncharacter-FFFE>
  • U+FFFF <noncharacter-FFFF> (65535decimal)
  • U+FFFF <noncharacter-FFFF>
  • U+D800 <surrogate-D800> (55296decimal)
  • U+D800 <surrogate-D800>
  • U+DFFE <surrogate-DFFE> (57342decimal)
  • U+DFFE <surrogate-DFFE>
  • U+DFFF <surrogate-DFFF> (57343decimal)
  • U+DFFF <surrogate-DFFF>
  • U+E000 <private-use-E000> (57344decimal)
  • U+E000 <private-use-E000>
  • U+F8FF <private-use-F8FF> (63743decimal)
  • U+F8FF <private-use-F8FF>
  • U+FDD0 <noncharacter-FDD0> (64976decimal)
  • U+FDD0 <noncharacter-FDD0>
  • U+FFFF <noncharacter-FFFF> (65535decimal)
  • U+FFFF <noncharacter-FFFF>

Special code points (control, surrogate, private) not in BMP

  • Error using {{unichar}}: Input "" is not a hexadecimal value.
  • Lua error in Module:Unichar at line 65: code point in hexadecimal expected, got nil.
  • U+1FFFD 🿽 CHK (131069decimal)
  • U+1FFFD <reserved-1FFFD>
  • U+2FFFD 𯿽 CHK (196605decimal)
  • U+2FFFD <reserved-2FFFD>
  • U+2FFFE <noncharacter-2FFFE> (196606decimal)
  • U+2FFFE <noncharacter-2FFFE>
  • U+2FFFF <noncharacter-2FFFF> (196607decimal)
  • U+2FFFF <noncharacter-2FFFF>
  • U+EFFFE <noncharacter-EFFFE> (983038decimal)
  • U+EFFFE <noncharacter-EFFFE>
  • U+EFFFF <noncharacter-EFFFF> (983039decimal)
  • U+EFFFF <noncharacter-EFFFF>
  • U+EFFFD 󯿽 CHK (983037decimal)
  • U+EFFFD <reserved-EFFFD>
  • U+FFFFE <noncharacter-FFFFE> (1048574decimal)
  • U+FFFFE <noncharacter-FFFFE>
  • U+FFFFF <noncharacter-FFFFF> (1048575decimal)
  • U+FFFFF <noncharacter-FFFFF>
  • U+0000 <control-0000> (0decimal)
  • U+0000 <control-0000>

Spaces (Zs)

  • Space U+0020   SOME SPACE (32decimal · HTML &#32;)
  • Space U+0020 SPACE
  • Space U+00A0   SOME SPACE (160decimal · HTML &#160; · &nbsp;, &NonBreakingSpace;)
  • Space U+00A0   NO-BREAK SPACE
  • Space U+01680 OGHAM SOME SPACE (5760decimal · HTML &#5760;)
  • Space U+1680 OGHAM SPACE MARK
  • Space U+0180E SOME SPACE (6158decimal · HTML &#6158;)
  • Space U+180E MONGOLIAN VOWEL SEPARATOR
  • Space U+02002 SOME SPACE (8194decimal · HTML &#8194; · &ensp;)
  • Space U+2002 EN SPACE
  • Space U+2003 SOME SPACE (8195decimal · HTML &#8195; · &emsp;)
  • Space U+2003 EM SPACE
  • Space U+200D SOME SPACE (8205decimal · HTML &#8205; · &zwj;)
  • Space U+200D ZERO WIDTH JOINER
  • Space U+202F SOME SPACE (8239decimal · HTML &#8239;)
  • Space U+202F NARROW NO-BREAK SPACE
  • Space U+205F SOME SPACE (8287decimal · HTML &#8287; · &MediumSpace;)
  • Space U+205F MEDIUM MATHEMATICAL SPACE
  • Space U+2060 SOME SPACE (8288decimal · HTML &#8288; · &NoBreak;)
  • Space U+2060 WORD JOINER
  • Space U+3000   SOME SPACE (12288decimal · HTML &#12288;)
  • Space U+3000   IDEOGRAPHIC SPACE

Formatting characters, no glyph shown (Cf, Zl, Zp)

Zl, Zp:

  • U+2028 LSEP (8232decimal · HTML &#8232; · Zl=LSEP)
  • U+2028 LINE SEPARATOR
  • U+2029 PSEP (8233decimal · HTML &#8233; · Zp=PSEP)
  • U+2029 PARAGRAPH SEPARATOR

Cf:

  • U+00AD SHY (173decimal · HTML &#173; · &shy;)
  • U+00AD SOFT HYPHEN
  • U+0600 ؀ 0600 (1536decimal · HTML &#1536;)
  • U+0600 ؀ ARABIC NUMBER SIGN
  • U+0603 ؃ 0603 (1539decimal · HTML &#1539;)
  • U+0603 ؃ ARABIC SIGN SAFHA
  • U+06DD ۝ ARABIC END OF AYAH (1757decimal · HTML &#1757;)
  • U+06DD ۝ ARABIC END OF AYAH
  • U+070F SYRIAC ABBREVIATION MARK (SAM) (1807decimal · HTML &#1807;)
  • U+070F ܏ SYRIAC ABBREVIATION MARK
  • U+17B4 17B4 (6068decimal · HTML &#6068;)
  • U+17B4KHMER VOWEL INHERENT AQ
  • U+17B5 17B5 (6069decimal · HTML &#6069;)
  • U+17B5KHMER VOWEL INHERENT AA
  • U+200B 200B (8203decimal · HTML &#8203; · &NegativeMediumSpace;, &NegativeThickSpace;, &NegativeThinSpace;, &NegativeVeryThinSpace;, &ZeroWidthSpace;)
  • U+200B ZERO WIDTH SPACE
  • U+202A 202A (8234decimal · HTML &#8234;)
  • U+202A LEFT-TO-RIGHT EMBEDDING
  • U+202B BIDI RLE (8235decimal · HTML &#8235; · bidi:should not reverse this note)
  • U+202B RIGHT-TO-LEFT EMBEDDING
  • U+202E BIDI RLO (8238decimal · HTML &#8238; · bidi:should not reverse this note)
  • U+202E RIGHT-TO-LEFT OVERRIDE
  • U+2060 2060 (8288decimal · HTML &#8288; · &NoBreak;)
  • U+2060 WORD JOINER
  • U+206A 206A (8298decimal · HTML &#8298;)
  • U+206A INHIBIT SYMMETRIC SWAPPING
  • U+FEFF FEFF (65279decimal · HTML &#65279;)
  • U+FEFF ZERO WIDTH NO-BREAK SPACE
  • U+FFFA FFFA (65530decimal · HTML &#65530;)
  • U+FFFAINTERLINEAR ANNOTATION SEPARATOR

Planes 1 & 14:

  • U+110BD 110BD (69821decimal · HTML &#69821;)
  • U+110BD KAITHI NUMBER SIGN
  • U+1D173 1D173 (119155decimal · HTML &#119155;)
  • U+1D173 MUSICAL SYMBOL BEGIN BEAM
  • U+1D17A 1D17A (119162decimal · HTML &#119162;)
  • U+1D17A MUSICAL SYMBOL END PHRASE

Combining chars

  • Umlaut Error using {{unichar}}: Input "U+0308" is not a hexadecimal value.
  • Umlaut Lua error in Module:Unichar at line 75: code point in hexadecimal expected, got "U+0308".
  • Umlaut U+0308 ̈ COMBINING DIARESES
  • Umlaut U+0308 ◌̈ COMBINING DIAERESIS
  • Umlaut U+0308 ◌̈ COMBINING DIARESES
  • Umlaut U+0308 ◌̈ COMBINING DIAERESIS
  • Umlaut U+0308 COMBINING DIARESES
  • Umlaut U+0308 ◌̈ COMBINING DIAERESIS
  • Umlaut U+0308  ̈  COMBINING DIARESES (776decimal)
  • Umlaut U+0308 ◌̈ COMBINING DIAERESIS
  • Z comb=U+0302 COMBINING CIRCUMFLEX ACCENT (as in â)
  • Z comb=U+0302 ◌̂ COMBINING CIRCUMFLEX ACCENT
  • Z comb=U+0302  ̂  COMBINING CIRCUMFLEX ACCENT (as in â)
  • Z comb=U+0302 ◌̂ COMBINING CIRCUMFLEX ACCENT
  • Z comb=U+0302  ̂  COMBINING CIRCUMFLEX ACCENT (770decimal · as in â)
  • Z comb=U+0302 ◌̂ COMBINING CIRCUMFLEX ACCENT

No name, input |2=|

  • J no na: U+00A9 ©
  • J no na: U+00A9 © COPYRIGHT SIGN
  • J no na: U+00A9 ©
  • J no na: U+00A9 © COPYRIGHT SIGN
  • J no na: U+00A9 ©
  • J no na: U+00A9 © COPYRIGHT SIGN
  • J no na: U+00A9 ©
  • J no na: U+00A9 © COPYRIGHT SIGN
  • J no na: U+00A9 ©
  • J no na: U+00A9 © COPYRIGHT SIGN

Image overruling the font

Currently, the Rupee symbol is available in most fonts. Still, the pictural glyph can be tested (October 2011):

testig input

  • H no #1: Error using {{unichar}}: Input "" is not a hexadecimal value.
  • H no #1: Lua error in Module:Unichar at line 65: code point in hexadecimal expected, got nil.
  • I #1=blank: Error using {{unichar}}: Input "" is not a hexadecimal value.
  • I #1=blank: Lua error in Module:Unichar at line 65: code point in hexadecimal expected, got nil.
  • I #1=-1: Error using {{unichar}}: Input "-1" is not a hexadecimal value.
  • I #1=-1: Lua error in Module:Unichar at line 82: code point -1 out of range.
  • I #1=110001: U+110001 (1114113decimal)
  • I #1=110001: Lua error in Module:Unichar at line 82: code point 1114113 out of range.

Using different bracketed notes (dec, html, note)

  • K noBtxt: U+00B9 ¹ COPYRIGHT SIGN
  • K noBtxt: U+00B9 ¹ SUPERSCRIPT ONE
  • L1 note.U+0019 <control-0019> Bell character (Some note here)
  • L1 note.U+0019 <control-0019>
  • L1 note.U+0012 <control-0012> Error using {{unichar}}: Pagename expected in "nlink=" (Some note here)
  • L1 note.U+0012 <control-0012>
  • L2 note.U+0017 <control-0017> (Some note here)
  • L2 note.U+0017 <control-0017>
  • M dec.U+00B7 · BEE SEVEN (183decimal)
  • M dec.U+00B7 · MIDDLE DOT
  • A1 html.U+00A9 © CSIGN TC (HTML &#169; · &copy;, &COPY;)
  • A1 html.U+00A9 © COPYRIGHT SIGN
  • A2 html.U+07A9 ީ SOME CHAR (HTML &#1961;)
  • A2 html.U+07A9 ީ THAANA EEBEEFILI
  • B dec.html.U+00A9 © CSIGN AS (169decimal · HTML &#169; · &copy;, &COPY;)
  • B dec.html.U+00A9 © COPYRIGHT SIGN
  • C html.note.U+2E18 CSIGN WT (HTML &#11800; · noot)
  • C html.note.U+2E18INVERTED INTERROBANG
  • D dec.note.U+2018 CSIGN QW (8216decimal · nootxx)
  • D dec.note.U+2018LEFT SINGLE QUOTATION MARK
  • E dec.html.note.: U+2018 CSIGN QW (8216decimal · HTML &#8216; · &lsquo;, &OpenCurlyQuote; · nootxx)
  • E dec.html.note.: U+2018LEFT SINGLE QUOTATION MARK
  • F no notes txt: U+00A9 © CSIGN EW
  • F no notes txt: U+00A9 © COPYRIGHT SIGN

other

  • Copyright sign
  • COPYRIGHT SIGN
  • COPYRIGHT SIGN
  • COPYRIGHT SIGN
{{bottomLinkPreText}} {{bottomLinkText}}
Template:Unichar/testcases
Listen to this article