Jump to content

Template:Punctuation marks in Unicode/doc

From Wikipedia, the free encyclopedia

General Category Px, punctuation

[edit]
Listed are all code points in Genetral Category P:
P, Punctuation mark
Pd Punctuation, dash Graphic Character 25 Includes several hyphen characters
Ps Punctuation, open Graphic Character 75 Opening bracket characters
Pe Punctuation, close Graphic Character 77 Closing bracket characters
Pi Punctuation, initial quote Graphic Character 12 Opening quotation mark. Does not include the ASCII "neutral" quotation mark. May behave like Ps or Pe depending on usage
Pf Punctuation, final quote Graphic Character 10 Closing quotation mark. May behave like Ps or Pe depending on usage
Pc Punctuation, connector Graphic Character 10 Includes "_" underscore
Po Punctuation, other Graphic Character 593
P Punctuation 798 Total
todo: check numbers; big table has 798 cp entries
Numbers as of version 14.0, by § BabelMap

TOC for this table

[edit]
{{Punctuation marks in Unicode/TOC}} -- local, no pagename provided
By Unicode General Category 'P'
  ()
§ Pd, dash
§ Ps-Pe, start–end (open–close brackets)
§ Pi-Pf, initial–final quote
§ Pc, connector
§ Po, other

Parameters

[edit]
|cp=        code point, hex. Determines Character as shown, Name, anchors. Use normalised "000A" (uppercase) notation.
|link=      link to article, will link from (first) name; optional
|gencat=    Generic Category, Px by list definition
|script=    character script property
|style=     large → double cell height, for example § U+104C:
|technote=  maintenance info: is pair, is unbalanced pair, is vertical pair
|style, technote= are shown in {{/testcases}}

For pairs (Pi-Pf quotes; Ps-Pe brackets); use {{../row-pair}}:

|cp2=       second code point
|gencat2=   second generic category (expected: Pf or Pe)
|script2, link2= unused, do not exist
|char= present for tests, could be removed when stable
{{Punctuation marks in Unicode/row
| cp       =
| link     =
| gencat   =
| script   =
| style    =
| technote =
| cp2      =
| gencat2  =
}}

Note: |char= is for checks only, might be removed later on.

|TOC=yes will show table-TOC


|gencat=

Anchors

[edit]
  • All marks are anchored twice: U+00B6, ¶ (= as entity value and as character)
cp=00B6§ U+00B6, [entity ¶] § ¶, char ⟨¶⟩: § ¶
Template:Punctuation marks in Unicode#U+00B6
cp=2995§ U+2995, entity: § ⦕, char: § ⦕
Template:Punctuation marks in Unicode#U+2995
cp=2996§ U+2996, entity: § ⦖, char: § ⦖
Template:Punctuation marks in Unicode#U+2996
When possible, notation U+00B6 is preferred (first code point of a pair). Result is nicer arrival (top of tablerow)

See also

[edit]