<!-- | |
NOTE: THIS IS A WORKING DOCUMENT. NOTHING IN HERE SHOULD BE | |
TAKEN AS FINAL. | |
This is an initial proposal for a Latin 1 and assorted | |
other characters entity set for SABLE. It is based on the | |
equivalent set for HTML 3. | |
--> | |
<!-- Portions of this text are copyright ISO: | |
(C) International Organization for Standardization 1986 | |
Permission to copy in any form is granted for use with | |
conforming SGML systems and applications as defined in | |
ISO 8879, provided this notice is included in all copies. | |
--> | |
<!-- Character entity set. Typical invocation: | |
<!ENTITY % HTMLlat1 PUBLIC | |
"-//W3O//ENTITIES W3 Latin 1 for HTML//EN"> | |
%HTMLlat1; | |
--> | |
<!-- Modified for use in HTML | |
$Id: sable-latin.ent,v 1.2 2001/04/04 13:12:35 awb Exp $ = | |
--> | |
<!-- Modified to add characters in the range &161; to &191; in | |
the ISO Latin-1 character set, which could only be referred = | |
to by numeric references. Entity names based on relevant entities in | |
ISO 8879-1986//ENTITIES Numeric and Special Graphic//EN" | |
Also added the standard lt gt amp entities from HTML 2.0 | |
Chris Lilley, 13 March 1995 | |
= | |
This covers all of Latin 1, but we are still unable to display a = | |
Trade Mark (TM) | |
--> | |
<!-- capital AE diphthong (ligature) --> | |
<!-- capital A, acute accent --> | |
<!-- capital A, circumflex accent --> | |
<!-- capital A, grave accent --> | |
<!-- capital A, ring --> | |
<!-- capital A, tilde --> | |
<!-- capital A, dieresis or umlaut mark --> | |
<!-- capital C, cedilla --> | |
<!-- capital Eth, Icelandic --> | |
<!-- capital E, acute accent --> | |
<!-- capital E, circumflex accent --> | |
<!-- capital E, grave accent --> | |
<!-- capital E, dieresis or umlaut mark --> | |
<!-- capital I, acute accent --> | |
<!-- capital I, circumflex accent --> | |
<!-- capital I, grave accent --> | |
<!-- capital I, dieresis or umlaut mark --> | |
<!-- capital N, tilde --> | |
<!-- capital O, acute accent --> | |
<!-- capital O, circumflex accent --> | |
<!-- capital O, grave accent --> | |
<!-- capital O, slash --> | |
<!-- capital O, tilde --> | |
<!-- capital O, dieresis or umlaut mark --> | |
<!-- capital THORN, Icelandic --> | |
<!-- capital U, acute accent --> | |
<!-- capital U, circumflex accent --> | |
<!-- capital U, grave accent --> | |
<!-- capital U, dieresis or umlaut mark --> | |
<!-- capital Y, acute accent --> | |
<!-- small a, acute accent --> | |
<!-- small a, circumflex accent --> | |
<!-- small ae diphthong (ligature) --> | |
<!-- small a, grave accent --> | |
<!-- small a, ring --> | |
<!-- small a, tilde --> | |
<!-- small a, dieresis or umlaut mark --> | |
<!-- small c, cedilla --> | |
<!-- small e, acute accent --> | |
<!-- small e, circumflex accent --> | |
<!-- small e, grave accent --> | |
<!-- small eth, Icelandic --> | |
<!-- small e, dieresis or umlaut mark --> | |
<!-- small i, acute accent --> | |
<!-- small i, circumflex accent --> | |
<!-- small i, grave accent --> | |
<!-- small i, dieresis or umlaut mark --> | |
<!-- small n, tilde --> | |
<!-- small o, acute accent --> | |
<!-- small o, circumflex accent --> | |
<!-- small o, grave accent --> | |
<!-- small o, slash --> | |
<!-- small o, tilde --> | |
<!-- small o, dieresis or umlaut mark --> | |
<!-- small sharp s, German (sz ligature) --> | |
<!-- small thorn, Icelandic --> | |
<!-- small u, acute accent --> | |
<!-- small u, circumflex accent --> | |
<!-- small u, grave accent --> | |
<!-- small u, dieresis or umlaut mark --> | |
<!-- small y, acute accent --> | |
<!-- small y, dieresis or umlaut mark --> | |
<!-- = | |
Ones that aren't accented characters, and so not in ISO Added Latin = | |
1. | |
umlaut. macron, acute, cedilla | |
were not in ISO Numeric and Special Graphic | |
either; I took their names from the numeric entity list in | |
http://www.hpl.hp.co.uk/people/dsr/html/latin1.html = | |
Chris Lilley, 13 March 1995 = | |
--> | |
<!-- inverted exclamation mark &161; --> | |
<!-- cent sign &162; --> | |
<!-- pound sterling sign &163; --> | |
<!-- general currency sign &164; --> | |
<!-- yen sign &165; --> | |
<!-- broken (vertical) bar &166; --> | |
<!-- section sign &167; --> | |
<!-- umlaut (dieresis) &168; --> | |
<!-- copyright sign &169; --> | |
<!-- ordinal indicator, feminine &170; --> | |
<!-- angle quotation mark, left &171; --> | |
<!-- not sign &172; --> | |
<!-- soft hyphen &173;--> | |
<!-- registered trademark &174; --> | |
<!-- macron &175; --> | |
<!-- degree sign &176; --> | |
<!-- plus-or-minus sign &177; --> | |
<!-- superscript two &178; --> | |
<!-- superscript three &179; --> | |
<!-- acute accent &180; --> | |
<!-- micro sign &181; --> | |
<!-- pilcrow (paragraph sign) &182; --> | |
<!-- middle dot (centred decimal point) &183; --> | |
<!-- cedilla accent &184; --> | |
<!-- superscript one --> | |
<!-- ordinal indicator, masculine --> | |
<!-- angle quotation mark, right --> | |
<!-- fraction one-quarter --> | |
<!-- fraction one-half --> | |
<!-- fraction three-quarters --> | |
<!-- inverted question mark --> | |
<!-- the odd ones tucked in amongst the accented letters --> | |
<!-- multiply sign --> | |
<!-- divide sign --> | |
<!-- ampersand --> | |
<!-- greater than --> | |
<!-- less than --> | |
<!-- double quote --> | |
<!-- | |
Should the dollar sign and such like also be given entity names? | |
There are suitable ones around. For example, people using = | |
internationalised keyboards in various countries might not be = | |
readily able to access some of these characters. | |
= | |
On such grounds, I suggest entity names for these: | |
! " # $ % ' ( ) * + , - . / : ; =3D ? [ \ ] ^ _ ` { | } ~ | |
Lastly, what about &tab; for those whose editors convert all | |
tabs to spaces? | |
--> | |