UKTeX Digest	Friday, 14 Sep 1990
		Volume 90 : Issue 31

Today's Topics:
   Extended TeX Font Encoding Scheme - Latin; request for comments

Moderator:       Peter Abbott
Editor:          David Osborne (University of Nottingham)

Latest TeXhax:    #59
Latest TeXmag:    V4 N3 
		       E d i t o r 's   N o t e

This is a special TUG90 Cork edition of the digest, announcing
a proposed font encoding scheme.  The next regular issue should
appear on Mon, 24th September.

Subject: Extended TeX Font Encoding Scheme - Latin; request for comments
Reply-To: Norbert Schwarz <p920021@dbohub01.bitnet>
Date: Fri, 14 Sep 90 10:41:46 BST
From: David Osborne <>

At the TUG90 meeting in Cork, the font-encoding Birds-of-a-Feather
meeting devised the following encoding scheme for using all 256
glyphs supported by the 8-bit facilities in TeX 3.0.

This is currently a draft for which comments are sought:
please direct all correspondence to Norbert Schwarz

The TeX file which follows should be typeset using the Plain format,
and uses no extended characters.  It can be typeset using TeX 2.x

%  This is the final definition of a 256 character code set for
%  TeX prepared by the TEX-CHAR working group, discussed at
%  the TeX conference at Cork.
%  Cork, September 12, 1990
% -----------------------------------------------------------------------

\advance\voffset by -1.5truecm
\advance\vsize by 3truecm


\leftline{\bf Extended \TeX\ Font Encoding Scheme --- Latin }
\hrule height 1pt
\leftline{finished in Cork, September 12, 1990 (corrected a typo in 
\newcount\m \newcount\n \newdimen\dim
\def\oct#1{\hbox{\rm\'{}\kern-.2em\it#1\/\kern.05em}} % octal constant
\def\hex#1{\hbox{\rm\H{}\tt#1}} % hexadecimal constant
\def\setdigs#1"#2{\gdef\h{#2}% \h=hex prefix; \0\1=corresponding octal
 \m=\n \divide\m by 64 \xdef\0{\the\m}%
 \multiply\m by-64 \advance\m by\n \divide\m by 8 \xdef\1{\the\m}}
  \setbox0=\hbox{\lower 2.3pt\hbox{\hex{\h x}}}\smash{\box0}\cr
 \ifnum\n<256 \m=\n \divide\m 16 \chardef\next=\m
 \ifnum\n=256 \let\next=\endchart\else\let\next=\morechart\fi
 \chartline \oddline \m=\1 \advance\m 1 \xdef\1{\the\m}
 \chartline \evenline}
\def\chartstrut{\lower4.5pt\vbox to14pt{}}
  \halign to\hsize\bgroup
    \chartstrut##\tabskip0pt plus10pt&
  \raise11.5pt\null&&&\hex 8&&\hex 9&&\hex A&&\hex B&
  &\hex C&&\hex D&&\hex E&&\hex F&\cr\egroup$$\par}
\def\:{\setbox0=\hbox{\csname ec\number\n\endcsname}%
  \box0\global\advance\n 1 }
  \advance\dim 2pt \dp0=\dim}

\font\san=cmss10 scaled \magstep1
\font\San=cmss10 scaled \magstep2
\def\eng{{\san r \hskip-6pt\j}}
\def\Eng{{\San r \hskip-7pt\j}}
\def\Thorn{{\San\lower4pt\hbox{l} \hskip-8.5pt\relax b}}
\def\thorn{{\san\lower3pt\hbox{l} \hskip-7pt\relax b}}
\def\eth{{\minisan \raise4pt\hbox{$\times$} \hskip-8.5pt\relax o}}
\def\ec#1{\expandafter\def\csname ec\number\n\endcsname{#1}\advance\n by 1{}}
% accents
%  '000-'015
\ec{\`{}} \ec{\'{}} \ec{\^{}}    \ec{\~{}}
\ec{\"{}} \ec{\H{}} \ec{\char23} \ec{\v{}}
\ec{\u{}} \ec{\={}} \ec{\.{}}    \ec{\c{}}
\ec{\ogonek{\ }}
% quotation marks
%  '016-'024
\ec{,} \ec{$\scriptstyle<$} \ec{$\scriptstyle>$} \ec{``} \ec{''}
\ec{,\hskip-1pt ,} \ec{$\scriptstyle\ll$} \ec{$\scriptstyle\gg$}
% dashes
% compound word mark
\ec{{\tt<\san cwm\tt>}}
% a small 0, to extend % into 0/00 and 0/000
% dotless i and j
% f ligatures
% visible space
% the ASCII character set
% hyphenation character
\ec{{\sevenrm (hyph.char) }-}
% accented letters, mainly from ISO Latin 2 (8859/2), and some more
%  '200-'237  uppercase
\ec{\u A} \ec{\ogonek{A}} \ec{\'C}        \ec{\v C}
\ec{\v D} \ec{\v E}       \ec{\ogonek{E}} \ec{\u G}
\ec{\'L}  \ec{L\kern-2pt'} \ec{\L}   \ec{\'N} 
\ec{\v N} \ec{Eng/\Eng}         \ec{\H O} \ec{\'R}
\ec{\v R} \ec{\'S}         \ec{\v S} \ec{\c S}
\ec{\v T} \ec{\c T}        \ec{\H U} \ec{\accent 23U}
\ec{\"Y}  \ec{\'Z}  \ec{\v Z}                          \ec{\.Z}
\ec{IJ}   \ec{\.I}  \ec{d\llap{\raise0.7ex\hbox{--}}}  \ec{\S}
%  '240-'277
\ec{\u a}        \ec{\ogonek{a}} \ec{\'c}        \ec{\v c}
\ec{d\kern-1pt'} \ec{\v e}       \ec{\ogonek{e}} \ec{\u g}
\ec{\'l} \ec{l\kern-1pt'} \ec{\l} \ec{\'n}
\ec{\v n} \ec{eng/\eng} \ec{\H o} \ec{\'r}
\ec{\v r} \ec{\'s}         \ec{\v s} \ec{\c s}
\ec{t\kern-1pt'} \ec{\c t} \ec{\H u} \ec{\accent 23u} 
\ec{\"y} \ec{\'z} \ec{\v z} \ec{\.z} 
\ec{ij}  \ec{!`}  \ec{?`}  \ec{{\tenu\$}}
% accented letters, mainly from ISO Latin 1
% '300-'337   uppercase characters
\ec{\`A} \ec{\'A} \ec{\^A} \ec{\~A}
\ec{\"A} \ec{\AA} \ec{\AE} \ec{\c C}
\ec{\`E} \ec{\'E} \ec{\^E} \ec{\"E}
\ec{\`I} \ec{\'I} \ec{\^I} \ec{\"I}
\ec{Eth/\rlap{\raise0.3ex\hbox{--}}D} \ec{\~N} \ec{\`O} \ec{\'O}
\ec{\^O} \ec{\~O} \ec{\"O} \ec{\OE}
\ec{\O}  \ec{\`U} \ec{\'U}    \ec{\^U}
\ec{\"U} \ec{\'Y} \ec{Thorn/\Thorn}  \ec{SS}
%  '340-'377   lowercase characters
\ec{\`a} \ec{\'a} \ec{\^a} \ec{\~a}
\ec{\"a} \ec{\aa} \ec{\ae} \ec{\c c}
\ec{\`e}  \ec{\'e}  \ec{\^e}  \ec{\"e}
\ec{\`\i} \ec{\'\i} \ec{\^\i} \ec{\"\i}
\ec{eth/\eth} \ec{\~n} \ec{\`o} \ec{\'o}
\ec{\^o} \ec{\~o} \ec{\"o} \ec{\oe}
\ec{\o}  \ec{\`u} \ec{\'u}   \ec{\^u}
\ec{\"u} \ec{\'y} \ec{thorn/\thorn} \ec{\ss}

\noindent{\bf A few words of explanation:}

\oct{000}--\oct{014} are accents. \oct{014} is an ogonek.
\oct{015}--\oct{024} are quotation marks.
\oct{030} is a small 0 to put after the per cent sign, to turn it into
a per thousand (\%$_0$) or per million (\%$_{00}$) sign.
\oct{027} ({\san cwm}) is a compound word mark 
(a zero-width invisible character) used e.g.\ for avoiding ligatures.
\oct{040} is a visible space.
\oct{042} is a {\it straight} double quotation mark.

\noindent \oct{041}--\oct{176} is like the 7 bit ASCII code.
Some characters that --- at first glance --- appear duplicated as accent
characters usually have a different shape.

\noindent \oct{177} is the hyphen character (that may be different from
the dash (\oct{055})).
\oct{201}, \oct{206}, \oct{241}, and \oct{246} are A's and E's with
ogonek accents.
\noindent The table has been sorted to reflect {\tt\string\uppercase}
{\tt\string\lowercase} mechanism for all characters.

\noindent This table shows the character codes positions, but the
shapes are only approximations.


End of UKTeX Digest