3 uhtml \- convert foreign character set HTML file to unicode
15 HTML comes in various character set encodings
16 and has special forms to encode characters. To
17 make it easier to process html, uthml is used
18 to normalize it to a unicode only form.
20 Uhtml detects the character set of the html input
24 to convert it to utf replacing html-entity forms
25 by ther unicode character representations except for
32 The converted html is written to
33 standard output. If no
35 was given, it is read from standard input. If the
37 option is given, the detected character set is printed and
38 the program exits without conversion.
39 In case character set detection fails, the default (utf)
40 is assumed. This default can be changed with the
44 .B /sys/src/cmd/uhtml.c