GNU Aspell mkchardata perl script and Unicode data file
Version 0.2
----------------------------------------------------------------------

This version of mkchardata will only work for GNU Aspell 0.60 or
better.  It will not work for Aspell 0.50 or any of Aspell 0.51
snapshots before I changed the version to 0.60.

The mkchardata perl script will read in a textual reference table(s)
and convert them into Aspell character data file(s).  Its usage is

  mkchardata [--no-ascii] unicode.txt <textual reference table(s)>

It will convert each textual reference table to an Aspell character
data file.  It expect the table to be in the form

  0x?? 0x???? # ...

Where 0x?? is the 8-bit character value in hex and 0x???? is the
Unicode value.  Anything after the '#' is ignored.  It table may
alternatively have the form:

  =?? U+???? ...

As of Aspell 0.60 the following characters may also be remapped:

  02-1F (  2- 31) # Control characters
  41-5A ( 65- 90) # Uppercase Latin alphabet
  61-7A ( 97-122) # Lowercase Latin alphabet
  80-FF (128-255)

Giving you a total of 220 characters to work with.

If any of the Latin ASCII letters are remapped than the option
--no-ascii should be given so that ALL the remaining letters are also
mapped to the private use area.  If you don't do this you may get
"interesting" results when trying to spell check a document that
contains words written using those letters, ie it may flag part of a
word as incorrect.

If your language uses characters not found in iso-8859-1 (code points
U+00 to U+FF) you might want to look over unicode.txt and make sure
everything is correct for your language.  If you find any errors
please send them to me at kevina@gnu.org.

