This document covers the following topics:
On Windows, UNIX and OpenVMS platforms, Natural has internally been Unicode-enabled. This means that many structures containing strings have Unicode format now. For example, the Natural source area has now Unicode format. For this reason, Unicode data can be handled at runtime in the Natural I/O as well as in the Natural development environment when writing and cataloging Natural code.
Even if Natural is Unicode-enabled internally, all existing data currently has code page format. As a consequence, all this data is converted from code page format to Unicode format when used in Natural Version 6.2 or above. For example, if a source is opened with the program editor, a conversion from the code page file format to the Unicode source area format is performed. Even if you do not use the U format, this is of advantage: you can now see all language-specific characters, no matter which system code page is installed. However, the user is responsible for defining the correct code page information. See Migrating Existing Applications for more details.
When cataloging Natural objects, all constants which are not defined with the U prefix are converted to the code page of the corresponding source. If the source has UTF-8 format, these constants are converted to the default code page.
Notes:
USIZE
may need to be
increased with Natural Version 6.2 or above.
Unicode is fully supported in the local Natural for Windows environment.
The editors are Unicode-enabled and it is possible to enter all possible
characters. When saving the source, Natural first tries to convert the source
to the original code page. If this fails because the source contains characters
which are not found in this code page, further processing depends on the
setting of the parameter SUTF8
. If
SUTF8
is ON
, the source will be saved in
UTF-8 format. If SUTF8
is OFF
, the user
will be asked whether to save the source in the original code page or to cancel
the current save. If the user decides to save the source in the original code
page, the characters which are not found will be replaced with substitution
characters. In addition, it is possible to select a code page explicitly in the
Save
As dialog box.
The program editor has been enhanced in order to support the Unicode bidirectional algorithm.
The output window is also Unicode-enabled. When characters are entered via the keyboard, A format fields accept only the characters which are available in the default code page.
Full Unicode support is only available with SPoD and the Natural Web I/O Interface. SPoD is necessary for entering Unicode input in Natural sources; the same applies as described above for the local Natural for Windows environment. The Natural Web I/O Interface is necessary for Unicode I/O from Natural applications.
If Natural is used via a terminal emulation, all output will be converted from Unicode to the default code page before displaying it. Characters which are not available in the default code page will be replaced with the substitution character of the default code page. Similar input is only possible on base of the default code page.
Note:
Natural sources which have UTF-8 format can no longer be opened with
the native Natural for UNIX or Natural for OpenVMS editors.