Download Presentation

Open-Source Approaches to Unicode Enablement

An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.

The Unicode Consortium provides mapping tables for converting many of the more common character sets to Unicode. The CSets archive provides supplementary mapping tables for character sets and encodings that are not supplied by the Unicode Consortium.

Amsterdam, the Netherlands, March 2000

CRL’s Unicode Support

Unicode character information.

To facilitate development of Unicode-capable software, a simple character information and partial bi-directional reordering API and library was developed early on before standardization efforts really gained momentum. This is the UCData package and the Pretty Good Bidi Algorithm.

Amsterdam, the Netherlands, March 2000

CRL’s Unicode Support

Algorithms.

To further encourage independent development of Unicode capable software, a few basic text search algorithms were converted to use Unicode text. These include:

A Boyer-Moore string search routine.

A glob matching routine called Wildmat.

An almost minimal DFA regular expression routine.

Amsterdam, the Netherlands, March 2000

CRL’s Unicode Support

Other resources.

Some of the other resources made available by CRL are:

Code to test wchar_t type support in C/C++ compilers.

Keyboard arrangements for various languages that have been collected over the years.

Resource Availability.

All of the resources mentioned are freeware and can be found at http://crl.nmsu.edu/~mleisher/.

Amsterdam, the Netherlands, March 2000

International Library for Mozilla

Frank Tang

Netscape Communications

Mozilla

International Components for Unicode (ICU)

Helena Shih and Steven Loomis

IBM Unicode Technology Center

Unicode support in the Industry

Lack of a complete set of features in most implementations.

Inconsistent across different environments. Win32 vs. POSIX, for example.

Poor portability.

Unable to share the resources with other products.

Almost no extensibility and customization.

Not a concern for most companies when a product is first designed.

Amsterdam, the Netherlands, March 2000

Netfinity Server

ICU

Apple G3 Macintosh

ICU

IBM’s DB/2 Product

AS/400 e-Server 720

Microsoft NT Workstation

World Wide Web

Sun Ultra 60 Workstation

S/390 Server

Amsterdam, the Netherlands, March 2000

ICU Objectives

Quality Unicode & I18N support across platforms

Consistent results in both C/C++ and Java

Powerful, portable APIavailable to the Open-Source development community