Representing symbols, characters and letters that are used worldwide is no mean feat, but Unicode managed it - how? Tom Scott explains how the web has settled on a standard.
This page describes normalization forms for Unicode text. When implementations keep strings in a normalized form, they can be assured that equivalent strings have a unique binary representation. This page also provides examples, additional specifications regarding normalization of Unicode text, and information about conformance testing for Unicode normalization forms.
Content on the web can be authored using a variety of software applications. Even within a single site, the content may have been created using multiple authoring tools. For example, a web site that was created using Macromedia Dreamweaver might also include a page created using Microsoft Access' data access page feature, as well as a dynamic Flash movie that allows for language selection. In order for all of these files to serve the correct text, they need to be properly encoded.
The purpose of this article is to identify where some of the key functionality for encoding exists within some of the more popular web authoring applications.
Hints on sending out character encoding information using the HTTP charset parameter. Includes pointers on how to set up your server or send the appropriate header through scripting.
This tutorial gathers together and organizes pointers to articles that, taken together, help you understand how to handle the essential aspects of authoring (X)HTML and CSS related to characters and character encodings.
The tutorial will provide you with an understanding of key requirements for implementing writing systems in information technology. It will do this by examining real examples of a wide range of modern scripts to discover features that a computerized implementation must support.
The Unicode Standard is a character coding system designed to support the worldwide interchange, processing, and display of the written texts of the diverse languages and technical disciplines of the modern world. In addition, it supports classical and historical texts of many written languages.
MARC 21 specifications for record structure, character sets, and exchange media describes the structure of MARC 21 records, the character repertoires and encodings used, and the formatting of records for exchange on via electronic file transfer, diskettes, and magnetic tape. This document provides technical information on these topics and is intended for the use of personnel involved in the design and maintenance of systems for the exchange and processing of MARC 21 records.