Brief Political History / Development of Sinhala Characters / Evolution of Sinhala Characters / Three Sinhala (Hodiyas) without the Vowel Expansion / Development of Tamil Characters / Tamil Alphabet with the Vowel Expansion / Position of Sri Lanka Standard Institute / Difficulties Relating to the Current Software / Advantages of having a Sinhala and Tamil Character Matrix / Why we need complete Sinhala and Tamil Characters / News Paper Articles / References / Solution / Cartoons /Comments / Save our Language Sinhala/Computer literacy low / Sinhala Grammer & Reading / Some E mail from Linux Group / Catalogue of Palmleaves | |||||
SINHALA CHARACTER CODES FOR INFORMATION INTERCHANGE OBJECTIONS (2nd Revision )
Summary of a forum held on 2/11/2004 at SLSI Board Room.
Meeting between the SLSI and Sri Lanka Association of Printers was held on the November 2, 2004 at the SLSI Board Room. A forum of twenty members were present including Prof V M Samaranayake, Members of CSSL, Lake House, ICTA & SLSI. The meeting was conducted by Mr Rohan N Wijeratne. The President of Sri Lanka Association of Printers Mr. Keerthi Gunawardane and I pointed out that the figure one in the SLS 1134 does not contain some of the Sinhala characters. Due to the missing characters absolute locations cannot be derived. Therefore OCR ,SMS, Voice to text software cannot be developed. The SLSI committee said that characters not specified in Figure one are shown in codes and rules. quote from the SLS1134 page 9 ‘The codes for zero-width joiner (ZWJ), which is used to form conjunct letters, and zero-width non-joiner (ZWNJ), are not in this code page, but have the values 200D and 200C respectively.” Unquote Refer to http://www.unicode.org/alloc/CurrentAllocation.html This clearly shows that the areas specified in SLS 1134 under 2yyy are named in Unicode as Empty Columns in Assigned Blocks. (Copyrights Unicode coloured in red for better understanding) Examples are given below. Quote from Unicode ‘ 0240..024F 16 [Latin Extended-B] 2090..209F 16 [Superscripts and Subscripts] 20C0..20CF 16 [Currency Symbols] 20F0..20FF 16 [Combining Marks for Symbols] ‘ unquote SLS 1134 is an incomplete Standard because it gives locations in empty columns. For all locations a shape and an absolute address have to be specified. Lake House declared that they have all the characters but this is only for the use of their institution. A copy was handed to the President of the Sri Lanka Association of Printers. The SLSI committee stressed the locations given by the Unicode to Sri Lanka was 128. That Unicode was dictating terms to Sri Lanka about the number of characters. This argument cannot be accepted because the SLSI team was totally unaware of the total number of Sinhala characters. Unicode unconditionally accept the National Standard of a country. Asian Languages have more than thousands of characters which is in the Unicode. Dr Gihan Dias mentioned the possibility of using sub sections to expand the number of characters. I pointed out that the character code has to be redone from the beginning. With the present expansions of Operating System it is possible to have 64,000 characters. This permits to include any variations and opinions of the public. Discussion went on for 2 hours while SLSI took notes. SLSI announced that they would have a private meeting. The end result will be informed later. Donald Gaminitillake Colombo November 5, 2004 |
|||||