Processing Chinese Text with PHP

Chinese Characters and Fonts

Previous   Contents   Next  
References

Punctuation

Chinese punctuation is different to English and this is a common point for errors in parsing text files. Chinese text uses a 。for a period, a ,for a comma, 《》 or a number of different styles for quotation marks, and slightly different characters for colons and semi-colons. These represent a style to better visually balance the punctuation marks within Chinese text. They will likely need special attention when parsing text. Input editors will use these characters rather than the ASCII variants even when working with plain text files.

A · character is often used with foreign names to separate the given name from the surname. For example 乔森·吉拉尔 (Jason Kilar).


Previous   Contents   Next  
References

About 关于本网站 © chinesenotes.com 2007-2010. Please send comments to alex@chinesenotes.com.