CJKV Information Processing
Please login to add notes
Chapter 1: CJKV Information Processing Overview
- Writing Systems and Scripts
- Character Set Standards
- + Encoding Methods
- Input Methods
Basic Concepts and Terminology FAQ
- What Are All These Abbreviations and Acronyms?
- What Are Internationalization, Globalization, and Localization?
- What Are the Multilingual and Locale Models?
- What Is a Locale?
- What Is Unicode?
- How Are Unicode and ISO 10646 Related?
- What Are Row-Cell and Plane-Row-Cell?
- What Is a Unicode Scalar Value?
- Characters Versus Glyphs: What Is the Difference?
- What Is the Difference Between Typeface and Font?
- What Are Half- and Full-Width Characters?
- Latin Versus Roman Characters
- What Is a Diacritic Mark?
- What Is Notation?
- What Is an Octet?
- What Are Little- and Big-Endian?
- What Are Multiple-Byte and Wide Characters?
- Advice to Readers
Chapter 2: Writing Systems and Scripts
Chapter 3: Character Set Standards
- + NCS Standards
- National Coded Character Set Standards Overview
- ASCII Variations
- Chinese Character Set Standards—China
- Chinese Character Set Standards—Taiwan
- Chinese Character Set Standards—Hong Kong
- Chinese Character Set Standards—Singapore
- Japanese Character Set Standards
- Korean Character Set Standards
- Vietnamese Character Set Standards
- + International Character Set Standards
Character Set Standard Oddities
- Duplicate Characters
- Phantom Ideographs
- Incomplete Ideograph Pairs
- Simplified Ideographs Without a Traditional Form
- Fictitious Character Set Extensions
- Seemingly Missing Characters
- CJK Unified Ideographs with No Source
- Vertical Variants
- + Noncoded Versus Coded Character Sets
- + Information Interchange and Professional Publishing
- + Future Trends and Predictions
- + Advice to Developers
Chapter 4: Encoding Methods
Chapter 5: Input Methods
Chapter 6: Font Formats, Glyph Sets, and Font Tools
- Typeface Design
- + How Many Glyphs Can a Font Include?
- + Bitmapped Font Formats
- + Outline Font Formats
- + Glyph Sets
- + Ruby Glyphs
Host-Installed, Printer-Resident, and Embedded Fonts
- Installing and Downloading Fonts
- The PostScript Filesystem
- Mac OS X
- Mac OS 9 and Earlier
- Microsoft Windows—2000, XP, and Vista
- Microsoft Windows—Versions 3.1, 95, 98, ME, and NT4
- Unix and Linux
- X Window System
- Font and Glyph Embedding
- Cross-Platform Issues
- + Font Development Tools
- The Gaiji Problem
- SING—Smart INdependent Glyphlets
- Ideographic Variation Sequences
- XKP, A Gaiji Handling Initiative—Obsolete
- Adobe Type Composer (ATC)—Obsolete
- Composite Font Functionality Within Applications
- Gaiji Handling Techniques and Tricks
- Creating Your Own Rearranged Fonts
- Acquiring Gaiji Glyphs and Gaiji Fonts
- Advice to Developers
Chapter 7: Typography
- + Rules, Principles, and Techniques
- + Typographic Units and Measurements
- + Horizontal and Vertical Layout
- Line Breaking and Word Wrapping
- Character Spanning
- + Alternate Metrics
- + Line-Length Issues
- + Multilingual Typography
- + Glyph Substitution
- + Annotations
- + Typographic Applications
- Advice to Developers
Chapter 8: Output Methods
Chapter 9: Information Processing Techniques
- Language, Country, and Script Codes
- CLDR—Common Locale Data Repository
- + Programming Languages
- + Code Conversion Algorithms
- + Java Programming Examples
- + Miscellaneous Algorithms
- + Byte Versus Character Handling
- Character Sorting
- + Natural Language Processing
- Regular Expressions
- Search Engines
- + Code-Processing Tools
Chapter 10: OSes, Text Editors, and Word Processors
- + Viewing CJKV Text Using Non-CJKV OSes
- + Operating Systems
- Boot Camp—Run Windows on Apple Hardware
- CrossOver Mac—Run Windows Applications on Mac OS X
- GNOME—Linux and Unix
- KDE—Linux and Unix
- VMware Fusion—Run Windows on Mac OS X
- Wine—Run Windows on Unix, Linux, and Other OSes
- X Window System—Unix
- + Text Editors
- + Word Processors
- + Online Word Processors
- Advice to Developers
Chapter 11: Dictionaries and Dictionary Software
Chapter 12: Web and Print Publishing
- Line-Termination Concerns
- + Network Domains
- Content Versus Presentation
- + Web Publishing
- + HTML—HyperText Markup Language
- + XML—Extensible Markup Language
- CGI Programming Examples
- + Print Publishing
- Where to Go Next?
- Appendix A: Code Conversion Tables
- Appendix B: Notation Conversion Table
- Appendix C: Perl Code Examples
- Appendix D: Glossary
- Appendix E: Vendor Character Set Standards
- Appendix F: Vendor Encoding Methods
- Appendix G: Chinese Character Sets—China
- Appendix H: Chinese Character Sets—Taiwan
- Appendix I: Chinese Character Sets—Hong Kong
- Appendix J: Japanese Character Sets
- Appendix K: Korean Character Sets
- Appendix L: Vietnamese Character Sets
- Appendix M: Miscellaneous Character Sets
First published a decade ago, CJKV Information Processing quickly became the unsurpassed source of information on processing text in Chinese, Japanese, Korean, and Vietnamese. It has now been thoroughly updated to provide web and application developers with the latest techniques and tools for disseminating information directly to audiences in East Asia. This second edition reflects the considerable impact that Unicode, XML, OpenType, and newer operating systems such as Windows XP, Vista, Mac OS X, and Linux have had on East Asian text processing in recent years.
Written by its original author, Ken Lunde, a Senior Computer Scientist in CJKV Type Development at Adobe Systems, this book will help you:
Learn about CJKV writing systems and scripts, and their transliteration methods
Explore trends and developments in character sets and encodings, particularly Unicode
Examine the world of typography, specifically how CJKV text is laid out on a page
Learn information-processing techniques, such as code conversion algorithms and how to apply them using different programming languages
Process CJKV text using different platforms, text editors, and word processors
Become more informed about CJKV dictionaries, dictionary software, and machine translation software and services
Manage CJKV content and presentation when publishing in print or for the Web
Internationalizing and localizing applications is paramount in today's global market -- especially for audiences in East Asia, the fastest-growing segment of the computing world. CJKV Information Processing will help you understand how to develop web and other applications effectively in a field that many find difficult to master.
Test the closed alpha on paperc.com