CJK Decomposition Data
The CJK Decomposition Data File is a graphical analysis of the approx 75,000 Chinese/Japanese characters in Unicode. It has now moved to
its own project page
, with an updated version of the file, with all extension C character having the correct codepoint, and the extension D characters added.
Older versions of the data are still available