Message Boards Message Boards

Re-exploring the structure of Chinese character images

Posted 2 years ago

POSTED BY: Anton Antonov
12 Replies

In general, the two steps:

  1. Radical finding

  2. Dimension reduction with found radicals

can be iterated multiple times and the obtained radical basis can be used to represent any Chinese character. And, yes, I think that basis is going to be similar to the Cangjie input method.

I am not sure how fruitful or useful that representation would be. It depends on the purposes for its use. For example, in general the Machine Learning (ML) algorithms that provide good explanations are not that good for, say, ML classification or approximation.

POSTED BY: Anton Antonov

Just curious how close your reduced set of auto radicals compared to those of the Cang Jie Input radicals?

POSTED BY: Jack I Houng

Thank you for your feedback, Silvia!

[...] I'm always a big fan of traditional statistical analysis!

That comment provided additional motivation to "wrap up" a first version of the post "Handwritten Arabic characters classifiers comparison".

In that post I show how using dimension reduction and nearest neighbors we can get better classification results than using "standard" convolutional networks. Of course, we also get additional insight.

One central interest on characters and fonts (not just CJK) is the topological relationship between strokes. It's somehow related but beyond properties in image space.

  1. In the linked post I had to crop-&-resize the images in order to get better results. That is, in some sense, related to having hard time to "discover" the topology in Arabic characters.

  2. Another approach is to actually make some cheap transformations that expose the topology. That is, of course, not that trivial.

  3. People writing Chinese characters -- after years of schooling -- follow certain order of the strokes. I think that is much more fruitful type of data utilization. (And as far as I know it is used in the Chinese character completions in different OS'es.)

POSTED BY: Anton Antonov

Nice post, Anton! In terms of interpretable model, I'm always a big fan of traditional statistical analysis!

One central interest on characters and fonts (not just CJK) is the topological relationship between strokes. It's somehow related but beyond properties in image space. There are neural net approach to this topic, like FontRNN, etc. I guess they captured those topology nicely, so I wonder how can we "extract" the information, and what can we say from applying "traditional" statistical analysis on them. If we can say something meaningful from that, then we get insight and they become interpretable deep learning models.

POSTED BY: Silvia Hao

Thanks a lot for pinging me here!

POSTED BY: Silvia Hao

Other versions:

  1. WordPress blog post (HTML)

    • Made with the package M2MD.
  2. GitHub post (Markdown)

POSTED BY: Anton Antonov

This post elaborates on how to make the Rorschach animations: "Rorschach mask animations".

POSTED BY: Anton Antonov

Nice followup! I hope @Silvia Hao will see this. And I am also looking forward to the animated Rorschach post. :-)

POSTED BY: Vitaliy Kaurov

Can't wait to see it :)

POSTED BY: Ahmed Elbanna

Thank you for the recognition, Moderation Team!

POSTED BY: Anton Antonov

Just a teaser here -- I plan to use the code (and explanations) in this post as a basis for submitting animated Rorschach test images for the "Computational Art Contest 2022".

POSTED BY: Anton Antonov

enter image description here -- you have earned Featured Contributor Badge enter image description here Your exceptional post has been selected for our editorial column Staff Picks http://wolfr.am/StaffPicks and Your Profile is now distinguished by a Featured Contributor Badge and is displayed on the Featured Contributor Board. Thank you!

POSTED BY: EDITORIAL BOARD
Reply to this discussion
Community posts can be styled and formatted using the Markdown syntax.
Reply Preview
Attachments
Remove
or Discard

Group Abstract Group Abstract