Group Abstract

Message Boards

WOLFRAM COMMUNITY

12.8K Views

12 Replies

25 Total Likes

View groups...

Follow this post

Share this post:

GROUPS:

Re-exploring the structure of Chinese character images

Anton Antonov

Anton Antonov, Accendo Data LLC

Posted 3 years ago

POSTED BY: Anton Antonov

12 Replies

Sort By:

Jack I Houng

Jack I Houng, Jack Houng Media Reseearch

Posted 3 years ago

Just curious how close your reduced set of auto radicals compared to those of the Cang Jie Input radicals?

POSTED BY: Jack I Houng

Anton Antonov

Anton Antonov, Accendo Data LLC

Posted 3 years ago

In general, the two steps: Radical finding Dimension reduction with found radicals can be iterated multiple times and the obtained radical basis can be used to represent any Chinese character. And, yes, I think that basis is going to be similar to the Cangjie input method. I am not sure how fruitful or useful that representation would be. It depends on the purposes for its use. For example, in general the Machine Learning (ML) algorithms that provide good explanations are not that good for, say, ML classification or approximation.

POSTED BY: Anton Antonov

Silvia Hao

Silvia Hao, Glimscape Technology

Posted 3 years ago

Nice post, Anton! In terms of interpretable model, I'm always a big fan of traditional statistical analysis! One central interest on characters and fonts (not just CJK) is the topological relationship between strokes. It's somehow related but beyond properties in image space. There are neural net approach to this topic, like FontRNN, etc. I guess they captured those topology nicely, so I wonder how can we "extract" the information, and what can we say from applying "traditional" statistical analysis on them. If we can say something meaningful from that, then we get insight and they become interpretable deep learning models.

POSTED BY: Silvia Hao

Anton Antonov

Anton Antonov, Accendo Data LLC

Posted 3 years ago

Thank you for your feedback, Silvia! [...] I'm always a big fan of traditional statistical analysis! That comment provided additional motivation to "wrap up" a first version of the post "Handwritten Arabic characters classifiers comparison". In that post I show how using dimension reduction and nearest neighbors we can get better classification results than using "standard" convolutional networks. Of course, we also get additional insight. One central interest on characters and fonts (not just CJK) is the topological relationship between strokes. It's somehow related but beyond properties in image space. In the linked post I had to crop-&-resize the images in order to get better results. That is, in some sense, related to having hard time to "discover" the topology in Arabic characters. Another approach is to actually make some cheap transformations that expose the topology. That is, of course, not that trivial. People writing Chinese characters -- after years of schooling -- follow certain order of the strokes. I think that is much more fruitful type of data utilization. (And as far as I know it is used in the Chinese character completions in different OS'es.)

POSTED BY: Anton Antonov

Anton Antonov

Anton Antonov, Accendo Data LLC

Posted 3 years ago

Other versions: WordPress blog post (HTML) Made with the package `M2MD`. GitHub post (Markdown) Made with `pandoc`.

POSTED BY: Anton Antonov

Vitaliy Kaurov

Vitaliy Kaurov, WOLFRAM Research

Posted 3 years ago

Nice followup! I hope @Silvia Hao will see this. And I am also looking forward to the animated Rorschach post. :-)

POSTED BY: Vitaliy Kaurov

Silvia Hao

Silvia Hao, Glimscape Technology

Posted 3 years ago

Thanks a lot for pinging me here!

POSTED BY: Silvia Hao

Anton Antonov

Anton Antonov, Accendo Data LLC

Posted 3 years ago

Just a teaser here -- I plan to use the code (and explanations) in this post as a basis for submitting animated Rorschach test images for the "Computational Art Contest 2022".

POSTED BY: Anton Antonov

Ahmed Elbanna

Ahmed Elbanna, Wolfram Research

Posted 3 years ago

Can't wait to see it :)

POSTED BY: Ahmed Elbanna

Anton Antonov

Anton Antonov, Accendo Data LLC

Posted 3 years ago

This post elaborates on how to make the Rorschach animations: "Rorschach mask animations".

POSTED BY: Anton Antonov

EDITORIAL BOARD

EDITORIAL BOARD, WOLFRAM

Posted 3 years ago

-- you have earned *Featured Contributor Badge* Your exceptional post has been selected for our editorial column *Staff Picks* http://wolfr.am/StaffPicks and Your Profile is now distinguished by a *Featured Contributor Badge* and is displayed on the Featured Contributor Board. Thank you!

POSTED BY: EDITORIAL BOARD

Anton Antonov

Anton Antonov, Accendo Data LLC

Posted 3 years ago

Thank you for the recognition, Moderation Team!

POSTED BY: Anton Antonov

Reply to this discussion

Reply Preview

Attachments

Remove Add a file to this post

Follow this discussion

or Discard

Feedback