Message Boards Message Boards

GROUPS:

Finding character name frequency in books using Mathematica?

Posted 3 months ago
870 Views
|
3 Replies
|
5 Total Likes
|

Hello! How might I find out how often a character's name appears in a book compared to another character? For example, if I wanted to see how often the name "Daisy" appears in the Great Gatsby as compared to the name "Gatsby"? I have tried WordCount but that doesn't work so far. Thanks for any help!

3 Replies

Try WordCounts instead of WordCount. For instance, if myGatsby is the variable holding the text of the book as a string, then you could use

WordCounts[DeleteStopwords[myGatsby]]

which produces an association whose keys are the words and whose values are their frequencies. In the copy I downloaded from Project Gutenberg Gatsby appears 199 times and Daisy 150 times.

Start with StringPosition and work from there. Take a look at the first half of

https://blog.wolfram.com/2017/12/07/tracking-a-descent-to-savagery-with-the-wolfram-language-plotting-sentiment-analysis-in-lord-of-the-flies/

where I did this with Lord of the Flies.

Take a look at this post: 400th anniversary of Shakespeare's death

https://community.wolfram.com/groups/-/m/t/842698

I recommend searching this forum, documentation, Wolfram blogs and other sites. There is a lot of useful information already published.

Reply to this discussion
Community posts can be styled and formatted using the Markdown syntax.
Reply Preview
Attachments
Remove
or Discard

Group Abstract Group Abstract