Group Abstract Group Abstract

Message Boards Message Boards

Get the sources for Wolfram's stop words and common words?

Posted 10 years ago

Hello everyone,

I'm doing some text analysis and making use of Mathematica's DeleteStopwords and

WordList["CommonWords"]

Where can I find Wolfram's source for these?

Regards,

Gregory

POSTED BY: Gregory Lypny
3 Replies

Folks please also take a look at

list = {"cat", "cow"};
StringDelete["cat dog cow horse", list]

which avoids splitting string into words.

POSTED BY: Vitaliy Kaurov
Posted 10 years ago

Hi Marco,

Much obliged. I'll definitely have to create a custom stop-words lists. Mathematica's is too broad, and includes words that have potential positive or negative sentiment (e.g., "against") in the context of text analysis.

Regards,

Gregory

POSTED BY: Gregory Lypny
POSTED BY: Marco Thiel
Reply to this discussion
Community posts can be styled and formatted using the Markdown syntax.
Reply Preview
Attachments
Remove
or Discard