# Get an updated value for WordFrequencyData?

Posted 1 year ago
3418 Views
|
3 Replies
|
2 Total Likes
|
 Hello community,I have two questions regarding WordFrequenceData[]:I noticed that the maximum date for this feature is 2008 (from 12 years ago), even in the new version 12.1. I understand that the data comes from the "Google Books English n-gram public dataset".I'm still trying to understand how this command (WordFrequenceData) works, so I may be missing something. Example: WordFrequencyData["computer", "TimeSeries", {1900, Now}] DateListPlot[%]  Now Today DateValue["Year"] WordFrequencyData["computer", "TimeSeries", {1900, Today}] WordFrequencyData["computer", "TimeSeries", {1900, DateValue["Year"]}]  My questions are: 1) Are there any estimates when this data will be updated?2) Is there any workaround for this? Maybe with WebSearch[] in any way?Thank you very much.
3 Replies
Sort By:
Posted 1 year ago
 I cannot help you further except noting that The original dataset is not newer than 2008 (the paper is from 2010) Mathematica's function WordFrequencyData considers only this data set. I suggest you read the documentation on WordFrequency WordData and related functions... bestyehuda
 There is additional functionality that allows you to analyze the raw text data As it is divided to 100 fragments you may use the following to download it as a first step and continue from there  Table[URLDownloadSubmit[ "http://commondatastorage.googleapis.com/books/syntactic-ngrams/eng/\ nodes." <> IntegerString[n, 10, 2] <> "-of-99.gz", "~/Downloads/" <> IntegerString[n, 10, 2] <> ".gz", HandlerFunctions -> <|"TaskFinished" -> Print|>], {n, 0, 98}] best