Group Abstract

Message Boards

WOLFRAM COMMUNITY

18.3K Views

42 Replies

18 Total Likes

View groups...

Follow this post

Share this post:

GROUPS:

[WSG20] Wolfram Language Basics from EIWL (Days 6, 7, 8, 9)

Abrita Chakravarty

Abrita Chakravarty, Wolfram U, Wolfram Research Inc.

Posted 6 years ago

During week two of the Wolfram Study Group Apr 2020 (WSG20) we are looking at the following topics from Stephen Wolfram's book Elementary Introduction to the Wolfram Language: Day 6: Chapter 16 Real-World Data, Chapter 35: Natural Language Understanding, Chapter 44: Importing and Exporting Day 7: Chapter 5: Operations on Lists, Chapter 13: Arrays, or Lists of Lists, Chapter 30: Rearranging Lists, Chapter 31: Parts of Lists Day 8: Chapter 34: Associations, Chapter 45: Datasets Day 9: Chapter 27: Applying Functions Repeatedly, Chapter 29: More about Pure Functions, Chapter 46: Writing Good Code, Chapter 47: Debugging Your Code We are looking at videos from the Wolfram U interactive course on EIWL and also working on simple exercises and mini projects. Feel free to post questions on the material we covered in these sessions here.

POSTED BY: Abrita Chakravarty

42 Replies

Sort By:

Rohit Namjoshi

Posted 6 years ago

Shorter version WordList[] // Select[Complement[letters, Characters[#]] == {} &] (* {"chutzpa", "chutzpah"} *)

POSTED BY: Rohit Namjoshi

Muhammad Ali

Posted 6 years ago

@Abrita Chakravarty First unlike `WordList`, `DictionaryLookup` supports patterns. Moreover `DictionaryLookup` seems to have more words than the `WordList` (probably a bug). WordList[]//Length(39176) DictionaryLookup[]//Length(92518) Complement[WordList[],DictionaryLookup[]]({}) Here is my solution: Clear[pangram]; pangram[characters_List]:=Module[{$characterSet="["<>StringJoin@characters<>"]"},DictionaryLookup@RegularExpression[StringJoin["(?="<>$characterSet<>#<>")"&/@characters,$characterSet]]]; pangram[{"a","o","l","r","w","f","m"}]({"wolfram"}) pangram[{"z","t","c","a","u","h","p"}]({"chutzpa","chutzpah"}) pangram[{"e","t","l","a","b"}]({"ablate","ballet","battle","beatable","bleat","eatable","table","tablet"}*) Its orders of magnitude faster than doing surgery on `WordList[]` or `DictionaryLookup[]` alone.

@Abrita Chakravarty First unlike WordList, DictionaryLookup supports patterns. Moreover DictionaryLookup seems to have more words than the WordList (probably a bug).

WordList[]//Length(*39176*)
DictionaryLookup[]//Length(*92518*)
Complement[WordList[],DictionaryLookup[]](*{}*)

Here is my solution:

Clear[pangram];
pangram[characters_List]:=Module[{$characterSet="["<>StringJoin@characters<>"]*"},DictionaryLookup@RegularExpression[StringJoin["(?="<>$characterSet<>#<>")"&/@characters,$characterSet]]];
pangram[{"a","o","l","r","w","f","m"}](*{"wolfram"}*)
pangram[{"z","t","c","a","u","h","p"}](*{"chutzpa","chutzpah"}*)
pangram[{"e","t","l","a","b"}](*{"ablate","ballet","battle","beatable","bleat","eatable","table","tablet"}*)

Its orders of magnitude faster than doing surgery on WordList[] or DictionaryLookup[] alone.

POSTED BY: Muhammad Ali

Valeriu Ungureanu

Valeriu Ungureanu, Moldova State University

Posted 6 years ago

Some other codes applying operations on sets: l = {"z", "t", "c", "a", "u", "h", "p"}; Select[ContainsAll[l]@Characters@# &]@WordList[] or Select[SubsetQ[Characters@#, l] &]@WordList[] or Select[Intersection[Characters@#, l] == Sort@l &]@WordList[] or Cases[x_ /; ContainsAll[l]@Characters@x]@WordList[] or Cases[x_ /; SubsetQ[Characters@x, l]]@WordList[] or Cases[x_ /; Intersection[Characters@x, l] == Sort@l]@WordList[] Sure, we can apply also the function If[].

Some other codes applying operations on sets:

l = {"z", "t", "c", "a", "u", "h", "p"};
Select[ContainsAll[l]@Characters@# &]@WordList[]

Select[SubsetQ[Characters@#, l] &]@WordList[]

Select[Intersection[Characters@#, l] == Sort@l &]@WordList[]

Cases[x_ /; ContainsAll[l]@Characters@x]@WordList[]

Cases[x_ /; SubsetQ[Characters@x, l]]@WordList[]

Cases[x_ /; Intersection[Characters@x, l] == Sort@l]@WordList[]

Sure, we can apply also the function If[].

POSTED BY: Valeriu Ungureanu

Rohit Namjoshi

Posted 6 years ago

For at least once letters = {"z", "t", "c", "a", "u", "h", "p"}; WordList[] // Map[{#, Complement[letters, Characters[#]]} &] // Select[Last@# == {} &] // Map[First] (* {"chutzpa", "chutzpah"} *}

POSTED BY: Rohit Namjoshi

Valeriu Ungureanu

Valeriu Ungureanu, Moldova State University

Posted 6 years ago

If every letter is used exactly once, then the following code solves the problem: In[1] := Intersection[StringJoin /@ Permutations[{"z", "t", "c", "a", "u", "h", "p"}], WordList[]] Out[1] = {"chutzpa"} or, thanks the Rohit's idea to use the function Complements: In[2]:= l = {"z", "t", "c", "a", "u", "h", "p"}; If[Complement[l, Characters@#] == {}, #, Nothing] & /@ WordList[] Out[2]= {"chutzpa", "chutzpah"}

If every letter is used exactly once, then the following code solves the problem:

In[1] :=  Intersection[StringJoin /@ Permutations[{"z", "t", "c", "a", "u", "h", "p"}],
          WordList[]]
Out[1] = {"chutzpa"}

or, thanks the Rohit's idea to use the function Complements:

In[2]:= l = {"z", "t", "c", "a", "u", "h", "p"};

If[Complement[l, Characters@#] == {}, #, Nothing] & /@ WordList[]

Out[2]= {"chutzpa", "chutzpah"}

POSTED BY: Valeriu Ungureanu

Abrita Chakravarty

Abrita Chakravarty, Wolfram U, Wolfram Research Inc.

Posted 6 years ago

Another possibility: d = Dataset[Association[ "Padmé Amidala" -> <\|"ID" -> "19435", "Planet" -> "Naboo"\|>, "Bail Organa" -> <\|"ID" -> "11114", "Planet" -> "Alderaan"\|>, "Mon Mothma" -> <\|"ID" -> "10712", "Planet" -> "Chandrila"\|>]] Apply[Association, d[#, "ID"] -> <\|"Name" -> #, "Planet" -> d[#, "Planet"]\|> & /@ Keys[d]]

Another possibility:

d = Dataset[Association[
   "Padmé Amidala" -> <|"ID" -> "19435", "Planet" -> "Naboo"|>, 
   "Bail Organa" -> <|"ID" -> "11114", "Planet" -> "Alderaan"|>, 
   "Mon Mothma" -> <|"ID" -> "10712", "Planet" -> "Chandrila"|>]]

Apply[Association, d[#, "ID"] -> <|"Name" -> #, "Planet" -> d[#, "Planet"]|> & /@ Keys[d]]

POSTED BY: Abrita Chakravarty

Valeriu Ungureanu

Valeriu Ungureanu, Moldova State University

Posted 6 years ago

My non-ideal code, that may be checked: data = Dataset@<\| "Padmé Amidala" -> <\|"ID" -> "19435", "Planet" -> "Naboo"\|>, "Bail Organa" -> <\|"ID" -> "11114", "Planet" -> "Alderaan"\|>, "Mon Mothma" -> <\|"ID" -> "10712", "Planet" -> "Chandrila"\|>\|>; name = Keys@Normal[data]; id = Values@Normal[data[All, 1]]; planet = Values@Normal[data[All, 2]]; Dataset@Association@ Array[id[[#]] -> <\|"Name" -> name[[#]], "Planet" -> planet[[#]]\|> &, Length@data]

My non-ideal code, that may be checked:

data = Dataset@<|
"Padmé Amidala" -> <|"ID" -> "19435", "Planet" -> "Naboo"|>, 
"Bail Organa" -> <|"ID" -> "11114", "Planet" -> "Alderaan"|>, 
"Mon Mothma" -> <|"ID" -> "10712", "Planet" -> "Chandrila"|>|>;
name = Keys@Normal[data];
id = Values@Normal[data[All, 1]];
planet = Values@Normal[data[All, 2]];
Dataset@Association@
Array[id[[#]] -> <|"Name" -> name[[#]], "Planet" -> planet[[#]]|> &,
Length@data]

POSTED BY: Valeriu Ungureanu

Muhammad Ali

Posted 6 years ago

Here is my clunky attempt at it. Probably there is a better, more humane way to do this.

POSTED BY: Muhammad Ali

Muhammad Ali

Posted 6 years ago

POSTED BY: Muhammad Ali

Valeriu Ungureanu

Valeriu Ungureanu, Moldova State University

Posted 6 years ago

My code for today (day 7) challenge: Last /@ Position[Characters@WordList[], "e"] // Histogram

POSTED BY: Valeriu Ungureanu

Muhammad Ali

Posted 6 years ago

@Valeriu Ungureanu In my experience when working with `Entities` it is advisable to look for both `CanonicalName` and `CommonName`. `CanonicalName` is the actual field name in the database so in many databases they are kept as short and cryptic to avoid naming conflicts. As an example, if you create your own SQL database and map it onto the EntityFramework using `RelationalDatabase`, you will see that database field names are mapped to `CanonicalName`. If you restrict your searching to `CanonicalName` only you will miss out on many relevant properties. Regarding the question, it didn't specify what name it was referring to (that's why I did an exhaustive search using the `Or` in my code) but I agree one can become pedantic and take "w" as a literal for the lower case. But I don't think that was the intention of the question.

POSTED BY: Muhammad Ali

Abrita Chakravarty

Abrita Chakravarty, Wolfram U, Wolfram Research Inc.

Posted 6 years ago

Small or capital "w". But good catch - interesting that "WaterArrivals" and "Workforce" have different common names. For this challenge all canonical names beginning with "w" should be accepted.

POSTED BY: Abrita Chakravarty

Muhammad Ali

Posted 6 years ago

@Abrita Chakravarty Cases[ EntityProperties["Country"], x_/;Or@@StringMatchQ[ {CanonicalName[x],CommonName[x]}, "W*", IgnoreCase->True ] ]

POSTED BY: Muhammad Ali

Abrita Chakravarty

Abrita Chakravarty, Wolfram U, Wolfram Research Inc.

Posted 6 years ago

@Moses Paul Thanks for your reply. We are looking for EntityProperties that start with "w", not Country Entities. Try: EntityProperties["Country"] @Muhammad Ali Thanks for your reply. Can you also post the code you used to get to this answer? @Valeriu Ungureanu Thanks for your reply. You want to modify the pattern a tad bit - so only property names starting with "w" are caught. @Rohit Namjoshi Nice one.

POSTED BY: Abrita Chakravarty

Rohit Namjoshi

Posted 6 years ago

There are nine Entity["Country"]["Properties"] // Select[ToLowerCase[StringTake[CanonicalName@#, 1]] == "w" &] (* {EntityProperty["Country", "WageAndSalariedWorkers"], EntityProperty["Country", "WageAndSalariedWorkersFraction"], EntityProperty["Country", "WagesCostIndex"], EntityProperty["Country", "WaterArea"], EntityProperty["Country", "WaterArrivals"], EntityProperty["Country", "WaterProductivity"], EntityProperty["Country", "WaterwayLength"], EntityProperty["Country", "Workforce"], EntityProperty["Country", "WPI"]} *)

There are nine

 Entity["Country"]["Properties"] // 
   Select[ToLowerCase[StringTake[CanonicalName@#, 1]] == "w" &]

(*
{EntityProperty["Country", "WageAndSalariedWorkers"], 
 EntityProperty["Country", "WageAndSalariedWorkersFraction"], 
 EntityProperty["Country", "WagesCostIndex"], 
 EntityProperty["Country", "WaterArea"], 
 EntityProperty["Country", "WaterArrivals"], 
 EntityProperty["Country", "WaterProductivity"], 
 EntityProperty["Country", "WaterwayLength"], 
 EntityProperty["Country", "Workforce"], 
 EntityProperty["Country", "WPI"]}
*)

POSTED BY: Rohit Namjoshi

Abrita Chakravarty

Abrita Chakravarty, Wolfram U, Wolfram Research Inc.

Posted 6 years ago

Yet another way: DeleteMissing[{#, PartOfSpeech[#]} & /@ StringJoin /@ Permutations[{"z", "t", "c", "a", "u", "h", "p"}], 1, Infinity]

POSTED BY: Abrita Chakravarty

Valeriu Ungureanu

Valeriu Ungureanu, Moldova State University

Posted 6 years ago

It looks important to define before the variable l: In[1]:= l = {"z", "t", "c", "a", "u", "h", "p"}; DictionaryLookup[x__ /; SubsetQ[Characters@x, l]] Out[1]= {"chutzpa", "chutzpah"}

POSTED BY: Valeriu Ungureanu

Muhammad Ali

Posted 6 years ago

Try your code on {"a", "o", "l", "r", "w", "f", "m"} DictionaryLookup[x__ /; SubsetQ[Characters@x, {"a", "o", "l", "r", "w", "f", "m"}]] {"flamethrower", "flamethrowers", "flatworm", "flatworms", "mayflower", "mayflowers", "wolfram"} It produces incorrect result.

POSTED BY: Muhammad Ali

Abrita Chakravarty

Abrita Chakravarty, Wolfram U, Wolfram Research Inc.

Posted 6 years ago

[WSG20] Wolfram Language Basics from EIWL (Days 6, 7, 8, 9)

POSTED BY: Abrita Chakravarty

Valeriu Ungureanu

Valeriu Ungureanu, Moldova State University

Posted 6 years ago

So, we can construct other codes with DictionaryLookup[], such as for example: In[1]:= DictionaryLookup[x__ /; SubsetQ[Characters@x, l]] Out[1]= {"chutzpa", "chutzpah"}

POSTED BY: Valeriu Ungureanu

Muhammad Ali

Posted 6 years ago

Taking the complement other way shows that `WordList` is a subset of `DictionaryLookup` so `DictionaryLookup` is a safe choice.

POSTED BY: Muhammad Ali

Valeriu Ungureanu

Valeriu Ungureanu, Moldova State University

Posted 6 years ago

DictionaryLookup[] contains more words, really: In[1]:= Complement[DictionaryLookup[], WordList[]] // Length Out[1]= 53342

POSTED BY: Valeriu Ungureanu

Abrita Chakravarty

Abrita Chakravarty, Wolfram U, Wolfram Research Inc.

Posted 6 years ago

No, a pangram would be a word that uses ONLY the letters from the given set - BUT could use a letter more than once. So for the letters, {"a", "o", "l", "r", "w", "f", "m"}, "flamethrower" would not be considered the pangram.

POSTED BY: Abrita Chakravarty

Muhammad Ali

Posted 6 years ago

@Abrita Chakravarty I have a query regarding the question. For the letters, {"a", "o", "l", "r", "w", "f", "m"}, can the pangram apart from containing at least one of each letter from the list also contain some of the remaining letters? Like for example "flamethrower" contains all of the letters from the list but also contains "e" twice, "h" and "t" once which are not part of the list. So is "flamethrower" also regarded as a pangram of this list in addition to "wolfram"?

POSTED BY: Muhammad Ali

Valeriu Ungureanu

Valeriu Ungureanu, Moldova State University

Posted 6 years ago

An identical version with Cases: In[1]:= l = {"z", "t", "c", "a", "u", "h", "p"}; Cases[x_ /; Complement[l, Characters@x] == {}]@WordList[] Out[1]= {"chutzpa", "chutzpah"}

POSTED BY: Valeriu Ungureanu

Abrita Chakravarty

Abrita Chakravarty, Wolfram U, Wolfram Research Inc.

Posted 6 years ago

Daily Challenge (Day 9): A pangram is usually understood as a sentence in which every letter of the alphabet is used at least once. For the letters {"a", "o", "l", "r", "w", "f", "m"}, a pangram would be a word that uses all of the seven letters e.g. "Wolfram". Use WL to find a pangram from the letters {"z", "t", "c", "a", "u", "h", "p"}.

POSTED BY: Abrita Chakravarty

Abrita Chakravarty

Abrita Chakravarty, Wolfram U, Wolfram Research Inc.

Posted 6 years ago

Daily Challenge (Day 8): Use WL code to restructure the following dataset, so the "ID" is used as the row-identifier instead of the name for each row. Dataset[ <\|"Padmé Amidala" -> <\|"ID" -> "19435", "Planet" -> "Naboo"\|>, "Bail Organa" -> <\|"ID" -> "11114", "Planet" -> "Alderaan"\|>, "Mon Mothma" -> <\|"ID" -> "10712", "Planet" -> "Chandrila"\|>\|>] Expected output:

POSTED BY: Abrita Chakravarty

Valeriu Ungureanu

Valeriu Ungureanu, Moldova State University

Posted 6 years ago

Another modification of the code: Histogram@Flatten@Map[Union, StringPosition[WordList[], "e"], 2] or Histogram@Flatten[Union @@@ StringPosition[WordList[], "e"]]

POSTED BY: Valeriu Ungureanu

Abrita Chakravarty

Abrita Chakravarty, Wolfram U, Wolfram Research Inc.

Posted 6 years ago

Another solution DeleteCases[StringPosition[WordList[], "e"], {}][[All, 1, 1]] // Histogram

POSTED BY: Abrita Chakravarty

Valeriu Ungureanu

Valeriu Ungureanu, Moldova State University

Posted 6 years ago

Thank you, Rohit! You are right! At the moment, I can modify the code to obtain the correct result, but it will not be as short as the precedent: Last /@ Flatten[StringPosition["e"]@WordList[], 1] // Histogram

POSTED BY: Valeriu Ungureanu

Rohit Namjoshi

Posted 6 years ago

Hi Valeriu, The counts are doubled in this solution because `StringPosition` returns a pair of numbers.

POSTED BY: Rohit Namjoshi

Valeriu Ungureanu

Valeriu Ungureanu, Moldova State University

Posted 6 years ago

A shorter code: Histogram@Flatten@StringPosition[WordList[], "e"] With such code, day 7 challenge becomes a simple problem :)

POSTED BY: Valeriu Ungureanu

Valeriu Ungureanu

Valeriu Ungureanu, Moldova State University

Posted 6 years ago

Agree!

POSTED BY: Valeriu Ungureanu

Abrita Chakravarty

Abrita Chakravarty, Wolfram U, Wolfram Research Inc.

Posted 6 years ago

Daily Challenge (Day 7): This one is from EIWL Chapter 31-- Make a histogram of where the letter “e” occurs in the words in WordList[ ]

POSTED BY: Abrita Chakravarty

Abrita Chakravarty

Abrita Chakravarty, Wolfram U, Wolfram Research Inc.

Posted 6 years ago

Here's another solution: StringCases[#[[2]] & /@ EntityProperties["Country"], StartOfString ~~ "w" ~~ ___, IgnoreCase -> True] // Flatten

POSTED BY: Abrita Chakravarty

Valeriu Ungureanu

Valeriu Ungureanu, Moldova State University

Posted 6 years ago

Thank you, Rohit, for your suggestion!

POSTED BY: Valeriu Ungureanu

Valeriu Ungureanu

Valeriu Ungureanu, Moldova State University

Posted 6 years ago

Hi Muhammad, There is a difficulty with your code. I have executed it: In[61]:= Cases[EntityProperties["Country"], x_ /; Or @@ StringMatchQ[{CanonicalName[x], CommonName[x]}, "W", IgnoreCase -> True]] Out[61]= {EntityProperty["Country", "WageAndSalariedWorkers"], EntityProperty["Country", "WageAndSalariedWorkersFraction"], EntityProperty["Country", "WagesCostIndex"], EntityProperty["Country", "WaterArea"], EntityProperty["Country", "WaterArrivals"], EntityProperty["Country", "WaterProductivity"], EntityProperty["Country", "WaterwayLength"], EntityProperty["Country", "Workforce"], EntityProperty["Country", "WPI"]} and found that for 2 of your results their Common Names do not start with small letter "w". So, for "WaterArrivals"* the Common Name is "arrivals by sea" and for "Workforce" the Common Name is "employment" It looks so that the code must be modified because, according to formulation, property names must begin with small letter "w". At least, such observation did @Abrita Chakravarty

Hi Muhammad,

There is a difficulty with your code. I have executed it:

In[61]:= Cases[EntityProperties["Country"], 
 x_ /; Or @@ 
   StringMatchQ[{CanonicalName[x], CommonName[x]}, "W*", 
    IgnoreCase -> True]]

Out[61]= {EntityProperty["Country", "WageAndSalariedWorkers"], 
 EntityProperty["Country", "WageAndSalariedWorkersFraction"], 
 EntityProperty["Country", "WagesCostIndex"], 
 EntityProperty["Country", "WaterArea"], 
 EntityProperty["Country", "WaterArrivals"], 
 EntityProperty["Country", "WaterProductivity"], 
 EntityProperty["Country", "WaterwayLength"], 
 EntityProperty["Country", "Workforce"], 
 EntityProperty["Country", "WPI"]}

and found that for 2 of your results their Common Names do not start with small letter "w". So, for

"WaterArrivals" the Common Name is "arrivals by sea"

and for

"Workforce" the Common Name is "employment"

It looks so that the code must be modified because, according to formulation, property names must begin with small letter "w". At least, such observation did @Abrita Chakravarty

POSTED BY: Valeriu Ungureanu

Rohit Namjoshi

Posted 6 years ago

Hi Valeriu, The question asks for 'beginning with the letter "w"'. You need to anchor the pattern to the start of the string or it will match "w" anywhere in the name. StartOfString ~~ "w" \| "W" ~~ ___

POSTED BY: Rohit Namjoshi

Valeriu Ungureanu

Valeriu Ungureanu, Moldova State University

Posted 6 years ago

POSTED BY: Valeriu Ungureanu

Muhammad Ali

Posted 6 years ago

The following 9: {EntityProperty["Country", "WageAndSalariedWorkers"], EntityProperty["Country", "WageAndSalariedWorkersFraction"], EntityProperty["Country", "WagesCostIndex"], EntityProperty["Country", "WaterArea"], EntityProperty["Country", "WaterArrivals"], EntityProperty["Country", "WaterProductivity"], EntityProperty["Country", "WaterwayLength"], EntityProperty["Country", "Workforce"], EntityProperty["Country", "WPI"]}

The following 9:

{EntityProperty["Country", "WageAndSalariedWorkers"], 
 EntityProperty["Country", "WageAndSalariedWorkersFraction"], 
 EntityProperty["Country", "WagesCostIndex"], 
 EntityProperty["Country", "WaterArea"], 
 EntityProperty["Country", "WaterArrivals"], 
 EntityProperty["Country", "WaterProductivity"], 
 EntityProperty["Country", "WaterwayLength"], 
 EntityProperty["Country", "Workforce"], 
 EntityProperty["Country", "WPI"]}

POSTED BY: Muhammad Ali

Moses Paul

Moses Paul, Student

Posted 6 years ago

Entity["Country", "WestBank"]["Properties"] Entity["Country", "WesternSahara"]["Properties"]

POSTED BY: Moses Paul

Abrita Chakravarty

Abrita Chakravarty, Wolfram U, Wolfram Research Inc.

Posted 6 years ago

Daily challenge (Day 6): List the EntityProperties for the "Country" entity that have a name beginning with the letter "w"? How many did you find?

POSTED BY: Abrita Chakravarty

Reply to this discussion

Reply Preview

Attachments

Remove Add a file to this post

Follow this discussion

or Discard

Feedback