Please join Functional Dataflow Coding Session #4 live Tue 5ET on https://www.twitch.tv/wolfram
Direct link: https://wolfr.am/FtPdPfKZ
If you missed the previous sessions, you can watch them through the Wolfram YouTube Channel: https://wolfr.am/FCPMQHCM
This channel is related to my forthcoming book "Functional Dataflow" based largely on functional approaches to data wrangling and structuring in WL rather than on specific statistical methods using a real world dataset
We'll discuss two self-contained topics and a wrap up that combines them.
A new implementation of trie operators that can take a function parameter to gather statistics as the trie is built from a list of lists.
Such tries have a variety of applications such as preprocessing documents into suffix tries for efficient string matching - algorithms leveraging tries can match in O(s+t) time vs the naive O(s x t) method.
Efficient indexing with statistics of general sequence data, such as mobile app user flows.
Experimental Flag Identicons: visual representations to tag and distinguish data layers based on filesystem paths, so that data layers sharing the same path prefix will have common elements (see screenshot)
Wrap up showing that the trie operator's use associating the graphics strips of the identicons with each path component (screenshot 2)
fileTokens [ funcTrie[AssociationMap[ flagIdenticon /* smallStripGr /* Framed]]] // Normal