I have been doing some research and modeling the use of words that could then create predictive analytics based on behaviors and sharing some of the random slides with you.
A brief into ruction from wikipedia on what an n-gram is? http://en.wikipedia.org/wiki/N-gram
“In the fields of computational linguistics and probability, an n-gram is a contiguous sequence of n items from a given sequence of text or speech. The items can be phonemes, syllables, letters,words or base pairs according to the application. The n-grams typically are collected from a text or speech corpus.An n-gram of size 1 is referred to as a “unigram”; size 2 is a “bigram” (or, less commonly, a “digram”); size 3 is a “trigram“. Larger sizes are sometimes referred to by the value of n, e.g., “four-gram”, “five-gram”, and so on.”
So does it mean we think Social is new? Not for over 200 years! Seems a little in decline now actually?
What if we could look at word usage and determine stock prices?
Or why Supplychain is a critical investment opportunity?
What if we could see that farming and water have a relationship?
What if we can see technology changing in-front of our eyes and know what next?
And why the word “Aggregate” is always critical for 200 years or more?
And why “Climate change” in the last 50 years has become an issue?
Whatever you derive is interesting. But the work around word usage reflect behaviors which are a privet to creating analytical models that can potentially be leveraged from buying equities to understanding change. That’s part of my work with a very smart team. Love to hear your thoughts.
You should try https://books.google.com/ngrams very interesting.
Disclosure: Our work is not part of any work or research at Google.