The project topic is to build and use co-occurrence network from
the Google N-gram data provided by Google Inc. This data is huge having more than
trillion words. We used the unigrams and bigrams for our project. The goal of the alpha stage of the project is to
build the network and the goal of the beta stage is to explore the network finding
specific paths. A co-occurrence network links together words that have occurred together in some piece of writing.
Each word represents a node in the network, and the edges between the words indicate that they have occurred together.
Note that the edges are directed, and show the order in which the words have occurred.
The weights on the edges are the frequency of the two words occurring together in that order