Probabilistic Methods for Realtime Streams
Realtime analysis is a challenge for modern data systems. The probabilistic methods explored in this report offer highly efficient models for extracting value from streams of data as they are generated. This orders-of-magnitude improvement in efficiency enables many new applications and modes of computation.
CliqueStream displays popular topics across Reddit as nodes in a force-directed graph where the connections are based on similarity of word use over all comments and tweets. You can also select groups of keywords and view their distribution across subreddits.
This is a report and prototype preview. For full access to all of our reports and prototypes contact us about becoming a subscriber.