Tag Archives: python

Pattern web mining module

CLIPS (Computational Linguistics & Psycholinguistics) has released a new module for web mining for Python called Patterns.

Quoting from the CLIPS site:

It bundles tools for data retrieval (Google + Twitter + Wikipedia API, web spider, HTML DOM parser), text analysis (rule-based shallow parser, WordNet interface, syntactical + semantical n-gram search algorithm, tf-idf + cosine similarity + LSA metrics) and data visualization (graph networks).

Visit their site to get the download.