Pattern
Pattern is a web mining module for the Python programming language.
It bundles tools for data retrieval (Google + Twitter + Wikipedia API, web spider, HTML DOM parser), text analysis (rule-based shallow parser, WordNet interface, syntactical + semantical n-gram search algorithm, tf-idf + cosine similarity + LSA metrics), clustering and classification (k-means, k-NN, SVM), and data visualization (graph networks).
The module is bundled with 30+ example scripts and 350+ unit tests.
Goto: http://www.clips.ua.ac.be/pages/pattern
Orange
Open source data visualization and analysis for novice and experts. Data mining through visual programming or Python scripting. Components for machine learning. Add-ons for bioinformatics and text mining. Packed with features for data analytics.
Classify
Goto: http://orange.biolab.si/
Install
# sudo pip install Orange
# sudo pip install pattern
'Data/Text/Knowledge Analysis & Mining' 카테고리의 다른 글
Crawler 운영 관리 도구 (0) | 2013.07.11 |
---|---|
DBPedia ontology (0) | 2013.07.11 |
DBpedia 데이타 (0) | 2013.07.11 |
추천 관련 좋은 강의자료 (0) | 2013.04.15 |
python 기반 과학,공학,데이타 분석 도구 (0) | 2013.03.20 |
WRITTEN BY
- manager@
Data Analysis, Text/Knowledge Mining, Python, Cloud Computing, Platform