[B! clustering] bob3のブックマーク

bob3 id:bob3

clusteringに関するbob3のブックマーク (5)

Mahout: Canopy and K-means Clustering - hamadakoichi blog
MahoutのCanopyとK-Meansを用い、Canopy生成しCanopy Centroidを用いたK-Means Clustering実行できる Driverの実装法を解説します。次のようなコマンドライン呼び出しで、質の良い Canopy+K-Meansの一連のClusteringの手続きを実行できます。以下では、org.apache.mahout.clustering.canopykmeans packageを作り、追加実装しコンパイルしたJobファイルを $MAHOUT_HOME/bin/mahout-core-0.4-job.jar に置いています。MahoutのVersionは最新の0.4です。実行例 $HADOOP_HOME/bin/hadoop jar $MAHOUT_HOME/bin/mahout-core-0.4-job.jar \ org.apache.maho
bob3 2011/02/05
clustering

canopy

mahout
リンク
Mahout Canopy Clustering - #TokyoWebmining 9
「第５回データマイニング+WEB 勉強会＠東京－WEB解析・最適化祭り－」の講師資料です(10/06/20) hamadakoichi 濱田晃一
bob3 2011/01/19
clustering

hadoop

mahout

canopy
リンク
Cluster Analysis in R
R offers a wide range of functions for cluster analysis</a >, including hierarchical agglomerative, partitioning, and model-based approaches. Although there is no definitive solution for determining the optimal number of clusters to extract, several approaches are available. Data Preparation Prior to clustering data, you may want to remove or estimate missing data and rescale variables for compara
bob3 2010/03/25
clustering

R
リンク
クラスター分析の光と闇
三中信宏農業環境技術研究所 minaka@affrc.go.jp http://cse.niaes.affrc.go.jp/minaka/ ●はじめに――"To classify is human" 分類（classification）は，秩序（order）を求める人間の基本的欲求に由来する行為である．分類の対象は生物であっても非生物であってもかまわない．個物としての多様な対象物のありようを理解するために私たちはつねに分類し続けている．人間は生まれながらの分類者（classifier）である．採集狩猟をしながら進化してきた過去数十万年の間に，ヒトは自然界に存在するパターンを認知し理解するように，認知カテゴリー化および帰納的推論の様式を自然淘汰によって獲得してきたと考えられる．分類とは外在する離散的実体を発見する行為であると考えるのはまちがいである．むしろ，時空的に連続する外界を人間が理
bob3 2010/02/24
今一度、読み直したい。

分析

research

clustering
リンク
Partitioning Around Medoids (Pam)
7.1.1 Partitioning Around Medoids (Pam) Compared to the well - known kmeans algorithm, .Pam has the following features: It operates on the dissimilarity matrix of the given data set or when it is presented with an n ´ p data matrix, the algorithm first computes a dissimilarity matrix. It is more robust, because it minimizes a sum of dissimilarities instead of a sum of squared Euclidean dis
bob3 2010/02/23
PAMとシルエット

clustering

data
リンク
1