机器学习笔记（Washington University）- Clustering Specialization-week four

时间：2017-06-02 01:04:19 阅读：201 评论：0 收藏：0 [点我收藏+]

1. Probabilistic clustering model

(k-means) Hard assignments do not tell the full story, capture the uncertainty
k-means only considers the cluster center, not good for overlapping clusters,disparate cluster size,different shaped cluster
learn weights on dimensions
can learn cluster-specific weights on dimensions

2. Gaussian distribution

1-D gaussian is fully specified by mean μ and variance σ².

2-D gaussian is fully specified by mean μ vector and covariance matrix Σ.

技术分享

thusly our mixture model of gaussian is defined by

{π_k, μ_k, Σ_k}

3. EM(Expectation maximization)

what if we knew the cluster parameters {π_k, μ_k, Σ_k} ?

compute responsibilites:

技术分享

r_ik is the responsibility cluster k takes for observation i.

p is the probability of assignment to cluster k, given model parameters and observaed value.

π_k is the initial probability of being from cluster k.

N is the gaussian model.

what if we knew the cluster soft assignments r_ij ?

技术分享

The procedure for the iterative algorithm:

1. initialize

2. estimate cluster responsibilities given current parameter estimates(E-step)

3. maximize likelihood given soft assignments

Notes:

EM is a coordinate-ascent algorithm

EM converges to a local mode

There are many ways to initialize the EM algorithm and it is important for convergence rates and quality of local mode

prevent overfitting

Do not let the variance goes down to zero, add small amount to diagonal of covariance estimate

原文地址：http://www.cnblogs.com/climberclimb/p/6931296.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

周排行