[Google Deep Learning 笔记] Logistic Classification

时间：2016-05-12 12:33:22 阅读：232 评论：0 收藏：0 [点我收藏+]

标签：

Logistic Classification

Github工程地址：https://github.com/ahangchen/GDLnotes
欢迎star，有问题可以到Issue区讨论
官方教程地址
 视频/字幕下载

simple but important classifier

技术分享

之所以这样建模，是因为线性公式是最简单的数学模型，仅此而已。

Input: X (e.g. the pixels in an image)
Apply a linear function to X
- Giant matrix multiply
- Take inputs as a big vector
- Multiply input vector with a matrix, W means weights
- b means biased term
- Machine learning adjust weights and bias for the best prediction
Output: Y, predictions for per output class
- Y is a vector, represents the probability of each label
- 好的预测中，正确的label的概率应当更接近1
- 往往得到的Y一开始不是概率，而是一些具体值（scores/logits），所以需要转换，by：
Softmax回归模型：Wikipedia

技术分享

技术分享

正确预测结果应当是只有一个label成立，其他label不成立。这种情况下，预测概率最大的则是最可能的结果。

Example: take this test

one hot encoding在label很多的情况下not work well，因为output vector到处都是0，很稀疏，因此效率低
- solved by embeddings
好处：可以measure我们与理想情况之间的距离（compare two vectors）

分类器输出：[0.7 0.2 0.1] \<=> 与label对应的真实情况：[1 0 0]
Compare two vectors: cross-entropy
D(S, L) != D(L, S)

Remember: Label don’t log, for label zero