zoukankan      html  css  js  c++  java
  • 关于“Unsupervised Deep Embedding for Clustering Analysis”的优化问题

    关于“Unsupervised Deep Embedding for Clustering Analysis”的优化问题

    作者:凯鲁嘎吉 - 博客园 http://www.cnblogs.com/kailugaji/

        Deep Embedding Clustering (DEC)和Improved Ceep Emdedding Clustering (IDEC)被相继提出,但关于参数的优化问题,作者并未详细给出,于是乎自己推导了一遍,但是发现和这两篇文章的推导结果不一致,不知道问题出在哪?下面,相当于给出一道数学题,来求解目标函数关于某个参数(以聚类中心为例)的偏导问题。

    问题描述

    已知

    [L=sumlimits_{i}^{N}{sumlimits_{j}^{c}{{{p}_{ij}}log frac{{{p}_{ij}}}{{{q}_{ij}}}}}]

    [{{q}_{ij}}=frac{{{(1+{{left| {{z}_{i}}-{{mu }_{j}} ight|}^{2}})}^{-1}}}{sum olimits_{j}{{{(1+{{left| {{z}_{i}}-{{mu }_{j}} ight|}^{2}})}^{-1}}}}]

    [{{p}_{ij}}=frac{q_{ij}^{2}/sum olimits_{j}{{{q}_{ij}}}}{sum olimits_{j}{(q_{ij}^{2}/sum olimits_{j}{{{q}_{ij}}})}}]

    固定${p}_{ij}$, 求

    [frac{partial L}{partial {{mu }_{j}}}]

    问题求解

    根据链式法则

    [frac{partial L}{partial {{mu }_{j}}}=frac{partial L}{partial {{q}_{ij}}}frac{partial {{q}_{ij}}}{partial {{mu }_{j}}}]

    [frac{partial L}{partial {{q}_{ij}}}=frac{partial left( {{p}_{ij}}log frac{{{p}_{ij}}}{{{q}_{ij}}} ight)}{partial {{q}_{ij}}}=frac{partial left( {{p}_{ij}}log {{p}_{ij}}-{{p}_{ij}}log {{q}_{ij}} ight)}{partial {{q}_{ij}}}=-frac{{{p}_{ij}}}{{{q}_{ij}}}]

    [frac{{partial {q_{ij}}}}{{partial {mu _j}}} = sumlimits_i^N {frac{{partial frac{{{{(1 + {{left| {{z_i} - {mu _j}} ight|}^2})}^{ - 1}}}}{{sum olimits_j {{{(1 + {{left| {{z_i} - {mu _j}} ight|}^2})}^{ - 1}}} }}}}{{partial {mu _j}}}}  = sumlimits_i^N {left( {frac{{partial {{(1 + {{left| {{z_i} - {mu _j}} ight|}^2})}^{ - 1}}}}{{partial {mu _j}}}frac{1}{{sum olimits_j {{{(1 + {{left| {{z_i} - {mu _j}} ight|}^2})}^{ - 1}}} }} + {{(1 + {{left| {{z_i} - {mu _j}} ight|}^2})}^{ - 1}}frac{{partial frac{1}{{sum olimits_j {{{(1 + {{left| {{z_i} - {mu _j}} ight|}^2})}^{ - 1}}} }}}}{{partial {mu _j}}}} ight)} ]

    其中

    [frac{{partial {{(1 + {{left| {{z_i} - {mu _j}} ight|}^2})}^{ - 1}}}}{{partial {mu _j}}} =  - {(1 + {left| {{z_i} - {mu _j}} ight|^2})^{ - 2}} cdot left( { - 2({z_i} - {mu _j})} ight) = 2({z_i} - {mu _j}) cdot {(1 + {left| {{z_i} - {mu _j}} ight|^2})^{ - 2}}]

    [frac{{partial frac{1}{{sum olimits_j {{{(1 + {{left| {{z_i} - {mu _j}} ight|}^2})}^{ - 1}}} }}}}{{partial {mu _j}}} =  - frac{{2({z_i} - {mu _j}) cdot {{(1 + {{left| {{z_i} - {mu _j}} ight|}^2})}^{ - 2}}}}{{{{left( {sum olimits_j {{{(1 + {{left| {{z_i} - {mu _j}} ight|}^2})}^{ - 1}}} } ight)}^2}}}]

    所以

    [frac{{partial {q_{ij}}}}{{partial {mu _j}}} = sumlimits_i^N {(frac{{2 cdot ({z_i} - {mu _j}) cdot {{(1 + {{left| {{z_i} - {mu _j}} ight|}^2})}^{ - 2}}}}{{sum olimits_j {{{(1 + {{left| {{z_i} - {mu _j}} ight|}^2})}^{ - 1}}} }} - frac{{2 cdot ({z_i} - {mu _j}) cdot {{(1 + {{left| {{z_i} - {mu _j}} ight|}^2})}^{ - 2}} cdot {{(1 + {{left| {{z_i} - {mu _j}} ight|}^2})}^{ - 1}}}}{{{{left( {sum olimits_j {{{(1 + {{left| {{z_i} - {mu _j}} ight|}^2})}^{ - 1}}} } ight)}^2}}})}  = sumlimits_i^N {left( {2 cdot ({z_i} - {mu _j}) cdot {{(1 + {{left| {{z_i} - {mu _j}} ight|}^2})}^{ - 1}} cdot {q_{ij}} - 2 cdot ({z_i} - {mu _j}) cdot {{(1 + {{left| {{z_i} - {mu _j}} ight|}^2})}^{ - 1}} cdot q_{ij}^2} ight)} { m{ = }}sumlimits_i^N {left( {2 cdot ({z_i} - {mu _j}) cdot {{(1 + {{left| {{z_i} - {mu _j}} ight|}^2})}^{ - 1}} cdot {q_{ij}} cdot (1 - {q_{ij}})} ight)} ]

    求导结果

    [frac{{partial L}}{{partial {mu _j}}} = frac{{partial L}}{{partial {q_{ij}}}}frac{{partial {q_{ij}}}}{{partial {mu _j}}} = sumlimits_i^N {left( { - frac{{{p_{ij}}}}{{{q_{ij}}}} cdot 2 cdot ({z_i} - {mu _j}) cdot {{(1 + {{left| {{z_i} - {mu _j}} ight|}^2})}^{ - 1}} cdot {q_{ij}} cdot (1 - {q_{ij}})} ight)}  = sumlimits_i^N {left( {2 cdot ({z_i} - {mu _j}) cdot {{(1 + {{left| {{z_i} - {mu _j}} ight|}^2})}^{ - 1}} cdot {p_{ij}} cdot ({q_{ij}} - 1)} ight)} ]

    原文结果

    不知道问题出在哪?求广大网友指正~

    参考文献

    [1] Deep Clustering Algorithms - 凯鲁嘎吉 博客园

    [2] Xie J, Girshick R, Farhadi A. Unsupervised deep embedding for clustering analysis[C]//International conference on machine learning. 2016: 478-487.

    [3] Guo X, Gao L, Liu X, et al. Improved deep embedded clustering with local structure preservation[C]//IJCAI. 2017: 1753-1759.

    作者:凯鲁嘎吉
    本文版权归作者和博客园共有,欢迎转载,但未经作者同意必须在文章页面给出原文链接,否则保留追究法律责任的权利。
  • 相关阅读:
    FCK常用Js,获取FCK内容,统计FCK字数,向FCK写入指定代码
    asp 点击链接 下载图片文件
    使用微软的 Visual Studio International Pack 1.0 进行网站简体与繁体的互转和得到汉字、拼音、笔画
    mysql alter 语句用法,添加、修改、删除字段等
    C#指定窗口显示位置的方法
    Soukey采集软件源码
    [转](收藏)《博客园精华集》分类索引
    YUI CSS Foundation讲座 博客文库 博客园
    sql group by 和having
    sql 多表查询
  • 原文地址:https://www.cnblogs.com/kailugaji/p/14300272.html
Copyright © 2011-2022 走看看