zoukankan      html  css  js  c++  java
  • (原)欧氏距离与余弦距离

    转载请注明出处:

     https://www.cnblogs.com/darkknightzh/p/12013741.html

    网上参考资料一大堆,自己也总结一下吧。

    两向量$mathbf{A}=[{{a}_{1}},cdots ,{{a}_{n}}]$,$mathbf{B}=[{{b}_{1}},cdots ,{{b}_{n}}]$,这两个向量之间的欧式距离为:

    $Euc\_dist={{left| mathbf{A}-mathbf{B} ight|}_{2}}=sqrt{sumlimits_{i=1}^{n}{{{({{a}_{i}}-{{b}_{i}})}^{2}}}}=sqrt{sumlimits_{i=1}^{n}{(a_{i}^{2}-2centerdot {{a}_{i}}centerdot {{b}_{i}}+b_{i}^{2})}}=sqrt{sumlimits_{i=1}^{n}{a_{i}^{2}}+sumlimits_{i=1}^{n}{b_{i}^{2}}-2centerdot sumlimits_{i=1}^{n}{{{a}_{i}}centerdot {{b}_{i}}}}$

    这两个向量之间的余弦相似度Cos_sim为:

    $Cos\_sim ext{=}frac{mathbf{A}centerdot {{mathbf{B}}^{T}}}{{{left| mathbf{A} ight|}_{2}}centerdot {{left| mathbf{B} ight|}_{2}}}=frac{sumlimits_{i=1}^{n}{{{a}_{i}}centerdot {{b}_{i}}}}{sqrt{sumlimits_{i=1}^{n}{a_{i}^{2}}}centerdot sqrt{sumlimits_{i=1}^{n}{b_{i}^{2}}}}$

    余弦距离为:

    $Cos\_dis ext{=}1-Cos\_sim ext{=}1 ext{-}frac{sumlimits_{i=1}^{n}{{{a}_{i}}centerdot {{b}_{i}}}}{sqrt{sumlimits_{i=1}^{n}{a_{i}^{2}}}centerdot sqrt{sumlimits_{i=1}^{n}{b_{i}^{2}}}}$

    若这两个向量均已归一化,即${{left| mathbf{A} ight|}_{2}}=sqrt{sumlimits_{i=1}^{n}{a_{i}^{2}}} ext{=}1$,${{left| mathbf{B} ight|}_{2}}=sqrt{sumlimits_{i=1}^{n}{b_{i}^{2}}} ext{=}1$,则:

    $Euc\_dist ext{=}sqrt{1+1-2centerdot sumlimits_{i=1}^{n}{{{a}_{i}}centerdot {{b}_{i}}}} ext{=}sqrt{2centerdot 1-sumlimits_{i=1}^{n}{{{a}_{i}}centerdot {{b}_{i}}}}$

    $Cos\_dis ext{=}1 ext{-}sumlimits_{i=1}^{n}{{{a}_{i}}centerdot {{b}_{i}}}$

    进而:

    $Euc\_dis{{t}^{2}} ext{=}2centerdot Cos\_dis ext{=}2centerdot (1-Cos\_sim)$

    $Cos\_sim=1-frac{1}{2}Euc\_dis{{t}^{2}}$

    另外:

    欧式距离越小(越接近0),两向量越相似。

    余弦距离越小(越接近0),两向量越相似。

    余弦相似度越大(越接近1),两向量越相似。

  • 相关阅读:
    查找质数的算法优化版
    一些新闻链接
    spring的自动装配基础
    Spring中的Autowired注解和Resource注解的区别
    动态规划:01背包 复习
    贪心问题 POJ 2393 Yogurt factory
    穷竭搜索:POJ 3187 Backward Digit Sums
    穷竭搜索: POJ 2718 Smallest Difference
    何凯文每日一句打卡||DAY1~DAY3
    贪心问题:区间覆盖 POJ 1328 Rader Installation
  • 原文地址:https://www.cnblogs.com/darkknightzh/p/12013741.html
Copyright © 2011-2022 走看看