zoukankan      html  css  js  c++  java
  • 20170605计算机视觉报告小结

    听了一个关于计算机视觉的报告,报告名字叫”computer vision ++: Where do we go from here?”,做点小结。我对计算机视觉了解的不多,连入门都算不上,但是近来对此产生了兴趣。报告主要讲的是一些应用。

    提到了David Marr (neuroscientist)好像是一个大牛,写过一本书:Vision-豆瓣Vision-Amazon

    两个超过人的事情:1.人脸识别,LFW数据集;2.图像分类,ImageNet数据集。

    两个数据集:

    1.国际权威人脸识别公开测试集LFW(Labeled Faces in the Wild),LFW由马萨诸塞大学于2007年建立,用于评测非约束条件下的人脸识别算法性能,是人脸识别领域使用最广泛的评测集合。

    2.ImageNet

    应用

    图像分类

    Liu J, Luo J, Shah M. Recognizing realistic actions from videos[C]// Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on. IEEE, 2009:1996-2003.

    Cao L, Luo J, Liang F, et al. Heterogeneous feature machines for visual recognition[C]// IEEE, International Conference on Computer Vision. IEEE, 2010:1095-1102.

    行为识别

    Li Q, Qiu Z, Yao T, et al. Action Recognition by Learning Deep Multi-Granular Spatio-Temporal Video Representation[C]// ACM on International Conference on Multimedia Retrieval. ACM, 2016:159-166.

    vision+Language

    Unsupervised Alignment of Actions in Video with Text Descriptions

    You Q, Jin H, Wang Z, et al. Image Captioning with Semantic Attention[J]. 2016:4651-4659.

    Li Y, Song Y, Cao L, et al. TGIF: A New Dataset and Benchmark on Animated GIF Description[C]// IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 2016:4641-4650.

    什么是Image Captioning?

    感觉是从图像中获取信息,caption应该是字幕的意思,那么应该是从图像中识别一些信息并把它标注出来,看起来应该是这样(Image Captioning - Kiran Vodrahalli):

    1

    视频和文本的无监督对准

    Naim I, Song Y C, Liu Q, et al. Unsupervised alignment of natural language instructions with video segments[C]// Twenty-Eighth AAAI Conference on Artificial Intelligence. AAAI Press, 2014:1558-1564.

    Image/Video Captioning with visual and semantic attention

    You Q, Jin H, Wang Z, et al. Image Captioning with Semantic Attention[J]. 2016:4651-4659.

    情感计算

    情感计算—人工智能的重要发展方向

    You Q, Luo J, Jin H, et al. Building a Large Scale Dataset for Image Emotion Recognition: The Fine Print and The Benchmark[J]. 2016.

    You Q, Cao L, Jin H, et al. Robust Visual-Textual Sentiment Analysis: When Attention meets Tree-structured Recursive Neural Networks[C]// ACM, 2016:1008-1017.

    研究自拍的

    Chen T, Chen Y, Luo J. A Selfie is Worth a Thousand Words: Mining Personal Patterns behind User Selfie-posting Behaviours[J]. 2017.

    用大数据来找到热销衣服的特征:

    Chen K T, Luo J. When Fashion Meets Big Data: Discriminative Mining of Best Selling Clothing Features[J]. 2017.

    医疗方面的应用

    帮助新手医生练习手术

  • 相关阅读:
    ES6箭头函数中this的指向问题
    不借助vue-cli,自行构建一个vue项目
    Vue组件props选项-实现父子组件动态数据绑定
    Linux 基本操作命令
    Javascript
    ES6
    利用Gulp和Webpack进行项目自动化构建
    自定义View实现图片热区效果
    新年学习计划
    Activity跳转通过EventBus传值问题
  • 原文地址:https://www.cnblogs.com/shanchuan/p/8150277.html
Copyright © 2011-2022 走看看