zoukankan      html  css  js  c++  java
  • Video tagging systems based on DNNs

    Need:

    1. With the ever-growth large-scale video in the mobile phone, so what will everyone get from these video? There are many videos contain something very interesting like a short comedy video. So if someone find something interesting in the video and want know more about it, they may not search it in the internet and find the information after watching this video due the poor memory. So if the advertiser have put some advertisements in the video ahead of time, it will be more convenient for the user to get some information. That’s very useful for the advertisers and the users.
    2. There are many videos in users’ phone. Maybe most of them are meaningful time mark. So someone want to look for some useful tools to tagging the meaningful object or want to know the object information. Then our video tagging systems will be very efficient for this work.

    Approach:

    1. The video tagging project can be divided into two steps. The first one is the key frame localization. The second one is the object classification or object detection.
    2. The key frame localization can be realized by some conventional method like the HOG features split or some other method. This is a litter challenge because there is no very efficient way to get the really accuracy key frame. And I think it is a program optimization problem.
    3. The object classification can be realized by the deep convolutional neural network classifier or some other deep learning state-of-the-arts method. The problem is the labels may be not enough. So it can be a research problem.

    Benefit:

    1. Everyone can be convenient to get some merchandise information by the tagged video which is processed by the mobile end application.
    2. Some people will summarize the meaningful moments and find some meaningful object.

    Competitors:

    There a video tagging system which has been released in the internet after my survey. The Website name is “Clarifai”. They can tag the video and get the object temporal information. And the classification accuracy is very high. So it is our main competitor.

                                                                                                             10/18/2015

      Fuchen Long

  • 相关阅读:
    使用百度网盘配置私有Git服务
    Linked dylibs built for GC-only but object files built for retain/release for architecture x86_64
    我的博客搬家啦!!!
    今日头条核心业务(高级)开发工程师,直接推给部门经理,HC很多,感兴趣的可以一起聊聊。
    学习Python的三种境界
    拿到阿里,网易游戏,腾讯,smartx的offer的过程
    关于计算机网络一些问题的思考
    网易游戏面试经验(三)
    网易游戏面试经验(二)
    网易游戏面试经验(一)
  • 原文地址:https://www.cnblogs.com/aidoer/p/4892399.html
Copyright © 2011-2022 走看看