zoukankan      html  css  js  c++  java
  • Video tagging systems based on DNNs

    Need:

    1. With the ever-growth large-scale video in the mobile phone, so what will everyone get from these video? There are many videos contain something very interesting like a short comedy video. So if someone find something interesting in the video and want know more about it, they may not search it in the internet and find the information after watching this video due the poor memory. So if the advertiser have put some advertisements in the video ahead of time, it will be more convenient for the user to get some information. That’s very useful for the advertisers and the users.
    2. There are many videos in users’ phone. Maybe most of them are meaningful time mark. So someone want to look for some useful tools to tagging the meaningful object or want to know the object information. Then our video tagging systems will be very efficient for this work.

    Approach:

    1. The video tagging project can be divided into two steps. The first one is the key frame localization. The second one is the object classification or object detection.
    2. The key frame localization can be realized by some conventional method like the HOG features split or some other method. This is a litter challenge because there is no very efficient way to get the really accuracy key frame. And I think it is a program optimization problem.
    3. The object classification can be realized by the deep convolutional neural network classifier or some other deep learning state-of-the-arts method. The problem is the labels may be not enough. So it can be a research problem.

    Benefit:

    1. Everyone can be convenient to get some merchandise information by the tagged video which is processed by the mobile end application.
    2. Some people will summarize the meaningful moments and find some meaningful object.

    Competitors:

    There a video tagging system which has been released in the internet after my survey. The Website name is “Clarifai”. They can tag the video and get the object temporal information. And the classification accuracy is very high. So it is our main competitor.

                                                                                                             10/18/2015

      Fuchen Long

  • 相关阅读:
    关于MySQL慢日志,你想知道的都在这
    h5 唤起app或跳转appStore
    ios 报错 Invalid row height provided by table delegate. Value must be at least 0.0, or UITableViewAutomaticDi......
    url的长度问题
    CUICatalog: Invalid asset name supplied:
    -[UITableView copyWithZone:]: unrecognized selector sent to instance 0x7XXXXXX00
    iOS项目运行出现:[__NSPlaceholderDictionary initWithObjects:forKeys:count:]: attempt to insert nil object
    UICachedDeviceRGBColor CGImage]: unrecognized selector sent to instance 0xxxxxxxxxxx'
    iOS Xcode 10: Multiple commands produce
    2019 开篇
  • 原文地址:https://www.cnblogs.com/aidoer/p/4892399.html
Copyright © 2011-2022 走看看