zoukankan html css js c++ java

全球最大的3D数据集公开了！标记好的10800张全景图

Middlebury数据集 http://vision.middlebury.edu/stereo/data/

KITTI数据集简介与使用 https://blog.csdn.net/solomon1558/article/details/70173223

http://www.dataguru.cn/article-12197-1.html

摘要: 一路走来，Matterport见证了3D数据集在深度学习多领域的巨大力量。我们在这个领域研究了很久，希望将一部分数据分享给研究者使用。令人兴奋的是，斯坦福、普林斯顿、TUM等的研究人员联手给大量的空间打了些标签，并 ...

工具模型深度学习商业智能 ETL

你一定不想错过这个全球较大的公开3D数据集。

本文作者为Matt Bell，是3D扫描解决方案提供商Matterport的联合创始人、首席战略官。在本文中，Bell亲述Matterport公开的这个数据集细节，我们随他去看看。

一路走来，Matterport见证了3D数据集在深度学习多领域的巨大力量。我们在这个领域研究了很久，希望将一部分数据分享给研究者使用。令人兴奋的是，斯坦福、普林斯顿、TUM等的研究人员联手给大量的空间打了些标签，并将标记数据以Matterport 3D数据集的形式公开出来。

这是目前世界上较大的3D公开数据集，其中的标注意义重大。

像ImageNet、COCO这种比较大的2D数据集创建于2010年左右，是高精2D图像分类系统工具。我们希望Matterport这种3D+2D的数据集也能提升AI系统的认知力、理解力，带动3D研究的发展。

Matterport的行业影响力巨大，从增强现实、机器人技术、3D重构到更好地理解3D图像，我们一直在推进。

数据集“魔盒”

数据集中包含了10800张尺寸相同的全景图（RGB+深度图像），这些图片是从90个建筑场景的194400张RGB色彩模式的深度图像中挑选出来的，图像均用Matterport的Pro 3D相机拍摄。

这些场景的3D模型已经用实例级对象分割做了标记，你可以在 https://matterport.com/gallery 网站中交互式探索不同的Matterport 3D重建模型。

几种不同的解锁姿势

很高兴地告诉大家，这个数据集非常实用。下面我将介绍Matterport研究的几个方向。

目前，我们内部用这个数据集做过这样一个系统，将用户拍摄的照片分割成房间，并将其分类。这个系统的表现不错，甚至在没有门或隔断隔开情况下，也能分辨出不同的房间类型（例如厨房和餐厅）。

此外，我们也在学习用深度学习方法填充3D传感器够不到的区域。这方便了用户快速拍摄广阔的开放空间，如仓库、购物中心、商业地产、工厂和新类型的房间等。

不妨看一个简单的示例。在这个例子中，我们的算法通过颜色和局部深度，预测深度值和深度传感器的表面方向(法向量)。由于这些区域太远，无法被深度传感器探测到。

其实，我们还能用它在用户拍摄的空间中划分出不同对象。与现在3D模型不同的是，这些完全分割的模型能较精确识别空间中的物体。这样就解锁了很多使用姿势，包括自动生成含有空间内容和特征的详细列表，并自动看到不同家具在空间中的样子。

我们还有个小目标，比如让任何空间能够被索引、搜索、排序和理解，让用户找到想要的东西。

比如，你想找到个地方度假，你希望那里有三间大卧室，配备着现代化厨房，客厅内还有内置的壁炉，在阳台上能看到下面的池塘风景，还有一扇落地窗？我们可以做到。

比如，你想盘点办公室里所有家具，想比较建筑工地上的管道和CAD模型是否一致？也so easy。

论文中还展示了一系列其他用例，包括通过深度学习的特性提高特征匹配、二维图像的表面法向量估计，以及识别基于体素模型的架构特征和对象等。

我们的下一步

正如上面所说，你可以使用这些数据、代码和论文，我们很愿意听听大家是如何使用它们的，也很期待与研究机构合作开展一些项目。

如果你对3D和更大的数据集感兴趣，也欢迎加入我们，感谢参与项目的所有人。

最后，附数据集地址：

https://niessner.github.io/Matterport/

Code地址：

https://github.com/niessner/Matterport

论文下载地址：

https://arxiv.org/pdf/1709.06158.pdf

欢迎来到3D世界！

欢迎加入本站公开兴趣群

商业智能与数据分析群

兴趣范围包括各种让数据产生价值的办法，实际应用案例分享与讨论，分析工具，ETL工具，数据仓库，数据挖掘工具，报表系统等全方位知识

QQ群：81035754

计算机视觉·常用数据集·3D

Multiview

3D Photography Dataset

Multiview stereo data sets: a set of images

Multi-view Visual Geometry group’s data set

Dinosaur, Model House, Corridor, Aerial views, Valbonne Church, Raglan Castle, Kapel sequence

Oxford reconstruction data set (building reconstruction)

Oxford colleges

Multi-View Stereo dataset (Vision Middlebury)

Temple, Dino

Multi-View Stereo for Community Photo Collections

Venus de Milo, Duomo in Pisa, Notre Dame de Paris

IS-3D Data

Dataset provided by Center for Machine Perception

CVLab dataset

CVLab dense multi-view stereo image database

3D Objects on Turntable

Objects viewed from 144 calibrated viewpoints under 3 different lighting conditions

Object Recognition in Probabilistic 3D Scenes

Images from 19 sites collected from a helicopter flying around Providence, RI. USA. The imagery contains approximately a full circle around each site.

Multiple cameras fall dataset

24 scenarios recorded with 8 IP video cameras. The first 22 first scenarios contain a fall and confounding events, the last 2 ones contain only confounding events.

CMP Extreme View Dataset

15 wide baseline stereo image pairs with large viewpoint change, provided ground truth homographies.

KTH Multiview Football Dataset II

This dataset consists of 8000+ images of professional footballers during a match of the Allsvenskan league. It consists of two parts: one with ground truth pose in 2D and one with ground truth pose in both 2D and 3D.

Disney Research light field datasets

This dataset includes: camera calibration information, raw input images we have captured, radially undistorted, rectified, and cropped images, depth maps resulting from our reconstruction and propagation algorithm, depth maps computed at each available view by the reconstruction algorithm without the propagation applied.

CMU Panoptic Studio Dataset

Multiple people social interaction dataset captured by 500+ synchronized video cameras, with 3D full body skeletons and calibration data.

4D Light Field Dataset

24 synthetic scenes. Available data per scene: 9x9 input images (512x512x3) , ground truth (disparity and depth), camera parameters, disparity ranges, evaluation masks.

RGB-D数据集汇总 List of RGBD datasets https://blog.csdn.net/aaronmorgan/article/details/78335436

原文链接：http://www.cnblogs.com/alexanderkun/p/4593124.html

This is an incomplete list of datasets which were captured using a Kinect or similar devices. I initially began it to keep track of semantically labelled datasets, but I have now also included some camera tracking and object pose estimation datasets. I ultimately aim to keep track of all Kinect-style datasets available for researchers to use.

Where possible links have been added to project or personal pages. Where I have not been able to find these I have used a direct link to the data

Please send suggestions for additions and corrections to me at m.firman <at> cs.ucl.ac.uk.

This page is automatically generated from a YAML file, and was last updated on 26 November, 2014.

Turntable data

These datasets capture objects under fairly controlled conditions. Bigbird is the most advanced in terms of quality of image data and camera poses, while the RGB-D object dataset is the most extensive.

RGBD Object dataset

Introduced: ICRA 2011

Device: Kinect v1

Description: 300 instances of household objects, in 51 categories. 250,000 frames in total

Labelling: Category and instance labelling. Includes auto-generated masks, but no exact 6DOF pose information.

全球最大的3D数据集公开了！标记好的10800张全景图

Middlebury数据集 http://vision.middlebury.edu/stereo/data/

KITTI数据集简介与使用 https://blog.csdn.net/solomon1558/article/details/70173223

计算机视觉·常用数据集·3D

Multiview

RGB-D数据集汇总 List of RGBD datasets https://blog.csdn.net/aaronmorgan/article/details/78335436

Turntable data

RGBD Object dataset

Bigbird dataset

Segmentation and pose estimation under controlled conditions

Object segmentation dataset

Willow Garage Dataset

'3D Model-based Object Recognition and Segmentation in Cluttered Scenes'

'A Global Hypotheses Verifcation Method for 3D Object Recognition'

'Model Based Training, Detection and Pose Estimation of Texture-Less 3D Objects in Heavily Cluttered Scenes'

Kinect data from the real world

RGBD Scenes dataset

RGBD Scenes dataset v2

'Object Disappearance for Object Discovery'

'Object Discovery in 3D scenes via Shape Analysis'

Cornell-RGBD-Dataset

NYU Dataset v1

NYU Dataset v2

'Object Detection and Classification from Large-Scale Cluttered Indoor Scans'

SUN3D

B3DO: Berkeley 3-D Object Dataset

SLAM, registration and camera pose estimation

TUM Benchmark Dataset

Microsoft 7-scenes dataset

IROS 2011 Paper Kinect Dataset

'When Can We Use KinectFusion for Ground Truth Acquisition?'

DAFT Dataset

ICL-NUIM Dataset

'Automatic Registration of RGB-D Scans via Salient Directions'

Stanford 3D Scene Dataset

Tracking

Princeton Tracking Benchmark

Datasets involving humans: Body and hands

Cornell Activity Datasets: CAD-60 and CAD-120

RGB-D Person Re-identification Dataset

Sheffield KInect Gesture (SKIG) Dataset

RGB-D People Dataset

50 Salads

Microsoft Research Cambridge-12 Kinect gesture data set

UR Fall Detection Dataset

RGBD-HuDaAct

Human3.6M

Datasets involving humans: Head and face

Biwi Kinect Head Pose Database

Eurecom Kinect Face Dataset

3D Mask Attack Dataset

Biwi 3D Audiovisual Corpus of Affective Communication - B3D(AC)^2

ETH Face Pose Range Image Data Set