zoukankan      html  css  js  c++  java
  • Mongodb——GridFS


    GridFS用于存储和恢复那些超过16M(BSON文件限制)的文件。

    GridFS将文件分成大块,将每个大块存储为单独的文件.GridFS中限制chunk最大为256k。GridFS使用两个collection存储,一个存储chunks,一个存储元数据(metadata)。
    fs.files和fs.chunks


    When should I use GridFS?
    http://docs.mongodb.org/manual/faq/developers/#faq-developers-when-to-use-gridfs


    file Collection:具体形式如下
    {
      "_id" : <ObjectID>,
      "length" : <num>,
      "chunkSize" : <num>
      "uploadDate" : <timestamp>
      "md5" : <hash>

      "filename" : <string>,
      "contentType" : <string>,
      "aliases" : <string array>,
      "metadata" : <dataObject>,
    }

    Documents in the files collection contain some or all of the following fields. Applications may create additional arbitrary fields:

    files._id
        The unique ID for this document. The _id is of the data type you chose for the original document. The default type for MongoDB documents is BSON ObjectID.

    files.length
        The size of the document in bytes.

    files.chunkSize
        The size of each chunk. GridFS divides the document into chunks of the size specified here. The default size is 256 kilobytes.

    files.uploadDate
        The date the document was first stored by GridFS. This value has the Date type.

    files.md5
        An MD5 hash returned from the filemd5 API. This value has the String type.

    files.filename
        Optional. A human-readable name for the document.

    files.contentType
        Optional. A valid MIME type for the document.

    files.aliases
        Optional. An array of alias strings.

    files.metadata
        Optional. Any additional information you want to store.


    The chunks Collection:举例如下
    {
      "_id" : <string>,
      "files_id" : <string>,
      "n" : <num>,
      "data" : <binary>
    }

    A document from the chunks collection contains the following fields:
    chunks._id
        The unique ObjectID of the chunk.

    chunks.files_id
        The _id of the “parent” document, as specified in the files collection.

    chunks.n
        The sequence number of the chunk. GridFS numbers all chunks, starting with 0.

    chunks.data
        The chunk’s payload as a BSON binary type.

    GridFS Index

    GridFS使用chunks中files_id和n域作为混合索引,files_id是父文档的_id,n域包含chunk的序列号,该值从0开始。
    GridFS索引支持快速恢复数据。

    cursor = db.fs.chunks.find({files_id: myFileID}).sort({n:1});

    如果没有建立索引,可以使用下列shell命令:
    db.fs.chunks.ensureIndex( { files_id: 1, n: 1 }, { unique: true } );

    Example Interface:

    // returns default GridFS bucket (i.e. "fs" collection)
    GridFS myFS = new GridFS(myDatabase);

    // saves the file to "fs" GridFS bucket
    myFS.createFile(new File("/tmp/largething.mpg"));

    接口支持额外的GridFS buckets
    // returns GridFS bucket named "contracts"
    GridFS myContracts = new GridFS(myDatabase, "contracts");

    // retrieve GridFS object "smithco"
    GridFSDBFile file = myContracts.findOne("smithco");

    // saves the GridFS file to the file system
    file.writeTo(new File("/tmp/smithco.pdf"));

  • 相关阅读:
    HDU 3280 Equal Sum Partitions(二分查找)
    提高效率—编程中的技巧
    开发心得
    使用OpenCV读、操作、写图像并与bash合作对某个文件夹下全部图像进行相似处理
    观察者模式
    ORA-00932: inconsistent datatypes: expected
    Struts1应用、实现简单计算器、使用DispatchAction、显示友好的报错信息、使用动态Form简化开发
    Apple Swift编程语言新手教程
    I NEED A OFFER! (hdu 1203 01背包)
    端午遥想
  • 原文地址:https://www.cnblogs.com/bigbigtree/p/3156554.html
Copyright © 2011-2022 走看看