zoukankan      html  css  js  c++  java
  • 信息检索关键词部分

     

    Key word

    1

    信息检索(Information Retrieval, IR      数据检索(data retrieval

    相关性(relevance                      推送(Push                

    超空间(hyperspace                    拉出(pulling

    文献逻辑表示(视图)(logical view of the document

    检索任务(retrieval task                   检索(retrieval

    过滤(filtering                         全文本(full text

    词干提取(stemming                    文本操作(text operation

    标引词(indexing term                  信息检索策略(retrieval strategy

    光学字符识别(Optical Character Recognition, OCR

    跨语言(cross-language                  倒排文档(inverted file

    检出文献(retrieved document            相关度(likelihood

    信息检索的人机交互界面(human-computer interaction, HCI

    检索模型与评价(Retrieval Model & Evaluation)文本图像(textual images 

    界面与可视化(Interface & Visualization   书目系统(bibliographic system

    多媒体建模与检索(Multimedia Modeling & Searching

    数字图书馆(Digital Library              检索评价(retrieval evaluation

    标准通用标记语言(Standard Generalized Markup Language, SGML

    标引和检索(indexing and searching       导航(Navigation

    并行和分布式信息检索(parallel and distribution IR

    模型与查询语言(model and query language)导航(Navigation

    有效标引与检索(efficient indexing and searching

     

    2

    特别检索(ad hoc retrieval     过滤(filtering     集合论(set theoretic             代数(algebraic                   概率(probabilistic     路由选择(routing

    用户需求档(user profile           阙值(threshold      权值(weight 

    语词加权(term-weighting          相似度(similarity   相异度(dissimilarity

    域建模(domain modeling          叙词表(thesaurus   扁平(flat

    广义向量空间模型(generalized vector space model          神经元(neuron    

    潜语义标引模型(latent semantic indexing model            邻近结点(proximal node       

    贝叶斯信任度网络(Bayesian belief network                结构导向(structure guided

    结构化文本检索(structured text retrieval, STR          推理网络(inference network

    扩展布尔模型(extended Boolean model              非重叠链表(non-overlapping list

     

    3

    检索性能评价(retrieval performance evaluation      会话(interactive session

    查全率(R, Recall Ratio)                             信息性(Informativeness

    查准率(P, Precision Ratio)                            面向用户(user-oriented

    漏检率(O, Omission Ratio)                           新颖率(novelty ratio

    误检率(M, Miss Ratio)                               用户负担(user effort

    相对查全率(relative recall                         覆盖率(coverage ratio

    参考测试集(reference test collection                优劣程度(goodness

    查全率负担(recall effort                          主观性(subjectiveness 

    信息性测度(informativeness measure

     

    4

    检索单元(retrieval unit     字母表(alphabet          分隔符(separator       

    复合性(compositional      模糊布尔(fuzzy Boolean    模式(pattern

    SQL(Structured Query Language, 结构化查询语言)   布尔查询(Boolean query   

    参照(reference    半结合(semijoin            标签(tag                   

    有序包含(ordered inclusion                      无序包含(unordered inclusion                                       

    CCL(Common Command Language, 通用命令语言)    树包含(tree inclusion

    布尔运算符(Boolean operator                    searching allowing errors容错查询

    Structured Full-text                                relevance feedback 相关反馈

    Query Language (SFQL) (结构化全文查询语言)     extended patterns扩展模式            

    CD-RDx Compact Disk Read only Data exchange (CD-RDx)(只读磁盘数据交换)

    WAIS (广域信息服务系统Wide Area Information Service) 

    visual query languages. 查询语言的可视化               查询语法树(query syntax tree

     

    5

    query reformulation 查询重构 query expansion 查询扩展                                 term reweighting 语词重新加权                相似性叙词表(similarity thesaurus

    User Relevance Feedback用户相关反馈         the graphical interfaces 图形化界面

    簇(cluster   检索同义词(searchonym    local context analysis局部上下文分析

     

    6

    文献(document           样式(style       元数据(metadata

    Descriptive Metadata 描述性元数据                Semantic Metadata 语义元数据

    intellectual property rights 知识产权                content rating 内容等级

    digital signatures数字签名                         privacy levels 权限

    electronic commerce电子商务                     

    都柏林核心元数据集(Dublin Core Metadata Element Set

    通用标记语言(SGMLstandard general markup language                                

    机读目录记录(Machine Readable Cataloging Record, MARC

    资源描述框架(Resource Document Framework, RDF)                    XML(eXtensible Markup Language, 可扩展标记语言

    HTMLHyperText Markup Language, 超文本标记语言)

    Tagged Image File Format (TIFF标签图像文件格式)

    Joint Photographic Experts Group (JPEG) Portable Network Graphics (PNG新型位图图像格式)

     

    7

    分隔符(separator                   连字符(hyphen

    排除表(list of stopwords             词干提取(stemming

    波特(porter                        词库(treasury of words

    受控词汇表(controlled vocabulary     索引单元(indexing component

    文本压缩text compression               压缩算法compression algorithm

    注释(explanation                    统计方法(statistical method

    赫夫曼(Huffman                    压缩比(compression ratio

    数据加密Encryption                   半静态的(semi-static

    词汇分析lexical analysis                排除停用词elimination of stopwords

     

    8

    半静态(semi-static191                   词汇表(vocabulary192           

    事件表(occurrence192                   inverted files倒排文档     

    suffix arrays后缀数组                      signature files签名档

    块寻址(block addressing193              索引点(index point199

    起始位置(beginning199                  Vocabulary search词汇表检索

    Retrieval of occurrences 事件表检索          Manipulation of occurrences事件表操作

    散列变换(hashing205                    误检(false drop205

    查询语法树(query syntax tree207           布鲁特-福斯算法简称BFBrute-Force

    故障(failure210    移位-或(shift-or    位并行处理(bit-parallelism212

    顺序检索(sequential search220            原位(in-place227

     

    9

    并行计算(parallel computing           SISD (单指令流单数据流)

    SIMD (单指令流多数据流)             MISD (多指令流单数据流)

    MIMD (多指令流多数据流)            分布计算(distributed computing

    颗粒度(granularity231                多任务(multitasking

    I/Oinput/output233                  标引器(indexer

     映射(map233                      命中列表(hit-list

    全局语词统计值(global term statistics   线程(thread

    算术逻辑单元(arithmetic logic unit, ALU 中介器(broker

    虚拟处理器(virtual processor240

    分布式信息检索(distributed information retrieval)249

    文献收集器(gatherer                 主中介器(central broker254

     

    10

    信息可视化(information visualization        图标(icon260

    颜色凸出显示(color highlighting            焦点+背景(focus-plus-context

    画笔和链接(brushing and linking           魔术透镜(magic lenses

    移动镜头和调焦(panning and zooming       弹性窗口(elastic window

    概述及细节信息(overview plus details        高亮色显示(highlight

    信息存取任务(information access tasks       文献替代(document surrogate

    常见问题(FAQ, Frequently Asked Question)      群体性推荐(social recommendation

    上下文关键词(keyword-in-context, KWIC      伪相关反馈(pseudo-relevance feedback

    重叠式窗口(overlapping window            工作集(working set

     

    11/12

    多媒体信息检索(Multimedia Information Retrieval, MIR 超类(superclass

    半结构化数据(semi-structured data                     数据片(data blade

    可扩充型系统(extensible type system                    相交(intersect

    动态服务器(dynamic server                            叠加(overlaps

    档案库服务器(archive server                           聚集(center

    逻辑结构(logical structure                             词包含(contain word

    例子中的查询(query by example                        路径名(path-name

    通过图像内容查询(Query by Image Content, QBIC       图像标题(image header

    主要成分分析(Principal Component Analysis, PCA       精确匹配(exact match

    潜语义标引(Latent Semantic Indexing, LSI              基于内容(content-based

    范围查寻(Range Query

     

    13

    exponential growth指数增长                Distributed data 数据的分布性  

    volatile data 不稳定数据                   redundant data 冗余数据   

    Heterogeneous data异构数据               分界点(cut point373 

    Centralized Architecture集中式结构         收集器-标引器(crawler-indexer373

    Wanderers 漫步者     Walkers 步行者     Knowbots 知识机器人

    Distributed Architecture分布式结构         gatherers 收集器    

    brokers 中介器                           the query interface 查询界面  

    the answer interface响应界面               PageRank 网页级别   

    Crawling the Web漫游Web                 breadth-first 广度优先   

    depth-first fashion 深度优先                Indicesindex pl.)索引   

    Web Directories 网络目录                  Metasearchers元搜索引擎 

    Teaching the User用户培训                 颗粒度(granularity384   

    超文本推导主题检索(Hypertext Included Topic Search, HITS380                              

    Specific queries专指性查询                 Broad queries 泛指性查询

    Vague queries模糊查询                    Searching using Hyperlinks使用超链接搜索

    Web Query Languages查询语言             Dynamic Search 动态搜索  

    Software Agents 软件代理鱼式搜索(fish search

    鲨鱼搜索(shark search)拉出/推送(pull/push393  

    门户(portal395                         Duplicated data 重复数据    

     

    14

    联机公共检索目录(online public access catalog, OPAC397

    化学文摘(Chemical Abstract, CA399      生物学文摘(Biological Abstract, BA

    工程索引(Engineering Index,EI

    国会图书馆分类法(Library of Congress Classification408

    杜威十进分类法(Dewey Decimal Classification408

    联机计算机图书馆中心(Online Computer Library Center, OCLC409

    机读目录记录(Machine Readable Cataloging Record, MARC409

     

    15

    NSF (National Science Foundation, 美国国家科学基金会)

    NSNANational Aeronautics and Space Administration 美国航空航天局)

    数字图书馆创新项目(Digital Libraries Initiative, DLI415

    5Sstream,信息流structure,结构space, 空间scenario, 场景society社会)416

    基于数字化对象标识符(Digital Object Identifier, DOI420

    都柏林核心(Dublin Core, DC430       数字图书馆(Digital Library, DL

    资源描述框架(Resource Document Framework, RDF)431

    text encoding initiative (TEI) (文本编码创新项目)431

     

    v
    作者:不老神仙
    本文版权归作者和博客园共有,欢迎转载,但未经作者同意必须保留此段声明,且在文章页面明显位置给出原文连接,否则保留追究法律责任的权利。
  • 相关阅读:
    美团面试,360面试 ,滴滴面试,阿里面试,百度面试,京东面试,搜狗面试:
    Maven 3-Maven依赖版本冲突的分析及解决小结 (阿里,美团,京东面试)
    maven snapshot和release版本的区别
    Maven 生命周期 和插件
    Maven pom 文件解释
    Zookeeper原理架构
    sublime 支持PHP语法提示
    Zen Coding 用法
    让浏览器屏蔽js
    淘宝设计师入门:设计师SDK环境配置
  • 原文地址:https://www.cnblogs.com/allanbolt/p/1489828.html
Copyright © 2011-2022 走看看