Elasticsearch mapping文档相似性算法 - 走看看

zoukankan html css js c++ java

Elasticsearch mapping文档相似性算法
Elasticsearch allows you to configure a scoring algorithm or similarity per field. The similaritysetting provides a simple way of choosing a similarity algorithm other than the default TF/IDF, such as BM25.

Similarities are mostly useful for text fields, but can also apply to other field types.

Custom similarities can be configured by tuning the parameters of the built-in similarities. For more details about this expert options, see the similarity module.

The only similarities which can be used out of the box, without any further configuration are:

BM25
The Okapi BM25 algorithm. The algorithm used by default in Elasticsearch and Lucene. See Pluggable Similarity Algorithms for more information.
classic
The TF/IDF algorithm which used to be the default in Elasticsearch and Lucene. See Lucene’s Practical Scoring Function for more information.

The similarity can be set on the field level when a field is first created, as follows:
PUT my_index { "mappings": { "my_type": { "properties": { "default_field": { "type": "text" }, "classic_field": { "type": "text", "similarity": "classic" } } } } }
COPY AS CURL VIEW IN CONSOLE

The default_field uses the BM25 similarity.

The classic_field uses the classic similarity (ie TF/IDF).

参考：https://www.elastic.co/guide/en/elasticsearch/reference/current/similarity.html
查看全文

相关阅读:
poj 1269(两条直线交点)
poj 2398(叉积判断点在线段的哪一侧)
poj 2318(叉积判断点在线段的哪一侧)
HDU 5650 so easy
POJ 1328 Radar Installation
POJ 1017 Packets
POJ 3190 Stall Reservations
CodeForces 652A Gabriel and Caterpillar
CodeForces 652B z-sort
CodeForces 652C Foe Pairs

原文地址：https://www.cnblogs.com/bonelee/p/6472719.html

Copyright © 2011-2022 走看看