我会建议使用match_phrase,而不是使用带有前导和尾随通配符的查询字符串。即使是标准分析仪也应该能够正确地将slu split分成令牌,所以不需要通配符。
curl -XPUT "localhost:9200/slugs/doc/1" -d '{"slug": "my-super-string"}'
echo
curl -XPUT "localhost:9200/slugs/doc/2" -d '{"slug": "my-other-string"}'
echo
curl -XPUT "localhost:9200/slugs/doc/3" -d '{"slug": "my-little-string"}'
echo
curl -XPOST "localhost:9200/slugs/_refresh"
echo
echo "Searching for my"
curl "localhost:9200/slugs/doc/_search?pretty=true&fields=slug" -d '{"query" : { "match_phrase": {"slug": "my"} } }'
echo
echo "Searching for my-super"
curl "localhost:9200/slugs/doc/_search?pretty=true&fields=slug" -d '{"query" : { "match_phrase": {"slug": "my-super"} } }'
echo
echo "Searching for my-other"
curl "localhost:9200/slugs/doc/_search?pretty=true&fields=slug" -d '{"query" : { "match_phrase": {"slug": "my-other"} } }'
echo
echo "Searching for string"
curl "localhost:9200/slugs/doc/_search?pretty=true&fields=slug" -d '{"query" : { "match_phrase": {"slug": "string"} } }'
或者,您也可以创建自己的分析,将只蛞蝓分成记号“ - ”
curl -XDELETE localhost:9200/slugs
curl -XPUT localhost:9200/slugs -d '{
"settings": {
"index": {
"number_of_shards": 1,
"number_of_replicas": 0,
"analysis": {
"analyzer" : {
"slug_analyzer" : {
"tokenizer": "slug_tokenizer",
"filter" : ["lowercase"]
}
},
"tokenizer" :{
"slug_tokenizer" : {
"type": "pattern",
"pattern": "-"
}
}
}
}
},
"mappings" :{
"doc" : {
"properties" : {
"slug" : {"type": "string", "analyzer" : "slug_analyzer"}
}
}
}
}'
你好imotov,感谢您的回答。 – Vinc
随着match_phrase我必须有一个完全匹配有结果。 所以我尝试了match_phrase_prefix,效果很好,但是如果我搜索“super-s”,我还需要一个“match_phrase_suffix”,我想要“我的超级字符串”。 事实上,我想有一个简单的通配符,比如'* -str *',它可以匹配任何包含“-str”的slug 这是我遇到问题的“ - ”字符。任何时候,我有一个我的查询,我没有得到任何结果。 – Vinc
哦,我明白了。然后它是这样的:http://stackoverflow.com/questions/6467067/how-to-search-for-a-part-of-a-word-with-elasticsearch/6471449#6471449 – imotov