2016-08-24 111 views
0

因此,我有一个定制的分析器,它添加了本体中的附加术语。另外,我想在索引之前对术语进行干扰。以下是取自elasticsearch头插件分析在ElasticSearch中索引的术语

{ 
    "state": "open", 
    "settings": { 
     "index": { 
      "refresh_interval": "1000s", 
      "number_of_shards": "5", 
      "creation_date": "1471931611750", 
      "analysis": { 
       "filter": { 
        "owlfilter": { 
         "type": "owl", 
         "indexName": "ontoowl", 
         "expansionType": "RDFSLABEL", 
         "owlFile": "/home/tannys/elasticsearch-2.3.0/ontologyWorkTrial/myownowl.owl" 
        } 
       }, 
       "analyzer": { 
        "owlanalyzer": { 
         "filter": ["owlfilter","porter_stem"], 
         "type": "custom", 
         "tokenizer": "standard" 
        } 
       } 
      }, 
      "number_of_replicas": "1", 
      "uuid": "d8Ub8A0eSm65geMK_bpdvw", 
      "version": {"created": "2030099"} 
     } 
    }, 
    "mappings": { 
     "mytype": { 
      "properties": { 
       "nameortitle": { 
        "search_analyzer": "standard", 
        "analyzer": "owlanalyzer", 
        "store": true, 
        "type": "string" 
       }, 
       "description": { 
        "search_analyzer": "standard", 
        "analyzer": "owlanalyzer", 
        "store": true, 
        "type": "string" 
       } 
      }, 
      "aliases": [ ] 
     } 
    } 
} 

问题的讽刺意味的是,直到我曾用porter_stem过滤器,效果比较好。所以我不太确定,出了什么问题。我想看看索引的条款。我怎样才能看到分析器的表现如何,比如说luke对Lucene的作用? 任何指导。

回答

1

您可以使用Term Vectors API here。这将为您提供文档中字段的条款,或者您也可以使用multi-term API以同样的方式查看多个文档中的条款。