2014-10-26 67 views
0
String prop = "austen.prop"; 
Properties props = StringUtils.propFileToProperties(prop); 
String to = props.getProperty("serializeTo"); 
props.setProperty("serializeTo", "C:\\ner-jxy-model.ser.gz"); 
SeqClassifierFlags flags = new SeqClassifierFlags(props); 
CRFClassifier<CoreLabel> crf = new CRFClassifier<CoreLabel>(flags); 
crf.train(); 

,这是我austen.prop我训练斯坦福NER程序,但没有得到模型文件

#location of the training file 
trainFile = train.tsv 
#location where you would like to save (serialize to) your 
#classifier; adding .gz at the end automatically gzips the file, 
#making it faster and smaller 
serializeTo = ner-model.ser.gz 

#structure of your training file; this tells the classifier 
#that the word is in column 0 and the correct answer is in 
#column 1 
map = word=0,answer=1 

#these are the features we'd like to train with 
#some are discussed below, the rest can be 
#understood by looking at NERFeatureFactory 
useClassFeature=true 
useWord=true 
useNGrams=true 
#no ngrams will be included that do not contain either the 
#beginning or end of the word 
noMidNGrams=true 
useDisjunctive=true 
maxNGramLeng=6 
usePrev=true 
useNext=true 
useSequences=true 
usePrevSequences=true 
maxLeft=1 
#the next 4 deal with word shape features 
useTypeSeqs=true 
useTypeSeqs2=true 
useTypeySequences=true 
wordShape=chris2useLC 

我训练斯坦福NER程序,但没有得到模型文件NER-model.ser。 GZ。

但它工作时,我直接使用crf分类。

+0

我恰好具有我同样的问题。你发现了什么错?我无法在文档中找到任何线索。 – cheseaux 2014-11-12 14:49:04

+0

在你的prop文件中,我可以看到你想要序列化到某个文件,然后以编程方式给它另外一个名字...为什么是这样? – MiNdFrEaK 2016-01-22 04:57:49

回答

1

你需要()crf.train后,这行代码

crf.serializeClassifier("path/to/model.ser.gz"); 
相关问题