2017-08-16 37 views
0

ATG - Endeca Baseline更新失败,ART应用程序中出现以下 错误。但部分索引是成功的。Endeca - 基线更新失败 - 从记录存储中读取错误

附加CAS日志也用于corressponding错误。

Aug 15, 2017 12:23:37 PM com.endeca.soleng.eac.toolkit.script.Script runBeanShellScript 
SEVERE: Crawl 'ART-last-mile-crawl' failed with error: Problem running full acquisition on data source for ART-last-mile-crawl: Error reading from Record Store ART-data: malformed input around byte 10. 
Occurred while executing line 11 of valid BeanShell script: 
[[ 

8|  Dgidx.cleanDirs(); 
9| 
10|  // run crawl and archive any changes in dvalId mappings 
11|  CAS.runBaselineCasCrawl("ART-last-mile-crawl"); 
12|  CAS.archiveDvalIdMappingsForCrawlIfChanged("ART-last-mile-crawl"); 
13| 
14|  // archive logs and run the indexer 

]] 

Aug 15, 2017 12:23:37 PM com.endeca.soleng.eac.toolkit.Controller execute 
SEVERE: Caught an exception while invoking method 'run' on object 'BaselineUpdate'. Releasing locks. 
java.lang.reflect.InvocationTargetException 
     at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) 
     at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) 

在CAS的错误日志:

2017-08-15 12:23:36,485 ERROR [ART-data] [cas-ART-last-mile-crawl-worker-1] com.endeca.itl.recordstore.impl.RecordStoreImpl: Error executing method RecordStoreImpl.readRecords() 
com.endeca.itl.recordstore.RecordStoreException: malformed input around byte 10 
     at com.endeca.itl.recordstore.impl.ReadCursor.read(ReadCursor.java:81) 
     at com.endeca.itl.recordstore.impl.RecordStoreImpl.readRecords(RecordStoreImpl.java:480) 
     at sun.reflect.GeneratedMethodAccessor106.invoke(Unknown Source) 
     at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) 
     at java.lang.reflect.Method.invoke(Method.java:497) 
     at com.endeca.itl.service.ServicePublisher$1.invoke(ServicePublisher.java:121) 
     at com.sun.proxy.$Proxy57.readRecords(Unknown Source) 
     at com.endeca.itl.recordstore.RecordStoreReader.fetchNextChunk(RecordStoreReader.java:267) 
     at com.endeca.itl.recordstore.RecordStoreReader.hasNext(RecordStoreReader.java:244) 
     at com.endeca.itl.extension.source.merger.RecordStoreMergerDataSourceRuntime$RecordStoreReadSession.runFull(RecordStoreMergerDataSourceRuntime.java:252) 
     at com.endeca.itl.extension.source.merger.RecordStoreMergerDataSourceRuntime.runFullAcquisition(RecordStoreMergerDataSourceRuntime.java:148) 
     at com.endeca.itl.util.CasExtensionRegistry$ContextClassLoaderDataSourceExtensionRuntime$2.doWork(CasExtensionRegistry.java:220) 
     at com.endeca.itl.util.CasExtensionRegistry$ContextClassLoaderDataSourceExtensionRuntime$2.doWork(CasExtensionRegistry.java:218) 
     at com.endeca.itl.plugin.ThreadContextRunner.run(ThreadContextRunner.java:136) 
     at com.endeca.itl.plugin.ThreadContextRunner.run(ThreadContextRunner.java:89) 
     at com.endeca.itl.util.CasExtensionRegistry$ContextClassLoaderDataSourceExtensionRuntime.runFullAcquisition(CasExtensionRegistry.java:218) 
     at com.endeca.itl.executor.extension.ExtensionDataSourceProcessor.processRecord(ExtensionDataSourceProcessor.java:104) 
     at com.endeca.itl.executor.extension.IncrementalDataSourceProcessor.processRecord(IncrementalDataSourceProcessor.java:106) 
     at com.endeca.itl.executor.TaskManager$2.work(TaskManager.java:166) 
     at com.endeca.itl.executor.WorkExecutor$WorkRunnable.run(WorkExecutor.java:194) 
     at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) 
     at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) 
     at java.lang.Thread.run(Thread.java:745) 
     at com.endeca.itl.util.LoggingContextAwareThread.run(LoggingContextAwareThread.java:71) 
Caused by: java.io.UTFDataFormatException: malformed input around byte 10 
     at java.io.DataInputStream.readUTF(DataInputStream.java:656) 
     at java.io.DataInputStream.readUTF(DataInputStream.java:564) 
     at com.endeca.itl.recordstore.impl.storage.RecordStorageEntry.load(RecordStorageEntry.java:114) 

如何分流和调试进一步将是有益的任何输入

+0

看起来像损坏的数据,不能使用UTF-8转换。是在这个环境中工作过的基线还是最近开始的? –

+0

感谢@AjayAgrawal的回应。直到有一天它工作良好。 ./recordstore-cmd.sh read-baseline -a ART-data -c即使记录计数也会产生相同的错误。 “退出错误,失败,字节10周围的输入格式错误” –

+0

我会建议初始化服务并查看它是否有效。似乎像记录存储损坏.. –

回答

2

上述错误是完成以下后解决脚步。

  1. 第1步 - 在添加以下更改后导出和导入CAS配置。 <ignoreInvalidRecords>true</ignoreInvalidRecords>

    recordstore-cmd.sh get-configuration -a ART-data -f dataConfig.xml 
    recordstore-cmd.sh set-configuration -a ART-data -f dataConfig.xml 
    
  2. 第二步 - 相同的变化,在本记录存储配置文件进行下<CAS_WS>/workspace/state/ART-data

    <?xml version="1.0" encoding="UTF-8" standalone="yes"?> 
    <recordStoreConfiguration xmlns="http://recordstore.itl.endeca.com/"> 
        <changePropertyNames/> 
        <idPropertyName>record.id</idPropertyName> 
        <ignoreInvalidRecords>true</ignoreInvalidRecords><!-- newly added --> 
        <jdbmSettings/> 
    </recordStoreConfiguration> 
    
  3. 步骤3 - 将cas_output<Endeca_apps>/ART/data/cas_output用备份替换(2天过去)那是可用的。

经过上述步骤,索引是直接从后端启动(通过调用脚本)。一旦这个索引成功,索引就从发电机中调用,并且成功。