我试图让XML解析器后数组元素如下:logstash过滤器:让所有的数组元素为新事件
filter {
xml {
source => "message"
target => "xmldata"
store_xml => "false"
xpath => ["/OMA/ESMLog/LogEntry/Index/text()","index"]
xpath => ["/OMA/ESMLog/LogEntry/Status/text()","status"]
xpath => ["/OMA/ESMLog/LogEntry/TimeStampRaw/text()","timestampraw"]
xpath => ["/OMA/ESMLog/LogEntry/Description/text()","description"]
}
mutate {
remove_field => [ "message", "inxml", "xmldata" ]
}
mutate {
replace => {
"index" => "%{[index][0]}"
"status" => "%{[status][0]}"
"timestampraw" => "%{[timestampraw][0]}"
"description" => "%{[description][0]}"
}
}
date {
match => [ "timestampraw", "UNIX" ]
}
}
正如你可以看到我能够从阵列中获得每一个第一要素,但是我怎样才能从数组中获取所有元素作为新事件? 因此,我希望将每个'LogEntry'元素看作XML中的新事件。 这里一些示例XML(原始XML来自OMSA):
<?xml version="1.0" encoding="UTF-8"?>
<OMA>
<ESMLog>
<LogEntry>
<Index>0</Index>
<Status>2</Status>
<TimeStamp>Tue Nov 3 07:22:57 2015</TimeStamp>
<TimeStampRaw>1446535377</TimeStampRaw>
<Description>The system board Mem2 temperature is within range.</Description>
</LogEntry>
<LogEntry>
<Index>1</Index>
<Status>3</Status>
<TimeStamp>System Boot</TimeStamp>
<TimeStampRaw>1446535378</TimeStampRaw>
<Description>The system board Mem2 temperature is less than the lower warning threshold.</Description>
</LogEntry>
<LogEntry>
<Index>2</Index>
<Status>2</Status>
<TimeStamp>Mon Nov 2 14:17:09 2015</TimeStamp>
<TimeStampRaw>1446473829</TimeStampRaw>
<Description>Drive 0 is installed in disk drive bay 1.</Description>
</LogEntry>
<LogEntry>
<Index>3</Index>
<Status>4</Status>
<TimeStamp>Mon Nov 2 14:17:04 2015</TimeStamp>
<TimeStampRaw>1446473824</TimeStampRaw>
<Description>Drive 0 is removed from disk drive bay 1.</Description>
</LogEntry>
<LogEntry>
<Index>4</Index>
<Status>2</Status>
<TimeStamp>Mon Nov 2 14:15:54 2015</TimeStamp>
<TimeStampRaw>1446473754</TimeStampRaw>
<Description>Drive 0 is installed in disk drive bay 1.</Description>
</LogEntry>
<LogEntry>
<Index>5</Index>
<Status>4</Status>
<TimeStamp>Mon Nov 2 13:58:54 2015</TimeStamp>
<TimeStampRaw>1446472734</TimeStampRaw>
<Description>Drive 0 is removed from disk drive bay 1.</Description>
</LogEntry>
<LogEntry>
<Index>6</Index>
<Status>2</Status>
<TimeStamp>Fri Feb 5 11:07:27 2010</TimeStamp>
<TimeStampRaw>1265368047</TimeStampRaw>
<Description>Drive 0 is installed in disk drive bay 1.</Description>
</LogEntry>
<LogEntry>
<Index>7</Index>
<Status>2</Status>
<TimeStamp>Fri Feb 5 11:07:08 2010</TimeStamp>
<TimeStampRaw>1265368028</TimeStampRaw>
<Description>Drive 0 in disk drive bay 1 is operating normally.</Description>
</LogEntry>
<LogEntry>
<Index>8</Index>
<Status>4</Status>
<TimeStamp>Fri Feb 5 11:07:07 2010</TimeStamp>
<TimeStampRaw>1265368027</TimeStampRaw>
<Description>Drive 0 is removed from disk drive bay 1.</Description>
</LogEntry>
<LogEntry>
<Index>9</Index>
<Status>4</Status>
<TimeStamp>Fri Jan 29 09:33:27 2010</TimeStamp>
<TimeStampRaw>1264757607</TimeStampRaw>
<Description>Fault detected on drive 0 in disk drive bay 1.</Description>
</LogEntry>
<LogEntry>
<Index>10</Index>
<Status>2</Status>
<TimeStamp>Mon Feb 25 16:14:15 2008</TimeStamp>
<TimeStampRaw>1203956055</TimeStampRaw>
<Description>Log cleared.</Description>
</LogEntry>
<NumRecords>11</NumRecords>
</ESMLog>
<ObjStatus>2</ObjStatus>
<SMStatus>0</SMStatus>
</OMA>
这里是我做出的解决方案,通过Jettro的例子:
filter {
xml {
source => "message"
target => "xmldata"
store_xml => "false"
xpath => ["/OMA/ESMLog//LogEntry","logentry"]
}
mutate {
remove_field => [ "message", "inxml", "xmldata" ]
}
split {
field => "[logentry]"
}
xml {
source => "logentry"
store_xml => "false"
xpath => ["/LogEntry/Index/text()","index"]
xpath => ["/LogEntry/Status/text()","status"]
xpath => ["/LogEntry/TimeStampRaw/text()","timestampraw"]
xpath => ["/LogEntry/Description/text()","description"]
}
mutate {
replace => {
"index" => "%{[index][0]}"
"status" => "%{[status][0]}"
"timestampraw" => "%{[timestampraw][0]}"
"description" => "%{[description][0]}"
}
}
date {
match => [ "timestampraw", "UNIX" ]
}
mutate {
remove_field => [ "logentry" , "timestampraw" ]
}
}
看来拆分后开始创建“循环“并处理来自更深层部分的所有数组。 谢谢
你忘了广告你的输出吗?我没有看到你的帖子中有任何 – Sidewinder94
@ Sidewinder94我没有添加任何内容,因为输出对代码没有问题。我正在寻找解决方案来开发过滤器,而不是修复它。 Jetjet的回答非常有帮助,我能够做出必要的改变。我用解决方案更新了这篇文章。谢谢 – peep