2016-09-07 96 views
1

我的问题是,我有一个具有相同值的两倍像一个XML文件:删除从XML文件中的重复标签

<ns:html xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:ns="2" release="1"> 
    <ns:Name>A 
     <ns:ID>ANI-2016-05-02T21:01Z</ns:ID> 
     <ns:CreationDate>2016-05-02T21:01:40</ns:CreationDate> 
     <ns:Subname>A2 
      <ns:Total>5000</ns:Total> 
      <ns:type>ANI</ns:type> 
     </ns:Subname> 
    </ns:Name> 

    <ns:Name>A 
     <ns:ID>ANI-2016-05-02T21:01Z</ns:ID> 
     <ns:CreationDate>2016-05-02T21:01:40</ns:CreationDate> 
     <ns:Subname>A2 
      <ns:Total>5000</ns:Total> 
      <ns:type>ANI</ns:type> 
     </ns:Subname> 
    </ns:Name> 

    <ns:Name>A 
     <ns:ID>ANI-2016-08-04T21:01Z</ns:ID> 
     <ns:CreationDate>2016-04-08T21:01:40</ns:CreationDate> 
     <ns:Subname>A2 
      <ns:Total>5000</ns:Total> 
      <ns:type>ANI</ns:type> 
     </ns:Subname> 
    </ns:Name> 
</ns:html> 

我的问题是如何使用XML来从XML文件中删除重复值以便在新的XML文件中获得以下结果。支票可以在ID的基础上完成。

<ns:html xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:ns="2" release="1"> 
    <ns:Name>A 
     <ns:ID>ANI-2016-05-02T21:01Z</ns:ID> 
     <ns:CreationDate>2016-05-02T21:01:40</ns:CreationDate> 
     <ns:Subname>A2 
      <ns:Total>5000</ns:Total> 
      <ns:type>ANI</ns:type> 
     </ns:Subname> 
    </ns:Name> 

    <ns:Name>A 
     <ns:ID>ANI-2016-05-02T21:01Z</ns:ID> 
     <ns:CreationDate>2016-05-02T21:01:40</ns:CreationDate> 
     <ns:Subname>A2 
      <ns:Total>5000</ns:Total> 
      <ns:type>ANI</ns:type> 
     </ns:Subname> 
    </ns:Name> 
</ns:html> 

我曾尝试以下的事情:

首先,我使用的例子作为发现here

## SETUP ENVIRONMENT 
# Find "Advanced Monitoring Agent" service and use path to locate files 
$gfimaxagent = Get-WmiObject Win32_Service | 
       Where-Object { $_.Name -eq 'Advanced Monitoring Agent' } 
$gfimaxexe = $gfimaxagent.PathName 
$gfimaxpath = Split-Path $gfimaxagent.PathName.Replace([char]34,"") -Parent #"Wordpress syntax highlighter bug 
$XmlFile = "C:\Users\Desktop\Test.xml" 
$Output = "C:\Users\Desktop\result.xml" 

[xml]$XmlContent = Get-Content $XmlFile 
$XmlPath = "checks" 
$Property = "uid" 
$XmlValues = @{} 
foreach ($XmlElement in $XmlContent.$XmlPath.ChildNodes) 
{ 
    $ElementValues = "" #"Wordpress syntax highlighter bug 
    foreach($XmlValue in $XmlElement.ChildNodes | Sort-Object name) 
    { 
     $ElementValues = $ElementValues + $XmlValue.Name + $XmlValue.InnerText 
    } 
    $XmlValues[$XmlElement.$Property] = $ElementValues 
} 

$XmlDuplicates = @{} 
foreach ($XmlValue in $XmlValues.Values) 
{ 
    $Items = @($XmlValues.Keys | Where { $XmlValues[$_] -eq $XmlValue }) 
    if ($Items.Count -gt 1) 
    { 
     if (!($XmlDuplicates[$Items[0]])) { $XmlDuplicates[$Items[0]] = $Items } 
    } 

} 

foreach ($XmlDuplicate in $XmlDuplicates.Keys) 
{ 
    for ($i = 1; $i -lt $XmlDuplicates[$XmlDuplicate].Count; $i++) 
    { 
     $XPath = "//" + $XmlPath + "/*[@" + $Property +"=" + $XmlDuplicates[$XmlDuplicate][$i]+"]" 
     $ChildToBeRemoved = $XmlContent.SelectSingleNode($XPath) 
     $ChildToBeRemoved.ParentNode.RemoveChild($ChildToBeRemoved) 
    } 
} 

$XmlContent.Save($Output) 

问题是即使我改变了源仍然没有载入我的文件但来自网站的原始示例文件。

至于第二我也用下面的代码试了一下:

[xml]$XmlDocument1 = Get-Content -Path C:\Users\EX27740\Desktop\testdubbel.xml 

$softwareVersionsArray = $catalogXML.catalog.software | 
    Group-Object name | 
    ForEach-Object {$_.Group[0]} 

$filename = ' C:\Users\EX27740\Desktop\Resultaat.xml' 
$catalogXML.Save($filename) 

但得到一个错误:通过XML节点

At line:8 Char:1 Cannot call a null-valued expression

+0

你能提供一个根节点的例子吗? –

+0

SO不是其他人为你工作的地方。你到目前为止试过了什么(显示你的代码),以及你需要什么*特定的问题(在你的代码中)? –

+0

嗨对不起,添加thodse钛现在加入他们 – Jarno343

回答

0

一种方法是循环并删除任何重复

[xml]$xml = @" 
<ns:html xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:ns="2" release="1"> 
    <ns:Name>A 
     <ns:ID>ANI-2016-05-02T21:01Z</ns:ID> 
     <ns:CreationDate>2016-05-02T21:01:40</ns:CreationDate> 
     <ns:Subname>A2 
      <ns:Total>5000</ns:Total> 
      <ns:type>ANI</ns:type> 
     </ns:Subname> 
    </ns:Name> 

    <ns:Name>A 
     <ns:ID>ANI-2016-05-02T21:01Z</ns:ID> 
     <ns:CreationDate>2016-05-02T21:01:40</ns:CreationDate> 
     <ns:Subname>A2 
      <ns:Total>5000</ns:Total> 
      <ns:type>ANI</ns:type> 
     </ns:Subname> 
    </ns:Name> 

    <ns:Name>A 
     <ns:ID>ANI-2016-08-04T21:01Z</ns:ID> 
     <ns:CreationDate>2016-04-08T21:01:40</ns:CreationDate> 
     <ns:Subname>A2 
      <ns:Total>5000</ns:Total> 
      <ns:type>ANI</ns:type> 
     </ns:Subname> 
    </ns:Name> 
</ns:html> 
"@ 

cls 
[email protected]() 
$xml.html.Name | ForEach-Object { 
    if($CreationDates -contains $_.CreationDate) { 
    [void]$_.ParentNode.RemoveChild($_) 
    } else { 
    $CreationDates += $_.CreationDate 
    } 
} 
#$CreationDates 
$xml.html.Name 
+0

谢谢完美 – Jarno343