2012-01-18 75 views
0

我把GPS数据存储为.tcx文件。 这是一个XML文件(以下乞讨文件)python 3.x xml解析类似于plistlib?

<?xml version="1.0" encoding="utf-8"?> 
<TrainingCenterDatabase xmlns="http://www.garmin.com/xmlschemas/TrainingCenterDatabase/v2" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:tp1="http://www.garmin.com/xmlschemas/TrackPointExtension/v1" xmlns:gpx="http://www.topografix.com/GPX/1/1" xsi:schemaLocation="http://www.garmin.com/xmlschemas/TrainingCenterDatabase/v2 http://www.garmin.com/xmlschemas/TrainingCenterDatabasev2.xsd"> 
    <Activities> 
     <Activity Sport="Other"> 
      <Id>2012-01-17T11:44:35Z</Id> 
      <Lap StartTime="2012-01-17T11:44:35Z"> 
       <TotalTimeSeconds>0</TotalTimeSeconds> 
       <DistanceMeters>0</DistanceMeters> 
       <Calories>0</Calories> 
       <Intensity>Active</Intensity> 
       <TriggerMethod>Manual</TriggerMethod> 
       <Track> 
        <Trackpoint> 
         <Time>2012-01-17T11:44:35Z</Time> 
         <Position> 
          <LatitudeDegrees>59.720211518183351</LatitudeDegrees> 

唯一类似的事情,我与已经使用该格式类似苹果.plists工作,虽然信息嵌套在<dictionary>标签,我相信内。

凡下面会给我的嵌套字典...

import plistlib 
pl = plistlib.readPlist('/Users/name/Documents/file.plist') 

for sub_dict in pl: 
    print(sub_dict['keyA']) 
    print(sub_dict['keyD']) 
    print(sub_dict['keyE']) 
    print(sub_dict['keyG']) 

我知道xml.dom.minidom,etree和LXML的,但我有麻烦工作如何得到输出相同上面的plistlib模块给了我。

我的最终目标是能够将两个数据集中的选定键合并在一起。一步一个脚印的时间...

编辑-----------------

我有一些工作:

from xml.dom.minidom import parse 
doc = parse('/Users/name/Documents/GPS/gps.tcx') 
lat = doc.getElementsByTagName("LatitudeDegrees") 
time = doc.getElementsByTagName("Time") 

for x in lat: 
    print(x.firstChild.data) 
+0

的输出是什么应该是什么样子的? – unutbu 2012-01-18 19:53:06

+0

串...?即。 '“59.920211518183351”''和'2012-01-17T11:44:35Z' – beoliver 2012-01-18 20:08:21

回答

1

我不得不将结束标签添加到您发布的XML中,以便lxml解析器可以解析它。 完成之后,可以使用致电doc.xpath的方式提取时间和纬度方面的数据。

import lxml.etree as ET 
import io 

content='''<?xml version="1.0" encoding="utf-8"?> 
<TrainingCenterDatabase xmlns="http://www.garmin.com/xmlschemas/TrainingCenterDatabase/v2" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:tp1="http://www.garmin.com/xmlschemas/TrackPointExtension/v1" xmlns:gpx="http://www.topografix.com/GPX/1/1" xsi:schemaLocation="http://www.garmin.com/xmlschemas/TrainingCenterDatabase/v2 http://www.garmin.com/xmlschemas/TrainingCenterDatabasev2.xsd"> 
    <Activities> 
     <Activity Sport="Other"> 
      <Id>2012-01-17T11:44:35Z</Id> 
      <Lap StartTime="2012-01-17T11:44:35Z"> 
       <TotalTimeSeconds>0</TotalTimeSeconds> 
       <DistanceMeters>0</DistanceMeters> 
       <Calories>0</Calories> 
       <Intensity>Active</Intensity> 
       <TriggerMethod>Manual</TriggerMethod> 
       <Track> 
        <Trackpoint> 
         <Time>2012-01-17T11:44:35Z</Time> 
         <Position> 
          <LatitudeDegrees>59.920211518183351</LatitudeDegrees> 
</Position> 
</Trackpoint> 
</Track> 
</Lap> 
</Activity> 
</Activities> 
</TrainingCenterDatabase> 
''' 

doc = ET.fromstring(content) 

ns = {'ns':'http://www.garmin.com/xmlschemas/TrainingCenterDatabase/v2'} 
for trackpoint in doc.xpath('//ns:Trackpoint', namespaces = ns): 
    print(trackpoint.xpath('(ns:Time|ns:Position/ns:LatitudeDegrees)/text()', namespaces = ns)) 

产生

['2012-01-17T11:44:35Z', '59.920211518183351'] 
+0

我看起来像我将安装lxml :),抱歉只给xml文件的开头。我一直在寻找玩下面的东西......把它放到for循环中......'from xml.dom.minidom import parse''doc = parse('/ Users/name/Documents/GPS/gps.tcx ')print(doc.getElementsByTagName(“LatitudeDegrees”)[1] .firstChild.data)' – beoliver 2012-01-18 22:07:03