Search code examples
pythonxmlminidom

python minidom read xml


I have this xml :

<?DOMParser ?> 
<logbook:LogBook xmlns:logbook="http://www/logbook/1.0"  version="1.2">
<visits>
<visit>
    <general>
        <startDateTime>2014-01-10T12:22:39.166</startDateTime>
        <endDateTime>2014-03-11T13:51:31.480</endDateTime>
    </general> 
</visit>
<visit>
<general>
    <startDateTime>2013-01-10T12:22:39.166</startDateTime>
    <endDateTime>2013-03-11T13:51:31.480</endDateTime>
</general>
</visit>
</visits>
</logbook:LogBook>

I want to have this output from xml:

startDateTime           | endDateTime           |
-----------------------|-----------------------|
2014-01-10T12:22:39.166|2014-03-11T13:51:31.480|
-----------------------|-----------------------|
2013-01-10T12:22:39.166|2013-03-11T13:51:31.480|

I am using minidom , so I wrote:

 import xml.dom.minidom as minidom
 doc=minidom.parse('test.xml')
 general=doc.getElementsByTagName('general')[0]
 startDateTime=general.getAttribute('startDateTime')
 print(startDateTime)

This return empty string to me. any help here?


Solution

  • I prefer to use the ElementTree class:

    from StringIO import StringIO
    from xml.etree.ElementTree import ElementTree
    
    et = ElementTree(None, StringIO("""<?DOMParser ?>
    <logbook:LogBook xmlns:logbook="http://www/logbook/1.0"  version="1.2">
    <visits>
      <visit>
        <general>
            <startDateTime>2014-01-10T12:22:39.166</startDateTime>
            <endDateTime>2014-03-11T13:51:31.480</endDateTime>
        </general>
      </visit>
    <visit>
    <general>
        <startDateTime>2013-01-10T12:22:39.166</startDateTime>
        <endDateTime>2013-03-11T13:51:31.480</endDateTime>
    </general>
    </visit>
    </visits>
    </logbook:LogBook>"""))
    
    for general in et.iterfind('visits/visit/general'):
        start = general.findtext('startDateTime')
        finish = general.findtext('endDateTime')
        print start, finish
    

    Outputs:

    2014-01-10T12:22:39.166 2014-03-11T13:51:31.480
    2013-01-10T12:22:39.166 2013-03-11T13:51:31.480