天天看点

Python使用xml.etree.ElementTree解析xml文件

示例一、

data.xml

<?xml version="1.0" encoding="utf-8"?>

<info>
	<intro>保存用户的信息</intro>
	<list id='001'>
		<head>jiaoshe_15</head>
		<name>小丸子</name>
		<number>12345678</number>
		<age>44</age>
		<sex>女</sex>
		<hobby>上网,打游戏</hobby>
		<note>我是一个来自广东深圳的女十,我期待和喜爱运动的你成为朋友</note>
	</list>
	<list>
		<head>jiaoshe_12</head>
		<name>功夫老伯</name>
		<number>aa33678</number>
		<age>78</age>
		<sex>男</sex>
		<hobby>功夫</hobby>
		<note>我是一个老伯,我期待和喜爱运动的你成为朋友</note>
	</list>
</info>
           

readxml.py

#!/usr/bin/env python
# -*- coding: UTF-8  -*-
'''
Created on 2010-4-14

@author: Administrator
'''
import os
import xml.etree.ElementTree as ET

#读取xml文件
def load_xml_file(filename):
    root = ET.parse(filename).getroot()
    #获取文件描述
    intro = root.find('intro').text
    print intro
    all_users = root.findall('list')
    for user in all_users:
        head = user.find('head').text
        name = user.find('name').text
        sex = user.find('sex').text
        print head,name,sex


if __name__ == '__main__':

    workpath = os.getcwd()
#    print workpath
    load_xml_file(r'%s\webUI\data\data.xml' % workpath)
           

输出结果

保存用户的信息
jiaoshe_15 小丸子 女
jiaoshe_12 功夫老伯 男
           

示例二、

1、需要解析的文件app.xml

<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE beans PUBLIC "-//SPRING//DTD BEAN//EN" "http://www.springframework.org/dtd/spring-beans.dtd">
<beans>
 <!-- 建立数据源 -->
 <bean id="dataSource" class="org.apache.commons.dbcp.BasicDataSource">
  <!-- 数据库驱动,我这里使用的是Mysql数据库 -->
  <property name="driverClassName">
   <value>com.mysql.jdbc.Driver</value>
  </property>
  <!-- 数据库地址,这里也要注意一下编码,不然乱码可是很郁闷的哦! -->
  <property name="url">
   <value>
       jdbc:mysql://localhost:3306/tie?useUnicode=true&characterEncoding=utf-8
   </value>
  </property>
  <!-- 数据库的用户名 -->
  <property name="username">
   <value>root</value>
  </property>
  <!-- 数据库的密码 -->
  <property name="password">
   <value>123</value>
  </property>
 </bean>
 <!-- 把数据源注入给Session工厂 -->
 <bean id="sessionFactory"
  class="org.springframework.orm.hibernate3.LocalSessionFactoryBean">
  <property name="dataSource">
   <ref bean="dataSource" />
  </property>
  <!-- 配置映射文件 -->
  <property name="mappingResources">
   <list>
    <value>com/alonely/vo/User.hbm.xml</value>
   </list>
  </property>
 </bean>
 <!-- 把Session工厂注入给hibernateTemplate -->
 <!-- 解释一下hibernateTemplate:hibernateTemplate提供了很多方便的方法,在执行时自动建立 HibernateCallback 对象,例如:load()、get()、save、delete()等方法。 -->
 <bean id="hibernateTemplate"
  class="org.springframework.orm.hibernate3.HibernateTemplate">
  <constructor-arg>
   <ref local="sessionFactory" />
  </constructor-arg>
 </bean>
 <!-- 把DAO注入给Session工厂 -->
 <bean id="userDAO" class="com.alonely.dao.UserDAO">
  <property name="sessionFactory">
   <ref bean="sessionFactory" />
  </property>
 </bean>
 <!-- 把Service注入给DAO -->
 <bean id="userService" class="com.alonely.service.UserService">
  <property name="userDAO">
   <ref local="userDAO" />
  </property>
 </bean>
 <!-- 把Action注入给Service -->
 <bean name="/user" class="com.alonely.struts.action.UserAction">
  <property name="userService">
   <ref bean="userService" />
  </property>
 </bean>
</beans>
           

2、解析的代码etXml.py

# -*- coding:UTF-8 -*-

import xml.etree.ElementTree as ET

#获取所有<bean>节点
def getAllBean():
    root = ET.parse('app.xml').getroot()
    beans = root.findall('bean')

    #显示所有bean
    for bean in beans:
#        print ET.tostring(bean, 'UTF-8')
#        print '==========================================='
        pass

    return beans;

#获取所有property节点,以字典的的形式返回,属性名/property节点
def getAllProperty():
    beans = getAllBean()
    propList = {}
    for bean in beans:
        propertysE = bean.findall('property')
        for prop in propertysE:
            proname = prop.get('name')
            propList[proname] = prop
#            print ET.tostring(prop, 'UTF-8')
#            print '==========================================='
    return propList


#根据bean的属性id获取bean的class属性值
def getBeanClassAttr(beanid):
    bean = getBeanById(beanid)
    return bean.get('class')#获取节点属性为class的值

#根据id获取bean节点
def getBeanById(beanid):
    beans = getAllBean()
    for bean in beans:
        id = bean.get('id')#获取属性名为id的值
        if beanid == id:
            return bean

#根据bean的属性id获取其所有property节点
def getPropertysByBeanid(beanid):
    bean = getBeanById(beanid)
    props = bean.findall('property')
    return props;

#根据bean的属性id和property的名称获取实际值
def getPropertyValue(beanid,proname):
    props = getPropertysByBeanid(beanid)
    for prop in props:
        name = prop.get('name')
        if proname == name:
            valueE = prop.find('value')
            if valueE != None:
                return valueE.text


if __name__ == '__main__':

    print getPropertyValue('userService','userDAO')