advertisement
javaboutique
Search Tips
Articles  |   Tutorials  |   Reviews  |   Tools  |   by Category  |   by Date  |   by Name  |   Submit  |   Source  |   Forums  |  
javaboutique
Browse DevX


Partners & Affiliates











advertisement

Tutorials : Digesting XML documents :

Digesting XML documents

by Keld H. Hansen

Introduction

When it comes to parsing XML documents there are several ways to proceed. One is to use SAX, which is an event-driven tool, that allows you to catch precisely the data you need. SAX is pretty low-level, however, and you'd therefore often prefer a tool like JDOM, which builds an in-core tree structure of the XML document. Using various method calls you may manipulate the tree as you like.

Another approach is to let either Castor or XMLBeans convert the XML document into a linked structure of Java beans, which are easy to access from your program. 

If you're only interested in parts of an XML document, or you don't care about fancy tree structures, Digester from Jakarta Commons could be an option. It allows you to extract the parts of the XML document you need, and puts few restrictions on the way you store data in your program. In general it's simpler to use than the other tools I mentioned.

It resembles SAX since it links the various XML elements to methods in your programs, but it's much simpler to use than SAX. Digester has a programming API, but also has a possibility of using an XML configuration file to describe how processing should be done. It furthermore implements a very open architecture allowing you to define your own processing rules by coding separate plug-ins.

If you're interested in another XML-tool for your toolbox then you're welcome to read on. First I'll tell you how to install Digester, then we'll look into a few basic examples, and finally we'll build a Struts web application which will process and show data from an RSS (Really Simple Syndication--or Rich Site Summary) feed.

Installing Digester

To run Digester you'll need a jar-file for Digester plus 3 additional jar-files from other Jakarta Commons projects: Beanutils, Collections and Logging. All projects can be downloaded from the same Jakarta download page. After the download you must place the jar files in your classpath and you're ready to run.

The Digester design

When using Digester there are some simple rules you must know and follow:

Determine what data you'll need from the XML document
You may choose to extract anything from a single data value to all data
 
Create Java classes (if you don't have them already) to hold the data you extract
You must also have methods in these classes that can be used for storing the data. The standard bean setter-methods are fine for non-Collection type of data. Collections may be handled by "add-methods", which adds one element to a Collection. The examples following shortly show how it works.
 
Digester identifies each XML element by a simple string pattern
Let's take a simple XML document like this:

<A>
   <B>data for B</B>
   <C>
      <D>data for D</D>
   </C>
</A>

To identify the B-element Digester uses the string "A/B".
The data in D is referred to by "A/B/C/D".
The strings are called "Element patterns". More on this in the examples below.
 
Rules for element matching
When an XML element is matched to a pattern you must specify what should happen in your program. This is done by telling which Java objects should be created or which Java methods should be called.   
 
If possible use the same names for XML elements and attributes as for bean properties 
  This just makes coding simpler.

How to Add Java Applets to Your Site

New on the Java Boutique:

New Review:

Time Management Made Easy with the Quartz Enterprise Job Scheduler
Why not just use the Java timer API? This open source scheduling API boasts simplicity, ease-of-integration, a well-rounded feature set, and it's free!

New Applet:

Reverse Complement
Reverse Complement is a simple applet that converts DNA or RNA sequences into three useful formats.

Elsewhere on internet.com:

WebDeveloper Java
Lots of Java information on webdeveloper.com

WDVL Java
Thorough Java resource at the Web Developer's Virtual Library.

ScriptSearch Java
Hundreds of free Java code files to download.

jGuru: Your View of the Java Universe
Customizable portal with online training, FAQs, regular news updates, and tutorials.

 Avaya Developer Showcase
 MSDN Spotlight
 PHP for Windows Showcase
XML error: undefined entity at line 34
advertisement
Receive Articles via our XML/RSS feed
Receive Articles via our XML/RSS feed

JavaBytes
Internet Cyclone
This powerful, easy-to-use, internet optimizer is for Windows 95, 98, ME, NT, 2000 and XP. It's designed to automatically optimize your Windows settings, boosting your Internet connection up to 200%.

IBM Brings Developers Into the Cloud
Apache at 10: You Can't Buy Us
Microsoft's CodePlex Foundation Moving Forward
Apple Claims 100,000 Apps, Google Analyzes Them
Nokia Latest to Play Opera Mobile 10 Browser
PayPal Opens Up Payment Platform to Devs
Ubuntu Linux 9.10 'Karmic Koala' Starts Its Climb
IBM Links Rational Developer Tools, Tivoli Apps
Libraries Give Vista Apps a Windows 7 Look
Ubuntu: The 'Default Alternative' to Windows?

Delivering Web-based Embedded Fonts in CSS 3
Adobe Helps PHP Developers Create Rich Internet Applications
Java Developers Finding a Home at Adobe Flex
Virtualization Delivers a Dynamic Infrastructure
Consuming XML Web Services in iPhone Applications
Build a More Agile Business with IBM
POJO-Based Solutions for LDAP Access: One Good, One Better
IBM Offers Enhanced Measurement and Management for Energy Usage
IBM Helps Transformation to an Information-Based Enterprise
Top Five Touch UI-Related Design Guidelines

Advertising Info  |   Member Services  |   Contact Us  |   Help  |   Feedback  |   Site Map  |   Network Map  |   About

internet.commediabistro.comJusttechjobs.comGraphics.com

Search:

WebMediaBrands Corporate Info

Legal Notices, Licensing, Reprints, Permissions, Privacy Policy.
Advertise | Newsletters | Shopping | E-mail Offers | Freelance Jobs