
JVM Languages
Easy DOM Parsing in Java
By Eric Bruno, July 25, 2011
A few handy routines make it much simpler to parse XML.Related Reading
More Insights
White Papers
More >>Reports
- Enterprise Cybersecurity Plans in a Post-Pandemic World
- 2021 Data Breach Investigations Report (DBIR)
Webcasts
- How Data Breaches Happen and What To Do When They Happen To You
- Incorporating a Prevention Mindset into Threat Detection and Response

Currently we allow the following HTML tags in comments:
Single tags
These tags can be used alone and don't need an ending tag.
<br>
Defines a single line break
<hr>
Defines a horizontal line
Matching tags
These require an ending tag - e.g. <i>italic text</i>
<a>
Defines an anchor
<b>
Defines bold text
<big>
Defines big text
<blockquote>
Defines a long quotation
<caption>
Defines a table caption
<cite>
Defines a citation
<code>
Defines computer code text
<em>
Defines emphasized text
<fieldset>
Defines a border around elements in a form
<h1>
This is heading 1
<h2>
This is heading 2
<h3>
This is heading 3
<h4>
This is heading 4
<h5>
This is heading 5
<h6>
This is heading 6
<i>
Defines italic text
<p>
Defines a paragraph
<pre>
Defines preformatted text
<q>
Defines a short quotation
<samp>
Defines sample computer code text
<small>
Defines small text
<span>
Defines a section in a document
<s>
Defines strikethrough text
<strike>
Defines strikethrough text
<strong>
Defines strong text
<sub>
Defines subscripted text
<sup>
Defines superscripted text
<u>
Defines underlined text
Dr. Dobb's encourages readers to engage in spirited, healthy debate, including taking us to task. However, Dr. Dobb's moderates all comments posted to our site, and reserves the right to modify or remove any content that it determines to be derogatory, offensive, inflammatory, vulgar, irrelevant/off-topic, racist or obvious marketing or spam. Dr. Dobb's further reserves the right to disable the profile of any commenter participating in said activities.
![]() |
To upload an avatar photo, first complete your Disqus profile. | View the list of supported HTML tags you can use to style comments. | Please read our commenting policy. |
How to use these helper functions if I have more than one Executive Nodes in the XML Document?
I'm not sure if it has improved. However, I often do the same as you suggest here, and walk through XML. This doesn't scale, however, but it is an option in some cases.
I found that early versions of Java had performance issues with the DOM XML (I was mostly using XPath). It was enough that I switched to doing my own tree walking operations to get a 2 to 3 times performance increase. Have the newer versions done anything to change this?
I find this article deeply disturbing in the number of bad practices that it naively encourages:
* Completely ignores namespaces throughout.
* Uses equalsIgnoreCase when comparing names, even though XML names are supposed to be case-sensitive.
* Uses internal classes of the Xerces implementation instead of the standard JAXP methods.
* getNodeValue() stops at the first text node. Even for simple content there could be multiple text nodes, embedded comments, etc., not to mention CDATA sections, entity references, etc.
I cringe every time I see these sort of mistakes in the wild and am very disappointed to find them in a Dr. Dobbs article.
What happens when the xml contains the same element name in different sections of the XML?