After the DOM parser retrieved out the content of an element by getTextContent() . The html entities were automatically translated into the characters.
So far, I didn't find a way to disable the automatic translation. So I am using org.apache.commons.lang.StringEscapeUtils to translate the special characters back to html entities.
This solution is really not a solution.
Tuesday, July 1, 2008
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment