|
Home TOC |
|
Using the DTDHandler and EntityResolver
In this section of the tutorial, we'll carry on a short discussion of the two remaining SAX event handlers:
DTDHandlerandEntityResolver. TheDTDHandleris invoked when the DTD encounters an unparsed entity or a notation declaration. TheEntityResolvercomes into play when a URN (public ID) must be resolved to a URL (system ID).The DTDHandler API
In the section Referencing Binary Entities you saw a method for referencing a file that contains binary data, like an image file, using MIME data types. That is the simplest, most extensible mechanism to use. For compatibility with older SGML-style data, though, it is also possible to define an unparsed entity.
The
NDATAkeyword defines an unparsed entity, like this:<!ENTITY myEntity SYSTEM "..URL.."NDATA gif>The
NDATAkeyword says that the data in this entity is not parsable XML data, but is instead data that uses some other notation. In this case, the notation is named "gif". The DTD must then include a declaration for that notation, which would look something like this:<!NOTATION gifSYSTEM "..URL..">When the parser sees an unparsed entity or a notation declaration, it does nothing with the information except to pass it along to the application using the
DTDHandlerinterface. That interface defines two methods:notationDecl(String name, String publicId, String systemId) unparsedEntityDecl(String name, String publicId, String systemId, String notationName)The
notationDeclmethod is passed the name of the notation and either the public or system identifier, or both, depending on which is declared in the DTD. TheunparsedEntityDeclmethod is passed the name of the entity, the appropriate identifiers, and the name of the notation it uses.
Note: The DTDHandler interface is implemented by theDefaultHandlerclass.
Notations can also be used in attribute declarations. For example, the following declaration requires notations for the GIF and PNG image-file formats:
<!ENTITY image EMPTY> <!ATTLIST image ... type NOTATION (gif | png) "gif" >Here, the
typeis declared as being eithergif, orpng. The default, if neither is specified, isgif.Whether the notation reference is used to describe an unparsed entity or an attribute, it is up to the application to do the appropriate processing. The parser knows nothing at all about the semantics of the notations. It only passes on the declarations.
The EntityResolver API
The
EntityResolverAPI lets you convert a public ID (URN) into a system ID (URL). Your application may need to do that, for example, to convert something likehref="urn:/someName"into"http://someURL".The
EntityResolverinterface defines a single method:resolveEntity(String publicId, String systemId)This method returns an
InputSourceobject, which can be used to access the entity's contents. Converting an URL into anInputSourceis easy enough. But the URL that is passed as the system ID will be the location of the original document which is, as likely as not, somewhere out on the Web. To access a local copy, if there is one, you must maintain a catalog somewhere on the system that maps names (public IDs) into local URLs.
|
Home TOC |
|