Activity Report 4, XML Project

2001-03-07
XML Project/JSA


1. Background

In June 12, 2000, We reported a preliminary XML-DTD for ISO standards. The DTD has been posted on the ITSIG web page of "04. Final deliverables", as:

In July 07, 2000, We reported XML tools (editor, parser and browser). It has also been posted on the web page as Activity report 3 dealing with revising the DDT was submitted in Oct. 2000. It was reported as a working document in the Oct. 2000 ITSIG plenary as shown in the email:
From: "Read-Grosfort Jennifer"
To: ...
Sent: Wednesday, October 18, 2000 5:35 PM
Subject: Add working document for October ITSIG plenary

> Please find attached a report on the XML project.  This report [ITSIG
> 03/200 (Add. 1)] has been added to the compilation of working document
> and the zip file which are posted on the ISODOCserver.

2. XML-DTD

2.1 Original SGML-DTD and its XMLization

The XML-DTD was developed being based on the SGML-DTD "ITSIG exchange DTD, version 0.94" provided by ISO in Feb. 2000.

Major modifications for XMLization are:

  1. Comments within declaration are extracted to be separate comment statements.
  2. Characters are adjusted according to the XML rule of case sensitivity.
  3. All tags are described without omission.
  4. Each element is defined separately.
  5. All the attribute values are quoted.
  6. Attribute types NAME, NUMBER, and NUTOKEN are changed into NMTOKEN.
  7. The location of #PCDATA is changed into a head of declaration.
  8. Content models are modified without & separators.
  9. Characters defined by SDATA are changed into simple strings.
  10. Elements declarations are changed without exceptions.
  11. URIs are specified to the public identifies.

2.2 Modularization

The XML-DTD was modularized for a more feasible DTD exchange. The modularization was carried out considering the logical structure of the original SGML-DTD (see Figure 1).

Figure 1 Referencing relationship between the files of the original SGML-DTD
ITSIG/stdex94.dtd (MATH, ARTWORK, Figure, Terminology)
   +--- ITSIG/m12083.dtd (Formula)
   +--- ITSIG/tl93a.dtd (Terminology)
   +--- ITSIG/isonet10.dtd (ISONET)
   +--- ITSIG/se9573.dtd (Entities)
   |      +--- ent9573/isolat1.ent, ent9573/isolat2.ent, ent9573/isonum.ent,
   |      +--- ent9573/isodia.ent, ent9573/isopub.ent, ent9573/isobox.ent,
   |      +--- ent9573/isotech.ent, ent9573/isogrk1.ent, ent9573/isogrk2.ent,
   |      +--- ent9573/isogrk3.ent, ent9573/isogrk4.ent, ent9573/isocyr1.ent,
   |      +--- ent9573/isocyr2.ent, ent9573/isoamsa.ent, ent9573/isoamsb.ent,
   |      +--- ent9573/isoamsc.ent, ent9573/isoamsn.ent, ent9573/isoamso.ent,
   |      +--- ent9573/isoamsr.ent, ent9573/isomfrk.ent, ent9573/isomopf.ent,
   |      +--- ent9573/isomscr.ent, ent9573/isocs.ent
   +--- ITSIG/calstab.dtd (Table)
   +--- ITSIG/stdb94.dtd (NOTATION, parameter entities,
                          structure, displayed elements,
                          terminology list, figure, table, formula)

All the XML-DTD module files are zipped and attached as attach1.zip. They are shown bellow, where a file with the extension ".dtd" is a driver or a DTD module translated from the corresponding original SGML-DTD file and a file with ".mod" is a new DTD file developed for the modularized XML-DTD representation.

(1) stdex.dtd [DTD Driver]
(2) stdex-model.mod [Model Module]
(3) stdex-profile.mod [Profile Module]
(4) isonet10.dtd [Isonet Module]
(5) se9573.dtd [Entity]
(6) stdex-base.mod [Base Element Module]
(7) stdex-notation.mod [Notation]
(8) stdex-tpage.mod [Title Page]
(9) stdex-lpage.mod [Last Cover Page]
(10) stdex-toc.mod [Table of Contents]
(11) stdex-index.mod [Index]
(12) stdex-foreword.mod [Foreword]
(13) stdex-intro.mod [Introduction]
(14) stdex-body.mod [Body]
(15) stdex-annex.mod [Annex]
(16) stdex-nest.mod [Nested Subdivisions]
(17) stdex-disp.mod [Displayed Components]
(18) stdex-tl-simple.mod [Terminology List Simple Module]
(19) stdex-tl.mod [Terminology List Module]
(20) stdex-inline.mod [Inline Components]
(21) stdex-artwork-simple.mod [Artwork Simple Module]
(22) stdex-artwork.mod [Artwork Module]
(23) stdex-ref.mod [Referential Components]
(24) stdex-float.mod [Float Components]
(25) stdex-figure-default.mod [Figure Default Module]
(26) stdex-figure.mod [Figure Module]
(27) stdex-table.mod [Table]
(28) calstab.dtd [Cals Table]
(29) stdex-specific.mod [Very Specific Components]
(30) stdex-math-simple.mod [Math Simple Module]
(31) stdex-math.mod [Math Module]
(32) stdex-math-extension.mod [Math Extension Module]
(33) stdex-tol.mod [Tolerance]
(34) stdex-chem.mod [Chemistry]
(35) stdex-listing.mod [Listing Module]

The referencing relationship of the modules is illustrated in Figure 2.

Figure 2 Referencing relationship of the module files consisting of the XML-DTD
stdex.dtd [DTD Driver]
   +--- stdex-model.mod [Model Module]
   +--- stdex-profile.mod [Profile Module]
   |      +--- isonet10.dtd [Isonet Module]
   |              +--- se9573.dtd [Entity]
   +--- stdex-base.mod [Base Element Module]
   |      +--- stdex-notation.mod [Notation]
   |      +--- stdex-tpage.mod [Title Page]
   |      +--- stdex-lpage.mod [Last Cover Page]
   |      +--- stdex-toc.mod [Table of Contents]
   |      +--- stdex-index.mod [Index]
   |      +--- stdex-foreword.mod [Foreword]
   |      +--- stdex-intro.mod [Introduction]
   |      +--- stdex-body.mod [Body]
   |      +--- stdex-annex.mod [Annex]
   |      +--- stdex-nest.mod [Nested Subdivisions]
   |      +--- stdex-disp.mod [Displayed Components]
   |      +--- stdex-tl.mod [Terminology List]
   |      +--- stdex-inline.mod [Inline Components]
   |      |      +--- stdex-artwork.mod [Artwork]
   |      +--- stdex-ref.mod [Referential Components]
   |      +--- stdex-float.mod [Float Components]
   |      |      +--- stdex-figure.dtd [Figure]
   |      |      +--- stdex-table.dtd [Table]
   |      |             +--- calstab.dtd [Cals Table]
   |      +--- stdex-specific.mod [Very Specific Components]
   |      |      +--- stdex-math.mod [Math]
   |      |             +--- stdex-math-extension.mod [Math Extension Module]
   |      |      +--- stdex-tol.mod [Tolerance]
   |      |      +--- stdex-chem.mod [Chemistry]
   |      +--- se9573.dtd [Entity]
   +--- stdex-listing.mod [Listing Module]

NOTE The values of gtext attributes are generated by a particular processor. Those attributes generation and rendering can be done, for example, by an XSL processor.

2.3 Samples of XML document instances

Three samples of XML document instances are attached as a zipped file, attach2.zip, where the two document instance files:

test1.xml
term1.xml
are sample XML documents translated from test1.sgml and term1.sgml respectively. The test1.sgml and term1.sgml are included in the set of original SGML-DTD files.

Another sample in the attach2.zip is an XMLized International Standard ISO 15178.

3. Final Report of XML-DTD

As described in the Activity report 3, the final report of the XML-DTD will be submitted including the discussion of the XML-Study Group Meeting in Geneva, March 2001.

The final report entitled with "XML DTD for International Standards and Technical Reports" will consist of some clauses of the document "ITSIG exchange DTD, version 0.94" and explanation of the XML-DTD modules. For making its drafting works more efficient, XML Project/JSA wishes to get a word form or SGML form of the "ITSIG exchange DTD, version 0.94".

4. XSLT translation into HTML

For easy rendering of XMLized ISO standards, we tried to translate the XML documents into HTML documents by using XSLT.

The attach3.zip includes an XSLT specification for the translation.

The attach4.zip includes the two two HTML document instance files:

test1.html
term1.html
translated from the two XML document instance files:
test1.xml
term1.xml
respectively, in accordance with the XSLT specification.