|

Cover Pages Archive
|
|
SGML and XML News |
- October 30, 2000]
RDF Schema Explorer for Querying/Validating/Extending RDF Models. Wolfram Conen recently announced RDF
Schema Explorer -- a service based on Jan Wielemaker's SWI-Prolog (3.4.0), his SGML/RDF
parser, and an adaptation of his CGIServ code. RDF Schema Explorer can be used
as follows: "(1) You can feed some RDF into the Explorer, either by keying it
directly into the text field below or by uploading a file. This will be parsed with Jan
Wielemaker's SWI-Prolog parser and the resulting triples will be asserted to the fact
base. (2) You can check/validate your model against the rule set provided in version 1.2
of the paper 'A
logical interpretation of RDF.' (3) You can query the model repeatedly by using the
provided rule/fact set and by providing your own additional rules/queries. (4) You can
define/extend the semantics for your own predicates directly from within your RDF
document, along the guidelines presented in RDF Semantic Extensions.
Version 1.2 of the paper 'A logical interpretation of RDF' is currently under public
review in the Semantic Web (SEWEB) area of the web site Electronic
Transactions on Artificial Intelligence (ETAI), where it is possible to
comment/discuss the paper." For related resources, see "Resource Description Framework
(RDF)."
- [October 27, 2000]
Electronic Book Exchange (EBX) Working Group Publishes Digital Rights Specification. The
Technical Committee of the Electronic
Book Exchange (EBX) Working Group has published an initial draft digital rights
standard "for protecting copyright in electronic books and for distributing
electronic books among publishers, distributors, retailers, libraries, and
consumers." The Electronic Book Exchange (EBX) Working Group "is an organization
of companies, organizations, and individuals developing a standard for protecting
copyright in electronic books and for distributing electronic books among publishers,
distributors, retailers, libraries, and consumers. The draft EBX specification
accommodates a variety of content formats for electronic books, including Open eBook
Publication Structure and Adobe Portable Document Format (PDF). The EBX Working Group
operates under the auspices of the Book Industry Study
Group." The draft specification is The Electronic Book Exchange System (EBX).
Version 0.8. July 2000 Draft, 109 pages. [An interim, incomplete draft and has not been
approved as a standard by the EBX Working Group.] "[This is] the complete technical
specifications for the Electronic Book Exchange (EBX) system for interoperable
applications and devices that use public-key cryptography for copyright protection and
distribution of electronic books. The EBX system is being developed by the EBX Working
Group, whose members are Adobe Systems Incorporated, the American Library Association,
Audible, ContentGuard, DigitalOwl.com, Glassbook, GlobalMentor, Nokia, RightsMarket.com,
SoftLock.com, Thomson Consumer Electronics, Versaware, and Yankee Rights Management."
This document describes the Electronic Book Exchange (EBX) system. The EBX system defines
the way in which electronic books (e-books) are distributed from publishers to booksellers
and distributors, from booksellers to consumers, between consumers and between consumers
and libraries. It describes the basic requirements of electronic book reading devices and
the electronic books themselves. It also describes how these 'trusted' components interact
to form a comprehensive copyright protection system that both protects the intellectual
property of authors and publishers as well as describes the capabilities required by
consumers. In addition, the model describes in general how products and revenue for those
products are generated and managed." While the The EBX system does not define a
specific 'content' file format, it does define vouchers, which are encoded in XML.
"EBX is primarily concerned with the creation and transfer of digital objects called
vouchers. A voucher is an electronic description of e-book permissions transferred from
one book owner in the network to another book owner. EBX vouchers are encoded in
XML." A sample EBX
voucher encoded in XML is available for inspection. See in this connection the feature
article by Mark Walter and Mike Letts "Mad Scramble for
Mindshare In Digital Rights Management. [Digital Rights Management: Peacekeepers
Needed]," in The
Seybold Report on Internet Publishing Volume 5, Number 2 (October 2000), pages
9-15. For DRM in relation to XML, see: (1) Extensible Rights Markup Language (XrML);
(2) Digital Property Rights Language
(DPRL); (3) Electronic Book
Exchange (EBX) Working Group; (4) Open
Digital Rights Language (ODRL); (5) Open eBook Initiative; (6) "IOTP Requirements for
Digital-Right Trading."
[October 27, 2000]
Payment API for Internet Open Trading Protocol Version 1.
Members of the Internet Open Trading Protocol Working Group have published a
specification for the "Payment
API for v1.0 Internet Open Trading Protocol (IOTP)." IETF Internet Draft. TRADE
Working Group. By Hans-Bernhard Beykirch, Werner Hans, Masaaki Hiroya, and Yoshiaki
Kawatsura. Reference: 'draft-ietf-trade-iotp-v1.0-papi-02.txt'. September 2000. "The
Internet Open Trading Protocol provides a data exchange format for trading purposes while
integrating existing pure payment protocols seamlessly. This motivates the multiple
layered system architecture which consists of at least some generic IOTP application core
and multiple specific payment modules. This document addresses the common interface
between the IOTP application core and the payment modules, enabling the interoperability
between these kinds of modules. Furthermore, such an interface provides the foundations
for a plug-in-mechanism in actual implementations of IOTP application cores. Such
interfaces exist at the Consumers', the Merchants' and the Payment Handlers' installations
connecting the IOTP application core and the payment software components/legacy systems. .
.The Payment API is formalized using the Extensible Markup Language (XML). It defines
wrapper elements for both the input parameters and the API function's response. In
particular, the response wrapper provides common locations for Error Codes and Error
Descriptions. It is anticipated that this description reflects the logical structure of
the API parameter and might be used to derive implementation language specific API
definitions..." Relevant XML DTDs are presented in the draft document. The Internet Open Trading Protocol
"provides an interoperable framework for Internet commerce. It is optimized for the
case where the buyer and the merchant do not have a prior acquaintance and is payment
system independent. It will be able to encapsulate and support payment systems such as
SET, Mondex, CyberCash's CyberCoin, DigiCash's e-cash, GeldKarte, etc. IOTP is able to
handle cases where such merchant roles as the shopping site, the payment handler, the
deliverer of goods or services, and the provider of customer support are performed by
different Internet sites." See related specifications referenced in "Internet Open Trading Protocol
(IOTP)."
[October 26, 2000]
XML Encoding for Sumerian Literary Texts.
A recent communiqué from the University of Oxford Electronic Text Corpus of Sumerian Literature
reports on initial work toward the creation of XML DTDs for transliteration-level encoding
and publishable translations for a large online corpus of Sumerian texts. This markup
language design is part of a broader project endeavor to analyze the digital library
corpus in order to document and describe aspects of its style, lexis, grammar and
register. The goal of the Sumerian Literature project, now substantially completed, has
been "to produce a 'collected works' of over 400 poetic compositions of the classical
[Sumerian] literature, equipped with translations. This standardised, electronically
searchable SGML corpus, which is based to a large degree on published materials, comprises
some 400 literary compositions of the Isin/Larsa/Old Babylonian Period, amounting to
approximately 40,000 lines of verse (excluding Emesal cult songs, literary letters, and
magical incantations). The full
catalogue can be found on the project web site. The online compositions are presented
in single-line composite text format (in a standardised transliteration) with
newly-prepared English prose translations, and a full bibliographical database, thereby
making available for the first time a collected works of Sumerian literature. The corpus
is freely available to anyone who wishes to use it via this World Wide Web site... The
literature written in Sumerian is the oldest human poetry that can be read, dating from
approximately 2100 to about 1650 BC. The main 'classical' corpus can be very roughly
estimated at 50,000 lines of verse, including narrative poetry, praise poetry, hymns,
laments, prayers, songs, fables, didactic poems, debate poems and proverbs. The majority
of this has been reconstructed during the past fifty years from thousands of often
fragmentary clay tablets inscribed in cuneiform writing. Relatively few compositions are
yet published in satisfactory modern editions. Much is scattered throughout a large number
of journals and other publications. Several important poems must still be consulted in
twenty-year-old unpublished doctoral dissertations, some with translations which have now
become unusable because of progress in our knowledge of the language. Major compositions
have not yet been edited at all. The slow progress of research, with little organised
collaboration until recently, means that Sumerian literature has [hitherto] remained
inaccessible to the majority of those who might wish to read or study it, and virtually
unknown to a wider public." This University of Oxford project is representative of a
large number of academic digital library projects which are migrating from SGML to
XML-based markup for document structuring and delivery. The extraordinary "Perseus Project" --
with its thousands of SGML-encoded Greek and Latin texts -- provides another example. For
additional description and references on the Oxford project, see "Electronic Text Corpus of Sumerian
Literature (ETCSL)." The development of online reference materials for ancient
civilizations presents significant challenge to the extent that the ancient writing
systems (especially cuneiform and hieroglyphic) are very complex, and the literary
traditions very rich recensionally. Standards efforts have been painfully slow, but
collaborative work is now being done in several research initiatives. See a partial
listing of projects in the document "Encoding and Markup for
Texts of the Ancient Near East." Readers are encouraged to send notification concerning related ancient language
projects.
[October 25, 2000]
Transaction Authority Markup Language (XAML).
From a recent announcement: "Leading proponents of e-business interoperability
Bowstreet, Hewlett-Packard Company, IBM, Oracle Corporation, and Sun Microsystems, today
announced they are leading an initiative to define a vendor-neutral industry standard that
will enable the coordination and processing of on-line, multi-party transactions in the
rapidly emerging world of XML-based web services. The initiative is called XAML (Transaction Authority Markup Language). [The XAML
initiative addresses] coordinated processing of transaction-supporting web services
between internal fulfillment services (the chemical provider's inventory system) and
external services such as: (1) An insurance policy service to insure the product being
shipped; (2) A financing service to ensure payment according to vendor terms; (3) A
transportation service to guarantee timely shipment/delivery of product; (4) A regulatory
service to ensure compliance with government safety requirements. The XAML standard will:
(1) Provide a specification for the XML message interfaces and interaction models of web
services to support the coordination and processing of multi-stage transactions on the
Internet; (2) Specify interfaces and protocols that preserve investment and strengths in
transaction monitors and resources; (3) Specify interfaces and protocols that can be
'added on' to existing and emerging web service interfaces and protocols; (4) Specify
interaction models for software systems to provide business-level transactions that
coordinate the processing of multiple distributed web services; (5) Build on existing and
emerging industry standards. The XAML initiative is so-named because it is an extension of
XML, the common language of e-commerce, which supports transactional semantics as defined
by the widely adopted standard for two-phase commit, XA (Transaction Authority). XAML
intends to provide a means for transaction supporting web services to participate in
higher-level business transactions. The XAML proposal will be submitted to one or several
standards bodies that may include the W3C, OASIS (Organization for the Advancement of
Structured Information Standards) and/or the IETF (Internet Engineering Task Force)."
For other details, see: (1) the "XAML
Transaction Authority Markup Language White Paper" and (2) the full text of the announcement:
"Bowstreet, HP, IBM, Oracle and Sun Microsystems Join Forces to Create Standard for
e-Business Transactions Across the Internet. XAML proposal focuses on creating XML
standard to guarantee multi-vendor transactional integrity across web services... There
are emerging standards for web services (such as SOAP, ebXML, XP, UDDI, e-Speak and WSDL),
and there are existing transaction management standards (such as XA and JTA); XAML ties
the two groups of standards together..." For other references, see "Transaction Authority Markup
Language (XAML)."
[October 25, 2000]
XLink Markup Name Control.
W3C has published a NOTE under the title XLink Markup Name Control.
Reference: W3C Note 24-October-2000, edited by [W3C XML Linking Working Group co-chairs] Eve Maler (Sun Microsystems) and Daniel Veillard (W3C). Document abstract: "This
document proposes a possible XML Schema-based solution to the need to use XLink in
XML-based languages such as XHTML 1.0." The note addresses the particular problem of
'namespaces' when attempting to upgrade existing document markup to be interpreted as
XLink syntax. "Currently, XLink
requires applications to recognize a particular set of attribute names in the XLink
namespace in order to do their work... [suppose] you already have some marked-up
information that provides some of the same kinds of linking information that XLink is
designed to provide: in order to incorporate XLink usage directly into the existing
vocabulary as a first-class construct, you would have to force the vocabulary to undergo a
backwards-incompatible change from href to xlink:href. XLink's
attributes must have namespace prefixes on them because of the way XML namespaces work;
'global' attributes that can be attached to any element must be prefixed because they
cannot identify themselves in any other way..." The NOTE's proposed solution builds
upon a suggestion from Henry Thompson of the W3C XML Schema Working Group. A future
version of W3C XLink might allow applications "to take advantage of XML Schema
datatypes instead, or in addition, as a way to recognize Schema-XLink data. The idea is
that any attribute name could be used, as long as the attribute were 'marked' with an
appropriate datatype, made available through a post-schema-validation information set or
by other means. . . If Schema-XLink were to define such datatypes, it could provide a
normative XML Schema module that merely contains a series of type definitions. Note,
however, that as of this writing, XML Schema does not have facilities to specify
additional normative constraints of the style that XLink needs; prose would still be
needed to specify the combinations of attribute types that are expected to appear on
particular 'XLink element types'..." For related references, see "XML Linking Language."
[October 25, 2000]
Using CSS for Everything.
As part of the W3C style activity,
the W3C CSS Working Group has produced a "work in
progress" Working Draft specification Syntax of CSS Rules in HTML's
"style" Attribute. Reference: W3C Working Draft
25-October-2000, edited by Tantek Çelik
(Microsoft) and Bert Bos (W3C). Document abstract:
"HTML provides a 'style' attribute on most elements, to hold a
fragment of a style sheet that applies to those elements. One of the
possible style sheet languages is CSS. This draft describes the
syntax of the CSS fragment that can be used in the 'style'
attribute." The WD illustrates how one can directly express and
control processing behaviors on an element-by-element basis
throughout a document (whether visual, aural, tactile, or other
behaviors) through the use of a globally-defined style
attribute. Thus, from the examples offered, one can: (1) specify
display behavior by "setting properties on the element itself,
[using] no pseudo-elements or pseudo-classes; (2) colorize a first
letter (by "setting properties on the element, as well as on
the first letter of the element, by means of the ':first-letter'
pseudo-element"; (3) regulate other appearances and effects, by
"setting properties on a source anchor for each of its dynamic
states, using pseudo-classes." The Working Draft document
"defines both the simple case (only properties on the element
itself), as well as the more complex case (properties on the
element's pseudo-elements and pseudo-classes), and generalizes the cascading order rule
for "the case where the inline fragment contains inline
rule-sets: the declarations are treated the same as if they occured
in the same order at the end of the author's style sheet with a
specificity equal to that of a selector with one ID-selector and as
many pseudo-elements and pseudo-classes as in the inline
rule-set." The working draft would appear to extend the life of
HTML 4.0's style attribute, along with the HTML META HTTP-EQUIV Content-Style-Type
selector mechanism, since the processing specification principle is
to be applied generally to any XML vocabularies (document types) in
which designers want to be able to control processing directly from
within the document instance. The WD states: "This document
recommends that any future XML based languages which have
presentational information (whether visual, aural, tactile, or
other) also add a STYLE attribute which similarly permits
the user to use CSS to style the document and elements in documents
written in that language." For one person's doubts about this
apparent notion of a 'global use' attribute without (apparent)
namespace declaration mechanism, see the W3C comment
list. For other references on CSS, see "W3C Cascading Style Sheets."
- [October 24, 2000]
XML Schema Becomes a W3C Candidate Recommendation.A W3C press release announces the publication of
XML Schema as a W3C Candidate
Recommendation. "The World Wide Web Consortium (W3C) has issued XML Schema as a
W3C Candidate Recommendation. Advancement of the document to Candidate Recommendation is
an invitation to the Web development community at large to make implementations of XML
Schema and provide technical feedback. Simply defined, XML Schemas define shared markup
vocabularies and allow machines to carry out rules made by people. They provide a means
for defining the structure, content and semantics of XML documents. 'Databases, ERP and
EDI systems all know the difference between a date and a string of text, but before today,
there was no standard way to teach your XML systems the difference. Now there is,'
declared Dave Hollander, co-chair of the W3C XML Schema Working Group and CTO of Contivo,
Inc. 'W3C XML Schemas bring to XML the rich data descriptions that are common to other
business systems but were missing from XML. Now, developers of XML ecommerce systems can
test XML Schema's ability to define XML applications that are far more sophisticated in
how they describe, create, manage and validate the information that fuels B2B ecommerce.'
By bringing datatypes to XML, XML Schema increases XML's power and utility to the
developers of electronic commerce systems, database authors and anyone interested in using
and manipulating large volumes of data on the Web. By providing better integration with
XML Namespaces, it makes it easier than it has ever been to define the elements and
attributes in a namespace, and to validate documents which use multiple namespaces defined
by different schemas. XML Schema introduces new levels of flexibility that may accelerate
the adoption of XML for significant industrial use. For example, a schema author can build
a schema that borrows from a previous schema, but overrides it where new unique features
are needed. his principle, called inheritance, is similar to the behavior of Cascading
Style Sheets, and allows the user to develop XML Schemas that best suit their needs,
without building an entirely new vocabulary from scratch. XML Schema allows the author to
determine which parts of a document may be validated, or identify parts of a document
where a schema may apply. XML Schema also provides a way for users of ecommerce systems to
choose which XML Schema they use to validate elements in a given namespace, thus providing
better assurance in ecommerce transactions and greater security against unauthorized
changes to validation rules. Further, as XML Schema are XML documents themselves, they may
be managed by XML authoring tools, or through XSLT. . . Candidate Recommendation is W3C's
public call for implementation, an explicit invitation for W3C members and the developer
community at large to review the XML Schema specification and build their own XML Schemas.
This period of implementations and reporting allows the editors to learn how developers
outside of the Working Group might use them, and where there may be ambiguities for
implementors. Public testing and implementation contribute to a more robust XML Schema,
and to more widespread use." The CR specification is published in three parts: (1) XML Schema Part 1: Structures.
W3C Candidate Recommendation 24-October-2000, edited by Henry S. Thompson (University of Edinburgh), David Beech (Oracle Corp.), Murray Maloney (for Commerce One), and Noah Mendelsohn (Lotus Development
Corporation). Part 1 defines the XML Schema definition language, "which offers
facilities for describing the structure and constraining the contents of XML 1.0
documents, including those which exploit the XML Namespace facility. The schema language,
which is itself represented in XML 1.0 and uses namespaces, substantially reconstructs and
considerably extends the capabilities found in XML 1.0 document type definitions (DTDs).
This specification depends on XML Schema Part 2: Datatypes. Appendix A
supplies a normative "Schema for Schemas"; Appendix F contains a non-normative
"DTD for Schemas"; Appendix J gives brief summaries of the substantive changes
to this specification since the public working draft of 7 April 2000. (2) XML Schema Part 2: Datatypes.
W3C Candidate Recommendation 24-October-2000, exited by Paul V. Biron (Kaiser Permanente, for Health Level
Seven) and Ashok Malhotra (IBM). Part 2 of the
specification for the XML Schema language "defines facilities for defining datatypes
to be used specifications. The datatype language, which is itself represented in XML 1.0,
provides a superset of the capabilities found in XML 1.0 document type definitions (DTDs)
for specifying datatypes on elements and attributes." Appendix A provides the
normative "Schema for Datatype Definitions" and Appendix B gives the
non-normative "DTD for Datatype Definitions." (3) XML Schema Part 0: Primer.
W3C Candidate Recommendation 24-October-2000, edited by
David C. Fallside (IBM). "XML Schema Part 0: Primer is a
non-normative document intended to provide an easily readable description of the XML
Schema facilities and is oriented towards quickly understanding how to create schemas
using the XML Schema language. XML Schema Part 1: Structures and XML
Schema Part 2: Datatypes provide the complete normative description of the XML
Schema language -- this primer describes the language features through numerous examples
which are complemented by extensive references to the normative texts." In connection
with this CR publication, Henry Thompson announced the availablility of a self-installing version of XSV, the
W3C/University of Edinburgh XML Schema validator; 'WIN32 for now, UN*X coming soon'. See
also: (1) Testimonials for XML
Schema Candidate Recommendation; (2) the longer memo from Henry S. Thompson (Janet
Daly) explaining why the I18N WG dissented from the specification's treatment of dates
and times, and the CR exit criteria; (3) W3C
XML Schema; and (4) full references in "XML Schemas."
- [October 24, 2000]
Rapid Progress on XML Topic Maps. Several subgroups
working within the TopicMaps.org initiative are making noteworthy progress: these include
the Interchange Syntax Subgroup (ISS), Conceptual Model Subgroup (CMS), Authoring Group
(AG), XTM Syntax Subgroup, and XTM Modeling Subgroup. Murray Altheim (Sun Microsystems) recently
announced the availability of minutes from the October
13-15 XTM meeting in Swindon. This document is available on the XTM Repository web site, which provides
also the current 'discussion DTD',
the Topicmaps.Org Charter and By-laws,
and other relevant resources. The eGroups mailing list 'xtm-wg@egroups.com' serves as host for XTM
email. The XTM Conceptual Model Subgroup (CMS) is attempting to explicate the relationship
between the Topic Maps and RDF models, and has participated in a number of discussions
with W3C's RDF design teams; the XTM syntax may be recommended for the interchange
('serialization') of RDF statements. See further references in "(XML) Topic Maps."
- [October 24, 2000]
XSLTMark: XSLT Benchmark and Compliance Testing Suite.Eugene Kuznetsov has announced the availability of
XSLTMark, an XSLT benchmark and a small compliance testing suite. "XSLTMark Version 1.1.0 is available now and
is the first release to the general public. The XSLTMark test cases have been designed to
challenge processors with a variety of tasks and input conditions in order to provide a
well-rounded benchmark and to facilitate analysis of which processors perform best for
various applications. XSLTMark measures performance in four major categories: (1) Pattern
Matching - this category covers XSLT template pattern matching and template instantiation.
This performance category is important to stylesheets with many template rules and with
many expected apply-template invocations. (2) XPath Selection - this category covers
nodeset selection through the evaluation of XPath path expressions. This performance
category is crucial for stylesheets that contain lots of xpath expressions, particularly
ones with predicates. (3) XPath Library Functions - this category covers the execution of
XPath library functions, particularly the frequently used string functions. This category
is most important to stylesheets that perform a lot of string processing. (4) XSLT Control
- this category covers the control structures defined by XSLT elements, including variable
and parameter handling. This category is most relevant for stylesheets that perform tricky
calculations involving calling templates with parameters. . . There are about 40 different
testcases in this release; see documentation for descriptions and several third-party
credits. A variety of java and C/C++ processors are supported, and drivers for other XSLT
engines are easy to add. Source and makefiles are being released (with an emphasis on
Linux X86, although Win32 X86 and Solaris SPARC are also supported, and other platforms
should be fairly straightforward). We are also making available some initial benchmark
results for several popular and well-regarded XSLT processors. We welcome comments,
benchmark results submissions and new test drivers for other XSLT processors. DataPower's
XSLTMark is the first comprehensive benchmark for measuring the performance of XSL
processors. It can be used to test the XSLT performance of XSL processors for XML-to-XML
and XML-to-HTML transformations. It also provides basic compliance testing to ensure that
benchmark results are not distorted by incorrectly functioning processors. The benchmark
is a java application that uses a 'Driver' class to communicate with the XSL processor
under test. Both java and native (C/C++) processors are supported, with driver modules
available for many popular XSLT engines on a variety of platforms. XSLTMark is currently
being used for performance and compliance testing at DataPower, but also has a core suite
of tests to yield benchmark figures for external comparison purposes. The tool features:
(1) Processing throughput measurement; (2) Normalized score calculation; (3) Balanced test
suite; (4) Optional standards compliance testing; (5) Support for processors written in
both java and C/C++; (6) Cross-platform operation; (7) Test drivers for most popular XSLT
processors [XT (James Clark), Saxon (Michael Kay), Transformiix (Mozilla), Xalan-J
(Apache), Xalan-C++ (Apache), MSXML (Microsoft)]. For related topics, see "XSLT/XPath
Conformance."
- [October 24, 2000]
W3C Specification for Modularization of XHTML Advances to Candidate Recommendation. The
W3C specification for the Modularization of XHTML
has been promoted to the status of a Candidate
Recommendation. Reference: W3C Candidate Recommendation 20-October-2000, edited by
Robert Adams (Intel Corporation), Murray Altheim (Sun Microsystems), Frank Boumphrey (HTML
Writers Guild), Sam Dooley (IBM), Shane McCarron (Applied Testing and Technology),
Sebastian Schnitzenbaumer (Mozquito Technologies), and Ted Wugofski (Phone.com). The new
Candidate Recommendation "specifies an abstract modularization of XHTML and an
implementation of the abstraction using XML Document Type Definitions (DTDs). This
modularization provide a means for subsetting and extending XHTML, a feature needed for
extending XHTML's reach onto emerging platforms." Status: This version of XHTML
"incorporates some comments from the Last Call Working Draft review period. A diff-marked
version from the Last Call draft is available for comparison purposes. Major changes
in this version include: (1) Re-integration of the Building document into this document;
(2) Incorporation of the Henry Thompson/Dan Connolly XML Namespace handling process with
substantial additions by the Math and HTML working groups; (3) Complete worked examples
including modules and miniature DTDs; (4) Minor restructuring of abstract module
definitions, including the creation of a 'style attribute module', a 'name identification
module' and a 'target' module; (5) Tweaking of some of the module contents based on review
comments. On 20 October 2000, this document enters a Candidate Recommendation review
period. From that date until 17-November-2000, W3C members are encouraged to review and
implement this specification and return comments to w3c-html-editor@w3.org. W3C is looking for
testimonials from users of this specification. Additionally, experience using all of the
modules is being sought to create a coverage table of the use of each module. These two
criteria are needed to advance this specification to Proposed Recommendation."
Available from W3C as a Single
HTML file, Postscript
version, PDF
version, ZIP
archive, or Gzip'd
TAR archive. See additional references in "XHTML and 'XML-Based' HTML
Modules."
- [October 24, 2000]
Update of XGMML (Extensible Graph Markup and Modeling Language) Schema. John Punin has announced an updated version of the XML
Schema for XGMML (Extensible Graph Markup and Modeling Language). The revised XML Schema is based on the W3C
XML Schema Working Draft 22-September-2000; it has been validated using the XSV Validator version 1.166/1.77. XGMML (Extensible Graph Markup and
Modeling Language) "is an XML application based on GML which is used for graph
description. XGMML uses tags to describe nodes and edges of a graph. The purpose of XGMML
is to make possible the exchange of graphs between differents authoring and browsing tools
for graphs. The conversion of graphs written in GML to XGMML is trivial. Using XSL with
XGMML allows the translation of graphs to different formats. XGMML was created to be used
for the WWWPAL System that visualizes web sites as a graph. Web Robots can navigate
through a web site and save the graph information as an XGMML file. XGMML, as any other
XML application, can be mixed Robots can navigate through a web site and save the graph
information as an XGMML file. XGMML, as any other XML application, can be mixed with other
markup languages to describe additional graph, node and/or edge information." The XGMML 1.0 Draft Specification
is available online. See further reference in "Extensible Graph Markup and
Modeling Language (XGMML)."
- [October 24, 2000]
Pixy System 2 Astronomical Image Software Uses RELAX/Relaxer. Murata Makoto recently announced the availability
of Pixy System 2 from the MISAO Project. The
MISAO Project "aims to make much use of images taken all over the world for searching
and tracking astronomical remarkable objects." Pixy (Practical Image eXamination and
Inner-objects Identification system) "is an automated astronomical image examination
system developed by Seiichi Yoshida, used in the cource of new object survey of the MISAO
Project. It automatically detects all stars from an image, collates them with star data
recorded in catalogs such as GSC, USNO-A2.0, etc., and finds out new objects or variable
stars. It also prints out all astrometric and photometric data of all detected stars, so
it is also useful for astrometry of minor planets or photometry of variable stars."
Pixy System 2 is implemented using Java language; it runs both on a Windows PC and a UNIX
workstation. The software is available for download, and its API documentation may be read
online. Pixy System 2 "heavily uses RELAX/Relaxer: the class files in the net.aerith.misao.xml.relaxer
package are created by Relaxer
from the RELAX files." Relaxer is a Java
class generator that operates on a XML document defined by a RELAX grammer. In the new
version of Pixy, one may 'save the examination result in XML file, and review of the
desktop from the XML file'.
- [October 19, 2000]
Research Information Exchange Markup Language (RIXML).
A recent announcement from
a group of industry financial leaders describes the formation of RIXML.org, created to
support the development of "an open protocol to improve the process of categorizing,
aggregating, comparing, sorting, and distributing global financial research." The
fifteen founding members of RIXML.org include five Asset Managers and ten Broker-Dealers.
Details: "A group of major financial firms announced the formation of RIXML.org, a
global industry association of buy-side and sell-side firms whose mission is to develop an
open standard for the electronic exchange of investment research. The new specification,
to be known as RIXML (Research Information eXchange Markup Language), will be based on
XML, the emerging standard for data sharing between applications. RIXML will provide a
structure for parsing and classifying investment research in a way that enables recipients
to access information in a customizable format through standard sorting and filtering
criteria. Once developed through the collaborative efforts of the association's members,
the RIXML specification will be made available for use by firms within the financial
services industry as well as other interested parties." See (1) the text of the
announcement: "Financial
Industry Leaders Join Forces to Develop a Global Standard for Investment Research",
and (2) "Research Information
Exchange Markup Language (RIXML)."
- [October 18, 2000]
New Working Draft for W3C Extensible Stylesheet Language (XSL) Version 1.0. Max Froumentin (W3C XSL Staff Contact) announced the release
of a new working draft for Extensible
Stylesheet Language (XSL) Version 1.0. Reference: W3C Working Draft
18-October-2000. By Sharon Adler (IBM), Anders Berglund (IBM), Jeff Caruso (Pageflex), Stephen Deach (Adobe), Paul Grosso (ArborText), Eduardo Gutentag (Sun), Alex Milowski (Lexica), Scott Parnell (Xerox), Jeremy Richman (BroadVision), and Steve Zilles (Adobe). Document overview: "This
specification defines the Extensible Stylesheet Language (XSL). XSL is a language for
expressing stylesheets. Given a class of arbitrarily structured XML documents or data
files, designers use an XSL stylesheet to express their intentions about how that
structured content should be presented; that is, how the source content should be styled,
laid out, and paginated onto some presentation medium, such as a window in a Web browser
or a hand-held device, or a set of physical pages in a catalog, report, pamphlet, or book.
Formatting is enabled by including formatting semantics in the result tree. Formatting
semantics are expressed in terms of a catalog of classes of formatting objects. The nodes
of the result tree are formatting objects. The classes of formatting objects denote
typographic abstractions such as page, paragraph, table, and so forth. Finer control over
the presentation of these abstractions is provided by a set of formatting properties, such
as those controlling indents, word- and letter-spacing, and widow, orphan, and hyphenation
control. In XSL, the classes of formatting objects and formatting properties provide the
vocabulary for expressing presentation intent. The XSL processing model is intended to be
conceptual only. An implementation is not mandated to provide these as separate processes.
Furthermore, implementations are free to process the source document in any way that
produces the same result as if it were processed using the conceptual XSL processing
model." WD status: "This version supersedes the previous draft released on March 27, 2000. The
working group is issuing this interim public draft as it sets out a number of changes made
in response to comments received on the Last Call draft. The Working Group intends to
submit a revised version of this specification for publication as a Candidate
Recommendation in the near future. Items under consideration for change for Candidate
Recommendation include the name of the font-height-override-before and
font-height-override-after properties. Discussion is invited and comments can be sent to
the editors at xsl-editors@w3.org." The WD is
available in several formats: PDF, XML file, HTML single file and .ZIP file. See related
references in "Extensible
Stylesheet Language (XSL)."
[October 18, 2000]
jUDDI: Bowstreet Hosts Open Source Java-based UDDI Toolkit Development on SourceForge.
Bowstreet recently announced
'jUDDI' as "the industry's first implementation of broad industry-initiated standard
to link e-businesses to the 'Yellow Pages' of B2B web services. The jUDDI implementation,
available immediately, comes after Ariba, IBM and Microsoft unveiled a draft specification
of the UDDI standard. Bowstreet has introduced
jUDDI as free, open source software that is available for anyone to use. UDDI -- which
stands for Universal Description, Discovery and Integration -- is designed to make it easy
for businesses to create partnerships and new business models using platform-neutral
application components called web services. The initiative will create a distributed
registry, or Yellow Pages, for publishing, finding and using web services that companies
wish to offer to the marketplace. Bowstreet's jUDDI (pronounced 'Judy') is an open source
Java-based toolkit for developers to make their applications UDDI-ready. jUDDI-enabled
applications will be able to look up a web service in a UDDI registry. A retail chain, for
example, could use the toolkit to jUDDI-enable its online catalog. With jUDDI, the catalog
could call another company's shopping cart and a third company's transaction web service,
creating an instant web-based store. Companies will eventually create many connections
like this, spawning "business webs," or dynamic collections of businesses, on a
massive scale. jUDDI and UDDI will complement DSML (Directory Services Markup Language) --
the directory services standard launched last year by Bowstreet, IBM, Microsoft, Oracle
and the Sun-Netscape Alliance. Directories provide users with a powerful way to manage web
services, including web services published in UDDI registries. Bowstreet sees synergy
between DSML and UDDI and will actively explore a relationship between the two
specifications, according to Tauber, who is chairman of the DSML 2.0 working group. The jUDDI project is hosted at
SourceForge and available as downloadable software from www.juddi.org.
jUDDI is the latest in a long line of Bowstreet's industry firsts that advance
intercompany interoperability on the Internet. 'UDDI, Microsoft's .NET, HP's e-Speak,
ebXML, DSML and a host of other initiatives confirm what Bowstreet customers already
know,' said Bob Crowley, Bowstreet's president and chief executive officer. 'They know
that plug-and-play e-commerce is possible and inevitable for the 21st century, because
they're doing it.' Bowstreet, a founding advisor to the UDDI initiative, was one of the
first companies to recognize the importance of web services and act on it commercially. In
1998, the company announced a software architecture for deploying and managing web
services across multiple vendor platforms." For references on UDDI, see "Universal Description, Discovery,
and Integration (UDDI)."
[October 18, 2000]
Revised Working Draft for W3C's Platform for Privacy Preferences 1.0 Specification.
As part of the W3C's P3P Activity, the P3P
Specification Working Group has released a new 'last call' working draft for the The Platform for Privacy Preferences 1.0
(P3P1.0) Specification. Reference: W3C Working Draft 18-October-2000, edited by
Massimo Marchiori (W3C/MIT/UNIVE). Description:
"The Platform for Privacy Preferences Project (P3P) enables Web sites to express
their privacy practices in a standard format that can be retrieved automatically and
interpreted easily by user agents. P3P user agents will allow users to be informed of site
practices (in both machine- and human-readable formats) and to automate decision-making
based on these practices when appropriate. Thus users need not read the privacy policies
at every site they visit. The P3P1.0 specification defines the syntax and semantics of P3P
privacy policies, and the mechanisms for associating policies with Web resources. P3P
policies consist of statements made using the P3P vocabulary for expressing privacy
practices. P3P policies also reference elements of the P3P base data schema -- a standard
set of data elements that all P3P user agents should be aware of. The P3P specification
includes a mechanism for defining new data elements and data sets, and a simple mechanism
that allows for extensions to the P3P vocabulary. P3P policies use an XML encoding of the
P3P vocabulary to identify the legal entity making the representation of privacy practices
in a policy, enumerate the types of data or data elements collected, and explain how the
data will be used. In addition, policies identify the data recipients, and make a variety
of other disclosures including information about dispute resolution, and the address of a
site's human-readable privacy policy." Appendices 4 and 5 of the Working Draft
provide the 'XML Schema Definition'
and the 'XML DTD Definition'. Status:
This Last Call Working Draft is submitted for review by W3C members and other interested
parties; the last call review period ends 31 October 2000. "Following this Last Call
period, the Working Group intends to submit this specification for publication as a
Candidate Recommendation." For related references, see "Platform for Privacy Preferences
(P3P) Project."
[October 17, 2000]
Systems Biology Markup Language (SBML).
The Caltech ERATO Kitano Systems Biology Project is developing the Systems Biology
Markup Language (SBML), using XML and UML for representation and modeling of the
information components in the system. The research team is attempting to specify "a
common, model-based description language for systems biology simulation software; we call
this the Systems Biology Markup Language (SBML). The overall goal is to develop an open
standard that will enable simulation software to communicate and exchange models,
ultimately leading to the ability for researchers to run simulations and analyses across
multiple software packages. SBML is the result of merging the most obvious
modeling-language features of BioSpice, DBSolve, E-Cell, Gepasi, Jarnac, StochSim, and
Virtual Cell. The description language is encoded in XML, the Extensible Markup Language.
The XML encoding of the description language can define a file format; however, at this
time, we are focusing on using the XML-based description language as an interchange format
for use in communications between programs. Appendix B [in the principal specification]
contains the current version of this XML schema. As XML Schemas are difficult to read and
absorb by human readers, we define the proposed data structures using a succinct graphical
notation based on a subset of UML, the Unified Modeling Language. . . The SBML
representation language is organized around five categories of information: model,
compartment, geometry, specie, and reaction. Not all ofthese will be needed by every
simulation package; rather, the intent is to cover the range of data structures needed by
the collection of all of the simulators examined so far..." For other description and
references, see "Systems Biology
Markup Language (SBML)."
- [October 16, 2000]
Gnome XML Library's libxml-2.2.5 Supports XPointer and XPath.
Daniel Veillard (W3C) posted an announcement for the
release of libxml-2.2.5 in the Gnome XML library with XPointer and XPath
implementations, including an initial testsuite. "I usually don't post announces of
new releases of libxml to xml-dev, but this this version has XPointer support which was
requested previously here I think it makes sense: Libxml is the XML C library developped
for the Gnome project, it allow to parse, manipulate, and save XML and HTML documents but
it does not expose a GUI interface. Here are some key points about libxml (a.k.a.
gnome-xml): (1) Libxml exports Push and Pull type parser interfaces for both XML and HTML.
(2) Libxml can do DTD validation at parse time, using a parsed document instance, or with
an arbitrary DTD. (3) Libxml now includes a nearly complete XPath and XPointer
implementations. (4) It is written in plain C, making as few assumptions as possible, and
sticking closely to ANSI C/POSIX for easy embedding. Works on Linux/Unix/Windows (5) Basic
support for HTTP and FTP client allowing to fetch remote resources (6) The design is
modular, most of the extensions can be compiled out. (7) The internal document
repesentation is as close as possible to the DOM interfaces. (8) Libxml also has a SAX
like interface; the interface is designed to be compatible with Expat. (9) This library is
released both under the W3C IPR and the GNU LGPL; use either at your convenience. URLs for
libxml include http://xmlsoft.org/ and ftp://xmlsoft.org." For related XLink/XPointer tools,
see "XML Linking and Addressing
Languages (XPath, XPointer, XLink)."
- [October 16, 2000]
XML Developers' Day Call for Presentations.
Marion Elledge (GCA) has posted a call for presentations
in connection with XML Developers' Day at the XML 2000 Conference. "Since 1984, the
fall GCA SGML/XML conference has been the one annual must-attend event for the structured
markup community. XML was first introduced to the world at this conference in 1996, and
the event continues to be a focal point for meetings of XML-related OASIS, W3C,
IDEAlliance, and ISO working groups. For information on the conference, held this year in
Washington, D.C., see http://www.gca.org/attend/2000_conferences/XML_2000/.
XML Developers' Day on Monday 4 December is intended for conference attendees with a
special interest in the latest XML tools and advanced techniques. If you have applications
that feature innovative uses of XML, this is your chance to share your accomplishments
with other advanced workers. Proposals of 1-3 paragraphs clearly describing the
presentation should be sent in plain text directly to the chair of the XML Dev Day track, Jon Bosak. Submissions must be mailed no later than
Monday, 23-October-2000." See the text of the announcement for details.
- [October 16, 2000]
HL7's Clinical Document Architecture (CDA).
A recent announcement
from Health Level Seven reports on the progress of the Clinical Document Architecture
(CDA): "Health Level Seven, Inc. (HL7) successfully
balloted what it believes to be the first XML-based standard for healthcare -- the
Clinical Document Architecture (CDA). The CDA, which was until recently known as the
Patient Record Architecture (PRA), provides an exchange model for clinical documents (such
as discharge summaries and progress notes) -- and brings the healthcare industry closer to
the realization of an electronic medical record. The CDA Standard is expected to be
published as an ANSI approved standard by the end of the year. By leveraging the use of
XML, the HL7 Reference
Information Model (RIM) and coded vocabularies, the CDA makes documents both
machine-readable-so they are easily parsed and processed electronically-and
human-readable-so they can be easily retrieved and used by the people that need them. CDA
documents can be displayed using XML-aware Web browsers or wireless applications such as
cell phones, as shown by Nokia at the HIMSS 2000 demonstration. The CDA is only the first
example of HL7's commitment to the advancement of XML-based e-healthcare technologies
within the clinical, patient care domain. Along with the CDA, HL7 is developing XML-based
Version 3 messages. These Version 3 messages enhance the usability of HL7 by offering
greater precision and less optionality, conformance profiles that will help guarantee
compliance, coded attributes linked to standard vocabularies, and an explicit,
comprehensive, and open information model-the HL7 RIM. All this, packaged in a
standardized XML syntax for ease of interoperability. In 1999, HL7 also successfully
balloted a recommendation for sending V2.3.1 messages using XML encoding. In 2001, HL7
will ballot, as a normative standard, a methodology for producing HL7 approved DTDs for
Version 2.4 and previous versions. Said Stan Huff, chair of the HL7 board of directors:
'XML is an encoding that complements the semantic content provided by the HL7 RIM,
allowing users to exploit all the possibilities of the Internet. The extensibility
inherent in XML is resulting in an explosion of schemas and DTDs from diverse sources,
which actually decreases the ability to provide plug and play applications. The
development of a model-based, standardized and industry-accepted application of XML, as
provided by HL7, will help decrease the cost of integration, and improve the reliability
and consistency of communications between disparate systems and enterprises.' HL7's
history with the Web and XML stretches back to the inception of the technologies. The
organization is a long-standing and active member of the World Wide Web Consortium-the
creators and keepers of XML. It has also exchanged sponsor memberships with OASIS, a
non-profit, international consortium that operates XML.org, a global XML industry portal
used to collect and distribute XML schemas." For other information, see "Health Level Seven XML Patient
Record Architecture."
- [October 14, 2000]
StarOffice Software 'Open' Source Available at OpenOffice.org.
A recent announcement from Sun announces the availability of StarOffice source code as
'open' source, and the decision to adopt XML to replace the old binary file format; the
project is dedicated to establishing open productivity XML-based file formats and
language-independent component APIs. "The source code for StarOffice software is now
available under the GNU Public License at OpenOffice.org. Sun has also made the StarOffice
APIs and XML file formats available as well, in an effort to drive standardization across
office productivity suites. Developers around the world now have the freedom to use
StarOffice technology to best suit their needs, whether to improve their own products,
build new value-added products on top of the StarOffice suite, improve existing technology
in StarOffice software, or contribute new StarOffice components to the open source
community. This move opens up the office productivity market to unlimited possibilities
for innovation. In one of the largest actions of its kind, Sun is working with the leaders
of the free software and open source community to make the source code for its StarOffice
software suite freely available under the GNU General Public License (GPL). In addition,
Sun will commit the efforts of its development team, as well as the resources of a $14
billion global company, to work side by side with members of the community to continue to
develop the code at OpenOffice.org, a site hosted
and managed by CollabNet. No longer will any one company determine what is best for the
market or the user, but the market will decide and users will choose. No longer will files
and documents wear the cement shoes of a single vendor or operating system, but standards
will flourish and compatibility reign across platforms. For the first time, a commercial
grade, full-featured office suite will be opened up to the innovative input of the global
developer community." [On the XML File Format:]
"We adopted XML to replace the old binary file format and become the OpenOffice.org
suite's new native file format. Our goals were twofold: to have a complete specification
encompassing all components, and to provide an open standard for office documents. One
single XML format applies to different types of documents -- e.g., the same
definition applies for tables in texts and in spreadsheets. XML is ideal as an open
standard because of the free availability of XML specifications and DTDs, and XML's
support for XSL, XSLT, Xlink, SVG, MathML, and many other important and emerging
standards. Beside replacing the binary file format with XML, the OpenOffice.org suite will
use XML internal for exchanging any type of content between the different applications.
OpenOffice.org provides today an infrastructure for using different XML components. The
XML-Parser and the XML-Printer are all implemented as components. Every of these component
support the Simple API for XML (SAX). This infrastructure will allow in the future to
dynamically configure a pipelines of different XML components, like XML-Parser,
XSLT-Processor, etc. to process XML-Input and Output. This will allow transformation of
XML-Data into different formats on the fly, without storing intermediate files and parse
them again for every transformation step. See the latest draft of the XML File Format Specification; the
XML DTDs
are available through the CVS access. There are many benefits to making StarOffice
software open source, including: (1) Higher quality product. Since there are more
developers on the project fixing bugs, there will be fewer bugs. (2) Faster development
time. Leveraging the efficiencies of the open source model, the community will get access
to new features sooner. (3) Ports to any platform. Since the code is open, anyone can port
the StarOffice code to any platform. (4) Many languages. It will be possible to localize
StarOffice software to any language the community has knowledge of. (5) Standard APIs. A
single API set for manipulating and extending StarOffice software. (6) Standard file
formats. XML will allow any XML-capable program to read StarOffice files. (7) More
templates and sample documents. By building a community, users will be able to share
sample documents, document templates, and macros, making it easier to produce
professional-quality content. . . With XML file formats and language-independent APIs,
OpenOffice.org ushers in an era of compatibility, giving developers the power to innovate
and build new applications that easily work together, regardless of platform. End users
will be able to choose from an array of powerful, free software, assured that their work
is transportable and can be shared with anyone. Sun will continue to drive the development
of the OpenOffice.org source code and distribute its own certified, StarOffice branded
version of the OpenOffice.org software for free. To ensure consumer confidence and promote
uniformity, OpenOffice.org will also allow other companies the opportunity to license the source for
commercial release under a royalty-free Sun Industry Standards Source License
(SISSL) that requires only that they maintain compatibility with the GPL reference
implementation. Companies that meet this requirement may also qualify for and license the
StarOffice brand for use on their product. . . As promised, Sun Microsystems and CollabNet have worked together to build the
infrastructure to put the StarOffice code into the open source arena on October 13, 2000.
The CVS repository is up and running, and the code is now available for checkout and
download. A complete set of technical documentation is available, including a guide to the
projects, whitepapers, a 'build
guide,' and a porting guide..." See (1) the Technical
Overview, (2) the main development web site,
and (3) "StarOffice and
XML."
- [October 13, 2000]
W3C Acknowledges XML Messaging Specification (XMSG).
The W3C has acknowledged receipt of
a submission for XMSG - XML
Messaging Specification. Reference: W3C Note 13-October-2000, by R. Alexander Milowski (of Lexica, LLC). Document abstract: "XMSG is a
specification for using XML to send messages that contain a set of XML documents, embedded
non-XML data, and references to non-XML documents in a fashion that supports scalable
transactions and operates on a participant model." The submission forms a multi-part
document consisting of: (1) the XMSG (XML Messaging)
Specification, (2) an XMSG DTD,
(3) an XMSG Schema, and (4) the XMSG Schema Documentation. Lexica,
LLC requests in the submission that the W3C Consortium include the submission as
consideration in the XML Protocol Activity. Description: The XML messaging specification
"is based on the basic principle of providing a simple way to transport multiple XML
documents within one logical XML construct without dictating any layered semantics of a
messaging protocol that might be layered on top. The general philosophy is to provide the
general structure upon which messaging protocols for specific business or technological
purposes can be layered allowing the identification of that messaging intent but not
dictating the exact syntax and semantics of the subject message. In this way, manifests,
metadata, and other messaging specific constructs can be tailored to specific vertical
markets or technology applications. In general, the idea of an XML message presented by
this specification is three-fold: (1) A pair or triplet of participants involved in the
message are identified by URI values. (2) Metadata may be associated with the message
itself. (3) A set of documents is contained and identified by URI allowing for document
specific metadata. The goals of this specification are (1) To provide the ability to
transport multiple documents and references to associated data objects within a single
document (a 'message') and preserve their identity. (2) To provide the ability to
associate metadata with both the documents and the message without modifying the original
document or schemas for those documents. (3) To provide the ability to transport non-XML
data as a document within the message. (4) To provide a simple way to accomplish XML
messaging." W3C Team Comment
on the NOTE has been provided by Yves Lafon, W3C lead
for XML Protocol Activity: "The submission provides a description of using XML to
send MIME mail like messages that contain XML documents, non-XML data and
references to other documents. In XMSG, a message consists of information about the
message itself, such as the origin, the destination, a unique ID used to identify and
track it, as well as management-oriented information, such as its priority, expire time
and receipt management. XML Documents are embedded using a special tag in the message
format, with an ID to provide easy reference inside the message. Even if errors codes
remain application-specific, having classes of errors may be helpful for
intermediaries..." [cache]
- [October 13, 2000]
Redfoot RDF Store/Viewer/Editor Framework.
James Tauber (Director XML Technology, Bowstreet) has posted an announcement to the W3C
'www-rdf-interest@w3.org' mailing list for the release of Redfoot Version 0.9.0.
"Redfoot is a store/viewer/editor framework for RDF that includes peer-to-peer
communication between stores. It is written in Python by James Tauber and Daniel Krech,
with open source development hosted on SourgeForge. "At present, Redfoot includes:
(1) an RDF database; (2) a query API for RDF with numerous higher-level query functions;
(3) an RDF parser and serializer; (4) a simple HTTP server providing a web interface for
viewing and editing RDF; (5) the beginnings of a peer-to-peer architecture for
communication between different RDF databases. Although the peer-to-peer functionality is
embryonic, the RDF viewing/editing capabilities are of beta quality... In the future,
Redfoot will hopefully include: (1) a full peer-to-peer architecture for discovery of RDF
statements; (2) an inference engine; (3) a fully customizable UI; (4) connectors for
mapping non-RDF data into RDF triples; (5) sample applications built on top of Redfoot.
Redfoot is written in pure Python and is being tested on Python 1.6 and 2.0b1 (soon
2.0b2). Redfoot makes extensive use of callbacks as a means of processing RDF structures
rather than building large temporary data structures in memory. For other details, see the
development documentation.
For related resources, see "Resource
Description Framework (RDF)."
- [October 13, 2000]
JDF Specification Draft Spiral Version 4.0.
A level 4.0 draft specification has been published for the XML-based Job Definition
Format (JDF) and its counterpart, the Job Messaging Format (JMF). JDF is an open,
extensible, XML-based print workflow specification framework. "Four companies
prominent in the graphic arts industry -- Adobe, Agfa, HEIDELBERG, and MAN Roland -- have
united to create this extensible, XML-based format built upon the existing technologies of
CIP3's Print Production Format (PPF) and Adobe's Portable Job Ticket Format (PJTF). JDF
provides three primary benefits to the printing industry. Unlike any previous format, it
has the ability to unify the pre-press, press, and post-press aspects of any printing job.
It also provides the means to bridge the communication gap between production services and
Management Information Systems (MIS). And finally, it is able to carry out both of these
functions no matter what system architecture is already in place, and no matter what tools
are being used to complete the job. In short, JDF is extremely versatile and
comprehensive. JMF messages are most often encoded in pure XML, without an additional
MIME/Multipart wrapper. Only controllers that support JDF job submission via the message
channel must support MIME for messages. Appendix A of the 389-page specification lists a
number of commonly used JDF data types and structures and their XML encoding, based upon
the W3C XML Schema datatypes. Data types are simple data entities such as strings, numbers
and dates. They have a very straightforward string representation and are used as XML
attribute values. Data structures, on the other hand, describe more complex structures
that are built from the defined data types, such as colors..." For references, see "Job Definition Format (JDF)."
For related initiatives, see: (1) Printing
Industry Markup Language (PrintML); (2) PML: Markup Language for Paper
and Printing; (3) XML for
Publishers and Printers (XPP). See also the PrintTalk Consortium and PrintCafe's eProduction eCommerce eXchange (PCX), now
being described primarily as 'a framework for integrating industry standards' supporting
XML-based specifications for the printing and publishing supply chain. The PrintTalk
implementation supports use of the proposed Job Definition Format (JDF) standard for its
job specification semantic and Commercial eXtensible Markup Language (cXML) to define the
business objects; four of thirteen business objects have been defined so far.
- [October 13, 2000]
Tutorials and Reference for XPointer and Extended-XLink.
Jiri Jirat recently announced the availability of
tutorial resources for the W3C XML Linking specifications. These materials are posted on
the Zvon.org web site along with a collection of
related tutorials covering XSLT, SOAP, XUL, CSS, Namespaces, etc. The online XPointer reference
"allows easy access to definitions of locations, errors and functions, with links to
relevant examples in XPointer tutorial. The XPointer
tutorial explains the concepts of XPointer using more than 30 examples. It is aimed at
'ordinary' user, who will use XPointer mainly in the href attribute of XLink. A
tutorial for extended-type
XLink has also been added. The Zvon XLink reference has been
updated with cross-references to examples for extended-type XLink." In this
connection, note Daniel Veillard's reminder that the W3C specifications for XPointer and XLink
are currently in Candidate Recommendation stage at W3C, and that the XML Linking Working
Group is seeking implementation reports for XPointer and XLink. The CR stage "is
dedicated to implementors, and the specifications are allowed to pursue their way toward
the final Recommendation status only if the prerequisite of implementability have been
verified." Implementation feedback for XPointer and Xlink may be sent to the publicly archived
mailing list www-xml-linking-comments@w3.org.
Review comments may also be sent to the XML Linking Working Group co-chairs, Eve Maler and Daniel
Veillard. Some examples of XLink/XPointer implementations are provided in "XML Linking Language."
- [October 13, 2000]
W3C Publishes CSS Mobile Profile 1.0 for Mobile Devices.
W3C has issued CSS
Mobile Profile 1.0 as a working draft to define a subset of CSS2 features that
provides a minimal guarantee of interoperability on mobile devices. Reference: Working
Draft 13-October-2000, by Ted Wugofski
(Phone.com), Doug Dominiak (Motorola), and
Peter StarkEricsson). Document abstract:
"This specification defines a subset of the Cascading Style Sheets Level 2
specification tailored to the needs and constraints of mobile devices." The Working
Draft of the CSS Mobile Profile specification has been published by the W3C CSS Working
Group part of the Style activity. The document
supplies a "profile of the Cascading Style Sheets, level 2 (CSS2) specification
appropriate for mobile devices such as wireless phones. Conformance to this profile means
that a user agent supports, at minimum, the features defined in this specification per the
CSS2 conformance. CSS2 specifies how developers can author style sheets for presenting
documents across multiple devices and media types. While this is very important, it is
also important that authors have an understanding of what features are supported on these
different devices. Likewise, it is important that similar devices operate in a similar
manner. Otherwise, authors will need to develop style sheets for each version of each
device -- raising the cost of content development and decreasing interoperability. The CSS
Mobile Profile specifies a conformance profile for mobile devices, identifying a minimum
set of properties, values, selectors, and cascading rules. The resulting CSS Mobile
Profile is very similar to CSS1." Section 3 provides a tabular summary of CSS Mobile
Profile selector syntax. The CSS Mobile Profile uses the same syntax as specified in CSS2,
with a subset of values; in general, the CSS Mobile Profile uses the same cascading rules
as in CSS2. A CSS Mobile Profile conforming user agent must also be able to process
media-dependent stylesheets as specified in CSS2." For related specifications, see "W3C Cascading Style Sheets."
- [October 12, 2000]
IBM Licenses New XML Technologies.
From an IBM announcement: "IBM today made seven new alpha technologies, including
six based on the XML (eXtensible Markup Language) standard, available for licensing
through alphaWorks, IBM's free, on-line resource for developers. Today's announcement
brings the total number of alpha technologies available for licensing on the alphaWorks
site to 13. The first six were launched together with the IBM licensing initiative in
August. The move has been welcomed by developers who have requested that the free 90-day
trial license model expand to commercial purchase rights. New technologies available for
licensing include Xeena, a visual
XML editor that can be used with XML document type descriptions (DTDs). The popular Xeena
editor was downloaded tens of thousands of times with numerous requests for licenses
through the alphaWorks home page. Other XML technologies now available as a part of
alphaWorks' new licensing initiative include: (1) XML EditorMaker - A
text-based XML document editor. This tool automatically creates visual, Java-based XML
editors that developers can easily use to create and modify XML documents, increasing
development and deployment speed of XML-based documents. (2) XML Productivity Kit, which
allows for rapid integration of XML documents into a Java development environment. (3) XTransGen, which
enables developers to easily define and store the mapping relationship between two XML
document types (DTDs). Once this initial translation is completed, XML documents can be
converted quickly. (4) XML Lightweight
Extractor for defining sources of information for a particular XML document. This
information can be stored and recalled dynamically to populate XML documents with the
appropriate data. It works with any JDBC-compliant relational database. (5) XML Master, a tool for creating custom,
Java-based logic for the manipulation of XML documents. Developers can build programming
frameworks for a particular XML document type and then automatically generate Java code
that can be imported into a Java development environment (e.g., VisualAge for
Java). The seventh new technology is the Remote Method Invocation for Microsoft Internet
Explorer 4.X (RMI for IE4), a package that provides support for the Microsoft JVM (Java
Virtual Machine) not included in older versions of Explorer. . . developerWorks, IBM's free, on-line
collection of content and resources, enables developers worldwide to build better software
and to enhance their technical skills by offering a wealth of tools, tutorials, code,
tips, news, white papers and how-to articles focused on open standards and cross platform
development. Committed to providing the most informative, reliable and accurate technical
information by tapping into IBM and industry leaders; developerWorks content is valuable
to developers regardless of their application development tool of choice. A major
component of developerWorks is alphaWorks,
IBM's emerging technology broker. alphaWorks provides early adopters and innovators direct
access to IBM's 'alpha' technologies through free download and commercial licenses. Both
IBM sites respond to the needs of developers by providing relevant technical content and
cutting-edge emerging technology."
- [October 12, 2000]
IBM's Agent Building and Learning Environment (ABLE).
IBM's Agent Building and Learning
Environment (ABLE) "is a toolkit from the IBM T.J. Watson Research Center for
developing hybrid intelligent software agents and agent applications in Java. The update
provides new neural and Bayesian learning algorithms, GUI enhancements, XML rule parsing,
bug fixes, and documentation on adding custom beans. . . ABLE provides a set of reusable
JavaBean components, called AbleBeans, along with several flexible interconnection methods
for combining those components to create software agents. AbleBeans implement data access,
filtering and transformation, learning, and reasoning capabilities. Function-specific
AbleAgents are provided for classification, clustering, prediction, and genetic search.
Application-specific agents can be constructed using one or more of these AbleBeans.
AbleAgents are situated in their environment through the use of sensors and effectors,
which provide a generic mechanism for linking them to Java applications. A GUI-based
interactive development environment, the Able Agent Editor, is provided to assist in the
construction of AbleAgents using AbleBean components. In the ABLE framework, an agent is
an autonomous software component. It could be running on its own thread of control or
could be called synchronously by another agent or process either through a direct method
call or by sending an event. By combining one or more AbleBeans, agents can be extremely
lightweight (e.g. a few lines of Java code) or can be relatively heavy weight, using
multiple forms of inferencing (e.g. fuzzy rule systems, forward and backward
chaining) and learning (e.g. neural classification, prediction, and clustering). . . ABLE
is meant to make your life easier if (1) you are an application developer, by providing a
set of intelligent beans, and a editor for combining them into agents. (2) you are doing
research on intelligent agents, by providing a flexible Java framework for combining the
ABLE beans with your algorithms or ideas about how agents should be constructed."
- [October 12, 2000]
Proposed OASIS Technical Committee on Entity Resolution.
A recent announcement
released by Karl Best (OASIS - Director,
Technical Operations) describes a proposed 'Entity Resolution' technical committee, to be
formed under the rules of the Technical
Committee Process as announced
in early October. The new committee would continue work begun under the SGML Open
Technical Resolution on Entity Management (entity catalog formats, formal system
identifiers, etc.), updating this work to cover XML. "A new OASIS technical committee
is being formed. The Entity Resolution TC has been proposed by Lauren Wood, SoftQuad
Software Inc.; Norman Walsh, Sun Microsystems; Paul Grosso, Arbortext, Inc.; and John
Cowan, Reuters Health. The request for a new TC meets the requirements of the OASIS TC
process. . . The objective of the Entity Resolution TC is to provide facilities to address
issue A of the OASIS catalog specification (TR 9401). These facilities
will take into account new XML features and delete those features of TR 9401 that are only
applicable to SGML, as well as those features applicable only to issue B in TR 9401.
Deliverables: The Entity Resolution TC will produce a Committee Specification that uses
XML syntax and provides a DTD (potentially also an XML Schema) for that syntax. This
specification will be ready by August 2001. The Entity Resolution TC intends to submit the
Committee Specification as an OASIS Standard after sufficient implementation experience
has been gathered." Note also that the formation of a technical committee for
'Customer Information Quality' was announced in February: "The objective of the
Technical Committee (TC) on Customer Information Quality (CIQ) formed by OASIS is to
deliver XML standards for customer profile/information management to the industry. The
Customer Information Quality TC has been proposed by Ram
Kumar, Cognito, Inc; Vincent Buller, AND Data Solutions; John Bennett, Parlo.com; and
Graham Lobsey, Cognito, Inc." See also the list of active OASIS TCs.
On entity resolution, see the topic "SGML/XML Entity
Types, and Entity Management," and the following section "Catalogs, Formal
Public Identifiers, Formal System Identifiers."
- [October 12, 2000]
Revised IETF/W3C XML-Signature Syntax and Processing Specification.
The IETF/W3C XML Signature Working
Group has issued an updated Last Call Working Draft for the XML-Signature Syntax and
Processing specification. Reference: W3C Working Draft 12-October-2000, edited
by Donald Eastlake, Joseph Reagle, and David
Solo. The document "specifies XML digital signature processing rules and syntax.
XML Signatures provide integrity, message authentication, and/or signer authentication
services for data of any type, whether located within the XML that includes the signature
or elsewhere. Enveloped or enveloping signatures are over data within the same XML
document as the signature; detached signatures are over data external to the signature
element. More specifically, this specification defines an XML signature element type and
an XML signature application; conformance requirements for each are specified by way of
schema definitions and prose respectively. This specification also includes other useful
types that identify methods for referencing collections of resources, algorithms, and
keying and management information. The XML Signature is a method of associating a key with
referenced data (octets); it does not normatively specify how keys are associated with
persons or institutions, nor the meaning of the data being referenced and signed.
Consequently, while this specification is an important component of secure XML
applications, it itself is not sufficient to address all application security/trust
concerns, particularly with respect to using signed XML (or other data formats) as a basis
of human-to-human communication and agreement. Such an application must specify additional
key, algorithm, processing and rendering requirements." Document status: This WD
represents an "update to the second last call version, with an abbreviated last call
termination date of October 20, 2000 (5 weeks in total). This update includes minor
editorial changes, reference to the latest Canonical XML, as well as an adoption of the
latest XML Schema specification. Barring substantive comment, we will request Candidate
Recommendation status as soon as possible following the Canonical XML request. However, we
do wish to ensure that readers are aware of following three substantive changes in the
second last call: (1) We've changed the Reference Processing Model (section 4.3.3.1). to
permit the presentation and acceptance of XML node-sets between Transforms (and resulting
from some URI References) when appropriate; we accomplish this by heavily relying upon the
XPath specification but still do NOT require a conformant XPath implementation. (2) We've
revised the treatment of pre-pended algorithm object identifier within the encoded RSA
SignatureValue by the PKCS1 algorithm (section 6.4.2). (3) We've revised the X509Data
element (section 4.4.4) to clarify the treatment of certificate 'bags' and CRLs within
that structure." See references in "XML Digital Signature (Signed XML
- IETF/W3C)."
- [October 11, 2000]
XML Adoption in the UK's e-Government Interoperability Framework (e-GIF).
One of the three key policy decisions in the UK 'e-GIF' program is identified as
the "adoption of XML as the primary standard for data integration and presentation on
all public sector systems...the adoption of XML (Extensible Mark-up Language) and XSL
(Extensible Stylesheet Language) form the cornerstone of the government data
interoperability and integration strategy." Some details of the "Data
integration policies" are highlighted in the "Policies and technical standards"
section of the e-GIF report: The "UK Government policy is to use: (1) XML and XML
schemas for data integration; (2) UML, RDF and XML for data modelling and description
language; (3) XSL, DOM and XML for data presentation." The model also identifies the
use of GML (Geospatial Markup
Language) as defined by Open Geographic Council. "XML products will be written so
as to comply with the recommendations of the World Wide Web Consortium (W3C). Where
necessary the government will base the work on the draft W3C standards but will avoid the
use of any product specific XML extensions that are not being considered for open
standardisation within the W3C. Centrally agreed XML schemas are approved through the UK
GovTalk processes..." According to an announcement of the plan
by Cabinet Office Minister Ian McCartney, "e-GIF is a key plank in the Government's
drive to get all its services online by 2005 and cut bureaucracy within the public sector.
Speaking at London's QE2 Centre, Mr McCartney launched the e-Government Interoperability
Framework (e-GIF) - a piece of policy which will help IT systems across the whole public
sector to communicate smoothly with each other. There are two main benefits the policy
will bring: (1) Creating 24-hour one-stop Government: e-GIF is key to creating one-stop
Government where services are available 24-hours a day from a single electronic point of
access. For example, the UK online portal - built around e-GIF standards - will offer
services around life episodes, giving the user information they need about a particular
experience such as having a baby or learning to drive. (2) Banishing bureaucracy in
Government: Step-up the red-tape revolution within Government, moving the public sector
away from traditional paper-based ways of working by electronically joining up information
across a range of Government departments and organisations. Again this is built around
e-GIF standards... The main thrust of the framework is to adopt the Internet and World
Wide Web standards for all government systems. There is a strategic decision to adopt XML
and XSL as the core standard for data integration and presentation. This includes the
definition and central provision of XML schemas for use throughout the public sector. The
e-GIF also adopts standards that are well supported in the market place. It is a pragmatic
strategy that aims to reduce cost and risk for government systems whilst aligning them to
the global Internet revolution. Specifying policies and standards in themselves is not
enough. Successful implementation will mean the provision of support, best practice
guidance, toolkits and centrally agreed data schemas. To provide this, the government has
launched the UK GovTalk initiative. This is a Cabinet Office led, joint government and
industry forum for generating and agreeing XML data schemas for use throughout the public
sector... The primary role of the UK GovTalk Group is to promote the production and
management of the XML schemas necessary to support data interoperability requirements of
the e-government strategy. XML schemas will be developed by specialist groups, established
to support specific projects, or by open submission to the UK GovTalk web site either in
response to a Request for Proposals or as an unsolicited proposal. In each case, the UK
GovTalk Group will manage the acceptance, publication, and any subsequent change requests
for the schema. XML schemas that have been accepted by the group will be published on
www.govtalk.gov.uk and will be open for public comment and requests for change. The Portal
Data Schemas Project has been established by the UK GovTalk Group to manage the generation
and timely delivery of the agreed XML data schemas required for government services
delivered through the Portal. The XML data schemas required for the portal services will
be the first outputs of the Portal Data Schemas Project and will be agreed through the
GovTalk processes as a prioritised delivery. The scope of the e-GIF includes
intradepartmental systems and the interactions between: UK Government department and other
UK Government departments, UK Government and wider public sector, UK Government and
foreign governments (UK/EC, UK/US etc), UK Government and businesses world wide, and UK
Government and citizens. UK Government includes central government departments and their
agencies, local government and the devolved administrations. The wider public sector
includes non departmental public bodies (NDPBs) and the National Health Service. The e-GIF
standards are mandated on all new systems. Legacy systems which need to link to the
Government Secure Intranet (GSI), Government Portal (Gateway and UK Online), the Knowledge
Network or other systems, which are part of electronic service delivery, will need to
comply with these standards." For other references, see "e-Government Interoperability
Framework (e-GIF)."
- [October 11, 2000]
Electronic Transactions on Artificial Intelligence (ETAI) Features
"The Semantic Web" Department.
Guus Schreiber (Department of Social
Science Informatics, University of Amsterdam) posted an announcement inviting
submissions for a new area of the electronic journal Electronic Transactions on
Artificial Intelligence (ETAI) entitled "The Semantic Web." The new semantic Web area
"is concerned with modeling semantics of web information, and covers theory, methods,
and applications. . . Tim Berners-Lee coined the vision of a 'semantic web' in which
background knowledge is stored on the meaning or content of web resources through the use
of machine-processable metadata. The semantic web should be able to support automated
services based on these descriptions of semantics. The semantic or "knowledge"
web is seen as a key factor in finding a way out of the growing problems of traversing the
expanding web space, where currently most web resources can only be found through
syntactic matches (e.g., keyword search). This ETAI area is targeted at all
research efforts aimed at constructing, maintaining and using such a knowledge-intensive
information and service web. Not surprisingly, our field is interdisciplinary in its very
nature covering various aspects dealt with in various communities of Artificial
Intelligence and Computer Science. It covers aspects from knowledge engineering, databases
and information systems, knowledge representation, information retrieval, digital
libraries, multi-agent systems, natural-language processing, and machine learning. We
envisage paper submissions falling within at least one of the following categories:
Metadata, knowledge markup, and formal annotations of web information; Information
extraction, automatic and semi-automatic generation of meta data for web information;
Knowledge representation for the web; Generic and heuristic reasoning methods for the web;
Integration of databases in the knowledge web; Interoperability of web services at the
semantic and pragmatic levels; Standard ontologies for content description of web
information; Distributed ontologies, knowledge composition and transformation; Scalability
of knowledge-intensive web services; Content-based information retrieval; Knowledge
retrieval; Tool environments, development methodologies, case studies and applications for
and of the knowledge web; Web-based knowledge management and electronic commerce. ETAI (Electronic Transactions on Artificial Intelligence)
makes submitted articles directly available on-line and promotes public discussions on the
submissions. Each year, accepted articles are also collected in a printed volume, mainly
for library use. Area editors include Dan Brickley (University of Bristol), Dieter Fensel
(Free University of Amsterdam), Yolanda Gil (ISI), Jim Hendler (University of Maryland /
DARPA), Ora Lassila (OKIA), Deborah McGuinness (Stanford University), Robert Meersman
(Free University of Brussels), and Guus Schreiber (University of Amsterdam)." See: "XML and 'The Semantic
Web'."
- [October 11, 2000]
Extensible Programming Language (XPL).
Michael Lauzon posted an announcement
for the development of Extensible Programming Language (XPL). "XPL is an open source
initiative, and is also an application of XML, it is a new programming language that will
be based on XML as an application of XML. XPL is conceived as a framework or meta-language
for defining XML document types which operate as programming languages. [Rationale:] The
practice of programming stands to benefit by exploiting the evident virtues of XML: its
cross-platform availability, its open textual format, its extension over a very large
class of data; structures, and the networking infrastructure available to it. The practice
of XML document exchange stands to benefit by exploiting the public body of
programming-language concepts and applications, by bringing programming architectures into
XML itself; XML is a meta-data language: the goal of XPL is to be a meta-process language.
. . To learn more about it please go to http://www.xplatypus.com/.
The programming language will be partly derived from Miva & Cold Fusion. Interested
parties may consult: (1) the eGroups mailing
list, (2) the XPL draft
specification, (3) the XPL FAQ document,
(4) the [Jonathan Burns'] annotated
version of Paul Prescod's document "Why the Web needs Groves" and (5) the
list of script tags.
| |