|

Cover Pages Archive
|
|
SGML and XML News |
[May 25, 2001]
New OpenTravel Alliance (OTA) Specification Supports
Cross-Industry Availability and Booking Capabilities.
More than 140 leaders representing all aspects of the travel
industry recently met for an OpenTravel Alliance (OTA) Advisory
Forum and approved the release of a new draft customer profile
specification. Version 2001A of the OpenTravel Alliance
Message Specifications document "merges the OTA and
Hospitality Industry Technology Integration Standards (HITIS)
respective customer profiles into one comprehensive profile to
expedite the imminent adoption of this new OTA specification. The
Version 2001A public release also includes a major infrastructure
change that incorporates emerging Extensible Markup Language (XML)
capabilities: it adopts a portion of the messaging structure
developed by the Electronic Business XML (ebXML) Transport, Routing
and Packaging project team. Version 2001B has also been released for
member review; it includes the complete set of revised HITIS
standards for the hotel industry coupled with availability and
booking specifications to serve airline, car rental, package
holidays from consumer to supplier, and travel insurance industries.
The OTA specifications now include the principal specification
document, W3C XML Schema and schema fragments conforming to the W3C
Candidate Recommendation, XML DTDs, UML Model diagrams, and a data
dictionary. The OTA's five working groups, together with an
interoperability committee to coordinate their efforts, are
developing open Internet-compatible messages using XML data terms:
air, car, hotel, leisure supplier, and non-supplier." [Full
context]
[May 25, 2001]
XPath Tester Provides Graphical Test Environment for XPath
Expressions.
Paul Brown (FiveSight
Technologies, Inc.) announced the release of an 'XPath Tester' tool
which provides a graphical test environment for XPath expressions.
XPath Tester "is an XML productivity tool that wraps the XPath
API in XalanJ2. It allows the user to load an XML document, enter a
query, and see the resulting NodeSet match highlighted or the value
displayed. The tool provides a real-time mode that will apply the
query as you type. The Xalan-J2 XSLT Processor [Version 2.1.0] from
the Apache XML Project is used as the underlying XPath engine. The
tool is available for download as a single executable JAR file that
can be run via double-click in Windows or via java -jar
xpathtester_1_x.jar [x replaced by the appropriate
digit)] on any platform." XPath Tester is made publicly
available under the GNU General Public License (GPL). Peter
Flynn nominated the XPath Tester application for [first] prize
as "Most Useful Tool." [Full
context]
[May 24, 2001]
Edge Side Includes (ESI) Specification Receives Broad Support.
Several leading companies recently announced support for Edge
Side Includes (ESI) as an open specification for managing content
transparently across Application Server solutions, Content
Infrastructure, Content Management Systems, and Content Delivery
Network industries. Edge Side Includes (ESI) "accelerates
dynamic Web-based applications by defining a simple markup language
to describe cacheable and non-cacheable Web page components that can
be aggregated, assembled, and delivered at the network edge. The
Edge Side Includes (ESI) specification defines an XML-based markup
language that provides a means to assemble resources in HTTP
clients. Unlike other in-markup languages, ESI is designed to
leverage client tools like caches to improve end-user perceived
performance, reduce processing overhead on the origin server, and
enhanced availability. ESI allows for dynamic content assembly at
the edge of the network, whether it is in a Content Delivery
Network, end-user's browser, or in a 'Reverse Proxy' right next to
the origin server. ESI is primarily intended for processing on
surrogates (intermediaries that operate on behalf of the origin
server, also known as 'Reverse Proxies') that understand the ESI
language. However, its application is not restricted to these
devices. ESI also specifies a content invalidation protocol for
transparent content management across ESI-compliant solutions, such
as application servers and content delivery networks. The ability to
assemble dynamic pages from individual page fragments means that
only non-cacheable or expired fragments need to be fetched from the
origin Web site, thereby lowering the need to retrieve complete
pages and decreasing the load on the Web site's content generation
infrastructure." [Full
context]
[May 22, 2001]
Johns Hopkins and Leading Medical Societies Announce MedBiquitous
Consortium for XML Medical Standards.
A recent announcement from Johns Hopkins University School of
Medicine describes the creation of a new technology consortium for
education and collaboration in the area of medical science.
"Johns Hopkins has joined with many leading professional
medical societies to create the MedBiquitous Consortium, a group
dedicated to creating technology standards and software for
education and collaboration in online medical communities. The
MedBiquitous Consortium is a non-profit international organization
dedicated to creating a comprehensive XML framework for professional
medical societies. The goal is to enable individual citizens,
medical societies, governmental, and industry members to build on
this framework to create thriving online communities and Web-based
services for physicians. The MedBiquitous Consortium's immediate
focus is on defining a set of XML (Extensible Markup Language)
standards and software requirements for communications within and
among professional medical societies and related organizations. The
standards developed by the Consortium are freely available
throughout the industry, thus enabling the development of
interoperable software tools and Web services. The Consortium is
also creating a library of Java software tools for Consortium
members." Key working standards identified by the group include
XML, Java, and UML: "The Consortium will create XML standards
that allow organizations to better exchange data; it will create
modular Java software components and applications based on the XML
standards; it will use UML to architect Java software components and
applications. MedBiquitous will thus provide the vehicle for a
consensus-building process of defining an XML vocabulary for
professional medical societies. The XML vocabulary will support a
wide range of activities including organizational membership and
participation, specialty certification, continuing education, online
meetings, scientific abstracts, journal article submission and
administration, clinical trials, and data registries for tracking
medical outcomes. As is the case with other industries, the creation
of these XML standards will allow the development within
professional medicine of interoperable Web software tools and
services." [Full
context]
[May 22, 2001]
IBM Global Services and IBM WebSphere Platform to Support IBM's
Web Services Infrastructure.
IBM recently announced a wide-scale program of support for
"production-ready infrastructure software and services to
enable Web Services," involving open Internet standards and
secure management of high-volume transactions and integrating
complex business processes. Also announced is systems integration
support from IBM Global Services to help businesses build Web
services applications using IBM's infrastructure software. To
support the growth of business-process integration, projected to
approach $50 billion by 2005, IBM is enabling all of its middleware
infrastructure software with comprehensive support for open Internet
standards to enable the development of Web services applications.
The support spans IBM's entire middleware portfolio, including DB2,
Lotus, Tivoli and WebSphere software. Software
for developing, publishing, hosting and deploying Web services
includes WebSphere Application Server Version 4, which supports the
variety of open standards and technologies necessary to develop,
publish and deploy Web services applications, including: Universal
Description Discovery and Integration (UDDI); Simple Object Access
Protocol (SOAP); Java2 Enterprise Edition (J2EE); Web Services
Description Language (WSDL); and enhanced integration of leading XML
technologies. Software enabling Web Services applications to manage
and access data includes DB2/XML Extender, which enables Web
services applications to access data stored in DB2 or any other
database, providing businesses with greater ease and efficiency in
accessing, analyzing and managing information across multiple
computing systems. Software for availability, performance and
security of web services applications involves the Tivoli Manager
for WebSphere Application Server, which provides a single point of
control to enable businesses to manage the distributed environments
that use WebSphere Application Server to host Web services
applications. Software for enabling web collaboration, knowledge
management, and e-learning includes the Lotus Web Services
Enablement Kit with the Lotus Domino Application Server; the
collaboration, workflow, and messaging capabilities of Domino can be
accessed as Web services by adding SOAP interfaces and WSDL
descriptions to new or existing Domino applications. The built-in
support for Web, XML, and Java technology provides a straightforward
approach for enabling custom Domino applications as Web services,
with workflow, knowledge discovery system, and instant
messaging." IBM is also developing an XML-based 'Web
Services Flow Language (WSFL)' which describes how Web services
may be composed as 'usage patterns' and 'interaction patterns' into
new Web services to support business processes. [Full
context]
[May 22, 2001] IBM XML Schema Quality Checker
Supports the W3C XML Schema Recommendation.
A communiqué from Bob
Schloss (IBM Research) reports on the availability of an updated
IBM XML Schema Quality Checker tool from IBM alphaWorks. The new
version of this downloadable tool (Version 1.0.17, 05/21/2001)
assists users who are creating XML Schemas conforming to the May 2,
2001 W3C Recommendation. The updated release "contains fixes
bugs, adds a checker to verify that identity constraint definitions
(Key, KeyRef and Unique) are consistent with the type of the element
declaration where they appear, and updates the default stylesheet
used to view the error reports. The XML Schema Quality Checker is a
Java program which takes as input an XML Schema written in the W3C
XML schema language and diagnoses improper uses of the schema
language. Where the appropriate action to correct the schema is not
obvious, the diagnostic message may include a suggestion about how
to make the fix. For XML Schemas which are composed of numerous
schema documents connected via <include>, <import>,
or <redefine> element information items, a full
schema-wide checking is performed. The tool can also be run in batch
mode to quality-check multiple XML schemas in a single run."
Schloss reports that the team is continuing to work on more complete
checking of the consistency of identity-constraint definitions and
on additional improvements; they welcome feedback and suggestions.
The tool has been produced by the IBM XML Application Development
team, including Achille Fokoué, Bob Schloss, Tom Gallivan, and
Roberto Galnares. [Full
context]
[May 21, 2001]
Ontopia Knowledge Suite Features Topic Map Engine and TM
Navigator Framework.
A product announcement from Ontopia AS describes the commercial
release of the Ontopia Knowledge Suite Version 1.0, with support for
the ISO 13250 and XML Topic Map (XTM) specifications. "The
Ontopia Knowledge Suite (OKS) version 1.0 comprises a full-featured
Topic Map Engine written in 100% Java, and the Ontopia Topic Map
Navigator, a framework for building web-based topic map delivery
applications. The new Navigator can be tested online or downloaded
free. The OKS will become a complete suite of tools for managing and
using topic maps, as new components are released during the coming
months. Topic Mapping is a new paradigm for organizing, retrieving,
and navigating information resources. Through the provision of a
'knowledge layer' that is independent of the information resources
themselves, topic maps help capture and manage corporate memory,
improve indexing, and enable the integration of information that
spans multiple, disparate repositories. Topic Maps are an
international standard, defined by the ISO, and can also be
interchanged using the Extensible Markup Language (XML) defined by
the W3C. Ontopia AS, a providers of advanced Topic Map technology
and co-creators of the Topic Map standards, offers technology
supporting added value to web portals, content management systems,
knowledge-based intranet applications, and enterprise application
integration." [Full
context]
[May 21, 2001]
BOX Tool Generates XML DTDs and Vector Graphics Diagrams from
UML/XMI.
A posting from Christian
Nentwich announces the release of a software tool called BOX
('Browsing Objects in XML') which "reads UML models in XMI and
exports the contained diagrams in vector graphics form, including
SVG and VML. The BOX tool includes, amongst other things, (1) An
implementation of the UML metamodel [mainly Foundation/Core, not
behavioral packages], in the uml package; (2) A parser for
XMI; (3) An additional parser for diagram information specific to
the Unisys exporter, in the unisys package; (4) Several
exporters in the export package, which you have to manually
call at the moment; (5) Heuristics for reconstructing diagrams from
the rather poor information made public by the exporter; (6) Sample
UML models." BOX was written for a research project in
1998-2001; though currently unmaintained and underdocumented, it is
licensed as free software under the GNU General Public License. A
research paper on 'Browsing Objects in XML' from 1999 describes BOX
as a "a portable, distributed, and interoperable approach to
browsing UML models with off-the-shelf browser technology; its
approach to browsing UML models leverages XML and related
specifications, such as the Document Object Model (DOM), the XML
Metadata Interchange (XMI) and a Vector Graphic Markup Language
(VML). BOX translates a UML model that is represented in XMI into
VML. BOX has been successfully evaluated in two industrial case
studies which used BOX to make extensive domain and enterprise
object models available to a large number of stakeholders over a
corporate intranets and the Internet. We discuss why XML and the BOX
architecture can be applied to other software engineering notations
and argue that the approach taken in BOX can be applied to other
domains that already started to adopt XML and have a need for
graphic representation of XML information. These include browsing
gene sequences, chemical molecule structures, and conceptual
knowledge representations." [Full
context]
[May 20, 2001]
OOPSLA Workshop on 'Objects, <XML> and Databases'.
A communiqué from Akmal B.
Chaudhri announces a call for papers in connection with the
OOPSLA 2001 Workshop on 'Objects, <XML> and Databases', to be
held October 14/15, 2001 in Tampa Bay, Florida. The workshop will be
hosted in conjunction with the OOPSLA
2001 Conference (ACM Conference on Object-Oriented Programming,
Systems, Languages, and Applications). Topics of interest for the
workshop include "the use, management and delivery of XML, web
and semi-structured data using object technology and database
systems; for example, the efficient storage, retrieval and querying
of XML from database systems, application and database evolution
issues relating to changes in structure of XML documents, case
studies or experience reports, benchmarks and performance, early
adopter experiences, standards issues, etc. The goal is to bring
together academics, practitioners, users and vendors to discuss the
use and synergy between Objects, XML and Databases. Many commercial
systems built today are increasingly using these technologies
together and it is important to understand the various research and
practical issues. The wide range of participants will help the
various communities understand both specific and common problems...
For many organizations already using object-orientation with
database management systems, XML data adds a new dimension that
brings considerable flexibility and promise, but also adds new
uncertainties and issues as to how to effectively manage that data.
The recent trend towards XML servers, native XML databases and
support for XML in existing relational databases is a testimony to
the importance of this issue for the vendor community as well."
[Full
context]
[May 19, 2001]
Workshop on E-Business and the Intelligent Web.
An updated workshop schedule is available for the IJCAI 2001
Workshop on E-Business and the Intelligent Web, to be held in
Seattle, WA, August 5, 2001 as part of the Seventeenth International
Joint Conference on Artificial Intelligence. An invited paper on
"Standardizing XML Rules" will be presented by Benjamin N.
Grosof of the MIT Sloan School of Management. The workshop is set
against the backdrop of two key trends: "(1) The e-business
community is creating new infrastructures to support high-level
business-to-business and business-to-consumer activities on the web.
(2) The so-called 'semantic web' movement is developing a range of
proposals aimed at supporting intelligent information-seeking and
information-management operations over the web. In the first trend,
the main foci are on defining a new generation of electronic data
interchange protocols, mostly based on XML (prominent initiatives
include BizTalk, eCo, OASIS, and RosettaNet) and on creating new
kinds of e-business services such as agent-mediated B2B e-commerce,
and knowledge-driven customer relationship management. In the second
trend, the emphasis is on enriching the web's data markup languages
with knowledge representation features, to permit inference over the
content of web pages (prominent initiatives include DAML, OIL, and
RDF). Its goals include the production of internet-scale inference
mechanisms, knowledge markup languages, and active
information-seeking services. The goal of the workshop is to bring
together members of the e-business and intelligent web communities
to consider the current state of the two fields, and identify
strengths, weaknesses, opportunities, and threats in the interaction
between the two areas." [Full
context]
[May 19, 2001]
W3C XML Key Management Services Workshop.
W3C has announced a call for papers and registration in
connection with a workshop on XML Key Management Services, to be
held July 19, 2001 in Redwood City, California. The goal of the
workshop is "to consider the requirements for simple key
resolution and trust services for XML security applications, the
degree to which the XKMS specification satisfies those requirements,
and to determine if there is sufficient focus and interest to
propose a W3C activity in this area." The workshop will focus
upon 'questions of key trustworthiness' which hitherto have not been
in scope for W3C activities under the XML Signature and XML
Encryption work. The workshop organizers believe the broader
questions of trust not yet fully accounted for in W3C activity,
including confidence in a key, "are critical to secure XML
applications, reliable XML protocols and trusted Web services.
Topics likely to be discussed at this workshop include: (1) XML
Security application key management requirements (2) Dependencies
upon XML Protocol, Web services, XML Query, Semantic Web, and
transport protocols [e.g., HTTP]; (3) Security considerations
resulting from a specification; (4) The scope of any resulting W3C
Activity." Workshop attendance will be limited to forty-five
(45) participants, based upon the quality of position papers
submitted and [preference for] membership in a W3C Working Group. [Full
context]
[May 19, 2001]
RELAX Core Approved as an ISO Technical Report.
A posting from Murata
Makoto (Project Editor of RELAX Core at ISO/IEC JTC1) reports on
the result of an ISO/IEC JTC1 ballot of May 10, 2001 which approved
the RELAX Core specification as ISO/IEC Technical Report 22250-1.
RELAX Core is part of the larger RELAX [Regular Language Description
for XML] specification, which currently includes RELAX Namespace,
and will later have RELAX Data. RELAX Core is "a simple
language for describing XML tag sets, viz., for describing
markup languages containing a single XML namespace; the RELAX
Namespace is used to describe markup languages containing more than
a single XML namespace consisting of more than one RELAX Core
document." RELAX Core "was originally created as a JIS TR
and was then submitted to ISO via the fast-track procedure. The
final text will be prepared in an upcoming meeting in August 2001.
The latter half, RELAX Namespace, has been approved as JIS TR and
will soon be submitted to ISO." In a parallel universe, RELAX
Core and TREX (Tree Regular Expressions for XML) are to be unified,
since the two are very similar as structure-validation languages.
The unified TREX/RELAX language will be called RELAX NG [for
"Relax Next Generation," pronounced "relaxing"].
This design work is now being conducted within the OASIS TREX
Technical Committee, where a (first) specification is expected by
July 1, 2001. The OASIS TC may also be renamed 'RELAX NG' to reflect
the new name of the unified TREX/RELAX language. The RELAX NG
development team plans to submit the OASIS specification to ISO,
given the importance of ISO standards in Europe. RELAX NG progress
may be tracked via the mailing
list archives or by comparing the issues
list with the original TREX tutorial and specification. [Full
context]
[May 17, 2001]
W3C Publishes Last Call Working Draft for XML Inclusions
(XInclude).
The W3C XML Core Working Group has announced the release of a
last call working draft specification for XML
Inclusions (XInclude) Version 1.0. The working draft
document "specifies a processing model and syntax for general
purpose inclusion. Inclusion is accomplished by merging a number of
XML Infosets into a single composite Infoset. Specification of the
XML documents (infosets) to be merged and control over the merging
process is expressed in XML-friendly syntax (elements, attributes,
URI References)." The working group invites comment on this
specification during the last call review period which begins May
16, 2001 and ends June 5, 2001. According to background information
supplied by Arnaud Le Hors and Paul Grosso (Chairs of the XML Core
WG), the XML Core WG "is not aware of an issue with this WD
document; the list of issues that have been raised and resolved
while elaborating the specification can be found in the accompanying
issues document." [Full
context]
[May 17, 2001]
SRML Proposed as an XML Rule Representation for Java Rule
Engines.
A communiqué from Margaret
Thorpe (ILOG, S.A.) announces the development SRML (Simple Rule
Markup Language) which the designers believe may be of wider
interest. SRML describes a "generic rule language consisting of
the subset of language constructs common to the popular
forward-chaining rule engines. Because it does not use constructs
specific to a proprietary vendor language, rules specified using
this XML DTD can easily be translated and executed on any conforming
rule engine, making it useful as an interlingua for rule exchange
between Java rule engines. The Java Rule Engine API specified in
JSR-000094 defines generic API support for parsing rulesets that are
represented using XML, but does not define a rule language standard.
Thus, while JSR-094 will prescribe that some kind of 'content
language' is made available to the engine via XML, and will provide
some kind of 'content handler' to process these XML rule documents,
a rule language is still required in order to provide a Java rule
engine with rules via this API." A preliminary XML DTD
describing the structure of SRML has been made available by the
authors. As defined in this DTD, a ruleset "is the
root element of the SRML XML document, and it consists of a list of rules.
Rules have a condition part [conditionPart] and an action
part [actionPart], and the condition part must have at
least one condition. Conditions are composed of test
expressions, and can be simple conditions or not conditions."
Etc. [Full
context]
[May 14, 2001]
W3C Publishes XML Information Set Specification as a W3C
Candidate Recommendation.
The XML Information Set specification ('Infoset') has been
advanced to the status of W3C Candidate Recommendation, and
implementation experience is being sought by the WG members. The
specification provides an abstract data set via a set of definitions
"for use in other specifications that need to refer to the
information in an XML document." The Infoset "does not
attempt to be exhaustive: the primary criterion for inclusion of an
information item or property has been that of expected usefulness in
future specifications. Nor does it constitute a minimum set of
information that must be returned by an XML processor... The XML
Information Set does not require or favor a specific interface or
class of interfaces. The specification presents the information set
as a modified tree for the sake of clarity and simplicity, but there
is no requirement that the XML Information Set be made available
through a tree structure. Other types of interfaces, including (but
not limited to) event-based and query-based interfaces are also
capable of providing information conforming to the XML
Information." The W3C XML Core Working Group invites comments
on the Infoset CR through 15-June-2001, at which time it expects to
have sufficient evidence of implementation to advance the
specification to a Proposed Recommendation. [Full
context]
[May 14, 2001]
Microsoft Hosts Online XQuery Prototype Application.
A posting from Michael Rys
(Microsoft Program Manager, SQL Server XML Technologies) announces
that the XQuery Prototype demonstrated at XML DevCon in New York has
been placed online for public use. "The goal of the prototype
implementation is to follow the public working drafts of the W3C XML
Query working group while trying to avoid 'inside' knowledge about
how something is supposed to work. The prototype currently
[2001-05-14] follows the February 15, 2001 W3C
XQuery working draft and will be updated to the next working
draft within weeks after the next working draft's publication. The
web site allows you to formulate XQueries and a subset of a proposed
XQuery-compatible data manipulation language and parse and execute
the former, but currently only allows one to parse the later. Since
the demo is provided via a website, we provide a set of predefined
XML documents and disallow the use of user-specified documents for
security reasons. In addition, the site offers a set of compliance
tests that can be used to check the syntax for the XQuery parser.
Since the tests are automatically generated based on the syntax,
some of the statements may not have meaningful semantics... The
prototype is implemented in C# and is currently only available via
the website; future downloadable implementations of XQuery are
planned for later technology previews of some of the XML
technologies." [Full
context]
[May 14, 2001]
Updated IBM Web Services ToolKit Contains Private UDDI Registry.
An updated release of the IBM Web Services ToolKit (WSTK version
2.3) contains a Private UDDI Registry and enhancements to the WSDL
Generation Tool in order to support COM. IBM Web Services ToolKit
provides "a runtime environment as well as demo/examples to
design and execute web-service applications to find one another and
collaborate in business transactions without programming
requirements or human intervention. The distribution includes a full
web services application 'Gourmet2Go', an Aggregation demo of web
services, and some tools that are helpful to develop and deploy Web
Services. Extensive documentation is included in the toolkit to
assist developers with the basic concepts of web services. The
toolkit includes a fully functioning web services client API that
can be used to directly access a UDDI registry." The version
2.3 WSTK release is compliant with the WSDL 1.1 specification; it
also supports SOAP encryption, UDDI4B (a UDDI for Browser), and
Digital Signature handler. [Full
context]
[May 14, 2001]
Final ebXML Specifications Approved in Vienna.
An announcment from UN/CEFACT
and OASIS reports on the
meeting of 11-May-2001 in which ebXML participants approved a core
set of electronic business data specifications. From the
announcement: "ebXML, which began as an 18-month initiative
sponsored by UN/CEFACT and OASIS, is a modular suite of
specifications that enables enterprises of any size and in any
geographical location to conduct business over the Internet. Using
ebXML, companies now have a standard method to exchange business
messages, conduct trading relationships, communicate data in common
terms and define and register business processes. Approved
specifications include ebXML Technical Architecture, Business
Process Specification Schema, Registry Information Model, Registry
Services, ebXML Requirements, Message Service and
Collaboration-Protocol Profile and Agreement. Accepted ebXML
Technical Reports include: Business Process and Business Information
Analysis Overview, Business Process Analysis Worksheets &
Guidelines, E-Commerce Patterns, Catalog of Common Business
Processes, Core Component Overview, Core Component Discovery and
Analysis, Context and Re-Usability of Core Components, Guide to the
Core Components Dictionary, Naming Convention for Core Components,
Document Assembly and Context Rules, Catalogue of Context Drivers,
Core Component Dictionary, Core Component Structure and Technical
Architecture Risk Assessment. Adoption, implementation and
maintenance of the ebXML specifications will be conducted by
UN/CEFACT and OASIS under the auspices of a Memorandum of
Understanding, which was signed by the two organizations in Vienna.
Coordination of this work will be achieved through a joint UN/CEFACT
and OASIS management committee." [Full
context]
[May 10, 2001]
University of Washington Tukwila Data Integration System.
The University of Washington Database Research Group is
developing a 'Tukwila' system which "uses adaptive query
processing techniques to efficiently deal with processing
heterogeneous, XML-based data from across the Internet. The data
integration system depends upon a mediated schema to
represent a particular application domain and data sources are
mapped as views over the mediated schema. The user asks a query over
the mediated schema and the data integration system reformulates
this into a query over the data sources and executes it. The system
then intelligently processes the query, reading data across the
network and responding to data source sizes, network conditions, and
other factors. The Tukwila data integration system is designed to
scale up to the amounts of data transmissible across intranets and
the Internet (tens to hundreds of MBs), with large numbers of data
sources. The Tukwila data integration system is designed to support
adaptivity at its core using a two-pronged approach. A highly
efficient query reformulation algorithm, MiniCon, maps the input
query from the mediated schema to the data sources. Next,
interleaved planning and execution with partial optimization are
used to allow Tukwila to process the reformulated plan, quickly
recovering if decisions were based on inaccurate estimates. The
system provides integrated support for efficient processing of XML
data, based on the x-scan operator. X-scan efficiently
processes non-materialized XML data as it is being received by the
data integration system; it matches regular path expression patterns
from the query, returning results in pipelined fashion as the data
streams across the network. XML provides a common encoding for data
from many different sources; combined with standardization of
schemas (DTDs) across certain domains, it greatly reduces the needs
for wrappers and even query reformulation. The latest versions of
Tukwila are built around an adaptive query processing architecture
for XML, and can seamlessly combine XML and relational data into new
XML content." [Full
context]
[May 10, 2001]
University of Bergen Hosts MLCD Project: Markup Language for Complex
Documents.
The Center for Humanities Information Technology at the
University of Bergen has announced support for a two-year research
project focused on markup for "complex" documents. The
main goal of the MLCD Project (Markup Language for Complex
Documents) is to "lay the theoretical foundation for a better
system of representation for complex textual phenomena than can be
found in today's SGML- and XML-based systems; the project will also
lay a foundation for software development, with an eye to web-based
delivery." Among the issues dealt with in MLCD: the
well-attested phenomenon of "overlap" (non-hierarchical
structures), handled by various methods in SGML and XML, e.g.,
'store both structures, filter into SGML; use the SGML CONCUR
Feature; use marked sections and entity declarations; use [TEI]
milestones [= empty, asynchronous elements]; model as fragmentation;
use stand-off markup'; etc. The MLCD project builds upon published
research from C. M. Sperberg-McQueen and Claus Huitfeldt, especially
"GODDAG,
MECS, and TexMECS (an experimental markup meta-language for complex
documents). MECS (Multi-Element Code System) "was developed by
Claus Huitfeldt in connection with the work of the Wittgenstein
Archive at the University of Bergen. MECS has many similarities to
SGML-based systems, but distinguishes itself from them in that it
has a simpler notation and a well-defined concept of well-formedness
as a property separate from that validity. MECS thus anticipates
many of the ways in which XML has modified the rules of SGML; in
addition, MECS allows non-hierarchical structures in the form of
overlapping elements. The MLCD project intends to define a system
which combines the best of SGML/XML and MECS. A notation for such a
system has already been designed, and a data structure has been
sketched out. The project will work to complete the specification of
the data structure and to develop some method of specifying document
grammars." [Full
context]
[May 08, 2001]
DOMMITT Online Diff and Merge Utility for XML.
Dommitt Inc. (Cupertino, CA) has announced an online utility for
XML diff/merge and XML DTD generation. The tool allows a user to
find changes in XML documents in three easy steps, aided by HTML
forms: "upload the files, view the differences between them,
and generate a DTD to validate the new XML file with changes."
Supporting documentation at the website demonstrates the differences
between the Dommitt online tool and related software tools in the
market (e.g., IBM alphaWorks XML
Diff and Merge utility, Sun's diffmk
perl tool, and DeltaXML).
DOMMITT produces an XML document annotated with edit operations
(insert, delete and update operations). DOMMITT has its roots in
research directed towards diffing snapshots of object databases.
Such snapshots are inherently tree structured. The algorithms
stemming from this research similar to the LCS algorithm used by the
UNIX diff utility, but LCS does not comprehend tree structure. So
variants of this approach were used to adapt the algorithm to this
new domain. DOMMITT Inc. is committed to delivering high-performance
and scalable solutions in the XML space. Making these tools easy to
use is also a priority. DOMMITT is structured to allow peer-to-peer
XML document comparisons. This essentially implies the
computationally intensive diff operation is scaled across machines
that are registered in the peer network." [Full
context]
[May 08, 2001]
IMS Educational Consortium Releases Content Packaging 1.1
Specification.
IMS, "a global consortium of more than 300 educational
institutions, commercial entities and government agencies"
recently announced the publication of the IMS Content
Packaging 1.1 Specification that forms part of the core IMS
standard for exchanging e-learning content. "The IMS Content
Packaging Specification provides the functionality to describe and
package learning materials, such as an individual course or a
collection of courses, into interoperable, distributable packages.
Content Packaging addresses the description, structure, and location
of online learning materials and the definition of some particular
content types. The Content Packaging Specification is aimed
primarily at content producers, learning management system vendors,
computing platform vendors, and learning service providers. Learning
materials described and packaged using the IMS Content Packaging XML
format should be interoperable with any tool that supports the
Specification. Content creators can develop and distribute material
knowing that it can be delivered on any compliant system, thereby
protecting their investment in rich content development. IMS uses
XML as its current binding, and XML-Schema as its primary XML
control document language. Some IMS bindings use parts of other IMS
XML bindings; for example, the Content Packaging specification also
uses the IMS Meta-Data." The
Managed Learning System Research and Development Lab has
announced support for IMS, along with other XML-based educational
standards designed for interoperability. [Full
context]
[May 05, 2001]
Visio 2002 Incorporates XML Support with XML for Visio Format.
Microsoft Corporation has released a new version of its Visio
2002 drawing and diagramming software to manufacturing companies
with new XML capabilities and support for Component Object Model
(COM) add-ins. "Based on Extensible Markup Language syntax, the
XML for Visio format allows developers to create tools for searching
and indexing, revision tracking, and version control. This new
format gives Visio data synergy with new Web tools, better archival
data format, and an open exchange of Visio data with other
XML-enabled applications. XML for Visio is capable of describing
Visio data with as much fidelity as VSD format, and is intended to
provide open access to data in Visio and to enable more open
exchange of Visio data with other applications. In addition, by
using the 'Save as Web Page' feature, users can publish Visio
drawings to the Web and at the same time generate a XML for Visio
file that will permit round-tripping-re-opening the file in Visio
without loss of data. The XML for Visio format requires considerable
complexity; however, developers who have a knowledge of Visio (the
Automation and ShapeSheet models in particular) will have no trouble
understanding the format and, with a reasonable amount of effort,
writing software that uses it. The XML for Visio schema is organized
so that individual shapes and masters can be represented without
requiring the complete document structure. In practice, this means
that Visio can both emit individual objects as XML for Visio data
and read in and merge such objects. XML for Visio is extensible with
SolutionXML; Visio solutions can save data in XML format into the
XML for Visio stream and access it using standard XML tools
externally to Visio, or from within Visio at runtime." [Full
context]
[May 05, 2001]
Visio 2002 Incorporates XML Support with XML for Visio Format.
Microsoft Corporation has released a new version of its Visio
2002 drawing and diagramming software to manufacturing companies
with new XML capabilities and support for Component Object Model
(COM) add-ins. "Based on Extensible Markup Language syntax, the
XML for Visio format allows developers to create tools for searching
and indexing, revision tracking, and version control. This new
format gives Visio data synergy with new Web tools, better archival
data format, and an open exchange of Visio data with other
XML-enabled applications. XML for Visio is capable of describing
Visio data with as much fidelity as VSD format, and is intended to
provide open access to data in Visio and to enable more open
exchange of Visio data with other applications. In addition, by
using the 'Save as Web Page' feature, users can publish Visio
drawings to the Web and at the same time generate a XML for Visio
file that will permit round-tripping-re-opening the file in Visio
without loss of data. The XML for Visio format requires considerable
complexity; however, developers who have a knowledge of Visio (the
Automation and ShapeSheet models in particular) will have no trouble
understanding the format and, with a reasonable amount of effort,
writing software that uses it. The XML for Visio schema is organized
so that individual shapes and masters can be represented without
requiring the complete document structure. In practice, this means
that Visio can both emit individual objects as XML for Visio data
and read in and merge such objects. XML for Visio is extensible with
SolutionXML; Visio solutions can save data in XML format into the
XML for Visio stream and access it using standard XML tools
externally to Visio, or from within Visio at runtime." [Full
context]
[May 05, 2001]
XSLToolbox Supports XML Application Interoperability.
Joshua Lubell (NIST) has
announced the creation of an open source toolkit designed to
"help developers avoid the drudgery of writing the complicated
XSLT transforms often needed to integrate XML applications."
The software has been developed within NIST's Manufacturing
Systems Integration Division (MSID). The 'XSLToolbox' toolkit
"currently contains two tools, both written in XSLT: (1) APEX
is an application for transforming XML documents as specified by
architectural forms; (2) ATTS is a stylesheet generator for adding
default attribute values to XML documents. Unlike some other XSLT
libraries, the XSLToolbox is specifically geared toward XML data
exchange between applications rather than conversion of XML to
human-readable data formats." APEX as a generic architecture
engine processes "architecture support attributes which
describe how its elements, attributes, and data correspond to their
architectural counterparts governed by the architecture's
architectural forms. [This allows the designer to:] extend XML
vocabularies without breaking existing applications; to create
architecture-specific document views, retaining only relevant markup
and character data while hiding all other content; and to promote
data sharing between user communities with inconsistent
terminologies by enabling the substitution of identifier names and
by allowing simple document transformations. Input to APEX consists
of an XML document plus stylesheet parameters identifying an
architecture used by the document. APEX produces as output an
architectural document, i.e., an XML document containing only
the markup and data defined by the architecture specified. ATTS
provides an XSLT meta-stylesheet for assigning default attribute
values to an XML document. ATTS is intended for XML applications
lacking a convenient method, such as a DTD (document type
definition), for specifying attribute defaults. Since the attributes
that control architectural processing are usually fixed, ATTS can be
used to supply their values. ATTS is therefore handy for DTD-less
applications using APEX." [Full
context]
[May 04, 2001]
Experimental Implementation for W3C XML Encryption Specification.
A posting from Takeshi Imamura
(Tokyo Research Laboratory, IBM Research) reports on the
availability of an experimental implementation of [W3C]
XML Encryption Syntax and Processing Version 1.0. Support is
implemented in the updated version of the XML Security Suite from
alphaWorks. The IBM XML Security Suite "provides security
features such as digital signature, element-wise encryption, and
access control to Internet business-to-business transactions. The
new experimental support for the W3C XML Encryption proposal enables
one to encrypt/decrypt arbitrary binary data, an XML element, or its
content. The XML Schema definition of XACL [XML Access Control
Language] syntax is introduced." Also featured in the IBM
implementation is W3C/IETF XML-Signature
support conforming to the new Candidate Recommendation of
19-April-2001. [Full
context]
[May 04, 2001]
Proposed URN Namespace for Public Identifiers.
A posting from Norman Walsh
contains the text of an IETF Network Working Group Internet-Draft
which the authors believe "resolves all outstanding issues with
respect to the request for a 'publicid' NID." The draft A
URN Namespace for Public Identifiers ('draft-urn-publicid-03,
May 4, 2001) is authored by Norman
Walsh (Sun Microsystems, Inc.), John
Cowan (Reuters Health Information), and Paul
Grosso (Arbortext, Inc.). The draft "describes a URN
namespace that is designed to allow Public Identifiers to be
expressed in URI syntax." From the document Introduction:
"XML external entities have two identifiers: a system
identifier and a public identifier. The system identifier is a URI,
by definition, but the public identifier is simply a string.
Historically, the system identifier of an external entity has been a
local, or system-specific identifier while the public identifier has
been a more global, persistent name. Unfortunately, public
identifiers do not fit neatly into the existing web architecture
because they are not legal URIs. Many new specifications (XSLT, XML
Schema, etc.) have the implicit or explicit requirement that all
external identifiers be URIs. The purpose of this namespace is to
allow public identifiers to be encoded in URNs in a reliable,
comparable way. This document describes a scheme for representing
public identifiers as URNs by introducing a public identifier
namespace, 'publicid'. This namespace specification is for a formal
namespace." [Full
context]
[May 03, 2001]
W3C XML Schema Published as a W3C Recommendation.
The World Wide Web Consortium (W3C) has announced the publication
of the W3C XML Schema specification as a W3C Recommendation. A W3C
'Recommendation' "indicates that a specification is stable,
contributes to Web interoperability, and has been reviewed by the
W3C Membership, who are in favor of supporting its adoption by
academic, industry, and research communities. XML Schemas define
shared markup vocabularies, the structure of XML documents which use
those vocabularies, and provide hooks to associate semantics with
them. With over two years of development and testing through
implementation, XML Schema provides an essential piece for XML to
reach its full potential. The XML Schema specification consists of
three parts. One part defines a set of simple datatypes, which can
be associated with XML element types and attributes; this allows XML
software to do a better job of managing dates, numbers, and other
special forms of information. The second part of the specification
proposes methods for describing the structure and constraining the
contents of XML documents, and defines the rules governing
schema-validation of documents. The third part is a primer, which
explains what schemas are, how they differ from DTDs, and how
someone builds a schema. XML Schema introduces new levels of
flexibility that may accelerate the adoption of XML for significant
industrial use. For example, a schema author can build a schema that
borrows from a previous schema, but overrides it where new unique
features are needed. XML Schema allows the author to determine which
parts of a document may be validated, or identify parts of a
document where a schema may apply. XML Schema also provides a way
for users of ecommerce systems to choose which XML Schema they use
to validate elements in a given namespace, thus providing better
assurance in ecommerce transactions and greater security against
unauthorized changes to validation rules. Further, as XML Schema are
XML documents themselves, they may be managed by XML authoring
tools, or through XSLT." [Full
context]
[May 03, 2001]
papiNet NA Established as Standards Initiative for the Paper
Supply Chain.
A joint announcement from the American Forest and Paper
Association (AF&PA) and the Graphic Communications Association
(GCA) describes the creation of a new paper supply chain initiative
involving a leadership structure and standards process for the North
American pulp, paper and paperboard industry. This 'papiNet NA'
initiative is part of an international effort to "develop
e-business processes and standards for the total paper supply
chain" and represents collaborative effort between the European
Paper Consortium (EPC) and papiNet NA (the North American Group).
"The goal is a single set of unified, international XML-based
e-business standards designed to improve the efficiency and accuracy
of transactions throughout the supply chain, while reducing the cost
of operations. The work of papiNet NA will support the ongoing
papiNet standards effort begun in 2000, which focuses on
publication, fine and packaging grades of paper, paperboard and
their supply chains. Five standards have been developed and are in
the implementation stage, including purchase order, order
confirmation, call-off (delivery schedule), delivery message and
invoice." Relevant XML DTDs are accessible from the papiNet web
site. [Full
context]
[May 03, 2001]
Sun XML Datatypes Library Supports W3C XML Schema.
A communiqué from Eduardo
Gutentag reports on the availability of the 'Sun XML Datatypes
Library'. Developed by Kohsuke
Kawaguchi, the datatypes library is Sun's implementation of
W3C's XML Schema Part 2 intended for use with applications that
incorporate XML Schema Part 2. It can be used from any Java code to
validate strings with datatypes and to convert strings into Java
objects. The preview version 1 of 'April 2001' implements the
proposed recommendation version of the W3C XML Schema Part 2
Datatypes. The distribution of the XML Datatypes Library includes a
sample class file src/com/sun/tranquilo/datatype/CommandLineTester.java
provided "as a guide for implementing your own Java classes
with the Datatypes Library." Documented examples include
validating a string with an integer datatype, deriving a new type
from an existing DataType object, and diagnosing errors. The library
distribution includes software developed by the Apache Software
Foundation; its use requires JDK 1.3. [Full
context]
More Information on Robin
Cover's XML pages at xml.coverpages.org
IMPORTANT NOTE REGARDING CONTENT OF
LINKED WEB SITES |