Tim Bray, one of XML's creators,
talks about HTML's successor
Interview by Wolf-Rüdiger Hansen - Director of IT Press Relations at
Software AG;
extract from Software Report no. 47, March 1999
The Web is the greatest system ever invented for delivering information to
humans. Spanning the globe, it has become a standard means for consumers and enterprises
to access, transmit and distribute all kinds of information. This information is based on
the use of HTML (hypertext markup language), which has been a huge success. But the limits
of HTML have become visible and a new standard is on the rise: XML (extensible markup
language).
Tim Bray is one of the fathers of XML and co-editor of W3C's (World Wide Web
Consortium) XML specification. W3C is commited to designing open Web standards. In this
interview, Tim Bray discusses the future of XML.
SR: Why do we need XML?
The Web today is all about communicating to humans, which is
important. But for the next phase, including electronic commerce, we need
machine-to-machine communication, and we need it to be reliable, efficient, and
internationalized. That's what XML is all about.
SR: Is XML a programming language?
No. XML is a data format. One of the main points of XML is that it is
equally usable with any programming language, although modern object-oriented languages
like Java seem to be an especially good fit. XML is also equally usable in any human
language, because it's based on the Unicode standard for international texts, and equally
usable on any operating system, because nobody owns it.
SR: What are the hard problems with XML?
XML is pretty easy to create, and pretty easy to display, but we have a big problem
with storing and managing large quantities of it in a clean and efficient way. Therefore,
I am very interested in Software AG's plans to deliver a DBMS based on a pure XML engine.
I think that an increasing amount of business information will be based on XML so we'll be
needing this kind of repository technology.
SR: Which role will all the existing database products play with respect to XML?
Most commercial databases of today are based on the relational SQL
model. This is a wonderful model for the rows and columns of business data, but XML has a
much more complex structure that includes sequence, hierarchy, and hypermedia. We have
already learned that it's very difficult to deal with XML in a relational repository. I
think that there is a huge opportunity for a new generation of database management systems
that are designed for XML.
SR: Will XML databases and conventional SQL databases coexists for a long time or
will they be substituted by XML products?
Obviously, the arrival of XML won't make existing databases go away.
Once we've solved the hard problem of how to store and manage XML well, we have to solve
the other hard problem of how to integrate XML with our existing databases. I am not sure
it's going to be possible to give a useful relational view of XML data, but it certainly
would be nice to have an XML view of relational data.
SR: What are the existing SQL DBMS suppliers doing about XML?
The only real announcement is from Oracle, in their 8i product. They
use an architecture where part of the XML goes into tables and part of it goes into
Context, their full-text engine. We'll have to wait and see how this works out in
practice, but it makes me a bit nervous that the data is split into two repositories and
this is user-visible. I'm expecting that IBM will do something in this area; they have
been putting huge amounts of work into Java, and they seem to be very committed to XML.
SR: Do we need to worry about Microsoft dominating the XML world the way it
dominates the desktop?
Yes, of course. Microsoft is smart and fast and very, very rich. If
they can see a way to take control of the XML world I am sure that they will do it.
Fortunately, I think that will be quite difficult, since XML is quite simple and is 100
percent open. Anybody with brains and energy has a chance to play. I'm also happy to
report that the World Wide Web Consortium, the organization that sponsored the development
of XML, has a process that seems quite successful at balancing the interests of all the
users and vendors.
SR: When we have XML databases, how will we query them? Can SQL do the job?
I think we need a new XML-oriented query language. SQL as it stands
today is more or less completely useless with XML. I suppose there is some hope with the
extension facility in SQL3 of building an XML query module. But I am not optimistic. SQL
is already insanely over-complicated, and one of the main advantages of XML is that it's
intrinsically simple; this is why the market opportunity is so big.
Tim Bray is an independent consultant based in Vancouver, British Columbia, Canada.
For more information, see www.textuality.com. Tim Bray can be contacted at tbray@textuality.com. |