Factor/GSoC/2009/Improve XML library


Doug Coleman

Skills required

  • Some prior knowledge of XML
  • Experience with parsing is a plus

Technical outline


Factor's XML parser does not yet pass standard conformance tests, due to inadequate support for parsing DTDs. Minor modifications should let it pass the XML 1.0 and XML 1.1 conformance test suites, as well as the namespaces 1.0 and 1.1 conformance tests.


The Factor XML parser is non-validating. It might be useful to implement either DTD validation or validation based on one of the other XML schema systems.


The XML parser right now is not very fast. A new lexer based on Factor's regexp library could improve performance substantially. Other kinds of parsing abstractions might also be useful.

Benefit to the student

The student gains valuable deep knowledge of XML, and has the opportunity to learn about the theory of parsing.

Benefit to the project

Use of XML is very pervasive, so Factor applications often have to deal with XML. Improvements on the XML library will be useful for everything that uses XML.

This revision created on Fri, 13 Mar 2009 17:17:33 by littledan