Space-separated vs Tag-separated list parsing

martin hill mch at roe.ac.uk
Thu Dec 15 08:15:55 PST 2005


As I understand it, when a schema is not available, or is not applicable, or the
parser doesn't refer to it for whatever reason,  the parser *should* (indeed,
*must*) read space-separated lists as a single string.  It's up to you then to
do your own tokenizing.

Quoting Alasdair Allan <aa at astro.ex.ac.uk>:

> 
> Ray Plante wrote:
> > I'd be curious to hear which parser(s) are not processing lists 
> > properly.
> 
> The are others, but LibXML2 springs to mind. See the following dump of 
> the data structure it generates post-parse, as you can see that 
> crd:Value2 ends up in a single tag.
> 
> [snip]

-- 
Software Engineer
Astrogrid, ROE (www.astrogrid.org)
Mob: +44 7901 55 24 66
Fax: +44 131 668 82 64



More information about the dm mailing list