role of the registry
Anita M. S. Richards
a.m.s.richards at manchester.ac.uk
Thu May 10 10:57:12 PDT 2007
>
> The registry was conceived as a resource location service. The formative
> document describes resource metadata.The name of the working group is
> Resource Registry. There has been debate for a long time on how far to push
> the registry in the direction of detailed service-level metadata.
.....
> This is not unique to the registry, but it is clear in the case of
> the registry that even the simple stuff is not done well or completely.
If you mean that people don;t use it 'well or completely' then I agree,
but there is a difference between implimenting it wrongly, and
implimenting it correctly but superficially.
As a data provider, I want a tool to help me register the content,
including coarse coverage, and basic curation and access information. I
will also provide tools/services particularly useful for these data and I
want to be able to describe them, too. If
I published tables, then the column names and metadata _might_ be
appropriate. On the other hand, if I have 100 tables each with 100
columns, it might not be.
At present, almost all this is covered in the human language RM document,
although tool publishing is a little rough. I am distressed to learn from
engineers that the schemata may not be consistent with this, and even more
distressed if I am told that in order to understand the model, I have to
read the schemata (as mentioned in another posting). I am probably fairly
typical of data providers in that, given a nice XML editor - or better
still, a form interface - and some instructions and examples, I can just
about fill in a document, but not evaluate it critically and we should not
expect users to have to read schemata!
Even if you don't agree that column names are useful, they are optional,
like much else in the RM document. Much as I rant about data providers
who dont include proper coverage information, I agree with it being
optional because ultimately the providers have to decide what is
appropriate for their data sets.
> I have never been convinced by the arguments for including things like table
> column names and UCDs in the registry. The use case recently cited, that of
> dynamically building SEDs by locating catalogs with relevant columns would,
> I think, never be trusted by astronomers doing research.
On the contrary, Bob, it is invaluable at present via Vizier or AstroGrid
or even Google _because other more accurate metadata aren't always
available_. UCDs in particular are useful. It depends what you are
doing, but for finding potentially interesting data it is certainly
useful. You can always check more thoroughly as needed...
> I suggest that everyone take a deep breath, step away from their desks for a
> while, and try to recall what the VO is really about. Key issues are data
> discovery, interoperability, and a low barrier to participation.
Exactly, which is why in my view, we should prioritise
a) making sure that the human-language documentation and the schemata are
consistent;
b) encourating affiliated VOs to develop tools to make it easy to register
resources;
c) if necessary, improving the provision for registering tools
and finally
d) make sure that things are labelled as 'must', 'should', 'may' or
equivalent so that the buy-in is very simple but people who do want to
provide more information can do so, especially where there are tools to
use it (as is the case for column names and especially UCDs!).
thanks
a
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
Dr. Anita M. S. Richards, AstroGrid Astronomer
MERLIN/VLBI National Facility, University of Manchester,
Jodrell Bank Observatory, Macclesfield, Cheshire SK11 9DL, U.K.
tel +44 (0)1477 572683 (direct); 571321 (switchboard); 571618 (fax).
More information about the dal
mailing list