table metadata and the registry

Ray Plante rplante at poplar.ncsa.uiuc.edu
Tue May 8 13:44:36 PDT 2007


Hi Tony,

On Tue, 8 May 2007, Tony Linde wrote:
> 3. we had already agreed to have fine- and coarse-grained registries back in 
> Victoria: I'm not sure why this has come up again (this in response to your 
> comments below about forcing you to do registries our way or getting detailed 
> metadata from your services!). As long as these registries are identifiable 
> and applications can tie themselves to the one they need, what is the 
> problem?
>
> Am I missing anything else?

I tried to explain how a fine-grained regsitry like AstroGrid affects a 
coarse-grained registry in the NVO.  It comes down to how much information 
appears in the records we exchange in harvesting.  If you export 
VOResource records through your harvesting interface that include table 
metadata, we are obligated to preserve that information and make it 
available to users.  That was part of the Victoria deal.  Furthermore, if 
the only way you get table metadata from NVO services is through our 
harvesting interface, then we have to provide this information in order to 
make them usable by the AstroGrid query builder.

Do you see how we have effectively been forced into storing table metadata 
in our registry?

Now, is there a way for you to get at the table metadata--regardless of 
where it came from, integrate it into your registry, and use it as you do 
now, without NVO having to collect and manage this information within our 
registry as well?  This is what I am trying to answer (with a 'yes').

Do you agree that we should try to find a way to do this?

> 1. You've proposed returning a limited subset but in the same format as 
> VOResource table schema. Why do you stop at that level? Surely everything 
> within Capability describes the service. If we're going to pick an arbitrary 
> level, why not Capability or, better, VOResource? What is the issue with the 
> service returning *all* its metadata?

I am currently picking on table metadata, because
   *  this is what critics are specifically pointing to as an example of
        information that is too fine-grained.
   *  it is not included among the RM metadata, our core set intended for
        registries.
   *  it is part of a schema and resource types that are too important to
        ignore completely.

If we can come up with the way to meet the above-mentioned goal, then we 
may have a model for dealing with other controversial information.  This 
might also include detailed detailed coverage descriptions.  What we 
really need is a shared understanding of what is fine-grained so that 
if/when we define new resource metadata and new extensions, we not only 
have a way to evaluate information that might be considered fine-grained, 
but also have a recipe for accessing that information if we can't agree 
whether to include it in the extension.

> 2. I have no problem how the metadata is returned but only want the most 
> efficient method. I would have thought it was more efficient to have all 
> services implement a single standard interface aligned to the way the service 
> is implemented: soap method if a soap service; url if a cgi service.

I suggested a URL as the handle because this is the simplest way to 
retrieve information when inputs are not required.  It can be associated 
with a kind of resource, be it a service or data collection, but it can be 
made as complex as necessary (e.g. to generate the information 
dynamically) under the covers.

> That's fine, but let's make sure the applications and services that rely on 
> registry information *now* can continue to work while these techniques are 
> being developed.

My suggestion had this in mind from the beginning.  You are currently 
collecting table metadata explicitly within VOResource records.  To 
implement my idea, we make a place in the schema to hold the new URL.  We 
begin populating the new metadatum.  We begin using the URL, and then we 
begin deprecating the explicit inclusion of table metadata.  There is 
no interruption in the support for you query builder, and all during 
this transition, we see an increase in the availability of table metadata 
because the URL for Cone Searches, SIAs, and SkyNodes can be 
auto-generated.

cheers,
Ray



More information about the registry mailing list