VizieR repository

Sebastien Derriere derriere at newb6.u-strasbg.fr
Mon Nov 24 10:55:32 PST 2003


  Dear harvesters :)

  I have opened a preliminary OAI-PMH interface to harvest the contents
of VizieR at the following address:

http://vizier.u-strasbg.fr/cgi-bin/registry/vizier/oai.pl

  You can test it with the OAI explorer, for example:
http://oai.dlib.vt.edu/cgi-bin/Explorer/oai2.0/testoai

  The repository handles two metadataPrefixes:
- the standard oai_dc for basic Dublin Core
- ivo_vr for metadata formatted in the VOResource-v0.9 schema

  I have adopted the following conventions. Your comments on these are
welcome:

* The AuthorityID has been set to "CDS". Is this the correct choice, or
should it look more like a domain name? Or should there be different
 AuthorityIDs for VizieR, Simbad, Aladin, ... etc in the case of CDS ???

* I have defined as 'Sets' the top-level categories in VizieR, and
the items correspond to the different 'Tables'. Should the items be
'Catalogues' rather than 'Tables'? (there are approx. 4000 catalogues,
and 10,000 tables - 1 cat. can contain several tables).
  Is there a way to handle different levels of granularity (sets,
subsets)?

* I still had problems with the allowed syntax for ResourceKey,
therefore I currently use a modified version of VOResource-v0.9.xsd
with the following:
   <xs:simpleType name="ResourceKeyType">
     <xs:restriction base="xs:string">
       <xs:pattern
value="[\w\d\-_\.!~\*'\(\)\+]+(/[\w\d\-_\.!~\*'\(\)\+]+)*"/>
     </xs:restriction>
   </xs:simpleType>

  This is to allow for the '+' char in the ResourceKey, as we have for
example
in VizieR tables from A&A coded as J/A+A/...
  In the IVOA Identifiers v0.2, the '+' was not forbidden, but the
RegExpr
in VOResource-v0.9.xsd didn't allow fot it.

* How do we deal with obsolete items in a registry?

  Sometimes, an item becomes obsolete in a repository,
  but it is very rare that it is acually deleted.
  Therefore, the status can still be 'active' while the
  item is obsolete.

  For example, the first 2MASS release is still active
  in VizieR (because there have been published papers
  referring to objects therein, and it is useful to have
  the original data available), but there is a flag to
  indicate that use of this catalogue is now deprecated
  because is it made obsolete by the new 2MASS catalogue.

  Maybe we should intrduce a new value for 'RelationshipType'
  in the VOResourceRelType schema, that would be 'deprecated-by'
  to indicate the link between the obsolete item and the new item
  that makes the other one obsolete.

* Not all the possible metadata are filled yet. At least the
required ones are. How are we supposed to describe the coverage
in this kind of OAI-PHM XML ? Should it be a second element in
the <metadata> section, formatted accordingly to the coverage
schema?

* Last question for now, how do we deal with mirrors/clones?
Should we copy this repository on all vizier clones, so that people 
can harvest on any vizier site?

Sebastien.
-- 
    _______
   /  ~   /, Sebastien Derriere   mailto:derriere at astro.u-strasbg.fr
  / ~~~~ //  Observatoire de Strasbourg    Phone +33 (0) 390 242 444
 /______//   11, rue de l'universite     Telefax +33 (0) 390 242 417
(______(/    F-67000 Strasbourg  France



More information about the registry mailing list