VEP-007: datalink-core#detached-header
Laurent Michel
laurent.michel at astro.unistra.fr
Wed Sep 15 13:30:01 CEST 2021
Hello,
I agree with you Anne, except for the last sentence :-)
In the Datalink context, I think it is important to keep seperate words for #meta-data and #documentation.
#documentation relates to textual information on #this. A good example is given by the Vizier READMEs that provide the
scientific contexte of #this.
#metadata relates to a documentation that is required to process #this, e.g. WCS.
One is supposed to be read by the user whereas the other is meant to by used for some processing.
Laurent
Le 15/09/2021 à 13:03, Anne Catherine Raugh a écrit :
> Hello,
>
> Well, thinking about this at 6am without having had coffee yet...
>
> In the context of data, "documentation" and "metadata" are very nearly synonymous. The term "machine-readable" meant something
> quite different when I started in this business (back when dinosaurs roamed the Earth providing tech support), but now it seems
> to have evolved to mean what we would have called "machine-actionable". Even still that's not as clear a line as we might like.A
> PDF document, for example, is machine-readable: it takes a computer to turn it into something that is human-readable and it is
> also possible to extract text, tables, and other items from a PDF file. But I don't think anyone wants to see PDF headers for
> data. Machine-readable headers are often human-readable as well, by design (like FITS). (There are binary headers out there in
> some formats that are /not/ human-readable, but I suspect they do not show up in IVOA repositories as data files.)
>
> So, from my still-learning and yet aging perspective, I would expect that "documentation" would be the broadest term, that
> "metadata" would be a subtype of documentation with the stipulation that it is documentation provided in a format that conforms
> to a published standard that makes it machine-readable. "Header" would then be a subtype of "metadata" that is tightly bound to
> #this, with the further restriction that it provides some metadata that is unique to #this (otherwise why carry it along?). If
> we have a need to identify documentation that is not machine-readable (PDFs would qualify in the IVOA context), then there is
> probably another subtype of #documentation lurking in the wings. If we require that all documentation be machine-readable, then
> "documentation" and "metadata" are synonymous in the IVOA context, and there is no need for two terms.
>
> Coffee time...
>
> -Anne.
>
>
> On Wed, Sep 15, 2021 at 5:26 AM BONNAREL FRANCOIS <francois.bonnarel at astro.unistra.fr
> <mailto:francois.bonnarel at astro.unistra.fr>> wrote:
>
> Hi all,
> I know that VEP-007 will be discussed today and I will not
> participate.
> I got no answer to this email except recently from Markus in a
> private discussion.
> I am strongly in favor of the new term itself.
> My small concern (with a proposal) is that as it is stated now
> in "https://volute.g-vo.org/svn/trunk/projects/semantics/veps/VEP-007.txt
> <https://volute.g-vo.org/svn/trunk/projects/semantics/veps/VEP-007.txt>"
> The head-term of #detached-header is "documentation".
> I think that a head term like #metadata would be better.
> This term doesn't exist up to now and I think it is lacking.
> #metadata is different from #documentation in the sense that the
> first one is machine readable while the other one is human readable
> So usage by clients would be very different. Other example for
> #metadata would be #ivoa-provenance-metadata and #ivoa-obscore
> If you think this is a good idea I would like to add this term
> either in the same VEP or as another VEP with the small modification of
> head-term in VEP-007 itself
> Cheers
> François
>
> Le 16/06/2021 à 09:20, BONNAREL FRANCOIS a écrit :
> > Hi all,
> > Very interesting proposal indeed.
> > I'm just wondering if we could take the opportunity to create a new
> > #metadata branch .
> > So that "#detached-header" would be a child of this head term #metadata
> > Other children terms in the future could be #provenance_record (to
> > attach ivoa provenance serailisation to #this), #obscore_record (to
> > attche obscore metadata to #this when the dataset was not discovered
> > via an ObsCoredelivering service) etc ....
> > Cheers
> > François
> > Le 16/06/2021 à 09:13, Baptiste Cecconi a écrit :
> >> Hi all,
> >>
> >> I received an direct comment from Anne Raugh, and have updated the
> >> Rationale of VEP-007 accordingly. I copy the new content of the
> >> Rationale section below:
> >>
> >>> Rationale:
> >>> In some formats and archives, the metadata required to decode the
> >>> content of #this is in a separate file. In the case of NASA/PDS
> >>> data products, for example, the PDS label file contains the decoding
> >>> metadata. In some archives, the FITS header may be stored separately
> >>> as a plain text file, next to a data file consisting of a binary
> >>> stream of bytes. Clients would use the content type (MIME type)
> >>> (e.g.: application/x-pds4-label+xml, or text/x-pds3-label) to enable
> >>> the processing.
> >>>
> >> Cheers
> >> Baptiste
> >
> >
>
--
English version: https://www.deepl.com/translator
--
jesuischarlie/Tunis/Paris/Bruxelles/Berlin
Laurent Michel
SSC XMM-Newton
Tél : +33 (0)3 68 85 24 37
Fax : +33 (0)3 )3 68 85 24 32
Université de Strasbourg <http://www.unistra.fr>
Observatoire Astronomique
11 Rue de l'Université
F - 67200 Strasbourg
More information about the semantics
mailing list