VEP-007: datalink-core#detached-header
Markus Demleitner
msdemlei at ari.uni-heidelberg.de
Wed Sep 15 12:00:25 CEST 2021
Hi François,
On Wed, Sep 15, 2021 at 11:25:14AM +0200, BONNAREL FRANCOIS wrote:
> I know that VEP-007 will be discussed today and I will not
> participate.
Oh, dang. So, what about VEP-006? François, you seemed to still
have concerns with that, too -- or can that go on now?
> I am strongly in favor of the new term itself.
> My small concern (with a proposal) is that as it is stated now in
> "https://volute.g-vo.org/svn/trunk/projects/semantics/veps/VEP-007.txt"
> The head-term of #detached-header is "documentation".
> I think that a head term like #metadata would be better.
> This term doesn't exist up to now and I think it is lacking.
> #metadata is different from #documentation in the sense that the
> first one is machine readable while the other one is human readable
> So usage by clients would be very different. Other example for
> #metadata would be #ivoa-provenance-metadata and #ivoa-obscore
> If you think this is a good idea I would like to add this term either
> in the same VEP or as another VEP with the small modification of head-term
> in VEP-007 itself
First, I'm strongly against introducing terms on, effectively, a whim
-- experience has shown many times it's almost certain that'll be
trouble once we actually need a related concept.
Instead, we should rather
(a) have a clear need for the term, which for datalink/core means
(for now): A link that either doesn't fit any of the existing
concepts at all or requires splitting an existing concept because the
link needs a different treatment (by a machine) from other members of
that existing concept.
(b) have a clear and plausible ("digital") *pragmatics* for that
concept. What we're doing here is machine-readable semantics, so to
make a term worthwhile, there must be at least an idea (even better:
implementation) of what a machine should do with the information that
the link belongs to the new concept (rather than an old one).
Having said that, François has a point in that documentation's
definition currently says "Extra information on the item in
*human-readable* text form" (emphasis mine). I think what we're
learning with this item is that "human-readable" was an error here;
the examples "processing logs" or "weather reports" already make that
clear: These are a lot more useful when they come in a structured
format, which very often will not have "text form" (unless you have a
very liberal idea of text).
Hence, I'd say we need to fix the definition #documentation to read
Extra information aiding in understanding or interpreting the item.
Such information can range from processing logs to weather reports
to technical documents on instruments to related publications.
I give you that one of these days separating "human-readable" and
"machine-readable" (or structured, as for #detached-header)
documentation might be useful (so users could hide away stuff that'll
only give gobbledigook unless they have a matching client); that
could then approach what I think your *#metadata would comprise.
But let's wait until we have cases where there's so much
#documentation that that's actually worth it. For now, I'm sure
even in interactive use most people would actually like to see a FITS
header when they open #documentation.
François (and, of course, everyone else), would you be ok with a VEP
fixing #documentation's definition as above? Since it just widens
the extension and probably won't conflict strongly with other
concepts in datalink/core, I'd be rather confident it's technically
sane.
-- Markus
PS: talking about discussions that petered out over the holiday season,
I'd like to use this opportunity to remind everyone of
http://mail.ivoa.net/pipermail/semantics/2021-July/002829.html, where
I'm raising several points that I'm rather sure need to be addressed
before VEP-009 can go on.
More information about the semantics
mailing list