Datalink vocabulary extension: sibling?

François Bonnarel francois.bonnarel at astro.unistra.fr
Mon Apr 20 12:51:15 CEST 2020


Dear all,

We have to go forward on this.

I agree that #see-also covers nearly everything we can imagine in the 
contexte of Datalink as was my initial proposal of "#associated_data" in 
VEP001.

I think it's true that something like what Markus defined as sharing the 
same "progenitor with #this" and that he called #sibling is needed.

But )

   -   Due to the   "non intuitive" terminology, I suggest a synonymous 
: #co-generated.  This is directly inspired by Provenance dataModel, a 
new IVOA recommendation dated  April the 11th this year. If you share a 
progenitor with somebody else, this means you have been *generated *by 
the same *activity* with this guy and that this activity *used *the 
same*entity,* you progenitor.

   - This doesn't cover everything we had in mind originally with 
"associated data" in the sense of VizieR, sepcially when #this is a 
source in a catalogue and not a dataset. Several proposals have been 
made. Pat proposed "contains" and "followup" and I Personnaly proposed 
""Observation_Result_of_source" and also "cross-correlated". none of 
these seemed to be generic enough to satisfy everybody. But I do think 
we need some term for linking something to #thing which is scientific 
data (that is not a process, not a preview, not a calibration file, no 
auxiliary data and not additional medata) and which is neither 
progenitor nor derived and nor sibling/co-generated.

       To illustrate this look at the VizieR example below.  this is a 
catalog of Spitzer galaxies with link to their optical counterpart 
image: "Optical imaging for the Spitzer Survey of Stellar Structure in 
Galaxies. Data release and notes on interacting galaxies.".

As explained in the abstract 
(https://cdsarc.unistra.fr/viz-bin/cat/J/A%2bA/569/A91) a link is 
provided to opticals images of various origins for all galaxies 
extracted from Spitzer IR information. This kind of relation between the 
Spitzer galaxy and its optical counterpart image is not covered by 
sibling/co-generated, derived, or progenitor.

      I propose something like "#other" or "#alternate" ( the latter was 
already proposed by Markus ..... in 2015 !!! semantics session during 
interop In Sydney)

      In addition i think #derived,#progenitor, #co-generated (#sibling 
? ) and #alternante (#other ?) should share the same "head term"  
"#science".

Cheers

François




Le 23/03/2020 à 13:33, Markus Demleitner a écrit :
> Dear Apps,
>
> Over in DAL, we've tried define a new concept in the datalink
> vocabulary -- see
> http://mail.ivoa.net/pipermail/dal/2020-February/008261.html.
>
> The use case that started this is that the Gaia astrometry satellite
> produces spectra and time series for a subset of the objects it
> measures, and the idea is to use datalink to link the files to
> catalogue rows using datalink.
>
> The question essentially is: What is "the" relationship between the
> astrometric record and the spectra (the quotes are there because
> there are of course many such relationships, but we'd like to find
> one that's at the same time characteristic and generalisable)?
>
> And more importantly: What are other cases like that, and what would
> applications like to see so they can display these extra links at the
> right moment (and not display them when inappropriate)?  That's why
> I'm here: I'd be grateful for application writers' thoughts about our
> (Semantics) problem.
>
>
> The thread above somewhat petered out when considering two
> alternatives:
>
> #sibling -- essentially, this marks artefacts that came up from the
> same basic measurement as #this; it nicely complements #progenitor
> and #derivation that we already have in datalink/core.  Against it it
> has been argued that it's not very intuitive to people who don't
> think in provenance terms too much.  And that it perhaps is a concept
> that's not very useful in practice (i.e., doesn't let clients do
> things users would like them to do).
>
> #see-also -- the idea is that it's "data people might be interested
> in when looking at #this".  The trouble is that once I try to have a
> clear definition for what this actually means I fail.  You see,
> essentially all datalink terms would fall under #see-also, and I
> couldn't find a good criterion to exclude them.
>
> So, if we do something along the lines see-also, it would have to be
> see-also-if-X, where X could be,  perhaps, "scientifically exploiting
> #this" or "you have found something interesting in #this and would
> like to learn more about it" or whatever.  I think you'll agree that
> these proposals are... uncovincing.
>
> After this, I'm essentially back to #sibling, which is well-defined
> and (IMHO) easy to explain, which, as concepts go, is great.  It's
> also been useful *for publishing* in several other data collection,
> such as when people did line maps for a spectral cube.
>
> But is it useful *when consuming*, i.e., when doing user interfaces?
> If you think it's not, what concept *would* be useful to you?  Do you
> perhaps have an idea how a see-also-if-X concept could usefully be
> built?
>
> Thanks,
>
>            Markus


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.ivoa.net/pipermail/apps/attachments/20200420/67a69210/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: opabbbnlpbennghi.png
Type: image/png
Size: 293348 bytes
Desc: not available
URL: <http://mail.ivoa.net/pipermail/apps/attachments/20200420/67a69210/attachment-0001.png>


More information about the apps mailing list