Product-type as a SKOS vocabulary
BONNAREL FRANCOIS
francois.bonnarel at astro.unistra.fr
Fri Jan 14 19:11:47 CET 2022
Dear Markus
Thanks for your answer
Le 10/01/2022 à 10:48, Markus Demleitner a écrit :
> Dear François,
>
> On Wed, Jan 05, 2022 at 04:50:59PM +0100, BONNAREL FRANCOIS wrote:
>> If we want to organize dataproducts in a network we have to think what are
>> the properties able to characterize the dataproducts and see what are the
>> most common cases
> As usual, I'd like to come from use cases; it is certainly valuable
> to have a formal and solid understanding of the full domain, but in
> order to come up with vocabularies (or, perspectively, ontologies)
> that people can work with, I am sure we will need to agree which
> parts of a full mapping ought to enter into a given semantic
> resource -- and which might be dispensable *for that specific*
> semantic resource.
>
> For product-type, I think we have two basic use cases:
>
> (A) discovery of artefacts relevant to a defined research project;
> that's the obscore use case where people would, for instance, look
> for spectrally resolved data and want to filter out images and time
> series that are not spectrally resolved.
>
> (B) routing of some artefact to the proper (SAMP) client; that's the
> datalink use case where a user might simply double-click a datalink
> row, and it'll open in TOPCAT when it's a table, except if it's
> really a spectrum, in which case it would preferably go to Splat, and
> it would go to Aladin if it's an image (or perhaps ds9 or whatever,
> depending on the user session).
>
> Taking François' schema:
>
>> I see at least 4 type of properties
>> 1 ) What are the independent variables (in the context of functional
>> dependencies of variables with respect to others)? for example if time is
>> independent we have a TimeSeries , if spectral coordinate is independent we
>> have a spectrum
>> 2 ) What are the dependent variables. In case of TimeSeries If it's a
>> photometric quantity, it could be a lightcurve, if it's radial velocity it
>> is a velocity curve.
>> 3 ) Are the independent variable sparsed or regularly sampled ?
>> 4 ) The organization : is this a table (where the different quantities
>> of a given measurement are explicitly recorded) or a bitmap where the range
>> of a dependent measurement in the dependent measurement array is a function
>> of the independent variables
> I think (1) and (2) are obviously relevant to both cases.
>
> For (3) I'm less certain. Use case (A), I would claim, actually
> requires making this ignorable. If I'm looking for a spectrum of a
> source, I'm happy to find anything spectrally resolved, and I'd
> rather like to avoid having to remember to somehow include both
> sparse and non-sparse datasets. For use case (B), it is conceivable
> that certain clients can only deal with regularly sampled data (e.g.,
> an image, which I'd like to send to Aladin) and others only with
> sparse data (e.g. spatially resolved events, which might better be
> dealt with in TOPCAT). Is this really something we'd want to (be
> able to) automatically handle? I'm currently leaning towards a
> tentative "no", but I could certainly be convinced I'm wrong here.
>
> Item (4) is I think closely related to (3), though it's perhaps still
> a bit more technical and father removed from scientific content. My
> example here would be IRAF-type spectra (which are 1D bitmaps) versus
> IVOA SDM spectra (which are tabular). And my conclusion would be
> rather analogous to item (3), also because SAMP doesn't allow to
> tell one from the other at this point.
>
> So, I think what we should come up with are (plausible) stories of
> data usage. These could guide us whethere (3) and (4) need to be
> reflected in product-type -- and how we don't spoil (A) if we want to
> have them for (B).
I may try two slides on such possible stories on Monday if there is some
time left in the meetong semantics is organizing.
Otherwise later
Cheers
François
>
> -- Markus
More information about the semantics
mailing list