[Heig] vocabulary update: proposal for dataproduct_type update for high energy data : event-list definition and event-bundle
Bruno Khelifi
khelifi at apc.in2p3.fr
Wed Apr 30 10:11:37 CEST 2025
Hello Markus,
Le 30/04/2025 à 08:57, Markus Demleitner via heig a écrit :
> Hi Bruno,
>
> On Tue, Apr 29, 2025 at 03:27:35PM +0200, Bruno Khelifi via semantics wrote:
>> Le 25/04/2025 à 09:29, Markus Demleitner via heig a écrit :
>>> (1) used-in: I really, *really* would like to see actual, published
>>> data here (always, in all VEPs; it's a pain if we go into all the
> [...]
>>> used-in: dataset ivo://csc.harvard.edu/scsr2?some-obs-id onhttp://cda.cfa.harvard.edu/csctap
>>>
>> In the VHE domain, CTAO, KM3NeT, SWGO wish to publish their current and
>> future data in VO. The current running experiments MAGIC and HESS are
>> working toward the publication of their legacy data into the VO.
>> And this is also in this context that such extensions are of interest for
>> us.
> Is anything already online in obscore or via datalink? It would
> really be great if there was something out there when the VEP
> officially goes up?
Not yet. We are preparing a HEIG note to present our needs and their
justifications. Mathieu Servillat, Ian Evans and al. will make some R&D
to be sure that it works well... even if DataLink is well secured;)
>>> (2) Relationship: That's an operational field, i.e., I need to create
> [...]
>>> user side: If I'm looking for #event-bundle, do I want to see
>>> #event-list, too? If I'm looking for #event-list, do I want to see
>>> #event-bundle, too? Whatever ought to encompass the other is the
>>> wider term.
>> event-bundle is e.g. event-list + response (a set of IRFs)
>> (+provenance+readme+etc)
>> In this context, one would need to use DataLink to access to all files
> Well... but does
>
> #event-list #skos:broader #event-bundle
>
> hold or is it the other way round?
As a simple physicist, I do not know that means #skos:broader. In words,
an event-list is contained into an event-bundle, because
event-bundle=event-list+IRFs+provenance+etc
>
>> First, our IRFs are not just accessory, they are mandatory for HE (photons
>> and neutrinos) to make physics (for data analysis specialists:
> I don't doubt this, I was just wondering if, from a *product-type*
> point of view there is a sufficient point in telling #event-bundle
> and #event-list apart: Do you ever want to discover one or the other?
> Would it perhaps be necessary to tell the two apart for the purpose
> of choosing an appropriate SAMP client?
>
> A good answer to that would also help answering the relationship
> question.
As I said, having event-list w/o IRFs is so poor and does not allow to
make science. One needs to discover the event-bundle, and as consequence
define properly its contents, ie event-list and response.
>
>> In addition, one has use cases where these IRFs can be used without
>> event-list: the case of simulations. Also, one can have one set of IRFs for
>> many observations. So a data producer could use a dedicated entry in ObsCore
>> for that.
> Ah-ha! But this would mean that you would need IRFs as an
> independent product type, no? How else would you annotate these
> stand-alone IRFs?
The UCs to use IRFs by its own and to be able to discover them are:
- make source simulations for a given instrument
- compute sensitivity of an instrument
- make predictive performance curves for an instrument
- share (large) IRFs valid for a wide range of observations (in the
spirit of the CALDB of X-rays) - very useful for IceCube, KM3NeT, HAWC,
SWGO (and maybe AUGER, etc)
This is quite important already, I think. I have to think a bit more to
check if the list is complete.
>
> If that's solid reasoning (and frankly, I only have a fairly vague
> idea of what I'm talking about here), then we could introduce some
> concept of *#ancillary into product-type (which feels a bit wrong
> because we have that concept in datalink/core, too, but I think it's
> justifiable), and then *#irf as a narrower term. #event-bundle would
> then be narrower than both #irf and #event-list.
ancillary is so vague that it can contains everything... This is a vague
stuff, a container somehow. Here, we propose to describe correctly some
of its possible elements: response, ARF, RMF, PSF, Edisp, Aeff,
Bkg(=background). All these elements are critical for us and they
deserve a proper description such that the HE experiments (from X-rays
to PeV, in photons and neutrino). Indeed, there are specific elements,
like a phot.mag.distMod, phys.ionizParam.coll or src.calib.guideStar.
Each astronomical domain has its own peculiarities, applicable only for
its own data releases. Narrow elements are common in IVOA. Are these
elements too narrow? We are just covering 12 orders of magnitude in the
electro-magnetic spectrum, which is not so narrow;)
>
> The consequence would be that searches for #event-list yield
> #event-bundle, too, but searches for #event-bundle wouldn't. Does
> that sound right to you? If so, could you try to write a definition
> for *#irf (and perhaps come up with a less cryptic label)? I think
> we should amend the #event-bundle VEP with that on top, if this is
> what we're going to do.
The data producers could have the freedom to organise their releases as
they wish. The 2 UCs associated to the data release organisation are:
- a list of event-bundle (using DataLink to access to all downloadable
files) - typical case for Imaging Atmospheric Cherenkov telescopes
- or separate entries for the event-list and the response - a possible
case for Water Cherenkov Detectors and neutrinos (or cosmic-rays
experiments)
- or dedicated entries about the response that is "representative" of an
instrument (UC: proposal preparation, estimation of instrument
performance for a given science use case, instrument sensitivity -
Zenodo is great, but all the IVOA technology permits such important use
cases)
Maybe one can discuss about that this afternoon!
Thank you for your feedback,
Bruno
>
> Thanks,
>
> Markus
>
--
Bruno Khelifi
Physicist at CNRS (laboratory APC, Paris)
Phone: +33.1.57.27.61.58 - Fax: +33.1.57.27.60.71
APC, IN2P3/CNRS - Universite de Paris Cite
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.ivoa.net/pipermail/semantics/attachments/20250430/c78eb676/attachment.htm>
More information about the semantics
mailing list