[Heig] vocabulary update: proposal for dataproduct_type update for high energy data : event-list definition and event-bundle

Bruno Khelifi khelifi at apc.in2p3.fr
Wed Apr 30 10:11:37 CEST 2025


Hello Markus,


Le 30/04/2025 à 08:57, Markus Demleitner via heig a écrit :
> Hi Bruno,
>
> On Tue, Apr 29, 2025 at 03:27:35PM +0200, Bruno Khelifi via semantics wrote:
>> Le 25/04/2025 à 09:29, Markus Demleitner via heig a écrit :
>>> (1) used-in: I really, *really* would like to see actual, published
>>> data here (always, in all VEPs; it's a pain if we go into all the
> [...]
>>>     used-in: dataset ivo://csc.harvard.edu/scsr2?some-obs-id onhttp://cda.cfa.harvard.edu/csctap
>>>
>> In the VHE domain, CTAO, KM3NeT, SWGO wish to publish their current and
>> future data in VO. The current running experiments MAGIC and HESS are
>> working toward the publication of their legacy data into the VO.
>> And this is also in this context that such extensions are of interest for
>> us.
> Is anything already online in obscore or via datalink?  It would
> really be great if there was something out there when the VEP
> officially goes up?
Not yet. We are preparing a HEIG note to present our needs and their 
justifications. Mathieu Servillat, Ian Evans and al. will make some R&D 
to be sure that it works well... even if DataLink is well secured;)
>>> (2) Relationship: That's an operational field, i.e., I need to create
> [...]
>>> user side: If I'm looking for #event-bundle, do I want to see
>>> #event-list, too?  If I'm looking for #event-list, do I want to see
>>> #event-bundle, too?  Whatever ought to encompass the other is the
>>> wider term.
>> event-bundle is e.g. event-list + response (a set of IRFs)
>> (+provenance+readme+etc)
>> In this context, one would need to use DataLink to access to all files
> Well... but does
>
> #event-list #skos:broader #event-bundle
>
> hold or is it the other way round?
As a simple physicist, I do not know that means #skos:broader. In words, 
an event-list is contained into an event-bundle, because 
event-bundle=event-list+IRFs+provenance+etc
>
>> First, our IRFs are not just accessory, they are mandatory for HE (photons
>> and neutrinos) to make physics (for data analysis specialists:
> I don't doubt this, I was just wondering if, from a *product-type*
> point of view there is a sufficient point in telling #event-bundle
> and #event-list apart: Do you ever want to discover one or the other?
> Would it perhaps be necessary to tell the two apart for the purpose
> of choosing an appropriate SAMP client?
>
> A good answer to that would also help answering the relationship
> question.
As I said, having event-list w/o IRFs is so poor and does not allow to 
make science. One needs to discover the event-bundle, and as consequence 
define properly its contents, ie event-list and response.
>
>> In addition, one has use cases where these IRFs can be used without
>> event-list: the case of simulations. Also, one can have one set of IRFs for
>> many observations. So a data producer could use a dedicated entry in ObsCore
>> for that.
> Ah-ha!  But this would mean that you would need IRFs as an
> independent product type, no?  How else would you annotate these
> stand-alone IRFs?
The UCs to use IRFs by its own and to be able to discover them are:
- make source simulations for a given instrument
- compute sensitivity of an instrument
- make predictive performance curves for an instrument
- share (large) IRFs valid for a wide range of observations (in the 
spirit of the CALDB of X-rays) - very useful for IceCube, KM3NeT, HAWC, 
SWGO (and maybe AUGER, etc)
This is quite important already, I think. I have to think a bit more to 
check if the list is complete.
>
> If that's solid reasoning (and frankly, I only have a fairly vague
> idea of what I'm talking about here), then we could introduce some
> concept of *#ancillary into product-type (which feels a bit wrong
> because we have that concept in datalink/core, too, but I think it's
> justifiable), and then *#irf as a narrower term.  #event-bundle would
> then be narrower than both #irf and #event-list.
ancillary is so vague that it can contains everything... This is a vague 
stuff, a container somehow. Here, we propose to describe correctly some 
of its possible elements: response, ARF, RMF, PSF, Edisp, Aeff, 
Bkg(=background). All these elements are critical for us and they 
deserve a proper description such that the HE experiments (from X-rays 
to PeV, in photons and neutrino). Indeed, there are specific elements, 
like a phot.mag.distMod, phys.ionizParam.coll or src.calib.guideStar. 
Each astronomical domain has its own peculiarities, applicable only for 
its own data releases. Narrow elements are common in IVOA. Are these 
elements too narrow? We are just covering 12 orders of magnitude in the 
electro-magnetic spectrum, which is not so narrow;)


>
> The consequence would be that searches for #event-list yield
> #event-bundle, too, but searches for #event-bundle wouldn't.  Does
> that sound right to you?  If so, could you try to write a definition
> for *#irf (and perhaps come up with a less cryptic label)?  I think
> we should amend the #event-bundle VEP with that on top, if this is
> what we're going to do.
The data producers could have the freedom to organise their releases as 
they wish. The 2 UCs associated to the data release organisation are:
- a list of event-bundle (using DataLink to access to all downloadable 
files) - typical case for Imaging Atmospheric Cherenkov telescopes
- or separate entries for the event-list and the response - a possible 
case for Water Cherenkov Detectors and neutrinos (or cosmic-rays 
experiments)

- or dedicated entries about the response that is "representative" of an 
instrument (UC: proposal preparation, estimation of instrument 
performance for a given science use case, instrument sensitivity - 
Zenodo is great, but all the IVOA technology permits such important use 
cases)


Maybe one can discuss about that this afternoon!

Thank you for your feedback,

Bruno

>
> Thanks,
>
>             Markus
>
-- 

                          Bruno Khelifi
                   Physicist at CNRS (laboratory APC, Paris)
       Phone: +33.1.57.27.61.58 - Fax: +33.1.57.27.60.71
               APC, IN2P3/CNRS - Universite de Paris Cite
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.ivoa.net/pipermail/semantics/attachments/20250430/c78eb676/attachment.htm>


More information about the semantics mailing list