[Heig] validation of UCD proposal by the semantics WG : results of the semantics meeting for UCD in Strasbourg 6-7 of May

Mireille Louys mireille.louys at unistra.fr
Wed May 20 19:12:24 CEST 2026


Hi everyone,

The discussion about the UCD terms proposed in the note is summarized on 
the request for modification page for UCD:
https://wiki.ivoa.net/twiki/bin/view/IVOA/UCDList_1-7_RFM

The terms are described in the VEP-UCD description files available from 
their specific link from the page above.
All the VEP-UCD files are also available at 
https://voparis-gitlab.obspm.fr/vespa/ivoa-standards/semantics/vep-ucd

We need to update the UCD section following the decisions taken .

There is another topic with semantics : the analysis products vocabulary .

I attach here a draft version of a VEP for analysis data product type :
What is needed are

- a revision of the definitions in order to encompass various kinds of 
HE experiments
- file examples for the Used-in section
- clarification of the #region data product

Thanks for helping for this.

Mireille

-- 
--
Mireille Louys, MCF (Assistant Professor)
Centre de données Astronomiques (CDS)       Equipe Images, ICube
Observatoire de Strasbourg                  Telecom Physique Strasbourg
11, rue de l' Université                    300, Bd Sebastien Brandt CS 10413
F-67000 Strasbourg                          F-67412  Illkirch Cedex
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.ivoa.net/pipermail/heig/attachments/20260520/617d6df3/attachment.htm>
-------------- next part --------------
Vocabulary: http://www.ivoa.net/rdf/analysis-product-type
Author: Bruno Khelifi <khelifi at in2p3.fr> ,<mireille.louys at unistra.fr>
Date: 2026-02-04/ updated 2026-04-22

New Term: draws

Action: Addition

Label: draws

Description: Probabilistic dataset containing a collection of samples (draws)
	generated from a probability distribution.

Relationships: none

Used-in: % todo : provide a link to an example dataset 
?? example corner plot gammapy ?? 
by high energy photon and neutrino experiments, and by cosmological observatories

Rationale: 

High-energy photon and neutrino experiments must employ statistical methods to derive 
final products  like #spectrum, #sed, #light-curve or #image in physical units. The 
underlying reason is that instrument responses are inherently non-invertible. By 
computation of probabilities for random variables associated with spectral, spatial, 
and/or temporal models, these final products can be derived.

In a frequentist approach, the best parameter estimates correspond to the maximum 
likelihood probability among all possible realizations of the random variables.
When priors are applied, the estimate is derived from the maximum of the posterior
probability. In Bayesian inference, the best estimate is associated with the 50th 
percentile (median) of the posterior draws.

This dataset maps the likelihood or probability landscapes across a space phase of 
possible values of the random variables. The collection of probabilities enables the
computation of quantiles, confidence intervals, confidence limits, and thus uncertainties,
upper limits, and lower limits. This collection is particularly critical in cases of 
non-Gaussian degeneracies or when dealing with a large number of parameters.

Discussion : 
++ The term is highly generic and applicable to any statistical framework, whether frequentist or Bayesian. It is worth noting that "draws" is a term typically associated with Bayesian statistics, whereas "samples" is more generic.
Note that 'samples', initially considered, can also be used for moon rocks samples, or other laboratory physical samples which would be outside of the HEIG scope here and misleading. 



=======================================

Vocabulary: http://www.ivoa.net/rdf/product-type
Author: Bruno Khelifi <khelifi at in2p3.fr>
Date: 2026-02-04

New Term: pdf

Action: Addition

Label: Probability Density Function of a quantity

Description: Probability density function of a quantity, for example the Bayesian
	marginal probability density function associated to the spectral index of
	a spectrum

Relationships: none

Used-in: --> please provide an example 
by high energy photon and neutrino experiments, and by cosmological observatories

Rationale:
When statistical analyses are in used to derive final products like #spectrum, #sed, 
#light-curve or #image in physical units, the probability density function (PDF) associated
to a random variable can be derived. This PSF can be the probability, the posterior or even
the prior of a random variable.

This is very useful when the distribution is highly asymmetrical or multi-modal. If the 
variable is for exemple the size of an object, the knowledge of asymmetry of this PDF is 
obviously more useful than symmetric errors.

Note that this PDF can be "differential" (e.g. a probability at a given value), "integral" or
"average" (when bins are used for the random variable). The serialization of this data production
should then contain accurate metadata information.

When statistical analyses are employed to derive final products such as #spectrum, #sed, #light-curve
or #image in physical units, the probability density function (PDF) associated with a random variable
can be derived. This PDF may represent the probability, posterior, or prior distribution of the random variable.

Discussion:
This approach is particularly valuable when the distribution is highly asymmetric or multimodal. For example, if the variable represents the size of an object, knowledge of the asymmetry in the PDF is significantly more informative than symmetric error estimates.

It is important to note that the PDF can be "differential" (e.g., the probability at a specific value), "integral",
or "averaged" (when bins are used for the random variable). Consequently, the serialisation of this data product
must include precise metadata to ensure clarity and reproducibility.

M.L: --> parameters to describe for PDF to be retrieved : 
probability_type = differential/integral/averaged

=======================================

Vocabulary: http://www.ivoa.net/rdf/product-type
Author: Bruno Khelifi <khelifi at in2p3.fr>
Date: 2026-02-04, update 2026-04-22

New Term: region

Action: Addition

Label: Region

Description: dataset that encodes (one or more) regions of parameter space, for example 
a spatial region or a region of phase space covered by a dataset. The set of dimensions
represented by the region can be arbitrary

Relationships: none

Used-in: %todo: provide a real example 
Rationale: 
%todo: clarify the role and dimensionality of this dataset kind
It seems that the spatial coverage of the observation is given as an extra data product (like an excess_map, or an error_map ?) in Chandra. 

Discussion:
Not clear how universal this can be in the High Energy domain. 
Some data collections like XMM, SVOM, etc. may store this information in a FITS file extension , or a S_MOC extension.
If the dataset is multidimensional, it does not fit into the tree proposed in "http://www.ivoa.net/rdf/product-type", which is based on the number and kind of data axes 






More information about the heig mailing list