HEIG Note discussion: Obscore and dataproduct_type and various classes of datasets in High energy data
Markus Demleitner
msdemlei at ari.uni-heidelberg.de
Sun Sep 7 09:02:41 CEST 2025
Dear DM list,
On Wed, Sep 03, 2025 at 01:10:34PM -0400, Igor Chilingarian via dm wrote:
> I've looked at the attached diagram. The main issue I see here is that a
> typical X-ray dataset (from Chandra and XMM-Newton) being an "event-list"
> can in fact be considered as image, datacube, spectrum, and time-series all
> at the same time. And there was a disperser (e.g. grating) involved, it
> will make it even more complex. Therefore, I don't really understand how
> you can build any kind of hierarchical relationship between a high-energy
> dataset and existing types of a dataset in the model. Any ideas?
Disclaimer: I am somewhat jumping into this discussion and doubt that
I understand what it actually about. It *sounds* as if the following
might be pertinent, though, so indulge me:
The way http://www.ivoa.net/rdf/product-type is built ("SKOS"), a
concept can have multiple "wider" terms (which is, here, an
approximate is-a relationship). In that sense, we *could* make an
event-list narrower than image, datacube, spectrum or whatever.
Whether we *should* do that is another matter, and for now I doubt
it.
The way to approach this is the consideration: "If someone looks for
images (specifically), would they want to find the dataset in
question?", and I am rather sure that for an event list, that's always
a no, even though you can generate an image from it.
There are the generic concepts *-resolved-dataset in
<https://www.ivoa.net/rdf/product-type/>, and these could be used to
derive more specific sorts of event lists. For instance, we *could*
have event-list-with-positions, which would be narrower than both
#spectrally-resolved-dataset and #temporally-resolved-dataset (and
#event-list, of course). But again, I have doubts we should do that.
For one, you'd have to convince me that there is a strong discovery
case here as long as we don't have hundreds of different event-list
collections. And the combinatorial explosion of potentially ending
up with concepts like event-list-with-positions-and-polarization
scares me, too.
So, my hunch is that if we need this sort information in discovery,
we ought to touch the obscore schema rather than the vocabulary. For
array-like-data, we have the _xel columns that (mainly) address this
use case. They won't work for event lists because they don't have
discrete values on their axes. But we could have a columns with a
list of UCDs of the columns of the dependent axes in the event list,
say.
Again apologies if all this misses the point.
-- Markus
More information about the dm
mailing list