SSA and SDM inconsistencies: the AIDA meeting minutes

Bruno Rino brino at eso.org
Thu Mar 25 07:56:32 PDT 2010


Hello all,

At the EuroVO AIDA a small group of people interested in updating the
SSA and SpectrumDM document gathered. These are the minutes.

We set forth the following goal: To create 1.1 versions (of SSA and
SpectrumDM) that attempt to fix the inconsistencies between the two
documents. Clarifications and small additions should be added in
versions 1.2. Big changes, possibly not backwards compatible, are
postponed to 2.0 versions.


The rationale for this split is to have versions 1.1 approved quickly
and replace the current 1.0. Versions 1.2 will take significantly
longer, gathering much input from the community.

In both cases (1.1. and 1.2) the assumption is that no existing
application should break. This means that when a fix creates potential
breakage, the potentially affected applications should be consulted.

Below, the result of our discussion on what must be changed in order to
reach version 1.1. We were lucky enough to have both the DM and DAL
Working Group leads; we concluded that after a short period of time
after circulating these meeting minutes to the relevant lists, Working
Drafts should be produced, ahead of the May interop in Victoria.

Participants:
Bruno Rino
Keith Noddle
Mireille Louys
Alberto Micol
Igor Chilingarian
Jesus Salgado
François Bonnarel


1. The SSA data model is derived, but decoupled, from the SpectrumDM.

The acknowledged divergences are:
- "required" flags (Mandatory, Recommended, Optional) are different
- the SSA data model contains service related metadata, that have no
meaning for the SpectrumDM
- the SpectrumDM contains metadata related to data analysis (Data.*)
that are of no interest for data discovery (which is the purpose of SSA)
- in SSA, the "Spectrum." prefix found in the SpectrumDM utypes was
dropped, and in some cases a "Dataset." prefix was added.

Even if those divergences are discrepancies, they are not going to be
fixed in a 1.x document. Instead, SSA Section 2.2 ("Data Model") must be
changed to reflect these divergences. It should state explicitly that a
reader of the SSA should only refer to the SpectrumDM to seek
clarification about the meaning of metadata fields. The specification of
required fields, the UCDs, and even the utype syntax for setting up a
SSA server are to be read from the SSA document. Metadata defined in the
SpectrumDM, but not listed in the SSA, are not relevant for SSA service
interface.

This is a compromise towards reaching rapidly a stable revision of the
documents. We would much prefer to have a single source for the
definition of the datamodel, which the SSA protocol would just extend.
But we believe this is too large of a task to achieve while maintaining
backwards compatibility, on a reasonable time-scale.



2. The use of "*" and ".." in UCDs

This must be eliminated.

These characters are always used in the context of "em.*" or "em...".
Our understanding is that these characters are placeholders, which a
data provider must fill in, according to its requirements. A list of all
possible values and meanings must be provided instead, using the
following primary UCDs:
em.wl
em.freq
em.energy



3. The use of "*" in utypes

This must be eliminated. The correct utypes to use are the ones in the
"Query Response" section of SSA, but without the "*":
Char.SpatialAxis.SamplingPrecision.SampleExtent
Char.SpatialAxis.SamplingPrecision.FillFactor
Char.SpectralAxis.SamplingPrecision.SampleExtent
Char.SpectralAxis.SamplingPrecision.FillFactor
Char.TimeAxis.SamplingPrecision.SampleExtent
Char.TimeAxis.SamplingPrecision.FillFactor


4. Missing UCDs in SSA

The SSA is correct, the SpectrumDM should not have a UCD for the
following elements (the UCDs provided in the SpectrumDM on those
elements are either wrong or confusing):
*utype*                 *UCD to be removed*
Dataset.TimeSI          time;arith.zp
Char.SpatialAxis.Name   meta.id
Char.SpatialAxis.Ucd    meta.ucd
Char.SpatialAxis.Unit   meta.unit


5. Misc. typos

The SpectrumDM (on the FITS serialization section) should fix the
following utypes:
Spectrum.Curation.ContactName  ->  Spectrum.Curation.Contact.Name
Spectrum.Curation.ContactEmail  ->  Spectrum.Curation.Contact.Email
Spectrum.Char.SpatialAxis.Accuracy.StatErr  ->
Spectrum.Char.SpatialAxis.Accuracy.StatError

The SSA should fix the following UCD:
em;spec.binSize  ->   em;spect.binSize

Extra spaces in UCDs and utypes are typos and should be removed


6. Dimensional analysis typo:

In the SpectrumDM, change (from 10-10 to 1E-10) the way to express
exponents within the dimensional analysis elements

Section 3.2 should read:

Pedro Osuna and Jesus Salgado have proposed a representation in the
spirit of dimensional analysis, using the symbols M, L, T to signify kg,
m, s respectively and omitting the ** for powers, so that
10**3 Jy Hz
which is equivalent to
10**-23 kg s**-2
is written compactly as
1E-23MT-2

and the example in section 9.4:

SPECSDIM= '1E-10 L'             / Spectral SIDim
FLUXSDIM= '1E+7 ML-1T-3'       / Flux SDim



6. Wrong UCDs

Spectrum.Char.SpectralAxis.Coverage.Location.Value has a wrong UCD of
instr.bandpass.
It should become the following list (in accordance to point 2 above):
em.wl;instr.bandpass
em.freq;instr.bandpass
em.energy;instr.bandpass



7. Inconsistencies within the SSA itself

Add the Dataset.Deleted utype to Appendix D.

Remove the Data.* utypes from Appendix D.

Add the remaining missing utypes present in Appendix D to section 4.2
(the list is too long and too boring to show here)



8. Inconsistencies within the SpectrumDM itself

Add a comment to the FITS serialisation stating that it does not cover
the whole of the SpectrumDM utypes


Note:
The "consolidation" activities detailed in points 7 and 8 should also
make sure the order by which the utypes are listed is consistent
throughout all documents.






More information about the dal mailing list