comments on the ObsTAP-2011-02-27 draft
Igor Chilingarian
chil at sai.msu.ru
Tue Mar 8 09:19:28 PST 2011
Hi all,
I decided finally to put my comments on the draft on the mailing list instead
of discussing them privately among the authors. If there is any volunteer to
put them on the Twiki page, please do it.
These apply to the 2011-02-27 version of the draft. They were made on Mar/1,
and Mireille has already taken into account cosmetic changes in the document
(typos etc.) So, here I will list more serious points and will omit all those
ubiquitous typos (hundreds!!!) in the examples originated from the usage of MS
Word with the "auto-replace" feature turned on by one of the authors.
Page 14. Section 3.2, paragraph 2 (or table caption -- unclear?)
add "in some cases" after "though it could be nillable"
Page 14. Table 1
Among all the listed columns, there is only one which must be unique, it is
"obs_publisher_did". So, I would recommend to highlight it somehow in the
table and mention that it can serve as a primary key.
Personally, I'm not happy with some column names, in particular with the fact
that "spatial" starts with s_, the same letter as "spectral". I know that this
has been discussed for years and some of us are already used to these column
names, but still I would suggest to use the top-level UCD in column names
instead of a single character, like it is done for the spectral axis ("em").
I mean: "pos_ra" "pos_dec", ...; "time_min", "time_max".
All Section 3.3. I think we should explicitly mention whether it is mandatory
to fill every given DM element (i.e. if it is "nillable" in the
SQL-serialization). Although this information is provided below, I think it
would be important to mention this in every 3.3.x section.
Page 15. Section 3.3.1 What data product type will be appropriate to describe
multi-object spectroscopy? Presently, a large fraction of all spectra provided
by modern facilities come as multi-object (or multi-slit). Well, when the
calib_level=2, normally one would expect the spectra to be extracted from the
dataset and published one-by-one, then the "spectrum" is fine (we have to
think what Observation ID to attach). But in case of lower calib_level values,
the reduced spectra obtained during the same exposure are often presented in
some specific formats (e.g. Euro3D, ESO FLAMES) just like "all in one". I
would definitely disagree to call it "other". Juan De is proposing to use a
fine-grained data product type, but this we have to discuss in more detail
Page 17. Section 3.3.3, par.2
We should again mention that only obs_publisher_did is unique.
Page 18. Section 3.3.4, par.1
"A spectrum could be represented in the VO-compliant Spectrum format or in
some instrument-specific FITS binary table format".
"FITS binary table" should be dropped. Because this way we reject all the
spectra in the "standard" IRAF-reduced format, i.e. 1D-image; we
possibly reject Euro3D, and other instrument-specific file types (ESO FLAMES)
Page 21. Section 4.5.
We should *highlight* that DID is unique and say explicitly that it should be
used as a primary key.
Page 21. Section 4.6.
The last paragraph (Access URLs...) is too vague. It doesn't bring any
useful information, only frightens developers. Has to be clarified.
Page 21. Section 4.8.
How to put "unknown" access_estsize? As NULL? Has to be specified
Page 23. Section 4.15
The statement that "In all cases, t_exptime is generally used as an indicator
of the relative sensitivity (depth) within a single data collection" is not
true. In case of multiple targeted observations (e.g. a survey of stars of
certain types) the exposure time is usually selected on the case-by-case basis
in order to achieve more or less the same signal-to-noise ratios for every
target of interest.
Page 23. Section 4.16
"For products with no sampling along the time axis, the "t_resolution could be
set to the exposure time." This is inconsistent with the Characterisation DM.
I would recall on a long discussion during the CharDM (v.1) development a few
years ago regarding the "resolution" value in case then a given "axis"
contains only one point. I tried to argue for putting some resolution RefVal
(i.e. lambda/dlambda for a broadband filter), but this idea met strong
objections, in particular by Alberto, so finally we decided to put "N/A" or
"NULL". I am not sure whether it was a right decision, but now we *must
comply* to what is already done. In this case, the query to discover the
time-resolved observations will be:
WHERE t_resolution IS NOT NULL
Page 29. Use case 1.6
I have serious doubts that this query will work. "|" should be replaced with
"abs", but "abs" does not exist for the "timestamp" datatype (although MJD
can be treated as "double precision". This is an example to discuss and check.
Pages 37-38. Section B1.2. Calibration level.
The first paragraph states about "80%" of data collections, while the last one
is more optimistic saying "simple enough to cover all regimes".
Page 39. Section B3.3.
"HI cube" or "CO cube" is not a title, it's more a data type. The title would
be something like "NGC224 HI cube 01"
Page 41. Section B5.2
Some examples are quite bad, in particular, "VOTable". I think we should be
more explicit about this. MIME types should be indicated as preferred, then
VOTable will become text/xml or whatever like this.
Page 45. Section B7.1
Example is needed. What about "Telescope" or "Facility"? To me it looks quite
strange if we give the instrument name, but do not identify the telescope it
is operated at.
Page 47. Table 6.
Identify obs_publisher_did as a primary key, highlight all mandatory elements
(by colour??)
Page 51. Section C.2
My suggestion would be to put the *tested and working* bunch of SQL scripts in
order to create and fill the database schema. This will facilitate the life of
people implementing the service. I believe, Mireille has already put this
point on the Twiki somewhere.
With best regards,
Igor
More information about the dal
mailing list