[obs-tap] draft summary: topic 1 *Data product type*
Petr Skoda
skoda at sunstel.asu.cas.cz
Mon Mar 28 02:23:25 PDT 2011
Hi all
as I have explained earlier, I am against the fixed enumerated tree
so from the practical point of view the proposal of the original 7 types
AND "other" seems to be reasonable for now.
The understanding what is "other" should be left to the user using some
free-form description - (I think it may be partially answered by
the nature of "observable" axis (o_ucd) which should already come from
controllable vocabulary of UCDs
AND the "dataproduct_subtype" should allow to describe the real nature of
published data in free text or (if not) there should be another field
allowing this. As a data provider I have to describe the nature of my
data even if I cannot find proper o_ucd or subtype from given list (if
required by standard).
Take an example of series of so called line profile periodograms - i.e
on x axis is a wavelength in a line profile on the y axis the frequency
and the various levels of gray mark the regions in line profile where the
changes have given frequency. This is quite common in asteroseismology
publications and people provide this for many lines (and stars).
The problem is with time coverages (as it is a combination of many
observations taken in given times .....) - It has a nature of timeseries.
Before going to other "wanted" tables in VO may be the list of emission
strenghts, equivalent width, radial velocities etc ...
Thinking in a practical terms when astronomer wants to search something,
the discovery process has 2 phases:
first he asks the question IF there is observation of (CLASSES of) OBJECTS
of given TYPE (e.g does it exist the spectrum of this beast ?).
So the search goes using very rough criteria.
My personal experience from searching stars with emission lines in VO has
reduced my enthusiasm in "fully queryable interface". Its difficult find
(even in surveys like SDSS, 2MASS) enough data corresponding your wishes
and so you have to do only very basic pre-selection and then go to refine
queries on given sets. But in practice you want to get first some samples
of ANY data from the service to get some idea abut its quality, structure
etc .. and DESCRIPTION (you have to go to web page of the project or to
published article to understand what is the data characterization etc ...)
So having TAP to return from many servers some basic data and some kind of
README (in free text ) in a first moment would be enough for phase 1.
Then you may be interested in real queryies using ADQL do "data mine" what
you want - but I suppose the number of servers asked will be already
limited and so mor specific parameters may be used.
Maybe the idea of force all servers to understand the same query fields
may really be limited to few mandatory parameters - in this context the
"other" + free form subtype should be sufficient.
>> ------------------------------------------------------------------
>> topic 1 *Data product type*
>>
>> For the moment, I keep prefering the current list of simple values
>> proposed in the draft.
>>
>> I would accept 'other' as a valid value to cover outliers cases ,
>> should be described better in a future revision of the Observation Core
>> components data model.
More information about the dm
mailing list