content, format, ctype, or xtype ?
Arnold Rots
arots at head.cfa.harvard.edu
Thu May 14 12:24:47 PDT 2009
I would definitely argue that, while you are at it, the list of
"standard columns" should include both decimal (RA,Dec) AND Galactic
(l,b).
Although a minority of users (but a significant minority) searches in
l,b the conversion of bounds is extremely nasty, while the addition of
the two extra columns is trivial.
- Arnold
Douglas Tody wrote:
> Hi Alberto -
>
> Yes, this is exactly the issue I was alluding to. The creators of
> these tables often go to a lot of trouble to get them just the way they
> are, and the community is used to seeing the data that way. Plus one
> would like to have table presentation automated and straightforward.
> These considerations lead one to trying to change the table as little
> as possible.
>
> In the past here we have usually dealt with this by adding additional,
> more standard columns for key items such as RA, DEC in standard units.
> Evidently Vizier does the same. Perhaps something along these lines
> is the solution: if we have metadata such as the ID main or primary
> RA, DEC (used by default for spatial searching) these could have
> standard UCDs and units and probably be indexed. There are only a
> few of these needed, as you say. When it comes to arbitrary table
> columns and the complexities of reference frames I do not think we
> want to change the external data.
>
> I agree that some standards or recommendations along these lines would
> be a good thing to have.
>
> - Doug
>
>
> On Thu, 14 May 2009, Alberto Micol wrote:
>
> >> On 13 May 2009, at 21:37, Doug Tody wrote:
> >>
> >> On Wed, 13 May 2009, Patrick Dowler wrote:
> >>
> >> So.... UCD?
> >>
> >> It looks like I object to everything but ucd: it allows one to say
> >> "this is a
> >> time". Maybe restriction is enough:
> >>
> >> MJD: DOUBLE <-> datatype="double" ucd="time"
> >> ISO8601: TIMESTAMP <-> datatype="char" ucd="time"
> >> STC-S: REGION or POINT <-> datatype="char" ucd="pos" ?
> >>
> >>
> >> While I agree with Alberto that in "strong" interfaces with formal
> >> parameters and data models we often want to constrain the units and
> >> representation, I question whether this is a good idea when we merely
> >> want to expose arbitrary external data (such as tables). In this case
> >> it is probably best, as well as simplest, to make as few changes to the
> >> external data as possible. Hence unless we are populating the fields
> >> of a well defined VO data model we should pass through whatever is
> >> in the external data table, and merely aim to describe it accurately.
> >
> > That would mean that ALL clients would have to know how to
> > interpret/translate/parse
> > ALL possible combination of units/ucds/utypes. I would not call this "best
> > and simplest"
> > from a client point of view.
> >
> > And if we consider the non negligeable fact, very well presented by Fabien
> > Chereau at the
> > last Interop in Baltimore, that coding a client that can make sense of the
> > different VOTables
> > returned by various data provides is a nightmare, then I think that the
> > solution to expose
> > to the VO external data as they are, it is actually (from a pragmatic point
> > of view)
> > a vo-stopper.
> > /* Fabien showed how Virgo handles SIA/SSA responses: a mixture of guesses
> > translated in complicated if/then/else playing with field names, ucds,
> > utypes, and units
> > and most of the time even that fails.
> > */
> >
> >
> > If, on the other hand, the server already implements the translation of a
> > time field
> > internally stored as "number of seconds since 1-JAN-19xx", into the VO
> > standard (e.g.) iso8601,
> > all clients will know what to expect, without having to take any assumption.
> > Only at the server side the correct knowledge to translate the internal
> > values
> > to the VO standard might be available. A client instead might take wrong
> > assumptions.
> >
> > Notice that I am only asking to standardise units and representations, not
> > reference frames,
> > and only for few well identified cases.
> >
> > So far we have spoken of:
> >
> > - angles (decimal degrees) [RA is an angle]
> > - timestamps/datetimes (iso8601?)
> >
> > and I would add
> >
> > - time intervals (like exptime, period; in seconds?),
> > - wavelengths (meters?),
> > - frequencies (Ghz?),
> > - energies (keV?).
> > - magnitudes (in mag, and not in eg dmag)
> >
> > more?
> >
> > It should not be a big deal to standardise those most used concepts. The rest
> > untouched.
> >
> > Alberto
> > I appreciate the problem of exposing the original data as they were expressed
> > by the authors of a catalog.
> > That is indeed important, and I am even thinking whether we should pass the
> > original
> > data untouched (as a display-string attribute?) along with the standard
> > representation of it.
> > Again, we are only talking of few quantities here...
> > Vizier already took this kind of approach when it was decided that the
> > original catalogs are stored
> > and served as they were conceived by the authors, but extra columns (e.g.
> > RA,DEC FK5 J2000)
> > are computed, stored, and served to allow homogeneous access throughout the
> > entire collection
> > (the MAIN ucd was invented, I think, for this).
> >
>
--------------------------------------------------------------------------
Arnold H. Rots Chandra X-ray Science Center
Smithsonian Astrophysical Observatory tel: +1 617 496 7701
60 Garden Street, MS 67 fax: +1 617 495 7356
Cambridge, MA 02138 arots at head.cfa.harvard.edu
USA http://hea-www.harvard.edu/~arots/
--------------------------------------------------------------------------
More information about the dal
mailing list