Scope of TAP parameter-based queries
Douglas Tody
dtody at nrao.edu
Tue Feb 24 11:50:40 PST 2009
I should add, all of this should be available via either a synchronous
or asynchronous interface (but TAP already provides that).
On Tue, 24 Feb 2009, Douglas Tody wrote:
> On Tue, 24 Feb 2009, Patrick Dowler wrote:
>
>> For now, there are many specific issues with P?? (the name included) that
>> we
>> can address in separate topics. I will start a few of those today, starting
>> with "scope of PQL".
>
> As as starting point here are the capabilities which NVO wants from
> simple parameter-based queries in TAP:
>
> o Simple table/DBMS metadata queries using the same query
> interface as is used for table data queries (i.e. based upon
> the SQL information schema approach).
>
> Use of the same query interface means that all this code can
> be reused on both the client and server side. Nice features
> such as multiple output formats fall out without having to
> do any extra work. The approach is scalable to very large
> tablesets, i.e., individual tables can be queried. We want
> this capability for our science users who will write software
> which uses TAP. Science users generally do not want to deal
> with the big block of XML metadata which VOSI will return.
> Mostly they just want a list of tables, or a list of the
> columns of a single table, in text, csv, or votable format
> depending upon what they want to do with it.
>
> o Cone search replacement. Something about as simple to use
> as classic SCS, but much more powerful. Hence we continue
> to support simple cone (circular) regions, but add optional
> support for more general spatial regions (REGION parameter).
> It should be possible to specify the table to be queried
> (FROM parameter), and optionally the subset of fields to be
> returned (SELECT parameter). Simple range based filter-type
> expressions of individual tables (mainly astronomical catalogs)
> should be supported (WHERE parameter). We should keep this
> capability fairly simple as ADQL will be available for the
> more complex cases.
>
> o Multi-position queries (aka "multicone"). A user supplied
> table of positions is used to generalize the cone search to
> multiple positions, providing scalability. A single table is
> returned containing the rows found for each input position,
> tagged by the position ID. Requires the UPLOAD capability (URL
> based) as well as inline table upload capability. REGION can
> be used to arbitrarly mask the table of input positions, e.g.,
> to allow a very large table to be used as the input position
> table in a cross match. It should be possible to further
> constrain the output by specifying simple range constraints on
> individual table fields. In queries of large catalogs this can
> greatly reduce the amount of data to be returned to the client.
>
> The multi-position query can be very useful as-is, but will
> often be only the first stage of a more complex multi-stage
> query, e.g., employing a custom multiparametric cross-matching
> algorithm on the client side to further refine the data,
> after doing a rough selection using the multi-position query.
> Distributed queries are easily supported as well, with minimal
> requirements on each individual TAP node.
>
> o Simple filter type queries of individual astronomical catalogs.
> This is really a variant on the cone search, but eliminates
> the requirement that a spatial constraint be used. The only
> constraint would be simple range type constraints on individual
> table fields.
>
> o Query for table modifications (MTIME parameter). This is a
> special case, used to query a remote service for what has
> changed in a given time interval, e.g., to automatically
> maintain a replica of a catalog. It is much easier to do this
> with a parameter-based query than with something like ADQL,
> due to the higher level of abstraction. The service is free
> to maintain time-of-modify/delete/update information any way
> they want, internal to the service/DBMS.
>
> o Another subtle point is that DBMS views should be supported.
> This will provide a very powerful way to extend the simple
> parameter-based query, allowing complex SQL-based operations
> on the server side. Unlike ADQL which permits general client
> defined expressions, the operations provided would be those
> defined by the data provider. But often the data provider
> will be the one in the best position to define standard views
> of their data.
>
> I think that is all we identified here. All of this needs to be
> fully specified in a TAP-specific manner in order to permit services
> to be implemented rigorously, and to define the interface well enough
> that clients can use it for actual access.
>
> - Doug
>
More information about the dal
mailing list