Dal architecture concepts

François Bonnarel francois.bonnarel at astro.unistra.fr
Mon Feb 8 13:10:10 CET 2016


Dear all,

    Discussion around SODA has some underlying DAL architecture 
conceptions. This a summary of my view of it.

The DAL landscape today is very modular. We have created or are 
developping a pretty large number of different protocols during the 
thirteen last years and the most recent ones work in complementarity on 
some definite aspects and also have to collaborate for higher order and 
more complex fonctionalities.

DALI, for example, is gavering a set of common definitions across 
protocols.

But the main differenciation in the DAL landscape comes with TAP 
emerging beside the so called "Simple Access protocols". TAP and ADQL 
coupled with the simplified observation datamodel ObscOre allow to 
develop the standard generic dataset discovery type of services: the 
so-called "ObsTAP".

SIAV1.0 and SSA have two functionalities : data discovery and  some 
hints of data access (cuting-out, resampling, regridding, mosaicking)

On another side the idea of linking discovered or known datasets to 
additional resources has been  in the air since a long time. Obviously 
there is a need to give access to additional information, other data 
formats, additional processing  without duplicating the lines dedicated 
to a specific dataset in the Query response. The concept of DataLinking 
is born from that need.

For cubes, and later for time series or event list, the Simple protocols 
were insuficient for search and selection on some axes. On the other 
side ObsTAP services are not designed at all for data access and only 
allow full retrieval.

DataLink was the right way to link datasets discovered by ObsTAP to 
various resources and specifically to an AccessData service able to 
provide all kind of cutout and selection facilities (and more access 
functionalities later).

Simple access protocols were long time providing  both discovery and 
accessData methods with a similar interface. Discovery with a simple  
HTTP interface still have advantages  over ObsTAp for some use cases and 
some data providers and was decided at some point to develop SIAV2.0 
version for that.

The current architecture decided at Hawaï interop after year-long 
discussions is the following :
     ObsTAP services and SIAV2 services are parallel discovery (and 
description) pathes for cube datasets The query response is based on 
Obscore datamodel in both cases. Full metadata Resource going beyond 
ObsCore will come later in SIAV2.
      AccessData (now called SODA) is independant and accessible from 
the query responses through one of the DataLink mechanisms. The first 
version is oriented towards Cuting-out and Selection on teh axes. More 
sophisticated access data operations will come later.


Implicit things or still in discussion

      SODA should also be accessible directly and not only from Data 
Discovery services.
       DataLink link resource is a very generic dataset-to-resource 
bounding mechanism. It must stay generic and independant from the 
services it relates which may be operated by various providers.
       SODA services are the best candidates to their auto description.
It is better to rely on them than on DataLink to describe dataset 
specific metadata.
       SODA, ObsTAP and SIAV2 share the same datamodel(s) that should 
reflect in a consistency effort (see my previous email).
       Custom parameters and custom services are interesting features, 
but they are not in our main requirement for this year for SODA at first 
steps. However it is nice and usefull in general  that DataLink is 
providing a way to relate datasets to such services.
        Standard parameters don't really require machine readable 
description because they are defined in the spec and related to the 
underlying model.
     But it can be usefull to give them a description as similar as 
possible to the custom description. This will make easier automatic 
functionality discovery and managment by the clients. However It should 
be done in a way not destroying consistency of the architecture at the 
price of a lack of interoperability.

       Cheers
François


More information about the dal mailing list