DataLink issues
François Bonnarel
francois.bonnarel at astro.unistra.fr
Mon Sep 10 10:49:26 PDT 2012
HI Doug, all,
Le 07/09/2012 05:08, Douglas Tody a écrit :
> On Thu, 6 Sep 2012, François Bonnarel wrote:
>
>> Dear all,
>> Last interop confirmed there was interest in the DataLink
>> concept , defined as a "way to find related resources to datasets via
>> a web service"
>> This is needed either as a complement to Data services (ObsTap,
>> Tap, S*A service) or to SimDal...
>> A general agreement was made on input/output (PublisherDID/ VOtable
>> with links) and on a general concept for the structure of the links
>> "records"
>>
>> There are still a couple of issues, however:
>>
>> -- Is "DataLink" a real full DAL service with recording in
>> the IVOA registries ? Or is just a web service refered in the main
>> services query responses ?
>> The first case requires dataset ID to be a real PublisherDID. For
>> the latter case an internal DatasetID could be sufficient....
>
> The basic concept of DataLink is that we start with a Dataset (PubDID)
> or maybe an Observation (ala TAP obs_id) and go get a list of data links
> pointing to associated data files, services, or other resources, e.g. a
> batch job for custom reprocessing of the dataset.
>
> It would seem that, given a DAL query response (TAP, SIA, etc.) it
> should be possible to directly query for the data links for a given
> dataset or possibly obs_id, without having to try to infer the existence
> of an associated datalink service via a registry query. So the DAL
> query response should point to whatever web service or service operation
> is used to get datalinks. If for the given service, a PubDID or obs_id
> is global and persistent then there is no reason this datalink service
> could not also be registered as a service in its own right, separate
> from any shortcut links from related DAL query responses, but this need
> not be required to get datalinks for an existing DAL service query
> response.
OK I agree.
>
>> -- One of our use cases is "related access to another DAL
>> service". Another one "Internal access to complex datasets
>> (archives, MEFS ....)....
>> The first use case can be some "AccessData" method of the DAL service
>> performing some dynamic transformation on the dataset. The actual
>> transformation is
>> driven by the parameters values of the method. Probably "AccessData"
>> URL of the service, without any parameter, could answer with a
>> VOTABLE describing available parameters for the dataset...
>> For the internal access to complex datasets, the May 2012
>> DataLink draft proposed a little model for internal structure
>> "mappable" on response VOTABLE FIELDs
>> This has been criticized as an "ad hoc" solution for our MEF or tar
>> archives examples...
>> Another solution could be "extended URI" (Norman Gray
>> proposal). But the rightmost part of the URI is not interpretable
>> anyway...
>> A new proposal (Laurent Michel) could be that the link to
>> the complex dataset provides a list of parameters allowing to
>> extract the various subparts of the dataset.
>> Something rather similar to the "AccessData" link behavior, eventually
>
> I think DataLink is a quite different mechanism from AccessData.
>
I agree. Probably the misunderstanding here comes from my bad english.
When I write "The first use case can be some "AccessData" method of the
DAL service" I mean that the acref in the link can be the URL root for a
DAL service/accessData method.
Suppose we have an image description in an Obstap query response.
Datalink can provide a link to either SIA URL with query method for more
metadata or SIA URL /accessdata method for cutouts or resampling
> DataLink is a sematic Web type of capability, providing semantically
> complex links (more complex than just URLs) which can be followed given
> a starting object. Although data links have more complex semantics I
> think the basic mechanism should be kept fairly simple - given the right
> software one might be able to just "click" on a link and something
> happens. Or at least one gets a list of associated data objects or
> other resources, with more semantic detail than we get from just a Web
> resource.
>
> AccessData is very different. This provides precise, client-driven,
> quantitative access to a given dataset. So for example the client can
> say give me this exact cutout, slice, or other subset of the dataset,
> expressed in pixel (in the case of an image) or WCS coordinates. This
> is very different than a datalink as datalinks are predefined by the
> data provider (here is a list of the standard datalinks we define)
> whereas accessData provides the client with direct, sample/pixel level
> access to a given dataset, to allow quantitative analysis without having
> to download the full dataset. The one place where they can come
> together is where a datalink URL can invoke a specific accessData
> operation on a dataset, e.g. to provide some standard view of the
> dataset.
The only point is : do we force the link with static , predefined values
of the parameters or do we let the AccessData method open for a given
dataset...
This can be done by AccessData answering by a VOTABLE describing
available transformation parameters for the considered dataset
Regards
François
>
> - Doug
>
>
>> Your comments are welcomed. We have to modify the May draft to go
>> forward at netx interop
>>
>> Best regards
>> François
>>
More information about the dal
mailing list