Prototype SODA and DataLink Service Descriptor at CADC

Patrick Dowler pdowler.cadc at gmail.com
Tue Mar 1 01:08:04 CET 2016


TL;DR - CADC implemented a prototype SODA async service that essentially
collects a bunch of sync cutout requests into a single job (batch); it is
operationally more efficient and not much more complex to implement.

Long version...

Picking up with the IRIS image example, I will now show how I have
implemented SODA async: what it does and why one might want something like
this.

the links:

http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/caom2ops/datalink?ID=caom:IRIS/f212h000/IRAS-25um

includes this SODA async descriptor with the same input params as the SODA
sync descriptor.
<RESOURCE type="meta" ID="soda-b4044ef4-4884-4ee5-9c99-1b0f66b0ad45"
utype="adhoc:service">
    <PARAM name="resourceIdentifier" datatype="char" arraysize="28"
           value="ivo://cadc.nrc.ca/soda#async" />
    <PARAM name="standardID" datatype="char" arraysize="33"
           value="ivo://ivoa.net/std/SODA#async-1.0" />
    <PARAM name="accessURL" datatype="char" arraysize="*"
           value="http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/caom2ops/async"
/>
    <GROUP name="inputParams">
      <PARAM name="ID" datatype="char" ucd="" arraysize="*"
value="ad:IRIS/I212B2H0" />
      <PARAM name="POS" datatype="char" ucd="obs.field" arraysize="*"
value="" />
      <PARAM name="CIRC" datatype="double" ucd="obs.field" unit="deg"
xtype="circle" arraysize="3" value="">
        <VALUES>
          <MAX value="140.63049941314583 0.2007826788236291
8.778341996040131" />
        </VALUES>
      </PARAM>
      <PARAM name="POLY" datatype="double" ucd="obs.field" unit="deg"
xtype="polygon" arraysize="*" value="">
        <VALUES>
          <MAX value="146.83673628162285 -6.408995958971017
134.3808989000012 -6.370135804011464 134.4242625446688 6.007430601323759
146.8700273500712 5.96918267453771" />
        </VALUES>
      </PARAM>
    </GROUP>
  </RESOURCE>

Our SODA async service allows for multiple of any params. While this
includes the ID parameter I don't see any particular value in that and if a
client is using SODA via a service descriptor, then (as above) they'll have
only one opaque ID in hand anyway and would have to go to considerable
effort to put together multiple IDs. More on this below.

So, one can create an async (UWS) job with multiple cutouts (eg search for
images that cover a cluster of galaxies and then cutout around all the
galaxies in the image). I'll show this example with curl because it is more
elaborate:

input file I can use with curl to post multiple positional cutouts (note
the mix of POS, CIRC, and POLY):

===multicut.txt===
POS=circle 140 0 0.1&
POS=circle 66 10 20&
CIRC=140 0 0.1&
CIRC=140 1 0.1&
CIRC=70 0 0.2&
POLY=140 0 141 1 141 1 141 0&
POLY=20 20 21 20 21 21 20 21&
ID=ad:IRIS/I212B2H0
===

curl -v --data @multicut.txt
http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/caom2ops/async

Look for the Location header and get the job, eg:

curl
http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/caom2ops/async/p1zhcks2z1gfybq0
<?xml version="1.0" encoding="UTF-8"?>
<uws:job xmlns:uws="http://www.ivoa.net/xml/UWS/v1.0" xmlns:xlink="
http://www.w3.org/1999/xlink">
  <uws:jobId>p1zhcks2z1gfybq0</uws:jobId>
  <uws:runId />
  <uws:ownerId xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:nil="true" />
  <uws:phase>PENDING</uws:phase>
  <uws:quote>2016-02-29T21:57:43.641</uws:quote>
  <uws:startTime xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:nil="true" />
  <uws:endTime xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:nil="true" />
  <uws:executionDuration>120</uws:executionDuration>
  <uws:destruction>2016-02-29T22:55:43.641</uws:destruction>
  <uws:parameters>
    <uws:parameter id="POS">circle 66 10 20</uws:parameter>
    <uws:parameter id="POS">circle 140 0 0.1</uws:parameter>
    <uws:parameter id="ID">ad:IRIS/I212B2H0</uws:parameter>
    <uws:parameter id="POLY">20 20 21 20 21 21 20 21</uws:parameter>
    <uws:parameter id="POLY">140 0 141 1 141 1 141 0</uws:parameter>
    <uws:parameter id="CIRC">70 0 0.2</uws:parameter>
    <uws:parameter id="CIRC">140 1 0.1</uws:parameter>
    <uws:parameter id="CIRC">140 0 0.1</uws:parameter>
  </uws:parameters>
  <uws:results />
</uws:job>

Ok, we have a PENDING job with a bunch of positional cutout params (and one
ID value). Now to run the job:

curl -d 'PHASE=RUN'
http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/caom2ops/async/p1zhcks2z1gfybq0/phase

... wait a bit ... and get the job again:

curl
http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/caom2ops/async/p1zhcks2z1gfybq0
<?xml version="1.0" encoding="UTF-8"?>
<uws:job xmlns:uws="http://www.ivoa.net/xml/UWS/v1.0" xmlns:xlink="
http://www.w3.org/1999/xlink">
  <uws:jobId>p1zhcks2z1gfybq0</uws:jobId>
  <uws:runId />
  <uws:ownerId xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:nil="true" />
  <uws:phase>COMPLETED</uws:phase>
  <uws:quote>2016-02-29T21:57:43.641</uws:quote>
  <uws:startTime>2016-02-29T21:58:05.928</uws:startTime>
  <uws:endTime>2016-02-29T21:58:05.995</uws:endTime>
  <uws:executionDuration>120</uws:executionDuration>
  <uws:destruction>2016-02-29T22:55:43.641</uws:destruction>
  <uws:parameters>
    <uws:parameter id="POS">circle 66 10 20</uws:parameter>
    <uws:parameter id="POS">circle 140 0 0.1</uws:parameter>
    <uws:parameter id="ID">ad:IRIS/I212B2H0</uws:parameter>
    <uws:parameter id="POLY">20 20 21 20 21 21 20 21</uws:parameter>
    <uws:parameter id="POLY">140 0 141 1 141 1 141 0</uws:parameter>
    <uws:parameter id="CIRC">70 0 0.2</uws:parameter>
    <uws:parameter id="CIRC">140 1 0.1</uws:parameter>
    <uws:parameter id="CIRC">140 0 0.1</uws:parameter>
  </uws:parameters>
  <uws:results>
    <uws:result id="ok-7" xlink:href="
http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/data/pub/IRIS/I212B2H0?runid=p1zhcks2z1gfybq0&amp;cutout=%5B0%5D%5B235%3A275%2C258%3A298%2C*%5D"
/>
    <uws:result id="warn-6" xlink:href="
http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/caom2ops/soda-echo/NDAwfHRleHQvcGxhaW58Tm9Db250ZW50OiBhZDpJUklTL0kyMTJCMkgwIHZzIFBPTFk9MjAgMjAgMjEgMjAgMjEgMjEgMjAgMjE="
/>
    <uws:result id="ok-5" xlink:href="
http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/data/pub/IRIS/I212B2H0?runid=p1zhcks2z1gfybq0&amp;cutout=%5B0%5D%5B271%3A279%2C254%3A262%2C*%5D"
/>
    <uws:result id="ok-4" xlink:href="
http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/data/pub/IRIS/I212B2H0?runid=p1zhcks2z1gfybq0&amp;cutout=%5B0%5D%5B271%3A279%2C294%3A302%2C*%5D"
/>
    <uws:result id="warn-3" xlink:href="
http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/caom2ops/soda-echo/NDAwfHRleHQvcGxhaW58Tm9Db250ZW50OiBhZDpJUklTL0kyMTJCMkgwIHZzIENJUkM9NzAgMCAwLjI="
/>
    <uws:result id="ok-2" xlink:href="
http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/data/pub/IRIS/I212B2H0?runid=p1zhcks2z1gfybq0&amp;cutout=%5B0%5D%5B271%3A279%2C254%3A262%2C*%5D"
/>
    <uws:result id="warn-1" xlink:href="
http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/caom2ops/soda-echo/NDAwfHRleHQvcGxhaW58Tm9Db250ZW50OiBhZDpJUklTL0kyMTJCMkgwIHZzIFBPUz1jaXJjbGUgNjYgMTAgMjA="
/>
  </uws:results>
</uws:job>

In there you see 7 results (1 per positional cutout in this case. I've
named them with "ok" when the result is a FITS cutout, with "warn" if it
isn't a FITS cutout but not a failure (eg the specified position doesn't
include any pixels), and (if any had failed) with "error". The number is
there because result names are unique. If you GET any of those URLs you
will get the same output as having made that single request via SODA sync
(eg same FITS cutout or same error message (code and message), eg: warn-1
from above gives:

HTTP/1.1 400 Bad Request
NoContent: ad:IRIS/I212B2H0 vs POS=circle 66 10 20

You will see that I'm using a special resource (soda-echo) to generate the
error messages; this is based on my eariler comment that UWS doesn't
support partial success/partial fail very well and this is how one can do
it with what is available in UWS. I think this is better than trying to
have 5 results and an error summary or document (for the other 2) in a job
that reached the COMPLETED phase. And it keeps sync and async less
different.
And yeah: the soda-echo URLs contain everything in the URL (base64 encoded
for extra opaqueness) so I don't need server-side state :-)

Note: if the input is detected to be invalid the whole job fails (e.g.
invalid ID).

** Why would I want SODA async **

As implemented here, SODA async is a simple way to collect a bunch of sync
requests into a batch. In our implementation, this is more efficient
because we only perform one query (per ID) to get the metadata necessary
for all the computations. If the user was authenticated, this would also
mean a single call to various other resources instead of many. I could
imagine a storage system such as an off-the-shelf object store that
required implementation of a cutout service to retrieve the data file to a
processing area before performing the cutouts; in that kind of scenario
async would permit one retrieval, many cutouts performed, results stored in
some temporary storage area, and result URLs in the job pointing at those
resulting files. We do not currently stage any results because we can run
astronomy code inside the storage system, but I could see that changing in
the future...

** What else? **

While multiple cutouts per ID can be much more efficient, I cannot see any
reason to think that creating a job with multiple IDs would be any better
than one job per ID. OK, I guess if you were bringing up a VM to process
one job you might want it to do moer work, but there are lots of other ways
to get rid of that overhead. So, it seems like I would agree with ID being
single-valued in SODA, which by extension means that it would be
nice/necessary for DataLink service descriptors to describe the
multiplicity of input parameters.

As implemented, if you were run a SODA async job with 2 CIRC and 2 BAND
(eg) will cause 4 cutouts to be computed (each combination). That may or
may not be what users want, but they cannot really pick how to combine the
multiple params for different axes: that requires some sort of structure.
Markus suggested that this kind of thing should be done via a table upload
with one row per desired combination of params. That could be done and
since the params and values are defined consistently with VOTable usage it
wouldn't be too hard to specify and implement, but the barrier to using
this for techy astronomers would be a lot higher without some client tools
they would actually use...

Finally, I will let others think about the use of POS vs. the more explicit
CIRC and POLY with metadata in the custom service descriptors from the
datalink service... and also whether we want to emphasise data-specific
service descriptors and convey useful metadata vs. generic service
descriptiors and getting the metadata some other way (a new capability such
as {metadata} from the DAL architecture that was hinted in SIA-2.0).




On 29 February 2016 at 16:06, Patrick Dowler <pdowler.cadc at gmail.com> wrote:

>
> TL;DR - CADC implemented a prototype SODA services and use DataLink
> service descriptors to convey data-specific metadata and parameter info.
> Here we provide a cube example of the links, data-specific service
> descriptors, and SIDA sync cutouts. Since it is a cube, positional and
> energy (BAND) cutouts are enabled.
>
> cube dataset: ID=caom:CGPS/MA1_DRAO-ST/HI-line
>
> * ObsCore-1.1 metadata *
>
>
> http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/sia/v2query?ID=caom:CGPS/MA1_DRAO-ST/HI-line
>
> note: due to a bug the em_exl column is null in the ObsCore output; we've
> fixed the bug but
> not (yet) the content
>
> * cube links *
>
>
> http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/caom2ops/datalink?ID=caom:CGPS/MA1_DRAO-ST/HI-line
>
>
> The link-specific SODA descriptors (the ID attributes have UUIDs in them)
> e.g.:
>
> <RESOURCE type="meta" ID="soda-efd4e3f2-1edb-4172-8d7c-a502769a3fc1"
> utype="adhoc:service">
>     <PARAM name="resourceIdentifier" datatype="char" arraysize="27"
>            value="ivo://cadc.nrc.ca/soda#sync" />
>     <PARAM name="standardID" datatype="char" arraysize="32"
>            value="ivo://ivoa.net/std/SODA#sync-1.0" />
>     <PARAM name="accessURL" datatype="char" arraysize="*"
>            value="
> http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/caom2ops/sync" />
>     <GROUP name="inputParams">
>       <PARAM name="ID" datatype="char" ucd="" arraysize="*"
> value="ad:CGPS/cgps_ma1_hi_line_image" />
>       <PARAM name="POS" datatype="char" ucd="obs.field" arraysize="*"
> value="" />
>       <PARAM name="CIRC" datatype="double" ucd="obs.field" unit="deg"
> xtype="circle" arraysize="3" value="">
>         <VALUES>
>           <MAX value="25.055691919221182 61.31670742370859
> 3.620294640836429" />
>         </VALUES>
>       </PARAM>
>       <PARAM name="POLY" datatype="double" ucd="obs.field" unit="deg"
> xtype="polygon" arraysize="*" value="">
>         <VALUES>
>           <MAX value="28.91706938461008 58.26098083062218
> 19.210513573678497 59.14944023257296 20.380237118242576 64.23953415946808
> 31.706454362389877 63.197249950105615" />
>         </VALUES>
>       </PARAM>
>       <PARAM name="BAND" datatype="double" ucd="em.wl;stat.interval"
> unit="m" xtype="interval" arraysize="2" value="">
>         <VALUES>
>           <MAX value="0.21094492014316196 0.21110274396161874" />
>         </VALUES>
>       </PARAM>
>     </GROUP>
>   </RESOURCE>
>
> TIME and POL are missing because this is a GLON-GLAT-VELO-POL fits cube
> (but the polarization axis has only one bin with I), so cutouts on those
> axes are not possible.
>
> POS is listed there because positional cutout will work.
>
> CIRC and POLY are listed with the minimum spanning circle and polygon
> bounds "max extent" as described earlier.
>
> The FITS file itself is in GLON-GLAT-VELO so the the implementation has to
> take care of all transformations; if you go so far as to download the
> result, the FITS file will still be in GLON-GLAT-VELO.
>
> For these URLs, I recommend just "curl -v" so you can see the HTTP
> headers, but you can download the data if you want to :-)
>
>
> * position cutout: POS *
>
> http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/caom2ops/sync?ID=ad:CGPS/cgps_ma1_hi_line_image\&POS=circle%2025.0%2060.0%201.0
> <http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/caom2ops/sync?ID=ad:CGPS/cgps_ma1_hi_line_image%5C&POS=circle%2025.0%2060.0%201.0>
>
> decoding the redirect url: cutout=[0][229:696,20:488,*,*]
>
> * position cutout: CIRC *
>
> http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/caom2ops/sync?ID=ad:CGPS/cgps_ma1_hi_line_image\&CIRC=25.0%2060.0%201.0
> <http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/caom2ops/sync?ID=ad:CGPS/cgps_ma1_hi_line_image%5C&CIRC=25.0%2060.0%201.0>
>
> decoding the redirect url: cutout=[0][229:696,20:488,*,*]
>
> * position cutout: POLY *
>
> http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/caom2ops/sync?ID=ad:CGPS/cgps_ma1_hi_line_image\&POLY=24.0%2060.0%2025.0%2060.0%2025.0%2061.0%2024.0%2061.0
> <http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/caom2ops/sync?ID=ad:CGPS/cgps_ma1_hi_line_image%5C&POLY=24.0%2060.0%2025.0%2060.0%2025.0%2061.0%2024.0%2061.0>
>
> decoding the redirect url: cutout=[0][468:601,234:449,*,*]
>
> * energy cutout: BAND *
>
> http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/caom2ops/sync?ID=ad:CGPS/cgps_ma1_hi_line_image\&BAND=0.21102%200.21104
> <http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/caom2ops/sync?ID=ad:CGPS/cgps_ma1_hi_line_image%5C&BAND=0.21102%200.21104>
>
> decoding the redirect URL: cutout=[0][*,*,108:143,*]
>
>
>
> On 29 February 2016 at 16:05, Patrick Dowler <pdowler.cadc at gmail.com>
> wrote:
>
>>
>> TL;DR - CADC implemented a prototype SODA services and use DataLink
>> service descriptors to convey data-specific metadata and parameter info.
>> The implementation demonstrates that with POS you cannot convey metadata so
>> we introduced new positional cutout paramseters (CIRC and POLY) that
>> conform to WD-DALI-1.1 xtypes (circle and polygon) and allow us to convey
>> useful parameter metadata as a result.
>>
>> Before anyone panics: we also show that POS, CIRC, and POLY can co-exist
>> :-)
>>
>> image dataset: ID=caom:IRIS/f212h000/IRAS-25um
>>
>> * ObsCore-1.1 metadata *
>>
>>
>> http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/sia/v2query?ID=caom:IRIS/f212h000/IRAS-25um
>>
>> So, this is a dataproduct_type=image, calib_level=2, s_xel1 & s_xel2 say
>> it is 500x500 (pixels)
>>
>> * image links *
>>
>>
>> http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/caom2ops/datalink?ID=caom:IRIS/f212h000/IRAS-25um
>>
>> Note the generic SODA service descriptors (not linked!):
>>
>> <RESOURCE type="meta" ID="soda-sync" utype="adhoc:service">
>>   <PARAM name="resourceIdentifier" datatype="char" arraysize="27"
>>          value="ivo://cadc.nrc.ca/soda#sync" />
>>   <PARAM name="standardID" datatype="char" arraysize="*"
>>          value="ivo://ivoa.net/std/SODA#sync-1.0" />
>>   <PARAM name="accessURL" datatype="char" arraysize="*"
>>          value="http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/caom2ops/sync"
>> />
>>     <GROUP name="inputParams">
>>       <PARAM name="ID" datatype="char" ref="fileURIRef" arraysize="*"
>> value="" />
>>       <PARAM name="POS" datatype="char" ucd="obs.field" arraysize="*"
>> value="" />
>>       <PARAM name="CIRC" datatype="double" ucd="obs.field" unit="deg"
>> xtype="circle" arraysize="3"
>>              value="" />
>>       <PARAM name="POLY" datatype="double" ucd="obs.field" unit="deg"
>> xtype="polygon" arraysize="*"
>>              value="" />
>>       <PARAM name="BAND" datatype="double" ucd="em.wl;stat.interval"
>> unit="m" xtype="interval" arraysize="2"
>>             value="" />
>>       <PARAM name="TIME" datatype="double" ucd="time;stat.interval"
>> unit="d" xtype="interval" arraysize="2"
>>              value="" />
>>       <PARAM name="POL" datatype="char" ucd="phys.polarization.stokes"
>> arraysize="2*" value="" />
>>     </GROUP>
>>
>> </RESOURCE>
>> <RESOURCE type="meta" ID="soda-async" utype="adhoc:service">
>>     <PARAM name="standardID" datatype="char" arraysize="*"
>>            value="ivo://ivoa.net/std/SODA#async-1.0" />
>>     ... same params as above
>> </RESOURCE>
>>
>> params: ID, POS, CIRC, POLY, BAND, TIME, POL
>>
>> Above and below you will see a resourceIdentifier param; this is there to
>> support the use of a runtime registry lookup to generate the accessURL.
>> Doing it this way allows us to generate URLs to development, test, or
>> production servers depending on the work environment... our DataLink and
>> SODA services are not actually registered but my intent is to make these
>> resolvable by registering the services in the near future.
>>
>> Likewise, the standardID values for SODA are not (yet) resolvable but
>> they will be...
>>
>> The link-specific SODA descriptors (the ID attributes have UUIDs in them)
>> e.g.:
>>
>> <RESOURCE type="meta" ID="soda-cbb62ed5-c2c9-4dd9-aed6-46d7d5173dca"
>> utype="adhoc:service">
>>     <PARAM name="resourceIdentifier" datatype="char" arraysize="27"
>>            value="ivo://cadc.nrc.ca/soda#sync" />
>>     <PARAM name="standardID" datatype="char" arraysize="32"
>>            value="ivo://ivoa.net/std/SODA#sync-1.0" />
>>     <PARAM name="accessURL" datatype="char" arraysize="*"
>>            value="
>> http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/caom2ops/sync" />
>>     <GROUP name="inputParams">
>>       <PARAM name="ID" datatype="char" ucd="" arraysize="*"
>> value="ad:IRIS/I212B2H0" />
>>       <PARAM name="POS" datatype="char" ucd="obs.field" arraysize="*"
>> value="" />
>>       <PARAM name="CIRC" datatype="double" ucd="obs.field" unit="deg"
>> xtype="circle" arraysize="3"
>>              value="">
>>         <VALUES>
>>           <MAX value="140.63049941314583 0.2007826788236291
>> 8.778341996040131" />
>>         </VALUES>
>>       </PARAM>
>>       <PARAM name="POLY" datatype="double" ucd="obs.field" unit="deg"
>> xtype="polygon" arraysize="*"
>>              value="">
>>         <VALUES>
>>           <MAX value="146.83673628162285 -6.408995958971017
>> 134.3808989000012 -6.370135804011464 134.4242625446688 6.007430601323759
>> 146.8700273500712 5.96918267453771" />
>>         </VALUES>
>>       </PARAM>
>>     </GROUP>
>> </RESOURCE>
>>
>> The value for the ID parameter is specified in the value attribute
>> because this is a file-specific (at CADC) service descriptor and it needs
>> this file. This *is not* the same kind of ID that one uses to call the
>> DataLink service (that is a publisher_did which we will be changing into a
>> resolvable ivo-id once I'm happy with the registration of our collections;
>> this is a file identifier from our storage system). I could have used a ref
>> attribute to the links table (since the file URI is there for our other
>> services and for the generic soda service descriptors) but once you have
>> link-specific descriptiorsd anyway this seems tidier (e.g. I'll eventually
>> be able to remove the custom fileURIref column from our links table).
>>
>> BAND, TIME, and POL are missing because this is a 2D image so cutouts on
>> those axes are not possible.
>>
>> POS is listed there because positional cutout is possible, but I don't
>> see a sane way to convey sensible values to help someone use POS; the
>> client has to know the extent (from data discovery) or get it in some other
>> way (metadata capability).
>>
>> For CIRC and POLY the service includes a "maximum sensible extent" with
>> which to perform cutouts. The value attribute of the MAX element is a
>> string and my interpretation of the intent is that the client should
>> interpret it as the same "type" of thing as the PARAM in which it is found.
>> It feels like the MAX extent conveys useful and sensible information, but I
>> didn't see anything useful to put in MIN. Is this an abuse of MAX? Maybe
>> (in the sense that MAX usually implies that there is an ordering) but given
>> the implied type consistency that is already there people are interpreting
>> this now and when I showed this to a few techy astronomers that understand
>> VOTable they interrpetted this as I meant it. (The CIRC MAX is the minimum
>> spanning circle; the POLY MAX is the polygon boundary -- so using those
>> values would get all the pixels.)
>>
>> Right now, CIRC and POLY are my own custom parameters and they should not
>> bother a strict client that used this descriptor because of the standardID.
>> I chose different parameter names so i could be more explicit about the
>> value metadata (datatype, arraysize, xtype, units, ucd -- WD-DALI-1.1)
>> *and* so I could provide the "maximum value" of the exact same type. In the
>> SODA service this is very straightforward to implement (I use the same
>> Format classes for reading and writing VOTables and for parsing and
>> validating SODA params).
>>
>> For these URLs, I recommend just "curl -v" so you can see the HTTP
>> headers, but you can download the data if you want to :-)
>>
>> * image cutout: POS *
>>
>> http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/caom2ops/sync?ID=ad:IRIS/I212B2H0\&POS=circle%20140.5%200.0%200.5
>> <http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/caom2ops/sync?ID=ad:IRIS/I212B2H0%5C&POS=circle%20140.5%200.0%200.5>
>>
>> decoding the redirect url: cutout=[0][235:275,238:278,*]
>>
>> * image cutout: CIRC *
>>
>> http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/caom2ops/sync?ID=ad:IRIS/I212B2H0\&CIRC=140.5%200.0%200.5
>> <http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/caom2ops/sync?ID=ad:IRIS/I212B2H0%5C&CIRC=140.5%200.0%200.5>
>>
>> decoding the redirect url: cutout=[0][235:275,238:278,*]
>>
>> * image cutout: POLY *
>>
>> http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/caom2ops/sync?ID=ad:IRIS/I212B2H0\&POLY=140%200.0%20140.5%200.0%20140.5%200.5%20140.0%200.0
>> <http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/caom2ops/sync?ID=ad:IRIS/I212B2H0%5C&POLY=140%200.0%20140.5%200.0%20140.5%200.5%20140.0%200.0>
>>
>> decoding the redirect url: cutout=[0][255:275,258:278,*]
>>
>> If successful, these SODA requests to /caom2ops/sync respond with an
>> error message or a redirect to a URL with a pixel cutout using cfitsio
>> syntax. That is completely an implementation detail of our archive
>> infrastructure and not part of the prototype per se.
>>
>> If they fail (easy to do, just mess with the params) the response (after
>> redirect) is text/plain with a suitable HTTP status code.
>>
>>
>>
>>
>>
>> On 29 February 2016 at 16:03, Patrick Dowler <pdowler.cadc at gmail.com>
>> wrote:
>>
>>>
>>> I have finally finished and deployed our latest prototype SODA services
>>> and augmented our DataLink service to provide service descriptors to enable
>>> use of SODA. This works spans several services so, following Markus'
>>> "gripes" appoach I will try to separate things into separate messages, but
>>> I'll just make the messages replies to this one so they will be a single
>>> thread and I promise to put the TL;DR at the top of each :-)
>>>
>>> So, coming up:
>>>
>>> 1. description of datalink service descriptor output and soda sync
>>> cutout of a 2D image
>>>
>>> 2. less wordy description of datalink service descriptor output and soda
>>> sync cutout of a 3D cube
>>>
>>> 3. description of datalink service descriptor output and soda async
>>> cutout(s) of a 2D image
>>>
>>> It is quite a lot to look at, but I would like to point out here that
>>> implementing the whole end-to-end usage forced me to reconsider some
>>> earlier decisions and refine things to make them more clear and more
>>> useful. Although DataLink and SODA are loosely coupled in a technical
>>> sense, they do need to get along and work together and each has some effect
>>> or influence on decisions one takes while implementing the other.
>>>
>>> more to follow...
>>>
>>> --
>>> Patrick Dowler
>>> Canadian Astronomy Data Centre
>>> Victoria, BC, Canada
>>>
>>
>>
>>
>> --
>> Patrick Dowler
>> Canadian Astronomy Data Centre
>> Victoria, BC, Canada
>>
>
>
>
> --
> Patrick Dowler
> Canadian Astronomy Data Centre
> Victoria, BC, Canada
>



-- 
Patrick Dowler
Canadian Astronomy Data Centre
Victoria, BC, Canada
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.ivoa.net/pipermail/dal/attachments/20160229/248af863/attachment-0001.html>


More information about the dal mailing list