[obs-tap]access_format

Mireille Louys mireille.louys at unistra.fr
Tue Apr 19 08:10:59 PDT 2011


Dear all,

Here is the short text I suggest to have for access_format.

4.7.	Access Format (access_format)
The access_format column emphasizes information about  the format of  
the data product if downloaded as a file. The values should describe  
(in increasing detail) the overall file format as well as the  
structure of the data within the file.
This data model fields is important to evaluate for data discovery and  
data retrieval.
MIME types can be used for that in existing protocols ( like http).  
However, when dealing with observations as in ObsTAP service, more  
information about the astronomical arrangement of data into predefined  
formats is very useful . For instance we want to distinguish between  
various formats like aedm (ALMA) , evla, MUSE multi-extension fits  
files( IFU) etc?
Providing this information speeds up the interpretation step for  
client application consuming these files on one hand , and improves  
data selection in the discovery step on the other hand.
Some data collection offer data sets are as multi-files and distribute  
them as  as directory or  tar format
Here we consider a list of possible MIME-type strings for various  
types of observational datasets
This could be extended or modified after implementation experience.

MIME-type	    Shortname	     Definition
application/fits	  fits
application/xml	          xml
application/x-votable+xml votable
image/fits  	          fits.image any multidimensional
                                     regularly sampled data cube, etc?
image/jpeg  	          jpeg
application/x-aedm 	  aedm	    ALMA Export Data Model
application/x-ms  	  ms	    Radio measurement set
application/x-euro3D 	  euro3D    imaging spectroscopy
application/pdf 	  pdf	     graphs stored as document: filter curve,
                                      footprint, spectrum profile, etc.
application/dvi	          dvi	     graph
text/csv	          csv	     comma separated value
text/tsv	          tsv	     tab separate value
text/html  	          html	     texte in HTML format
application/x-tar	  tar	     multiple files archive in tar format
application/x-directory+text dir     multiple files set as text list

Compression may be applied at different levels:
?	after the data file is built
?	after binding a bunch of files into an archive file (like in .gzip,
         .7zip, .gz, .tar.gz, etc.)
?	directly on the file content (jpeg, hcompress in fits images,  
multi-resolution compression (.MRC files as in MR/1 application)
In this case extension file name conveys the information directly on  
the file content.
No suffix means there is no compression applied.
Example of combined access format could be a concatenation of mime  
short name , with compression suffix.





More information about the dm mailing list