VOUnit encodings

Norman Gray norman at astro.gla.ac.uk
Fri Jul 27 12:53:54 CEST 2018


Stuart, hello.

On 26 Jul 2018, at 13:45, Chalk, Stuart wrote:

>  I am surprised that there is a tolerance for unknown units.  Do you 
> think that the community would ever re-evaluate this decision?  It 
> would be much better if the parsers would flag an unknown unit as such 
> and provide suggested units.

I think a reevaluation of that would be unlikely, now.  As I recall, the 
issues included:

   * The VOUnits activity had a fairly modest goal, to codify and tidy 
current practice: there were three or four 'unit specs' in circulation, 
none of which were completely precise.  For that reason, it didn't aim 
to add any restrictions that weren't absolutely necessary.

   * I don't think the IVOA felt it was in a position where it could, or 
even should, demand people rework their data releases.  'IVOA tools 
reject my data because of the unit strings?!'  Big wow.

   * Having a specific list of approved units would mean that there'd be 
a never-ending process of adding to that list.

   * Consider the unit 'jupiterMass'.  If that's the natural mass scale 
for the data you're releasing, then it's arguably inconvenient to you 
_and your users_ to have to use this instead in units of 10^{silly}.  
Also, is 1 'jupiterMass' just an alias for some number of kg, or is it a 
calibrated/derived/notional mass in your particular dataset?  Both are 
reasonable, but only one could be the standard.  And does that also 
imply defining uranusMass, neptuneMass, nibiruMass, ...?

There are multiple responses to those, but those were the points of 
discussion.  The conclusion was to require 'vounit-compatible' unit 
strings to have a standard syntax, but to permit unknown units within 
that syntax, even though they may be misparsed (my pet example: a 
'furlong' is a femto-'urlong'), as long as it is possible for the 
unknown units to be algorithmically identified.

> This becomes an important issue when you take into consideration the 
> current move toward FAIR data (https://www.go-fair.org/) where the 
> expectation is that FAIR data can be reused by other researchers and 
> thus needs (in part) to have standardized units.

It's opening up a larger discussion, but having units in a standardised 
syntax, and even better using standard units, is a good thing, and 
certainly removes one of the possible barriers; however using someone's 
data without reading the release notes or other documentation is 
probably reckless.  Any decent data-release documentation is going to 
discuss the units being used in the data.

I should have mentioned earlier that there's significant overlap between 
the qudt.org set of units and the VOUnits ones (though we did have to 
add some astronomy units which weren't in the _long_ list of QUDT ones).

All the best,

Norman


-- 
Norman Gray  :  https://nxg.me.uk
SUPA School of Physics and Astronomy, University of Glasgow, UK


More information about the semantics mailing list