IAU thesaurus in RDF (an update)

Frederic V. Hessman Hessman at Astro.physik.Uni-Goettingen.de
Mon Oct 8 02:50:50 PDT 2007


Andrew Drake wrote:
> I don't know much about we are supposed to use UCDs with VOEvents
> but you seem to be missing all of the lensing ontology.
>
> lensing
> lensing_weak
> lensing_strong
> microlensing
> microlensing_caustic_crossing
> microlensing_cusp_crossing
> quasars_lensing

You're right - the IAU Thesaurus just had the generic terms  
"gravitational_lenses" and "microlensing".  "Gravitational_lensing"  
would be a better term since it's the process rather than an object  
which is important and the look-and-feel and "findability" of the  
tokens is similar (the SV hierarchical organization of our origianl  
proposal was a lot easier to deal with, as we expected, but.....)    
I'll change/add the following:

	"gravitational_lenses"	->	"gravitational_lensing"
	"microlensing"	->	"gravitational_microlensing"
	(NEW)	->	"planetary_microlensing"
	(NEW)	->	"quasar_microlensing"
	(NEW)	->	"strong_gravitational_lensing"
	(NEW)	->	"weak_gravitational_lensing"

Since terms like "cautic crossing" are generic to gravitational  
lensing, I'll add

	(NEW)	->	"caustics"
	(NEW)	->	"caustic_crossing"
	(NEW)	->	"cusps"
	(NEW)	->	"cusp_crossing"

> microlensing_binary_source
> microlensing_binary_lens
> microlensing_multiple_source
> microlensing_multiple_lenses
> microlensing_anomalous

Ugh - this is getting pretty detailed (see discussion of complexity  
later).  How about at least

	(NEW)	->	"gravitationally_lensed_sources"
	(NEW)	->	"gravitational_lenses"

The former is a good new broader term for things already present like  
"luminous_arcs" and "Einstein_rings".  While we're at it....

	(NEW)	->	"gravitational_lensing_shear"

The IAU thesaurus already has "anomalies" for generic unexpectedness.

> variable_stars_periodic

	Oops : there's "multi-periodic_variable_stars" but not  
"periodic_variable_stars".  Even Shobbrook^2 didn't think of  
everything classical.

> variable_stars_aperiodic

	There's "irregular_variable_stars" : is this "aperiodic" enough?  Is  
"semi-regular" to regular for "aperiodic"?

> asteroid_occultation

	(NEW)	->	"asteroid_occultations"
	(NEW)	->	"lunar_occultations"

> solar_eclipse
> lunar_eclipse

	Both exists already in exactly this form, but should be made plural  
(IAU thesaurus things are always plural if appropriate at all), just  
like "annular_eclipses".

	"solar_eclipse"	->	"solar_eclipses"
	"lunar_eclipse"	->	"lunar_eclipses"

> planetary_eclipse
> planetary_transit

	(NEW)	->	"planetary_eclipses", ALT "planetary transits"

> sun_spots (probably better seperate from starspots).

	No need: we've already got "sunspots".   A quick Google-survey  
indicates that "sunspots" is rarely written as "sun spots" (the  
latter are skin discolorings).  I suppose an ALT doesn't hurt.

> Perhaps most of these should be broken down into their actual  
> constituents rather than discribe types of events for actual UCDs.

There is/was a general feeling that the tokens shouldn't contain too  
much ontological baggage and the abandonment of any UCD-grammer (as  
originally proposed in the SV-document) means that the means of  
combining tokens has yet to be defined.  Thus, replacing something  
like the SV-like "microlensing_planetary" either means extended IAU- 
style normal phrases like

	(NEW)	->	planetary_microlensing"

or the assumption that - somewhere down the line - we'll decide how  
these tokens are to be combined, e.g.

	<rdf:Bag>
		<rdf:li resource="IAU:planets"/>
		<rdf:li resource="IAU:gravitational_lensing"/>
	</rdf:Bag>

(the equivalent N3 syntax would be.....?).   Once the semantics WG  
accepts the idea of IVOA-compatible vocabularies, it'll be time to  
move on to this topic - one which directly impacts other groups like  
VOEvent, since the ability to combine tokens to express information  
isn't an abstract ontological  exercise but a very down-to-earth need  
whose soltuion which should also be VO-wide if possible.

I think the RDF listings like that above is a reasonable and quite  
generically useful way of describing lists, sequences, etc.  Extended  
SKOS even offers NOT, AND, and OR (what does the N3 syntax suggest/ 
permit??).

Practically, I'd say we have to have too many tokens at first (so  
that we have tokens at all) but try to contain ourselves a bit and  
quickly decide how the IVOA wants us to combine them later.  The  
present IAU/IVOA list is already too large to be handled easily (see  
examples above where Andrew bravely went through the list and still  
didn't catch things which weren't missing after all).


Douglas Burke wrote:
>> Douglas implicitly raises a good point: the text file isn't much  
>> simpler than n3 and n3 is a lot less verbose than SKOS/OWL -  
>> should the IVOA suggest vocabularies are published in n3?
>
> I do much prefer to look at RDF/N3 rather than RDF/XML...

I certainly don't have any 'druthers about the format.  The advantage  
of N3 would be that it could replace the pure text format without  
much loss in human-readability.  On the other hand, I don't know if  
N3 is widely-enough accepted as an official format that the IVOA  
could simply stipulate using N3 as THE format for vocabularies.   A  
low buy-in cost for groups publishing their vocabularies sounds like  
a  good idea.  On the other hand, if the N3 syntax is too  
restrictive.....

> I have a question about the format of the identifiers used.
>
> If you look at the N3 form that cwm generates
>  - e.g. http://hea-www.harvard.edu/~dburke/playground/iau-djb.n3 -  
> then most terms are fine, in that they are represented as IAU:foo,  
> but we do find
>
> <http://www.Astro.physik.Uni-Goettingen.DE/~hessman/rdf/IAU#He 
> +_ionization_zone>
>
> which I guess comes about because of the "+" character. In itself  
> it's just ugly, and not a real issue, but I've come across cases  
> where certain parsers [*] didn't like the fragments to contain the  
> "." character, or start with a number, so I wonder if we should  
> take care in how we create the fragments, even if my reading of  
> http://gbiv.com/protocols/uri/rfc/rfc3986.html#collected-abnf is  
> that we can use these characters.
>
> [* whatever parser it is that longwell, http://simile.mit.edu/wiki/ 
> Longwell, uses]

Ugh - a pretty stupid contraint on token formats.  I rather liked the  
idea of using "_" to make the token effectively the same as the  
preferred label but this handiness will go away if we can't use other  
handy characters like "+" (let's not get into a unicode discussion).   
Otherwise, we'll need explicit computer-readable and human-readable  
tokens.  If the standard says we are allowed to use "+", then I vote  
for using it and notifying the parser-authors to fix their bug.


Ed Shaya wrote:
>> How about micro-OWLness as a start?  Again, if someone would tell  
>> me explicitly what OWL-ish entries you like......
>
> I don't understand what you are asking for here.  Property terms?

I meant that I didn't know enough OWL to know what different parts  
would be needed in a complete token entry ("complete" in the sense of  
containing all the information already present in the IAU thesaurus):  
all the ClassThis, subClassThat, and propertyBlah stuff.


Thanks to all again for their help.

Rick

------------------------------------------------------------------------ 
------------------------
Dr. Frederic V. Hessman     Hessman at Astro.physik.Uni-Goettingen.DE
Institut für Astrophysik          Tel.  +49-551-39-5052
Friedrich-Hund-Platz 1         Fax +49-551-39-5043
37077 Goettingen                 Room F04-133
http://www.Astro.physik.Uni-Goettingen.de/~hessman
------------------------------------------------------------------------ 
-------------------------
MONET: a MOnitoring NEtwork of Telescopes
http://monet.Uni-Goettingen.de
------------------------------------------------------------------------ 
-------------------------


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.ivoa.net/pipermail/voevent/attachments/20071008/b6a32f42/attachment-0003.html>


More information about the voevent mailing list