IVOA Thesaurus
Alasdair Gray
agray at dcs.gla.ac.uk
Tue Oct 30 09:28:25 PDT 2007
Hi,
>> First of all I would like to talk about top level concepts. I found
>> that there were 2,646 top level concepts. I feel this very high,
>> particularly as there are only about 2,850 concepts in total. This
>> is also particularly bothersome if you analyse the IAU Thesaurus
>> which only has 516 top level concepts.
> Strange, my python script claims that there are 2972 tokens in the
> newest version (which now includes all the elements and so had to
> grow a little bit!) and only 968 top concepts. The script simply
> looks for BT's and NT's, which we know are not yet internally
> consistent, so maybe this is a good time to clean up the ontological
> connections. Alasdair undoubtably used some smarter tools than my
> python script and the RDF hasTopConcept list should be taken with a
> grain of salt...
I'm afraid to say that I did not use anything particularly clever, just
a few greps with wc -l. Also I counted the hasTopConcept to find the
number of top level concepts. Rick, I've not looked closely at your
script, but why is it generating so many top level concepts while
counting a smaller number?
> A more formal question: I've been shoving UF's into aliases (a
> process which has not been completed, since I'm mostly doing this
> myself on spare time), but SKOS doesn't seem to have a means of
> expressing U's?
The U and UF should be modelled as skos:altLabel and skos:prefLabel
respectively. I'll illustrate this with an example from the original IAU
thesaurus:
acceleration of gravity
U
GRAVITY
SURFACE GRAVITY
Should result in both the gravity concept and surface gravity concept
having the declaration
<skos:altLabel>Acceleration of gravity</skos:altLabel>
Specifically, there should not be a concept for acceleration of gravity.
I hope this clears the issue up for you Rick,
Alasdair
-----Original Message-----
From: owner-semantics at eso.org [mailto:owner-semantics at eso.org] On Behalf
Of Frederic V. Hessman
Sent: 30 October 2007 09:55
To: IVOA semantics
Subject: Re: IVOA Thesaurus
Alasdair Gray noted:
> First of all I would like to talk about top level concepts. I found
> that there were 2,646 top level concepts. I feel this very high,
> particularly as there are only about 2,850 concepts in total. This
> is also particularly bothersome if you analyse the IAU Thesaurus
> which only has 516 top level concepts.
Strange, my python script claims that there are 2972 tokens in the
newest version (which now includes all the elements and so had to
grow a little bit!) and only 968 top concepts. The script simply
looks for BT's and NT's, which we know are not yet internally
consistent, so maybe this is a good time to clean up the ontological
connections. Alasdair undoubtably used some smarter tools than my
python script and the RDF hasTopConcept list should be taken with a
grain of salt...
Latest changes:
- Alasdair's latest corrections have been implemented (mostly
typos)
- all elements, their official symbols as aliases plus their
links
to appropriate other tokens; "mercury_manganese_stars" makes more
sense when you know there is something called "element_mercury" (as
opposed to the token "Mercury") and "manganese".
- a few solar terms needed by the solar VOEvent community
The new documentation is a bit easier to use - just one top-level
page at
http://www.astro.physik.uni-goettingen.de/~hessman/rdf/IVOAT/
with links to the introduction, dictionary, and lists of tokens,
labels, and aliases. The alphabetic link buttons at the bottom
aren't perfect but should enable you to browse more conveniently (the
direct links between the list items and the dictionary are now broken
- next thing to be fixed).
A more formal question: I've been shoving UF's into aliases (a
process which has not been completed, since I'm mostly doing this
myself on spare time), but SKOS doesn't seem to have a means of
expressing U's?
Rick
--
Alasdair J G Gray
Research Associate: Explicator Project
http://explicator.dcs.gla.ac.uk
Computer Science, University of Glasgow
0141 330 6292
More information about the semantics
mailing list