ADQL DISTANCE argument?

Patrick Dowler pdowler.cadc at gmail.com
Thu Feb 20 16:38:45 CET 2020


I agree with the basics:

DISTANCE(<point>, <point>)
DISTANCE(<number>, <number>, <number>, <number>)  -- for convenience
plus suitable conversion/restriction when a column reference is used in
place of a value

[1] the prose is probably there because CENTROID returns a <point>, eg
DISTANCE(CENTROID(foo), ...)

--
Patrick Dowler
Canadian Astronomy Data Centre
Victoria, BC, Canada


On Wed, 19 Feb 2020 at 07:16, Markus Demleitner <
msdemlei at ari.uni-heidelberg.de> wrote:

> Dear DAL,
>
> Current ADQL says:
>
>   Functions like AREA, COORD1, COORD2 and DISTANCE accept a
>   geometry and return a calculated numeric value.
>
>   The specification defines two versions of the DISTANCE function, one
>   that accepts two geometries, and one that accepts four separate numeric
>   values, both forms return a numeric value.
>
> Both statements would indicate that DISTANCE should accept general
> geometries, i.e., including circles and polygons.
>
> The later definition of DISTANCE then says
>
>   The specification defines two versions of the DISTANCE function,
>   one that accepts two POINT values, and a second that accepts four
>   separate numeric values.
>
> -- which is clear enough, had it not been for the previous statement,
> and the later statement
>
>   If the geometric arguments are expressed ...
>
> which might again be understood as saying the arguments can be more
> general.
>
> Finally, the grammar says, for the geometry case:
>
>   DISTANCE <left_paren> <coord_value> <comma>
>     <coord_value> <right_paren>
>
> where
>
>   <coord_value> ::= <point> | <column_reference>
>
> I *think* all this works out to say that over and above the grammar,
> for distance there's the additional constraint that column_reference
> must be POINT-typed.[1]
>
> Being general here is a pain in the neck (actually, that's why I ran
> into this question).  For one, you'll need to define distance
> much more carefully for such geometries, and if (as I think we ought
> to) we chose "minimum of distances of between all points in arg 1 and
> arg 2", I doubt we'll see many correct implementations of that.  Also
> I'll want to map a lot of DISTANCE calls into contains(point,
> circle) statements (because that's much easier on the query planner),
> and that's a pain if one of the points could actually be, say, a
> polygon.
>
> So... do we agree that DISTANCE only accept POINT-s?
>
> If so, I'd suggest to just drop the sentence:
>
>   Functions like AREA, COORD1, COORD2 and DISTANCE accept a
>   geometry and return a calculated numeric value.
>
> Then change
>
>   The specification defines two versions of the DISTANCE function,
>   one that accepts two geometries, and one that accepts four
>   separate...
>
> to
>
>   This specification defines two versions of the DISTANCE function,
>   one that accepts two POINTs, and one that accepts four
>   separate...
>
> And then add in 4.2.16 in some appropriate location something like
>
>   Note that when <column reference>s[2] are passed into DISTANCE, the
>   operation is only defined for POINT-typed values.  Behaviour for
>   other geometries is undefined at this point (but may be defined
>   later).
>
> Would anyone veto a PR to this effect?  Would anyone prefer something
> completely different?  Would anyone volunteer for doing the PR?
>
>            -- Markus
>
> [1] Incidentally, the grammar rules are incompatible with the
> statement in the 4.2.16 that "[t]he DISTANCE function may be applied
> to any expression that returns a geometric POINT value"; I see why it
> was put in, but unless we fix the grammar, we should remove the
> prose.
>
> [2] or <geometry_value_expression>s, depending on how you think about
> [1]
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.ivoa.net/pipermail/dal/attachments/20200220/72042dce/attachment.html>


More information about the dal mailing list