SED Data Model working draft (on behalf of D. Tody)

Omar Laurino olaurino at head.cfa.harvard.edu
Thu Oct 25 06:47:54 PDT 2012


Doug, are you suggesting that we go back to Spectrum.* utypes? That seems
the natural conclusion to your point:

    Dataset.X    generic Dataset metadata
>     Sed.X        SED-specific metadata
>     Target.X     Target metadat
>     DataId.X     Dataset identification metadata


Spectrum.X Spectrum-specific metadata

Omar.



On Thu, Oct 25, 2012 at 11:27 AM, Douglas Tody <dtody at nrao.edu> wrote:

> On Thu, 25 Oct 2012, Mark Cresitello-Dittmar wrote:
>
>  Doug,
>>
>> On 10/24/2012 8:45 AM, Douglas Tody wrote:
>>
>>>
>>> Much of this discussion concerns the aggregate SED and segments. In
>>> this most recent SEDDM it is proposed that a segment of the aggregate
>>> SED is an externally viable, self contained dataset, which is merely
>>> stored in the serialized aggregate SED, with the votable or FITS MEF
>>> serving as a container (pointing to an actual external dataset with a
>>> URI is also permitted and might be desirable for large datasets).
>>> Usually this externally viable dataset could indeed be an object derived
>>> from SpectralDM and compliant with the VO models, but if we merely have
>>> a simple container of segments this does not have to be the case. Since
>>> the aggregate SED is used for SED building or editing it could be
>>> desirable to include or reference native datasets which have not been
>>> converted to a VO model.
>>>
>> I just want to be clear.. you want to allow the Segment to contain EITHER
>> a reference to an external dataset OR an actual in-line serialization..
>> right?
>>
>
> Yes, this was the intention.  It might be something large, or otherwise
> awkward to copy/include.
>
>
>  If instead we want to require that segments of the aggregate SED must be
>>> derived from and compatible with the SpectralDM then the model might
>>> indeed want to be changed as you both describe.  Even in this case
>>> however it is possible to merely provide a container for aggregating
>>> externally viable, standalone datasets, giving us the flexibility to
>>> reference external data in some native format.
>>>
>>> ** So the issue to be decided here is whether we permit the aggregate
>>> SED to reference native datasets or restrict it to only data classes
>>> derived from SDM **
>>>
>>>
>> I can see the interest in allowing the external dataset to be relatively
>> undefined.
>> The same 'issue' was discussed for PhotDM regarding the TransmissionCurve.
>>
>>
>>>   "Uniform (or rebinned)"  I think implies a synonym relationship, while
>>>> the description that follows shows that they have a different level of
>>>> processing.  Also, it says that rebinned handles overlaps.  Does that mean
>>>> that Uniform does NOT.. so the single Uniform sequence can have overlapping
>>>> data?
>>>>
>>>
>>> Yes, it is possible for segments in the uniform SED to overlap, e.g., a
>>> spectrum may overlap another spectrum or photometry point from another
>>> observation (segment).  Rebinning further normalizes the data to combine
>>> any overlapped regions, which may or may not be desirable.  In both
>>> cases we have a uniform SED however.  (This business of uniform vs
>>> rebinning originated with Jonathan).
>>>
>>>
>> I think I'm missing something in the picture then... the Uniform SED has
>> multiple Segments?
>> From the description, I'm getting:
>>    + Uniform    = Char+Data
>>    + Aggregate = Char+Segment[]
>>
>
> The uniform SED has all the segments from the aggregate SED transformed
> to common spectral/flux units and collected in the Data portion of the
> SED object.  Rebinning to avoid overlaps is optional.  In general there
> can be overlaps due to overlapping spectral coverage of the original
> segments, but since Data represents the spectral coord as a vector this
> can be handled.
>
>
>  Section 3.5.1:
>>>>  + Utypes containing SED?  Isn't this contrary to the changes we did to
>>>> remove the 'Spectrum' node from Spectral utypes?
>>>>
>>>
>>> We decided earlier that we would have a core SpectralDM defining classes
>>> which are common to all the models - Spectrum, SED, TimeSeries, etc.  So
>>> Dataset, Char, DataID, etc. Utypes do not start with "Spectrum.",
>>> "SED." and so on, and are the same for all classes of data derived from
>>> the model.  We would then extend the core model by adding a class which
>>> is for stuff which really is specific to the particular class of data
>>> and not part of the generic core, inherited model.  Hence for TimeSeries
>>> for example, all the usual SDM stuff is there unchanged, but we add a
>>> new class TimeSeries which contains metadata specific to time series
>>> data, e.g., information about the period or periods, trend removal,
>>> whatever.  For SED we have a similar construct.  Most of the SDM is
>>> inherited unchanged, the Utypes are generic, but stuff which is really
>>> specific to SED, e.g., nSegments, is put into a SED.xx element.
>>>
>>
>> The way I see this, we are extending BaseSPS to the desired dataset
>> (Spectrum, TimeSeries)
>> and If we want to add content, they can go into a complex object
>> (TimeProps for example),
>> or directly at the top level.  The UType convention imposed on
>> Spectral/Spectrum/**PhotometryPoint
>> is to NOT include the top node in the UType.. so these tags do not appear
>> before the
>> top level objects and attributes (Spectrum.Dataset).
>>
>> To do so with SED is breaking a convention you advocated heavily for the
>> other BaseSPS extensions.
>>
>
> Well if necessary we could change the class name of the "complex object"
> to something other than "SED"/ "TimeSeries" / "Image" etc.  ("SedProps"
> in your example) and accomplish the same goal.  But I don't see this as
> in conflict with removing the top node name from the Utypes since these
> objects are defined as specific to the class of data and hence not
> sharable.  These really are SED or TimeSeries etc.  specific Utypes.  It
> is not that much different than e.g., Target, which contains only
> Target-specific Utypes.  If we have a Sed.X class for SED-specific
> metadata, but were including a "Sed" prefix in all Utypes (e.g.,
> "Sed.Target.Name"), then applying the same convention we would have
> "Sed.Sed.X" for the Utypes, which is not what is being suggested here.
>
> So the proposal is
>
>   SED object
>     Dataset.X    generic Dataset metadata
>     Sed.X        SED-specific metadata
>     Target.X     Target metadat
>     DataId.X     Dataset identification metadata
>        etc.
>
> But we could instead have SedProps or whatever for the class name if this
> scheme is breaking some rule.
>
>         - Doug
>
>  Mark
>>
>>


-- 
Omar Laurino
Smithsonian Astrophysical Observatory
Harvard-Smithsonian Center for Astrophysics
100 Acorn Park Dr. R-376 MS-81
02140 Cambridge, MA
(617) 495-7227
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.ivoa.net/pipermail/dm/attachments/20121025/f873b0cf/attachment-0001.html>


More information about the dm mailing list