SED Data Model working draft (on behalf of D. Tody)
Douglas Tody
dtody at nrao.edu
Thu Oct 25 07:10:28 PDT 2012
No, I am not suggesting this. - Doug
On Thu, 25 Oct 2012, Omar Laurino wrote:
> Doug, are you suggesting that we go back to Spectrum.* utypes? That seems the
> natural conclusion to your point:
> Dataset.X generic Dataset metadata
> Sed.X SED-specific metadata
> Target.X Target metadat
> DataId.X Dataset identification metadata
>
>
> Spectrum.X Spectrum-specific metadata
>
> Omar.
>
>
>
> On Thu, Oct 25, 2012 at 11:27 AM, Douglas Tody <dtody at nrao.edu> wrote:
> On Thu, 25 Oct 2012, Mark Cresitello-Dittmar wrote:
>
> Doug,
>
> On 10/24/2012 8:45 AM, Douglas Tody wrote:
>
> Much of this discussion concerns the
> aggregate SED and segments. In
> this most recent SEDDM it is proposed that a
> segment of the aggregate
> SED is an externally viable, self contained
> dataset, which is merely
> stored in the serialized aggregate SED, with
> the votable or FITS MEF
> serving as a container (pointing to an
> actual external dataset with a
> URI is also permitted and might be desirable
> for large datasets).
> Usually this externally viable dataset could
> indeed be an object derived
> from SpectralDM and compliant with the VO
> models, but if we merely have
> a simple container of segments this does not
> have to be the case. Since
> the aggregate SED is used for SED building
> or editing it could be
> desirable to include or reference native
> datasets which have not been
> converted to a VO model.
>
> I just want to be clear.. you want to allow the Segment
> to contain EITHER
> a reference to an external dataset OR an actual in-line
> serialization.. right?
>
>
> Yes, this was the intention. It might be something large, or otherwise
> awkward to copy/include.
>
> If instead we want to require that segments of the
> aggregate SED must be
> derived from and compatible with the SpectralDM
> then the model might
> indeed want to be changed as you both describe.
> Even in this case
> however it is possible to merely provide a
> container for aggregating
> externally viable, standalone datasets, giving us
> the flexibility to
> reference external data in some native format.
>
> ** So the issue to be decided here is whether we
> permit the aggregate
> SED to reference native datasets or restrict it to
> only data classes
> derived from SDM **
>
>
> I can see the interest in allowing the external dataset to be
> relatively undefined.
> The same 'issue' was discussed for PhotDM regarding the
> TransmissionCurve.
>
>
> "Uniform (or rebinned)" I think
> implies a synonym relationship, while
> the description that follows shows
> that they have a different level of
> processing. Also, it says that
> rebinned handles overlaps. Does that
> mean that Uniform does NOT.. so the
> single Uniform sequence can have
> overlapping data?
>
>
> Yes, it is possible for segments in the uniform
> SED to overlap, e.g., a
> spectrum may overlap another spectrum or
> photometry point from another
> observation (segment). Rebinning further
> normalizes the data to combine
> any overlapped regions, which may or may not be
> desirable. In both
> cases we have a uniform SED however. (This
> business of uniform vs
> rebinning originated with Jonathan).
>
>
> I think I'm missing something in the picture then... the
> Uniform SED has multiple Segments?
> >From the description, I'm getting:
> + Uniform = Char+Data
> + Aggregate = Char+Segment[]
>
>
> The uniform SED has all the segments from the aggregate SED transformed
> to common spectral/flux units and collected in the Data portion of the
> SED object. Rebinning to avoid overlaps is optional. In general there
> can be overlaps due to overlapping spectral coverage of the original
> segments, but since Data represents the spectral coord as a vector this
> can be handled.
>
> Section 3.5.1:
> + Utypes containing SED? Isn't this
> contrary to the changes we did to
> remove the 'Spectrum' node from
> Spectral utypes?
>
>
> We decided earlier that we would have a core
> SpectralDM defining classes
> which are common to all the models - Spectrum,
> SED, TimeSeries, etc. So
> Dataset, Char, DataID, etc. Utypes do not start
> with "Spectrum.",
> "SED." and so on, and are the same for all classes
> of data derived from
> the model. We would then extend the core model by
> adding a class which
> is for stuff which really is specific to the
> particular class of data
> and not part of the generic core, inherited model.
> Hence for TimeSeries
> for example, all the usual SDM stuff is there
> unchanged, but we add a
> new class TimeSeries which contains metadata
> specific to time series
> data, e.g., information about the period or
> periods, trend removal,
> whatever. For SED we have a similar construct.
> Most of the SDM is
> inherited unchanged, the Utypes are generic, but
> stuff which is really
> specific to SED, e.g., nSegments, is put into a
> SED.xx element.
>
>
> The way I see this, we are extending BaseSPS to the desired
> dataset (Spectrum, TimeSeries)
> and If we want to add content, they can go into a complex
> object (TimeProps for example),
> or directly at the top level. The UType convention imposed on
> Spectral/Spectrum/PhotometryPoint
> is to NOT include the top node in the UType.. so these tags do
> not appear before the
> top level objects and attributes (Spectrum.Dataset).
>
> To do so with SED is breaking a convention you advocated
> heavily for the other BaseSPS extensions.
>
>
> Well if necessary we could change the class name of the "complex object"
> to something other than "SED"/ "TimeSeries" / "Image" etc. ("SedProps"
> in your example) and accomplish the same goal. But I don't see this as
> in conflict with removing the top node name from the Utypes since these
> objects are defined as specific to the class of data and hence not
> sharable. These really are SED or TimeSeries etc. specific Utypes. It
> is not that much different than e.g., Target, which contains only
> Target-specific Utypes. If we have a Sed.X class for SED-specific
> metadata, but were including a "Sed" prefix in all Utypes (e.g.,
> "Sed.Target.Name"), then applying the same convention we would have
> "Sed.Sed.X" for the Utypes, which is not what is being suggested here.
>
> So the proposal is
>
> SED object
> Dataset.X generic Dataset metadata
> Sed.X SED-specific metadata
> Target.X Target metadat
> DataId.X Dataset identification metadata
> etc.
>
> But we could instead have SedProps or whatever for the class name if this
> scheme is breaking some rule.
>
> - Doug
>
> Mark
>
>
>
>
> --
> Omar Laurino
> Smithsonian Astrophysical Observatory
> Harvard-Smithsonian Center for Astrophysics100 Acorn Park Dr. R-376 MS-81
> 02140 Cambridge, MA
> (617) 495-7227
>
>
>
More information about the dm
mailing list