Schema Comparison (47 vs 21) Score = 0.614

Date Last Updated
2010-07-23 12:48:32+0000

Note: This page is automatically generated. Edits will be lost.

This document presents a summary of an automated comparison of two DiGIR federation schemas. Various attributes of the documents are compared and given a score of 0…1. The overall comparison score is (sum of attribute scores) / (maximum possible score). In general, scores over 0.9 seem to indicate that the two federation schemas are likely to be different versions of the same schema.

The following schema attributes are compared:

  • Schema namespace
  • Imported namespaces
  • Document level annotation
  • Number of concepts
  • Definitions of each concept

Concepts between schemas are matched by comparing each concept in schema A with every concept in schema B. The concepts that match with the highest score are presented in the Concept Definitions section below, and that score is added to the total for the document.

Schema A (ID = 47)
http://digir.net/schema/conceptual/darwin/manis/1.21/darwin2.xsd
Schema B (ID = 21)
http://www.iobis.org/obis/obis.xsd

Namespace Comparison (score = 0.293)

Schema A NS
http://digir.net/schema/conceptual/darwin/2003/1.0
Schema B NS
http://www.iobis.org/obis

Schema Annotation (score = 0.425)

Schema A:

 $Id: darwin2.xsd,v 1.21 2003/06/17 11:14:24 John Wieczorek Exp $  XML Schema
draft Darwin Core Version 2
(http://tsadev.speciesanalyst.net/documentation/ow.asp?DarwinCoreV2) content
model.  Uses and extends data elements from the DiGIR (http://digir.net)
protocol.

Schema B:

 $Id: obis.xsd,v 1.1 2005/07/10 edited by Lissa Jerry$  XML Schema describing
the OBIS Schema (http://www.iobis.org/FAQschema1.shtml) content model.  Uses and
extends data elements from the DiGIR (http://digir.sourceforge.net) protocol and
Darwin Core
V2(http://tsadev.speciesanalyst.net/documentation/ow.asp?DarwinCoreV2).

Imported Namespaces (score = 0.000):

Schema A:

http://digir.net/schema/protocol/2003/1.0
http://digir.net/schema/protocol/2003/1.0

Schema B:

http://digir.net/schema/conceptual/darwin/2003/1.0

Concept Defintions (score = 0.683): %s

Best matches for each concept. Note that where the number of concepts defined in each schema is different, only the smaller set is used in the comparison since these represent the best matches out of all possible combinations.

Schema A Schema B Score
BasisOfRecord (xsd:string)
Nillable?: true
Sub Grp: digir:searchableReturnableData

A description indicating whether the record represents an observation, tissue sample, living organism, voucher specimen, germplasm/seed, genetic information, etc.
RecordURL (xsd:string)
Nillable?: true
Sub Grp: digir:searchableReturnableData

Gives the web address of the page where more information on this particular record (not on the whole dataset) can be found.
0.660
Subspecies (xsd:string)
Nillable?: true
Sub Grp: digir:searchableReturnableData

The phylogenetic subspecific epithet of the cataloged item.
Source (xsd:string)
Nillable?: true
Sub Grp: digir:searchableReturnableData

Indicates who gave the record to the data provider. Can indicate a literature citation, an electronic dataset, etc. Is used to provide credit.
0.622
VerbatimElevation (xsd:string)
Nillable?: true
Sub Grp: digir:searchableReturnableData

A text representation of the Elevation in its original format in the source database.
Citation (xsd:string)
Nillable?: true
Sub Grp: digir:searchableReturnableData

Indicates how this record should be attributed if used
0.623
Genus (xsd:string)
Nillable?: true
Sub Grp: digir:searchableReturnableData

The phylogenetic genus to which the cataloged item belongs.
Subgenus (xsd:string)
Nillable?: true
Sub Grp: digir:searchableReturnableData

The subgenus name of the organism
0.745
YearCollected (xsd:gYear)
Nillable?: true
Sub Grp: digir:searchableReturnableData

The four digit year in the Common Era calendar in which the cataloged item was collected.
StartYearCollected (xsd:gYear)
Nillable?: true
Sub Grp: digir:searchableReturnableData

For samples/observations/record events that were taken over time this gives the start year of the collecting event. The full year should be expressed (e.g. 1972 must be expressed as “1972” not “72”). Must always be a four digit integer
0.816
YearCollected (xsd:gYear)
Nillable?: true
Sub Grp: digir:searchableReturnableData

The four digit year in the Common Era calendar in which the cataloged item was collected.
EndYearCollected (xsd:gYear)
Nillable?: true
Sub Grp: digir:searchableReturnableData

For samples/observations/record events that were taken over time this gives the end year of the collecting event. The full year should be expressed (e.g. 1972 must be expressed as “1972” not “72”). Must always be a four digit integer
0.851
MonthCollected ()
Nillable?: true
Sub Grp: digir:searchableReturnableData

The two digit month of year in the Common Era calendar during which the cataloged item was collected from the field.
StartMonthCollected ()
Nillable?: true
Sub Grp: digir:searchableReturnableData

For samples/observations/record events that were taken over time this gives the start month of the collecting event. Possible values range from 01…12 inclusive
0.837
MonthCollected ()
Nillable?: true
Sub Grp: digir:searchableReturnableData

The two digit month of year in the Common Era calendar during which the cataloged item was collected from the field.
EndMonthCollected ()
Nillable?: true
Sub Grp: digir:searchableReturnableData

For samples/observations/record events that were taken over time this gives the end month of the collecting event. Possible values range from 01…12 inclusive
0.870
DayCollected ()
Nillable?: true
Sub Grp: digir:searchableReturnableData

The two digit day of the month in the Common Era calendar during which the cataloged item was collected from the field.
StartDayCollected ()
Nillable?: true
Sub Grp: digir:searchableReturnableData

For samples/observations/record events that were taken over time this gives the start day of the collecting event. Possible value ranges from 01..31 inclusive
0.823
DayCollected ()
Nillable?: true
Sub Grp: digir:searchableReturnableData

The two digit day of the month in the Common Era calendar during which the cataloged item was collected from the field.
EndDayCollected ()
Nillable?: true
Sub Grp: digir:searchableReturnableData

For samples/observations/record events that were taken over time this gives the end day of the collecting event. Possible value ranges from 01..31 inclusive
0.860
JulianDay (xsd:string)
Nillable?: true
Sub Grp: digir:searchableReturnableData

The ordinal day of the year (i.e., the number of days since December 31 of the previous year; January 1 is Julian Day 1) on which the cataloged item was collected. May be derived from the YearCollected, MonthCollected, and DayCollected by the provider.
StartJulianDay ()
Nillable?: true
Sub Grp: digir:searchableReturnableData

For samples/observations/record events that were taken over time this gives the start ordinal day of the year for the collecting event; i.e., the number of days since January 1 of the same year. (January 1 is Julian Day 1.). Should be an integer from one to 365, i.e. of the form (([0–3][0–9][0–9)
([0–9][0–9) 0.672
JulianDay (xsd:string)
Nillable?: true
Sub Grp: digir:searchableReturnableData

The ordinal day of the year (i.e., the number of days since December 31 of the previous year; January 1 is Julian Day 1) on which the cataloged item was collected. May be derived from the YearCollected, MonthCollected, and DayCollected by the provider.
EndJulianDay ()
Nillable?: true
Sub Grp: digir:searchableReturnableData

For samples/observations/record events that were taken over time this gives the end ordinal day of the year for the collecting event; i.e., the number of days since January 1 of the same year. (January 1 is Julian Day 1.). Should be an integer from one to 365, i.e. of the form (([0–3][0–9][0–9)
([0–9][0–9) 0.717
TimeCollected ()
Nillable?: true
Sub Grp: digir:searchableReturnableData

The time of day the cataloged item was collected, expressed as decimal hours from midnight, local time (e.g., 12.0 = noon, 13.5 = 1:30pm).
StartTimeofDay ()
Nillable?: true
Sub Grp: digir:searchableReturnableData

For samples/observations/record events that were taken over time this gives the start time of day of the collecting event expressed as decimal hours from midnight local time (e.g. 12.0 = mid day, 13.5 = 1:30pm)
0.600
TimeCollected ()
Nillable?: true
Sub Grp: digir:searchableReturnableData

The time of day the cataloged item was collected, expressed as decimal hours from midnight, local time (e.g., 12.0 = noon, 13.5 = 1:30pm).
EndTimeofDay ()
Nillable?: true
Sub Grp: digir:searchableReturnableData

For samples/observations/record events that were taken over time this gives the end time of day of the collecting event expressed as decimal hours from midnight local time (e.g. 12.0 = mid day, 13.5 = 1:30pm)
0.619
VerbatimLongitude (xsd:string)
Nillable?: true
Sub Grp: digir:searchableReturnableData

A text representation of the Longitude data in its original format in the source database.
TimeZone (xsd:string)
Nillable?: true
Sub Grp: digir:searchableReturnableData

Indicates the time zone for the Time of Day measurements
0.595
DecimalLongitude ()
Nillable?: true
Sub Grp: digir:searchableReturnableData

The longitude of the location from which the cataloged item was collected, expressed in decimal degrees.
StartLongitude ()
Nillable?: true
Sub Grp: digir:searchableReturnableData

For samples/observations/record events better represented as line features rather than point features (e.g. extended trawls or transects) this indicates the starting longitude location from which the specimen was collected. Express in decimal degrees (East & North = +; West & South = -). GPS-derived data must use the WGS 84 geodetic reference system (http://www.wgs84.com/).
0.734
DecimalLongitude ()
Nillable?: true
Sub Grp: digir:searchableReturnableData

The longitude of the location from which the cataloged item was collected, expressed in decimal degrees.
EndLongitude ()
Nillable?: true
Sub Grp: digir:searchableReturnableData

For samples/observations/record events better represented as line features rather than point features (e.g. extended trawls or transects) this indicates the starting longitude location from which the specimen was collected. Express in decimal degrees (East & North = +; West & South = -). GPS-derived data must use the WGS 84 geodetic reference system (http://www.wgs84.com/).
0.734
DecimalLatitude ()
Nillable?: true
Sub Grp: digir:searchableReturnableData

The latitude of the location from which the cataloged item was collected, expressed in decimal degrees.
StartLatitude ()
Nillable?: true
Sub Grp: digir:searchableReturnableData

For samples/observations/record events better represented as line features rather than point features (e.g. extended trawls or transects) this indicates the starting latitude location from which the specimen was collected or in which the sample/observation/record event occurred. This value should be expressed in decimal degrees (East & North = +; West & South = -). GPS-derived data must use the WGS 84 geodetic reference system (http://www.wgs84.com/).
0.716
DecimalLatitude ()
Nillable?: true
Sub Grp: digir:searchableReturnableData

The latitude of the location from which the cataloged item was collected, expressed in decimal degrees.
EndLatitude ()
Nillable?: true
Sub Grp: digir:searchableReturnableData

For samples/observations/record events better represented as line features rather than point features (e.g. extended trawls or transects) this indicates the starting latitude location from which the specimen was collected or in which the sample/observation/record event occurred. This value should be expressed in decimal degrees (East & North = +; West & South = -). GPS-derived data must use the WGS 84 geodetic reference system (http://www.wgs84.com/).
0.715
CoordinateUncertaintyInMeters (xsd:decimal)
Nillable?: true
Sub Grp: digir:searchableReturnableData

The upper limit of the distance (in meters) from the given latitude and longitude describing a circle within which the whole of the described locality must lie. Use NULL where the uncertainty is unknown, cannot be estimated, or is not applicable.
Start_EndCoordinatePrecision (xsd:decimal)
Nillable?: true
Sub Grp: digir:searchableReturnableData

An estimate of how tightly the locality was specified in the Start/End Latitude and Longitude fields; expressed as a distance, in meters, that corresponds to a radius around the latitude-longitude coordinates. Use NULL where precision is unknown, cannot be estimated, or is not applicable.
0.603
VerbatimDepth (xsd:string)
Nillable?: true
Sub Grp: digir:searchableReturnableData

A text representation of the Depth in its original format in the source database.
DepthRange (xsd:string)
Nillable?: true
Sub Grp: digir:searchableReturnableData

For data sets that have the depth range expressed in one field (e.g. “150–200 m”) it can be entered here as free text. Separate, numeric Minimum and Maximum Depth fields are the preferred format; the Depth Range option is included for legacy data sets.
0.591
TypeStatus (xsd:string)
Nillable?: true
Sub Grp: digir:searchableReturnableData

A list of one or more nomenclatural types that the cataloged item represents (e.g., “holotype of Ctenomys sociabilis. Pearson O. P., and M. I. Christie. 1985. Historia Natural, 5(37):388.”).
Temperature (xsd:decimal)
Nillable?: true
Sub Grp: digir:searchableReturnableData

The temperature recorded with the collection/record event. Is assumed to be taken at the collection depth. Expressed in degrees Celsius.
0.615
TypeStatus (xsd:string)
Nillable?: true
Sub Grp: digir:searchableReturnableData

A list of one or more nomenclatural types that the cataloged item represents (e.g., “holotype of Ctenomys sociabilis. Pearson O. P., and M. I. Christie. 1985. Historia Natural, 5(37):388.”).
LifeStage (xsd:string)
Nillable?: true
Sub Grp: digir:searchableReturnableData

Indicates the life stage present. Will require developing a controlled vocabulary. Can include multiple stages for a lot with multiple individuals.
0.583
IndividualCount (xsd:nonNegativeInteger)
Nillable?: true
Sub Grp: digir:searchableReturnableData

The number of individuals present in the lot or container referred to by the catalog number. Not an estimate of abundance or density at the collecting locality.
ObservedIndividualCount (xsd:nonNegativeInteger)
Nillable?: true
Sub Grp: digir:searchableReturnableData

The number of individuals (abundance) found in a collection/record event.
0.820
Species (xsd:string)
Nillable?: true
Sub Grp: digir:searchableReturnableData

The phylogenetic specific epithet of the cataloged item.
SampleSize (xsd:string)
Nillable?: true
Sub Grp: digir:searchableReturnableData

Sample_size: the size of the sample from which the collection/observation was drawn. It can be a volume (e.g. for a phytoplankton sample), a linear distance (e.g. for a visual transect or net haul), a surface area (e.g. for a benthic core), etc. This field must also include the units, e.g. 200 mfor a transect, or 0.25 m2 for a benthic grab (use to denote a superscript). Note that when multiple collections/observations are reported from the same physical sample, a code identifying the sample can be placed in the Field_Number field to allow all collections/observations from a single sample to be connected.
0.639
TimeCollected ()
Nillable?: true
Sub Grp: digir:searchableReturnableData

The time of day the cataloged item was collected, expressed as decimal hours from midnight, local time (e.g., 12.0 = noon, 13.5 = 1:30pm).
ObservedWeight ()
Nillable?: true
Sub Grp: digir:searchableReturnableData

The total biomass found in a collection/record event. Expressed as kg.
0.488
VerbatimLatitude (xsd:string)
Nillable?: true
Sub Grp: digir:searchableReturnableData

A text representation of the Latitude data in its original format in the source database.
GMLFeature (xsd:string)
Nillable?: true
Sub Grp: digir:searchableReturnableData

Geographic Markup Language(GML) description of the feature for representing complex shapes such as lines and polygons, per Open GIS Consortium (OGC) standards – http://www.opengis.net/gml/01–029/GML2.html.
0.568