Class: Instance
What do the instances that comprise the dataset represent (e.g., documents, photos, people, countries)?
URI: data_sheets_schema:Instance
erDiagram
Instance {
string representation
string instance_type
string data_type
integer counts
string label
string id
string name
string description
}
Software {
string version
string license
string url
string id
string name
string description
}
MissingInfo {
stringList missing
stringList why_missing
string id
string name
string description
}
SamplingStrategy {
stringList is_sample
stringList is_random
stringList source_data
stringList is_representative
stringList representative_verification
stringList why_not_representative
stringList strategies
string id
string name
string description
}
Instance ||--}o SamplingStrategy : "sampling_strategies"
Instance ||--}o MissingInfo : "missing_information"
Instance ||--}o Software : "used_software"
MissingInfo ||--}o Software : "used_software"
SamplingStrategy ||--}o Software : "used_software"
Inheritance
- NamedThing
- DatasetProperty
- Instance
- DatasetProperty
Slots
Name | Cardinality and Range | Description | Inheritance |
---|---|---|---|
representation | 0..1 String |
direct | |
instance_type | 0..1 String |
Are there multiple types of instances (e | direct |
data_type | 0..1 String |
What data does each instance consist of? “Raw” data (e | direct |
counts | 0..1 Integer |
How many instances are there in total (of each type, if appropriate)? | direct |
label | 0..1 String |
Is there a label or target associated with each instance? | direct |
sampling_strategies | * SamplingStrategy |
direct | |
missing_information | * MissingInfo |
direct | |
used_software | * Software |
What software was used as part of this dataset property? | DatasetProperty |
id | 1 String |
the unique name of the dataset | NamedThing |
name | 0..1 String |
NamedThing | |
description | 0..1 String |
human readable description of the information | NamedThing |
Usages
used by | used in | type | used |
---|---|---|---|
Dataset | instances | range | Instance |
DataSubset | instances | range | Instance |
Identifier and Mapping Information
Schema Source
- from schema: https://w3id.org/bridge2ai/data-sheets-schema
Mappings
Mapping Type | Mapped Value |
---|---|
self | data_sheets_schema:Instance |
native | data_sheets_schema:Instance |
LinkML Source
Direct
name: Instance
description: What do the instances that comprise the dataset represent (e.g., documents,
photos, people, countries)?
in_subset:
- Composition
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
is_a: DatasetProperty
attributes:
representation:
name: representation
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
domain_of:
- Instance
range: string
instance_type:
name: instance_type
description: Are there multiple types of instances (e.g., movies, users, and ratings;
people and interactions between them; nodes and edges)?
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
domain_of:
- Instance
range: string
data_type:
name: data_type
description: What data does each instance consist of? “Raw” data (e.g., unprocessed
text or images) or features? In either case, please provide a description.
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
domain_of:
- Instance
range: string
counts:
name: counts
description: How many instances are there in total (of each type, if appropriate)?
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
domain_of:
- Instance
range: integer
label:
name: label
description: Is there a label or target associated with each instance?
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
domain_of:
- Instance
range: string
sampling_strategies:
name: sampling_strategies
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
multivalued: true
domain_of:
- Dataset
- Instance
range: SamplingStrategy
missing_information:
name: missing_information
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
multivalued: true
domain_of:
- Instance
range: MissingInfo
Induced
name: Instance
description: What do the instances that comprise the dataset represent (e.g., documents,
photos, people, countries)?
in_subset:
- Composition
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
is_a: DatasetProperty
attributes:
representation:
name: representation
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
alias: representation
owner: Instance
domain_of:
- Instance
range: string
instance_type:
name: instance_type
description: Are there multiple types of instances (e.g., movies, users, and ratings;
people and interactions between them; nodes and edges)?
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
alias: instance_type
owner: Instance
domain_of:
- Instance
range: string
data_type:
name: data_type
description: What data does each instance consist of? “Raw” data (e.g., unprocessed
text or images) or features? In either case, please provide a description.
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
alias: data_type
owner: Instance
domain_of:
- Instance
range: string
counts:
name: counts
description: How many instances are there in total (of each type, if appropriate)?
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
alias: counts
owner: Instance
domain_of:
- Instance
range: integer
label:
name: label
description: Is there a label or target associated with each instance?
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
alias: label
owner: Instance
domain_of:
- Instance
range: string
sampling_strategies:
name: sampling_strategies
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
multivalued: true
alias: sampling_strategies
owner: Instance
domain_of:
- Dataset
- Instance
range: SamplingStrategy
missing_information:
name: missing_information
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
multivalued: true
alias: missing_information
owner: Instance
domain_of:
- Instance
range: MissingInfo
used_software:
name: used_software
description: What software was used as part of this dataset property?
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
multivalued: true
alias: used_software
owner: Instance
domain_of:
- DatasetProperty
range: Software
id:
name: id
description: the unique name of the dataset
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
exact_mappings:
- schema:name
rank: 1000
slot_uri: dcterms:identifier
identifier: true
alias: id
owner: Instance
domain_of:
- NamedThing
- Information
range: string
required: true
name:
name: name
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
slot_uri: schema:name
alias: name
owner: Instance
domain_of:
- NamedThing
range: string
description:
name: description
description: human readable description of the information
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
rank: 1000
slot_uri: dcterms:description
alias: description
owner: Instance
domain_of:
- NamedThing
- Information
- Relationships
- Splits
- DataAnomaly
- Confidentiality
- Deidentification
- SensitiveElement
- InstanceAcquisition
- CollectionMechanism
- DataCollector
- CollectionTimeframe
- EthicalReview
- DirectCollection
- CollectionNotification
- CollectionConsent
- ConsentRevocation
- DataProtectionImpact
- PreprocessingStrategy
- CleaningStrategy
- LabelingStrategy
- RawData
- ExistingUse
- UseRepository
- OtherTask
- FutureUseImpact
- DiscouragedUse
- ThirdPartySharing
- DistributionFormat
- DistributionDate
- LicenseAndUseTerms
- IPRestrictions
- ExportControlRegulatoryRestrictions
- Maintainer
- Erratum
- UpdatePlan
- RetentionLimits
- VersionAccess
- ExtensionMechanism
range: string