Class: PreprocessingStrategy
Was any preprocessing of the data done (e.g., discretization or bucketing, tokenization, SIFT feature extraction)?
URI: data_sheets_schema:PreprocessingStrategy
erDiagram
PreprocessingStrategy {
    stringList description  
    string id  
    string name  
}
Software {
    string version  
    string license  
    string url  
    string id  
    string name  
    string description  
}
PreprocessingStrategy ||--}o Software : "used_software"
Inheritance
- NamedThing- DatasetProperty- PreprocessingStrategy
 
 
- DatasetProperty
Slots
| Name | Cardinality and Range | Description | Inheritance | 
|---|---|---|---|
| description | * String | direct | |
| used_software | * Software | What software was used as part of this dataset property? | DatasetProperty | 
| id | 1 String | the unique name of the dataset | NamedThing | 
| name | 0..1 String | NamedThing | 
Usages
| used by | used in | type | used | 
|---|---|---|---|
| Dataset | preprocessing_strategies | range | PreprocessingStrategy | 
| DataSubset | preprocessing_strategies | range | PreprocessingStrategy | 
Identifier and Mapping Information
Schema Source
- from schema: https://w3id.org/bridge2ai/data-sheets-schema
Mappings
| Mapping Type | Mapped Value | 
|---|---|
| self | data_sheets_schema:PreprocessingStrategy | 
| native | data_sheets_schema:PreprocessingStrategy | 
LinkML Source
Direct
name: PreprocessingStrategy
description: Was any preprocessing of the data done (e.g., discretization or bucketing,
  tokenization, SIFT feature extraction)?
in_subset:
- Preprocessing-Cleaning-Labeling
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
is_a: DatasetProperty
attributes:
  description:
    name: description
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    multivalued: true
    domain_of:
    - NamedThing
    - Information
    - Relationships
    - Splits
    - DataAnomaly
    - Confidentiality
    - Deidentification
    - SensitiveElement
    - InstanceAcquisition
    - CollectionMechanism
    - DataCollector
    - CollectionTimeframe
    - EthicalReview
    - DirectCollection
    - CollectionNotification
    - CollectionConsent
    - ConsentRevocation
    - DataProtectionImpact
    - PreprocessingStrategy
    - CleaningStrategy
    - LabelingStrategy
    - RawData
    - ExistingUse
    - UseRepository
    - OtherTask
    - FutureUseImpact
    - DiscouragedUse
    - ThirdPartySharing
    - DistributionFormat
    - DistributionDate
    - LicenseAndUseTerms
    - IPRestrictions
    - ExportControlRegulatoryRestrictions
    - Maintainer
    - Erratum
    - UpdatePlan
    - RetentionLimits
    - VersionAccess
    - ExtensionMechanism
    range: string
Induced
name: PreprocessingStrategy
description: Was any preprocessing of the data done (e.g., discretization or bucketing,
  tokenization, SIFT feature extraction)?
in_subset:
- Preprocessing-Cleaning-Labeling
from_schema: https://w3id.org/bridge2ai/data-sheets-schema
is_a: DatasetProperty
attributes:
  description:
    name: description
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    multivalued: true
    alias: description
    owner: PreprocessingStrategy
    domain_of:
    - NamedThing
    - Information
    - Relationships
    - Splits
    - DataAnomaly
    - Confidentiality
    - Deidentification
    - SensitiveElement
    - InstanceAcquisition
    - CollectionMechanism
    - DataCollector
    - CollectionTimeframe
    - EthicalReview
    - DirectCollection
    - CollectionNotification
    - CollectionConsent
    - ConsentRevocation
    - DataProtectionImpact
    - PreprocessingStrategy
    - CleaningStrategy
    - LabelingStrategy
    - RawData
    - ExistingUse
    - UseRepository
    - OtherTask
    - FutureUseImpact
    - DiscouragedUse
    - ThirdPartySharing
    - DistributionFormat
    - DistributionDate
    - LicenseAndUseTerms
    - IPRestrictions
    - ExportControlRegulatoryRestrictions
    - Maintainer
    - Erratum
    - UpdatePlan
    - RetentionLimits
    - VersionAccess
    - ExtensionMechanism
    range: string
  used_software:
    name: used_software
    description: What software was used as part of this dataset property?
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    multivalued: true
    alias: used_software
    owner: PreprocessingStrategy
    domain_of:
    - DatasetProperty
    range: Software
  id:
    name: id
    description: the unique name of the dataset
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    exact_mappings:
    - schema:name
    rank: 1000
    slot_uri: dcterms:identifier
    identifier: true
    alias: id
    owner: PreprocessingStrategy
    domain_of:
    - NamedThing
    - Information
    range: string
    required: true
  name:
    name: name
    from_schema: https://w3id.org/bridge2ai/data-sheets-schema
    rank: 1000
    slot_uri: schema:name
    alias: name
    owner: PreprocessingStrategy
    domain_of:
    - NamedThing
    range: string