Training Data Markup Language for Artificial Intelligence

This standard aims to develop the unified modeling language (UML) model and encodings for geospatial machine learning training data. Training data plays a fundamental role in Earth Observation (EO) Artificial Intelligence Machine Learning (AI/ML), especially Deep Learning (DL). It is used to train, validate, and test AI/ML models.

OGC Open Call Requests
Overview

The Training Data Markup Language for Artificial Intelligence (TrainingDML-AI) Standard aims to develop the UML model and encodings for geospatial machine learning training data. Training data plays a fundamental role in Earth Observation (EO) Artificial Intelligence Machine Learning (AI/ML), especially Deep Learning (DL). It is used to train, validate, and test AI/ML models. This Standard defines a UML model and encodings consistent with the OGC Standards baseline to exchange and retrieve the training data in the Web environment.

The TrainingDML-AI Standard provides detailed metadata for formalizing the information model of training data. This includes but is not limited to the following aspects:

  • How to introduce external classification schemes and flexible means for representing ground truth labeling.
  • How the training data is prepared, such as provenance or quality;
  • How to specify different metadata used for different ML tasks such as scene/object/pixel levels;
  • How to differentiate the high-level training data information model and extended information models specific to various ML applications; and

Open Calls and Requests

Members of the public can review draft standards and share feedback to ensure they are practical and widely applicable.

Explore Insights

You’ll find the latest thought leadership from OGC here.  

OverviewTestbed Europe is an emerging Open Geospatial Consortium (OGC) initiative exploring how Europe’s...