Top Qs
Timeline
Chat
Perspective
LinkML
From Wikipedia, the free encyclopedia
Remove ads
The Linked Data Modeling Language, or LinkML, is a data modeling framework based on YAML that aims to "bring semantic web standards to the masses, simplifying the production of FAIR, ontology-ready data."[1] It can be used to specify data schemas that can be used for validating data via formats like JSON-LD and ShEx.
Description
Summarize
Perspective
The Linked data Modeling Language is an object-oriented data modeling framework aiming to simplify the production of FAIR data. It is intended to be used for schematizing a variety of kinds of data, ranging from simple flat checklist-style standards to complex interrelated normalized data utilizing polymorphism/inheritance.[1]
LinkML structure

LinkML is designed to be aligned with the standards used by modern software developers and database engineers, including JSON files, relational databases, document stores and object models in Python, while providing a semantic web underpinning, mapping all elements to Resource Descriptoin Framework (RDF) Uniform Resource Identifiers (URIs). LinkML’s formal RDF-based framework aims at abstracting the complexity and allow semantics to "hide in plain sight".[1]
The basic structure is a schema plus associated metadata (including namespace to URI mapping), a set of classes, plus their attributes. Classes follow object-oriented semantics rather than OWL semantics, allowing classes to also be metaclasses, enabling the modeling of design patterns used in some computational ontologies. Each element in the schema can be assigned URIs from existing vocabularies, allowing for increased interoperability. [1]
The LinkML schema tries to anchor the meaning of free text strings by establishing identity via resolvable URIs. Technically, the framework allows the models using both open and closed world assumptions, and when operating in a closed world provides ways to validate schema instances and their relations, using modeling paradigms like JSON-Schema and SQL-DDL. The language itself reuses terms from other ontologies, such as the Simple Knowledge Organization System Namespace (SKOS). The formalims allow users to extend or reuse existing object definitions while at the mapping data to existing standards where appropriate (for example, a "gene" object in one LinkML schema can be mapped directly to another LinkML schema’s representation of a "gene" via the "skos:exact_match" predicate).[1]
Generators
Tooling referred to as LinkML generators can translate from the schema YAML to a growing number of other formats, including JSON-schema, JSON-LD, SQL DDL, ShEx, GraphQL, Python data classes, Markdown and UML diagrams.[1]The LinkML runtime provides loaders and dumpers to convert instances of the schema between these formats.
Use cases
LinkML is used in a range of projects in biomedical research, including:
- The United States National Microbiome Data Collaborativem for storing data related to microbiome studies.[1][2]
- The Center for Cancer Data Harmonization.[1]
- The NCATS Biomedical Data Translator. [1]
- The Alliance of Genome Resources. [1][3]
- The Monarch Initiative.[4]
LinkML also being discussed as a way to model to store microscopy research data in schemas that allow exports in diverse data formats, including JSON-LD and SQL, particularly in the context of the German NFDI4BIOIMAGE.[5][6][7]
Its usage includes basic research on biocuration and the semantic web, e.g in the context of data integration using Shape Expressions,[8] in the development of standards for mapping ontologies[9] or on integration of LLMs into biocuration practice. [10]
The Biolink Model, a schema for knowledge graphs in the clinical and biomedical sciences, is built upon LinkML.[11]
Remove ads
Notes
This page contains text in CC-BY 4.0 from Sierra Moxon and colleagues, from the publication The Linked Data Modeling Language (LinkML): A General-Purpose Data Modeling Framework Grounded in Machine-Readable Semantics, in the proceedings of the International Conference on Biomedical Ontologies in 2021.[1]
References
External links
Wikiwand - on
Seamless Wikipedia browsing. On steroids.
Remove ads