Publications

5 Publications visible to you, out of a total of 5

Flexible Metadata Structuring for Research Data Management Through the FAIRDOM-SEEK Platform - Implementing Tailored and Complex Metadata Schemes in SEEK

Xiaoming Test

(Show All)

Abstract (Expand)

Modern research projects increasingly require hybrid metadata approaches that balance adherence to domain-overarching, as well as domain-specific community standards with flexibility for project- or … resource-specific metadata. The FAIRDOM-SEEK platform [1] is a widely used research data management system designed to support diverse domains, from systems biology to health research data, by integrating standardized metadata models (e.g., the ISA framework [2]) with customizable extensions. To address this need, we introduce the Extended Metadata feature in SEEK, which allows researchers to extend core metadata schemas with user-defined fields, hierarchies, and semantic annotations while ensuring interoperability with domain-specific standards. We demonstrate this capability through two use cases: 1. NFDI4Health Local Data Hubs (LDH) [3],[4]: In the context of the German National Research Data Infrastructure for Personal Health Data (NFDI4Health [5]), we have developed Local Data Hubs (LDH) based on the SEEK platform. These hubs support federated data structuring and sharing for sensitive health data from clinical trials, epidemiological studies, and public health research and allow to connect local platforms to the central metadata repository of NFDI4Health, the German Health Study Hub. Given the complexity of the NFDI4Health metadata schema (MDS) [6], the SEEK-based LDH software utilizes the Extended Metadata feature to fully represent the schema, allowing for flexible project-defined metadata extensions. 2. FAIR Data Station (FAIR-DS) [7]: Based on the ISA-framework, with the addition of Observation units from MIAPPE [8], the FAIR-DS is a web application that enables users to create and manage metadata according to FAIR principles. Using packages and terms configured through the UI, it generates Excel spreadsheets which are then populated to gather the metadata. FAIR-DS is then used to validate the metadata and generates RDF datasets representing the content. SEEK has been updated to allow Extended Metadata and Sample Types to be configured automatically via these RDF datasets, and also the content can be imported, and updated, in a single action. The Extended Metadata feature allows users to define additional metadata attributes to be tailored to specific data types, ensuring compliance with standards. When creating a resource, users can select an Extended Metadata type from a dropdown menu, dynamically triggering the rendering of associated metadata input forms within the web interface. This enables seamless integration of resource-specific metadata (e.g., clinical trial study metadata) alongside core descriptive fields. Currently, only instance administrators can create, manage (enable/disable), and delete additional attributes for specific resource types (e.g., ISA items such as Investigation, Study, Assay, as well as Projects and Models) based on specific schemas (e.g., the NFDI4Health MDS). Attribute types range from simple (e.g., string, text, date, integer, Boolean) to complex (e.g., controlled vocabularies linked to ontologies, nested hierarchical structures), with validation rules for mandatory or optional fields. Regular expressions are introduced to ensure correct input formatting. Metadata schemas can be created through backend seed files, JSON uploads, or FAIR-DS RDF imports. These schemas are programmatically accessible via the SEEK REST API, enabling automated metadata creation and retrieval. This ensures interoperability with external tools while adhering to FAIR data principles.

Authors: Xiaoming Hu, Stuart Owen, Frank Meineke, Finn Bacall, Carole Goble, Wolfgang Müller, Martin Golebiewski

Date Published: 2025

Publication Type: Conference Paper

DOI: 10.5281/zenodo.16736322

Citation: Zenodo. https://zenodo.org/doi/10.5281/zenodo.16736322.

Created: 11th Nov 2025 at 18:20

FAIR Computational Workflows

Xiaoming Test

Abstract (Expand)

ional workflows describe the complex multi-step methods that are used for data collection, data preparation, analytics, predictive modelling, and simulation that lead to new data products. They can …

Authors: Carole Goble, Sarah Cohen-Boulakia, Stian Soiland-Reyes, Daniel Garijo, Yolanda Gil, Michael R. Crusoe, Kristian Peters, Daniel Schober

Date Published: 2020

Publication Type: Journal Article

DOI: 10.1162/dint_a_00033

Citation: Data Intelligence,2(1-2):108-121

Created: 2nd Dec 2021 at 13:44, Last updated: 11th Mar 2024 at 18:14

Stealthy annotation of experimental biology by spreadsheets

SysMO DB

(Show All)

Abstract

Not specified

Authors: Firstname Lastname, Firstname Lastname, Matthew Horridge, Simon Jupp, Firstname Lastname, Firstname Lastname, Firstname Lastname, Wolfgang Mueller, Robert Stevens, Firstname Lastname

Date Published: 1st Feb 2013

Publication Type: Not specified

DOI: 10.1002/cpe.2941

Citation:

Created: 2nd Jan 2014 at 10:18, Last updated: 24th Mar 2022 at 10:39

Semantic Data and Models Sharing in Systems Biology: The Just Enough Results Model and the SEEK Platform

Refinery NDD

(Show All)

Abstract

sdfsdfsdf

Authors: Katherine Wolstencroft, Stuart Owen, Olga Krebs, Wolfgang Mueller, Quyen Nguyen, Jacky L. Snoep, Carole Goble

Date Published: 2013

Publication Type: Not specified

DOI: 10.1007/978-3-642-41338-4_14

Citation: Lecture Notes in Computer Science 8219 : 212

Created: 4th Oct 2016 at 13:35, Last updated: 24th Mar 2022 at 10:39

Taverna: a tool for building and running workflows of services

SysMO DB

Abstract (Expand)

Taverna is an application that eases the use and integration of the growing number of molecular biology tools and databases available on the web, especially web services. It allows bioinformaticians …

Authors: Duncan Hull, Firstname Lastname, Robert Stevens, Firstname Lastname, Mathew R Pocock, Peter Li, Tom Oinn

Date Published: 18th Jul 2006

Publication Type: Not specified

PubMed ID: 16845108

Citation:

Created: 14th Oct 2010 at 18:16, Last updated: 24th Mar 2022 at 10:39

Publications

Filters ×

Filters