DatAasee - A Metadata-Lake
Christian Himpe

Christian Himpe @gramian

About: Applied Mathematics - Functional Programming - Data Engineering

Location:
Sol, Orion Arm
Joined:
Apr 7, 2020

DatAasee - A Metadata-Lake

Publish Date: May 6
0 0

DatAasee is an open-source metadata-lake, functioning as a metadata aggregator of distributed data-sources in libraries. It instantiates a data space and provides a FAIR-compliant metadata-layer for repositories. Its data architecture is a data-lake which is its own metadata catalog, hence a metadata-lake, with an underlying graph data model.

Internally, DatAasee is realized as a three tier software architecture, where each tier is encapsulated in its own container. The inner data-tier is a multi-model NoSQL database ArcadeDB. The middle logic-tier is an (HTTP-JSON-REST) API-server and semantic-layer around the database realized via Connect (aka Benthos). The outer presentation-layer uses Lowdefy as a prototype frontend which exclusively uses the API. This multi-container application is deployed and orchestrated via Compose.

For in-depth explanations, see the open-access white paper: "DatAasee -- A Metadata-Lake as Metadata Catalog for a Virtual Data-Lake".

Comments 0 total

    Add comment