Skip to main content

Introduction

Schema Builder is SureCentric's AI-assisted JSON-LD schema editor — the primary authoring surface for all schemas within an AI-native platform for centralized clinical intelligence, purpose-built for regulated health sciences enterprises.

SureCentric unifies three traditionally separate technical domains — ontology-driven schema management, dynamic data warehousing, and natural-language AI analytics — into a single, auditable platform. Schema Builder sits at the core of this architecture: it imports OWL content models from SureDrive document archives, generates SQL DDL for the analytics warehouse, and bridges every downstream platform component — from the CARD system for dataset provisioning to DataAgent for NL-to-SQL grounding and SureAgent for agentic workflow orchestration.

Schema Builder — Schema Mode Schema Builder — Schema Mode

It runs as:

  • A standalone web application (npm run dev in SchemaBuilder/meta_configurator/)
  • An iframe-embedded service inside the SureClinical AngularJS web client
  • A headless translation service invoked programmatically via plugins

Vision

Schema Builder provides a pre-configured, machine-readable and human-readable schema editor, viewer, and schema generator to support existing SureDrive-based archives with OWL/Nuxeo-based schemas, and adds the ability to generate JSON and DDL for SQL-based databases.

Based on a core schema of JSON-LD, Schema Builder can translate and export to OWL or JSON format. For JSON format, Schema Builder can export to a Schema Catalog or Registry that is used by Apache Superset for the creation of dataset dashboards, reports, charts, and SQL queries.


The Problem

Clinical organizations operate across fragmented, incompatible schema worlds:

WorldFormatUsed For
Document archives (SureDrive, Nuxeo)OWL / RDF-XMLRegulatory content models, eTMF, EHR
Relational analytics (Superset, Grafana)SQL DDLDashboards, reports, SQL Lab
API contracts and validationJSON SchemaREST APIs, data validation
Semantic web and linked dataJSON-LDOntology integration, knowledge graphs

Moving a schema between these worlds today requires bespoke conversion tooling, manual DDL authoring, repeated re-entry of the same structural information, and brittle one-way transformations. There is no single canonical representation, no audit trail, and no reproducible pipeline.


The Solution

Schema Builder makes JSON-LD the pivot format — a single canonical representation that is simultaneously:

  • Valid JSON (readable by any JSON tool)
  • Valid JSON-LD 1.1 (W3C-compliant linked data, processed by jsonld.js)
  • The single source of truth for all downstream exports

From one JSON-LD schema, Schema Builder generates:

JSON-LD Schema (canonical)

├──► OWL / RDF-XML → SureDrive archives, Nuxeo content models
├──► SQL DDL → PostgreSQL / DuckDB CREATE TABLE statements
├──► JSON Schema → API validation, schema registries
└──► SQL INSERT data → Faker.js seed datasets for Superset dashboards

Every schema change propagates through the pipeline. The same schema that governs a SureDrive archive also generates the analytics warehouse tables and seeds the dashboard data.