Enterprise GrafDrive

Organize your corporate files into an AI-queryable knowledge graph.

Graffold's GrafDrive crawls filesystems, extracts metadata and table schemas, detects likely duplicates and versions, and gives data teams a reviewable graph of how datasets, owners, folders, and business concepts relate. The resulting knowledge graph is queryable and transparently traceable.

Which supplier file is current?

Dozens of versions across regions with no single source of truth.

From folder tree to harmonization report

Connect β†’ Crawl β†’ Extract β†’ Graph β†’ Review β€” then query and trace every answer back to its source.

1

Connect

Import a bounded folder tree or connect a cloud drive.

2

Crawl

Extract metadata, permissions, and folder structure.

3

Extract

Parse table schemas from CSV/XLSX and document context from PDFs.

4

Graph

Build a graph of files, datasets, columns, owners, and concepts.

5

Review

Approve mappings, resolve duplicates, export harmonization report.

β‘  Source: Corporate Drive
πŸ“ Corporate/
πŸ“ Suppliers/
πŸ“ EMEA/
πŸ“„ supplier_master_v3.xlsx
πŸ“„ supplier_master_v2_OLD.xlsx
πŸ“„ supplier_list_backup(1).csv
πŸ“ APAC/
πŸ“„ vendors_2024_final_FINAL.csv
πŸ“„ vendor_codes_KT.xlsx
πŸ“ NA/
πŸ“„ procurement_suppliers.csv
πŸ“„ Copy of suppliers (2).xlsx
πŸ“ Shared/
πŸ“„ master_supplier_table.xlsx
⚠ 47 files · 4 regions · no single source of truth
β‘‘β‘’ Extracted Metadata
supplier_master_v3.xlsx Β· 847 rows Β· J. Smith
supplier_id company_name region onboard_date
vendors_2024_final_FINAL.csv Β· 312 rows Β· K. Tanaka
vendor_code vendor_name country reg_date
Detected relationships:
0.91 v3.xlsx ↔ vendors_2024.csv
0.87 v3.xlsx ↔ procurement.csv
β‘£β‘€ Knowledge Graph
Dataset: supplier_master
supplier_id β†’ vendor_code
company_name β†’ vendor_name
Rel: likely_duplicate
Owner: J. Smith (EMEA)
Owner: K. Tanaka (APAC)
Canonical: supplier_master
βœ“ 12 families Β· 89 columns Β· 3 canonical tables
βœ“ Queryable Β· Traceable Β· Human-reviewed

Example: Supplier Harmonization

supplier-harmonization
1. Find every supplier onboarding dataset across the company.
2. Group likely duplicates by schema similarity.
3. Compare column schemas across EMEA, APAC, and North America.
4. Identify current versions by modified date and owner.
5. Propose a canonical supplier table with column mappings.
6. Export the evidence report with source citations.
Result: 47 files β†’ 12 dataset families β†’ 3 canonical tables proposed
Evidence: 94 source citations across 6 business units

Built for enterprise data teams

Read-only, permission-aware, and human-reviewed β€” designed for the teams responsible for governing enterprise data.

Read-only source crawl

Graffold never modifies source files.

Permission-aware views

Graph respects source access controls.

No source file mutation

Original files remain untouched.

Every proposal has evidence

Relationships cite source documents.

Human-approved mappings

No canonical assertion without review.

Audit trail for decisions

Every approval/rejection is logged.

Data Governance

Map and reconcile scattered datasets across business units.

Data Platform

Discover file-based data assets outside the warehouse.

Analytics Engineering

Find canonical sources before building new pipelines.

Enterprise Architecture

Visualize how data flows through folder structures.

Operations Analytics

Identify duplicate reports and outdated spreadsheets.

AI Readiness

Assess data landscape before training or fine-tuning.

Get a GrafDrive Audit β†’