AgentData is a semantic layer and data fabric. It connects to your existing databases and data lakes read-only, auto-discovers the business entities inside them (facts and dimensions), maintains a single multilingual semantic model, and serves analytical queries — structured or natural-language — over REST and MCP, so humans and AI agents can ask questions of your data without writing SQL.

Can AI agents query AgentData?

Yes. AgentData exposes a Model Context Protocol (MCP) server at /mcp with tools to list entities, describe an entity, and run metric or natural-language queries, gated by per-user API keys. You can point Claude, ChatGPT or any MCP client at it. A live MCP endpoint is available at https://agentdata.mdm.biskilled.com/mcp/.

🧠 A BiSkilled Product · Live now · Commercial

The semantic layer that lets AI agents query your data — safely.

AgentData connects to your existing databases read-only, auto-discovers your business entities, and builds one multilingual semantic model. Humans and AI agents then ask questions in plain language over REST + MCP — no SQL, no data movement.

Only the model and the question ever reach the LLM — never your row data. Self-hostable and air-gap ready.

Book a Demo Open the Live App ↗ Connect via MCP →

In one sentence

AgentData is a self-hostable semantic layer and data fabric that auto-discovers the business entities inside your databases and serves them to people and AI agents as plain-language questions over REST and MCP — without writing SQL and without your row data ever leaving your environment. It's a commercial data-management product, with integration pipelines on the roadmap.

Why teams reach for AgentData

The gap between your data and the AI agents that want to use it

🔌

"Our data is in five different systems"

Postgres here, Snowflake there, an S3 lake, a legacy SQL Server. AgentData clusters equivalent tables across all of them into one business entity.

🧩

"Modelling our semantic layer by hand takes months"

AgentData profiles every table, classifies it into an entity and role, infers relationships, and emits Cube + dbt-semantic YAML you simply review and approve.

🔐

"We can't send row data to an LLM"

Only the semantic model and the question reach the LLM. SQL runs locally against read-only sources; only the result returns. Run it fully air-gapped if you need to.

🤖

"We want agents to answer data questions"

One validated query path backs both a REST API and an MCP server — so Claude, ChatGPT or your own agents query governed data, not raw tables.

How AgentData works

Two paths: one writes the model, one reads the data — both governed

DISCOVERY — writes the model

Point AgentData at a source and it runs: profile → classify → cluster → relate → emit. Equivalent tables across sources collapse into a single entity (Order, Customer, Product…), relationships are inferred, and a Model Registry is written for human review.

Approve-by-default review, drag-and-drop merges (union / join / SCD), calculated columns, and metrics taught in plain language ("revenue = price × qty − discount").

RETRIEVAL — reads the data

A question runs: plan → validate → dispatch → execute → result. The planner generates SQL from the approved model only, validates it, and executes it locally against your read-only sources.

Conversational memory ("just the top 2 of those"), multilingual questions (ask in Hebrew or any language), and saved-and-approved queries that guide the planner for everyone.

What you get

🗄️

Heterogeneous, read-only sources

One config-driven splitter routes each source to the right adapter and engine — native, Trino or Athena.

PostgreSQL · MySQL · SQL Server · Oracle
Snowflake · Redshift · Synapse
S3 + Glue/Athena · Azure Blob (lake)
Read-only adapters, encrypted credentials

🧠

AI-assisted discovery

Profiles, classifies and clusters tables into business entities and emits standards-based YAML.

Cross-source entity clustering
Relationship inference
Cube + dbt-semantic YAML output
Heuristic fallback with no LLM

💬

Conversational & multilingual

Ask in plain language, follow up, and ask in any language — answers show the plan, the model YAML and the SQL.

Natural-language querying with memory
Any language (e.g. Hebrew, RTL UI)
Save & approve good queries
Named star-schema models in a Graph

🔗

REST + MCP for agents

The same validated retrieval path backs both interfaces — multi-tenant by per-user API key.

REST API for apps and dashboards
MCP server for Claude / ChatGPT / agents
list · describe · query_metric · query_nl
Only confirmed entities are queryable

🔐

On-prem & air-gapped

Keep everything inside your boundary. Pluggable LLM backend, zero external egress when you need it.

Local Ollama / vLLM / TGI (no egress)
AWS Bedrock over PrivateLink
Anthropic cloud option
Model + question only — never row data

📊

Built to operate

Production concerns handled out of the box, from cost to audit.

Per-call LLM cost tracking
Self-healing watchdog
Full audit log
Optional exact cross-source federation (Cube + Trino)

Connect an AI agent in one line

AgentData ships a live, secure Model Context Protocol endpoint. Create a per-user API key in the Connect tab, then point any MCP client at it.

                claude mcp add --transport http agentdata \

                  https://agentdata.mdm.biskilled.com/mcp/ \

                  --header "Authorization: Bearer <YOUR_API_KEY>"

Exposes list_entities, describe_entity, query_metric and query_nl over the same validated path used by the REST API. query_nl works in any language and never hard-fails.

Open the Live App ↗ Book a Demo

AgentData FAQ

Does AgentData move or copy my data?

No. It connects read-only. Only the semantic model and your question reach the LLM — never row data. SQL is generated from the model and executed locally; only the result comes back.

Which databases are supported?

PostgreSQL, MySQL, SQL Server, Oracle, Snowflake, Redshift and Synapse via SQLAlchemy, plus S3 + Glue/Athena and Azure Blob via a lake adapter.

Can it run fully on-premises or air-gapped?

Yes. Point it at a local OpenAI-compatible server (Ollama / vLLM / TGI) for zero external egress, or use AWS Bedrock over PrivateLink. Credentials are Fernet-encrypted at rest; adapters are read-only.

How is it different from a traditional semantic layer or data catalog?

Instead of hand-modelling every cube and metric, AgentData auto-discovers entities across heterogeneous sources, collapses equivalent tables into one entity, infers relationships, and emits Cube + dbt-semantic YAML you review — then serves it to both people and AI agents through one validated path.