Roadmap

This is a draft page and isn’t really a “Roadmap” yet, just mainly some notes about what features and enhancements we need. It needs significant formatting.

Governance

Learn from DataHub about how to document and manage meta data (‘data governance’) artifacts and incorporate appropriate governance capabilities. These are DataHub features we should understand and perhaps push into DataHub:
- Tracing lineage across platforms, datasets, pipelines, charts, etc.
- Context about related entities across lineage
- Capture and maintain institutional knowledge using folksonomic identifiers (tags) and taxonomies
- Asset ownership by users and/or user groups
- Fine-Grained Access Control with Policies
- Metadata quality & usage analytics

Safety and Security Capabilities

String literals for injection-safe SQL generation
- Integration of pg-format

Information Model Evolution (migrations, etc.)

See Atlas open-source schema migration tool and create a SQLa to Atlas schema / DDL file.

See EdgeDB Migrations for some interesting ideas.

Consider generating Flyway and Liquibase migrations.

DDL (Data Definition Language)

There are two types of DDL: seed and evolution (also known as migration).

DQL (Data Query Language)

DML(Data Manipulation Language)

PL (Procedural or Programming Language)

BODY defines PL (stored function or stored procedure) body
CONTRACT defines the header, parameter, etcs.

DCL (Data Control Language)

GRANT: This command gives users access privileges to the database.
- Refer to https://supabase.com/blog/2021/07/01/roles-postgres-hooks for how to manage complex policies such as roles across multiple tenants
REVOKE: This command withdraws the user’s access privileges given by using the GRANT command.

TCL (Transaction Control Language)

COMMIT: Commits a Transaction.
ROLLBACK: Rollbacks a transaction in case of any error occurs.
SAVEPOINT: Sets a savepoint within a transaction.
SET TRANSACTION: Specify characteristics for the transaction.

Dialect/Engine-specific

These engines / dialects are supported:

References:

PostgreSQL Vs MySQL Syntax

Dialect Engines

Universal PostgreSQL wire interface pg-server to as many different engines as possible. When an engine (like DuckDB or osQuery, etc.) do not have native TS/JS support consider wrapping in pg-server.
Universal SqlEngine and SqlEngineInstance interfaces and engine-specific implementations to prepare SQL, send into a specific database driver and return typed rows (array) or object lists as query execution results. All SQL engines support the same query execution results so that results and queries can be mixed/matched across engines.

Engineering and QA (IDE)

Render SQL Notebook output that will allow interactive use through VS Code.

PostgreSQL

Anonymous PL/pgSQL and PL/SQL blocks
Stored procedures definition (namespaced and type-safe)
- Should these be moved to ANSI dialect and not specific to PG only?
Stored functions definition (namespaced and type-safe)
Stored routine definition STABLE and other type-safe modifiers
CALL stored procedure (SqlTextSupplier as a new stored routine object property similar to how a InsertStatementPreparer works. Just like DML is tied to a table, CALL should be tied to stored routine header(s) so that there’s full type-safety integrated into the call)
Domains
Extensions
search_path

Structural Lint Rules

The system generates lint messages:

Missing indexes for primary keys, foreign keys (see https://use-the-index-luke.com/)
Plural vs. singular naming checks
Foreign key column name should be X_id where X is the referenced Fkey column name
- _id attributes that are not foreign keys (might be OK, might be a mistake)
Suggest foreign keys when column name is similar to a table names but fkey is not defined
Integrate advice from Ordering Table Columns in PostgreSQL
Integrate SQLFluff or learn from their rules.

Roadmap

Roadmap

Governance

Safety and Security Capabilities

Information Model Evolution (migrations, etc.)

DDL (Data Definition Language)

DQL (Data Query Language)

DML(Data Manipulation Language)

PL (Procedural or Programming Language)

DCL (Data Control Language)

TCL (Transaction Control Language)

Dialect/Engine-specific

Dialect Engines

Engineering and QA (IDE)

PostgreSQL

Structural Lint Rules

Content Lint and Data Validation Rules

General TODOs

References

Roadmap

Roadmap

Governance

Safety and Security Capabilities

Information Model Evolution (migrations, etc.)

DDL (Data Definition Language)

DQL (Data Query Language)

DML(Data Manipulation Language)

PL (Procedural or Programming Language)

DCL (Data Control Language)

TCL (Transaction Control Language)

Dialect/Engine-specific

Dialect Engines

Engineering and QA (IDE)

PostgreSQL

Structural Lint Rules

Content Lint and Data Validation Rules

General TODOs

Related Code

References