Skip to content

getml.pipeline.dialect

SQL dialects that can be used for the generated code.

One way to productionize a Pipeline is to transpile its features to production-ready SQL code. This SQL code can be run on standard cloud infrastructure. Please also refer to SQLCode.

Example
sql_code = my_pipeline.features.to_sql(
    getml.pipeline.dialect.spark_sql)

# Creates a folder called "my_pipeline"
# which contains the SQL scripts.
sql_code.save("my_pipeline")

bigquery module-attribute

bigquery = _all_dialects[0]

BigQuery is a proprietary database system used by the Google Cloud.

Enterprise edition

This feature is exclusive to the Enterprise edition and is not available in the Community edition. Discover the benefits of the Enterprise edition and compare their features.

For licensing information and technical support, please contact us.

duckdb module-attribute

duckdb = _all_dialects[1]

DuckDB is an columnar database system that is designed for OLAP workloads.

Enterprise edition

This feature is exclusive to the Enterprise edition and is not available in the Community edition. Discover the benefits of the Enterprise edition and compare their features.

For licensing information and technical support, please contact us.

human_readable_sql module-attribute

human_readable_sql = _all_dialects[2]

SQL that is not meant to be executed, but for interpretation by humans.

mysql module-attribute

mysql = _all_dialects[3]

MySQL and its fork MariaDB are among the most popular open-source database systems.

Enterprise edition

This feature is exclusive to the Enterprise edition and is not available in the Community edition. Discover the benefits of the Enterprise edition and compare their features.

For licensing information and technical support, please contact us.

postgres module-attribute

postgres = _all_dialects[4]

The PostgreSQL or postgres dialect is a popular SQL dialect used by PostgreSQL and its many derivatives like Redshift or Greenplum.

Enterprise edition

This feature is exclusive to the Enterprise edition and is not available in the Community edition. Discover the benefits of the Enterprise edition and compare their features.

For licensing information and technical support, please contact us.

spark_sql module-attribute

spark_sql = _all_dialects[5]

Spark SQL is the SQL dialect used by Apache Spark.

Apache Spark is an open-source, distributed, in-memory engine for large-scale data processing and a popular choice for productionizing machine learning pipelines.

Enterprise edition

This feature is exclusive to the Enterprise edition and is not available in the Community edition. Discover the benefits of the Enterprise edition and compare their features.

For licensing information and technical support, please contact us.

sqlite3 module-attribute

sqlite3 = _all_dialects[6]

SQLite3 is a light-weight and widely used database system.

It is recommended for live prediction systems or when the amount of data handled is unlikely to be too large.

Enterprise edition

This feature is exclusive to the Enterprise edition and is not available in the Community edition. Discover the benefits of the Enterprise edition and compare their features.

For licensing information and technical support, please contact us.

tsql module-attribute

tsql = _all_dialects[7]

TSQL or Transact-SQL is the dialect used by most Microsoft databases.

Enterprise edition

This feature is exclusive to the Enterprise edition and is not available in the Community edition. Discover the benefits of the Enterprise edition and compare their features.

For licensing information and technical support, please contact us.