getml.pipeline.dialect

SQL dialects that can be used for the generated code.

One way to productionize a Pipeline is to transpile its features to production-ready SQL code. This SQL code can be run on standard cloud infrastructure. Please also refer to SQLCode.

Example

sql_code = my_pipeline.features.to_sql(
    getml.pipeline.dialect.spark_sql)

# Creates a folder called "my_pipeline"
# which contains the SQL scripts.
sql_code.save("my_pipeline")

bigquery `module-attribute`

bigquery = _all_dialects[0]

BigQuery is a proprietary database system used by the Google Cloud.

Enterprise edition

This feature is exclusive to the Enterprise edition and is not available in the Community edition. Discover the benefits of the Enterprise edition and compare their features.

For licensing information and technical support, please contact us.

duckdb `module-attribute`

duckdb = _all_dialects[1]

DuckDB is an columnar database system that is designed for OLAP workloads.

Enterprise edition

This feature is exclusive to the Enterprise edition and is not available in the Community edition. Discover the benefits of the Enterprise edition and compare their features.

For licensing information and technical support, please contact us.

human_readable_sql `module-attribute`

human_readable_sql = _all_dialects[2]

SQL that is not meant to be executed, but for interpretation by humans.

mysql `module-attribute`

mysql = _all_dialects[3]

MySQL and its fork MariaDB are among the most popular open-source database systems.

Enterprise edition

This feature is exclusive to the Enterprise edition and is not available in the Community edition. Discover the benefits of the Enterprise edition and compare their features.

For licensing information and technical support, please contact us.

postgres `module-attribute`

postgres = _all_dialects[4]

The PostgreSQL or postgres dialect is a popular SQL dialect used by PostgreSQL and its many derivatives like Redshift or Greenplum.

Enterprise edition

This feature is exclusive to the Enterprise edition and is not available in the Community edition. Discover the benefits of the Enterprise edition and compare their features.

For licensing information and technical support, please contact us.

spark_sql `module-attribute`

spark_sql = _all_dialects[5]

Spark SQL is the SQL dialect used by Apache Spark.

Apache Spark is an open-source, distributed, in-memory engine for large-scale data processing and a popular choice for productionizing machine learning pipelines.

Enterprise edition

This feature is exclusive to the Enterprise edition and is not available in the Community edition. Discover the benefits of the Enterprise edition and compare their features.

For licensing information and technical support, please contact us.

sqlite3 `module-attribute`

sqlite3 = _all_dialects[6]

SQLite3 is a light-weight and widely used database system.

It is recommended for live prediction systems or when the amount of data handled is unlikely to be too large.

Enterprise edition

This feature is exclusive to the Enterprise edition and is not available in the Community edition. Discover the benefits of the Enterprise edition and compare their features.

For licensing information and technical support, please contact us.

tsql `module-attribute`

tsql = _all_dialects[7]

TSQL or Transact-SQL is the dialect used by most Microsoft databases.

Enterprise edition

This feature is exclusive to the Enterprise edition and is not available in the Community edition. Discover the benefits of the Enterprise edition and compare their features.

For licensing information and technical support, please contact us.

getml.pipeline.dialect

bigquery module-attribute

duckdb module-attribute

human_readable_sql module-attribute

mysql module-attribute

postgres module-attribute

spark_sql module-attribute

sqlite3 module-attribute

tsql module-attribute

bigquery `module-attribute`

duckdb `module-attribute`

human_readable_sql `module-attribute`

mysql `module-attribute`

postgres `module-attribute`

spark_sql `module-attribute`

sqlite3 `module-attribute`

tsql `module-attribute`