Changelog
1.5.0 Sep 24, 2024
Features
Developer-focused
- Complete rework of the build pipeline (docker and linux native)
- Introduce CCache, conan, vcpkg
- User multi-stage docker builds leveraging buildx and buildkit
- Centralized
VERSION
- Ruff for linting and formatting
- Hatch for python package management
Bug fixes
- Generalization of
Placeholder.join
's on
argument - Improved timestamp handling
- Slicing improvements
- Slicing of
DataFrames
returned wrong results: Remove short circuit for slices with upper bound - Introduce set semantics for slicing of
DataFrame
(return empty collections instead of erroring)
- Fix displaying of parameter lists with values that exceed the presentable width
- Fix displaying of
DataFrames
with one row or less - Fix progress bar output on Google Colab
1.4.0 Oct 17, 2023
1.3.2 Jan 26, 2023
1.3.1 Dec 20, 2022
- Implement
tqdm
for progress bars - Minor bugfixes
1.3.0 Aug 28, 2022
- Use websockets instead of polling
- Size threshold for better visualization of feature code
- Faster reading of memory-mapped data, relevant for all feature learners and predictors
- Introduce CategoryTrimmer as preprocessor
1.2.0 May 20, 2022
- Support for SQL transpilation: TSQL, Postgres, MySQL, BigQuery, Spark
- Support for memory mapping
1.1.0 Nov 21, 2021
- Enhance data processing by introducing Spark (e.g. spark_sql) and Arrow (e.g. from_arrow())
- Integrate Vcpkg for dependency management
- Improve code transpilation for seasonal variables
- Better control of predictor training and hyperparamter optimization through introduction of early stopping (e.g. in ScaleGBMClassifier)
- Introduce TREND aggregation
- Better progress logging
1.0.0 Sep 23, 2021
0.16.0 May 25, 2021
0.15.0 Feb 23, 2021
- Add the Fastprop feature learner
- Overhaul the way RelMT and Relboost generate features, making them more efficient
0.14.0 Jan 18, 2021
- Significant improvement of project management:
- Add custom
__getattr__
and __dir__
methods to DataFrame, enabling column retrieval through autocomplete
0.13.0 Nov 13, 2020
- Introduce new feature learner:
0.12.0 Oct 1, 2020
- Extend dataframe handling: delete(), exists()
- Data set provisioning: load_air_pollution(), load_atherosclerosis(), load_biodegradability(), load_consumer_expenditures(), load_interstate94(), load_loans(), load_occupancy()
- High-level hyperopt handlers: tune_feature_learners(), tune_predictors()
- Improve pipeline functionality: delete(), exists(), Columns
- Introduce preprocessors: EmailDomain, Imputation, Seasonal, Substring
0.11.1 Jul 13, 2020
- Add pipeline functionality: Pipeline, list_pipelines(), Features, Metrics, SQLCode, Scores
- Better control of hyperparameter optimization: burn_in, kernels, optimization
- Handling of time stamps: time
- Improve database I/O: connect_odbc(), copy_table(), list_connections(), read_s3(), sniff_s3()
- Enable S3 access: set_s3_access_key_id(), set_s3_secret_access_key()
- New Feature Learner: MultirelTimeSeries, RelboostTimeSeries [now both integrated in TimeSeries]
0.10.0 Mar 17, 2020
0.9.1 Mar 17, 2020
- Include hotfix for new domain getml.com
0.9 Dec 9, 2019
0.8 Oct 22, 2019
- Rename Autosql to Multirel
- Boolean and categorical columns: Add support for boolean columns and operators, along with enhanced categorical column handling.
- Introduce API improvements: fitting, saving/loading of models, data transformation
- Add support for various aggregation functions such as MEDIAN, VAR, STDDEV, and COUNT_DISTINCT
- Move from closed beta to pip
- Introduce basic hyperopt algorithms: LatinHypercubeSearch, RandomSearch