Skip to content

Blog

New algorithms for relational learning: where deep learning falls short of expectations

Feature learning makes relational data usable for machine learning, unlocking a vast trove of data with business potential for companies.

The idea of storing data in relational structures dates back to the 1970s. Today, relational data forms the backbone of every modern business. Corporate data accumulates in databases, playing a pivotal role in bridging the AI gap identified by decision-makers. However, despite the enthusiasm for innovation, extracting value from relational data with machine learning (ML) is currently only possible through significant effort. This challenge limits even large companies' access to machine learning and business applications with artificial intelligence (AI).

ARMA is obsolete - but whats next?

It's a classic, around since the 70s, used it myself back in uni, but data science needs have clearly evolved.

With TSFresh, Blue Yonder's engineering team set the sails for brute-force feature engineering, combining a large set of 100+ predefined feature templates with traditional feature selection methods. This approach performs well, but their implementations limitations: computational complexity on real-world data and limited support for multivariate time series.

image

LLMs for relational data?

It's a classic, around since the 70s, used it myself back in uni, but data science needs have clearly evolved.

With TSFresh, Blue Yonder's engineering team set the sails for brute-force feature engineering, combining a large set of 100+ predefined feature templates with traditional feature selection methods. This approach performs well, but their implementations limitations: computational complexity on real-world data and limited support for multivariate time series.

image

Boosting Graph Neural Networks with getML: Automated Feature Engineering for Superior Model Performance

Graph Neural Networks (GNNs) excel at modeling complex data but do require feature engineering. This article shows how integrating getML's FastProp automates this process, boosting accuracy to 92.5% on the CORA dataset—surpassing the 90.16% benchmark. This approach simplifies implementation, reduces manual effort, and ensures consistent performance across a wide range of neural architectures, making it a valuable tool for data scientists for building consistent and high-performing models with minimal effort.