[Software Release] What’s New in KNIME Analytics Platform 4.6.0 and KNIME Server 4.15
The following announcement was originally published by KNIME
A sleeker UI, better Python scripting capabilities, new visualisation nodes, and an improved Snowflake integration
KNIME Modern UI Preview (Labs)
KNIME Analytics Platform is getting a makeover. This release includes an extension that previews the new interface. Simply click the “Open KNIME Modern UI Preview” button in the top right corner to check it out.
The biggest changes you’ll see include:
- The upgraded look and feel.
- The node repository now has filtering and advanced searching capabilities. Nodes are also displayed in the repository just like they are in the workflow, with the default ports.
- A new workflow breadcrumb that allows users to browse their workflow content.
This extension is considered a preview and is currently under development. We are keen to provide you a better user experience. Please share your feedback on what’s working and what needs more changes in the KNIME forum. For in-depth technical information, read our documentation.
New Visualisation Nodes in KNIME (Labs)
Brand new visualisation nodes for exploring data and building data apps are available as a preview in the KNIME Views (Labs) extension. These nodes replace four previous visualisation nodes and offer a more consistent experience. We plan to replace even more, upon feedback from the community.
Major features include:
- A live preview of the visualisation next to the configuration dialog.
- Node descriptions are now displayed beside the settings dialog.
- Any settings controlled by flow variables are displayed and automatically indicated with an icon.
Bundled Python Environment
We’re continuing to improve the experience for Python script writers. The KNIME Python (Labs) Extension now contains its own Python Environment so that you can get started with Python scripting in KNIME right away—no additional software installation is needed.
Major features include:
- A live preview of the visualisation next to the configuration dialogue.
- Node descriptions are now displayed beside the settings dialogue.
- Any settings controlled by flow variables are displayed and automatically indicated with an icon.
For configuration, there is a new Python (Labs) preference page that lets you choose the new “Bundled” option, or the previously available “Conda” or “Manual” Python environment. For a full list of included Python packages, see the documentation.
Pure-Python KNIME Nodes (Labs)
This release marks the first time KNIME nodes can be written completely in Python and can be shared within teams, just like other KNIME extensions. This includes node configuration and execution as well as dialog definition and node views.
To help you with designing these nodes, we introduce a Pythonic API and debugging capabilities within KNIME. We also provide the means to deploy pure-Python KNIME nodes—including their Python environment needed for execution—using a locally built update site. See a simple example for defining a full-fledged KNIME node in this demo video.
See this tutorial as well as the API documentation here.
Snowflake H2O Machine Learning Model Push-Down
Business users with little or no coding experience—SQL or otherwise—have been able to gain insights on Snowflake data using KNIME’s intuitive low-code/no-code interface. With this latest release, KNIME Analytics Platform now supports push-down of H2O models directly into Snowflake. This means that users can now build machine learning models using Snowflake data and then even execute these models in Snowflake. This allows you to get predictions on large amounts of data in seconds as the data stays in Snowflake.
For an example workflow, check out this blog post, or to learn more have a look at the Snowflake Extension Guide.
DB Framework Enhancements
With the KNIME Database Framework, you can utilise the processing power of your database by pushing down the execution where the data resides—all by visually building SQL statements within KNIME.
First, we’ve improved the connectivity, using the KNIME Database Framework:
- Getting started with Oracle Databases is now much easier since all required database drivers are integrated and we now support Kerberos-based authentication on the KNIME Server.
- Built-in drivers are updated for improved security and additional functionality for Amazon Redshift, H2, Microsoft Access, MySQL, PostgreSQL, and SQLite.
Additionally, based on popular requests, we’ve extended the visual query and data manipulation capabilities with four new database nodes:
- The DB Concatenate node makes it easy to concatenate any number of database queries into a single query.
- The DB Looping node supports queries that match any value in a list of input values (e.g. IN queries).
- The DB Delete (Filter) node allows you to specify filter criteria to identify rows that should be deleted from a database table.
- Finally, the DB Data Spec Extractor node extracts the database table specification into a KNIME table allowing you to use this information in your analysis.
Microsoft Azure Services
KNIME Analytics Platform allows you to seamlessly reach out to various Microsoft Azure services. With this release, you can now visually interact with serverless and dedicated SQL pools on Azure Synapse Analytics without the need to write any SQL statements using the KNIME database framework.
- Use the extended Microsoft SQL Server Connector node to connect to these pools and perform high throughput data uploading via the DB Loader node.
- Use the KNIME file handling framework to manage your data files in Azure Synapse storage accounts.
This release also expands on the SharePoint Online integration by adding support for creating new or deleting existing lists. Finally, the Microsoft Authentication node has been extended to support custom application IDs and authorization endpoints to meet your security requirements.
Column Expression and Multi-Row Formulas
Column Expressions is an all-purpose tool to compute new columns based on simple expressions. While previously limited to deriving these new values from the current row, a new function column(name, offset) has been added to read values from preceding and subsequent rows, effectively allowing multi-row formulas (find an example on KNIME Hub).
Additionally, new functions and capabilities make it easy to manipulate and create path cells and now also variables in KNIME Analytics Platform. New functions to create standard file system variables and cells allow you to create new paths directly within the nodes.
XGBoost
The XGBoost integration finally moves out of labs with three main updates:
- Row weights give you control over the weight the learners assign to individual rows of your dataset. This can be very useful to remedy class imbalances or to prioritise certain subsets of your data.
- Bit & Byte Vector Support improves the applicability of the nodes in domains like text processing or life sciences where it is common to have a vector representation of the data
- Feature Importance Output of the XGBoost Tree Ensemble Learner nodes is an output table that provides you with various metrics that indicate how important every single feature is to the learned model.
To see example workflows demonstrating this feature, visit the KNIME Hub.
Extended Spark Support
We have added support for Spark 3.1 and 3.2 including support for H2O Sparkling Water. The Create Local Big Data Environment that creates a fully functional big data environment for testing and prototyping locally, now uses Spark 3.2.
Reset Workflows on KNIME Server
It is now possible to reset workflows stored on KNIME Server directly from the KNIME Explorer without opening the workflow first. This new feature will save users a lot of time, especially when dealing with workflows that upload a large amount of data.
About KNIME
Described as the ‘Swiss army knife of analytics’, KNIME is the leading open source solution for data-driven innovation. KNIME is designed for discovering the potential hidden in data, mining for fresh insights, predicting new futures and automating manual tasks. The platform is fast to deploy, east to scale and intuitive to learn. Thanks to the drag-and-drop visual interface, KNIME enables every stakeholder in the data science process to focus on what they do best.