Datalore 2024.4 Help

What's new

What is Datalore On-Premises

Datalore On-Premises is a self-hosted version of Datalore, tailored for teams that want to have a collaborative data science platform set up on their own infrastructure.

Watch Datalore overview or get in touch with our team to learn more about the product.

The page chronologically lists the key features and updates of all implemented Datalore On-Premises releases.

Full description of the latest release is available on the Datalore On-Premises landing page.

2024

Version

Fixes/Improvements

2024.4.1

A security fix that handles XSS vulnerabilities

2024.4

  • SQL cells: option to limit returned rows to 500

  • Enhanced folder uploading procedure

  • Dropdown control: searchable list of values, Select all and Clear oll options

  • Editor UI improvements: better cell visibility, clearer cell status indication, more accessible layout of cell toolbar, and more

  • Quick notebook access from the editor

  • Polars DataFrame support

  • Python `input()` commands now supported for interactive reports

  • Fixed issue of pages not loading for Datalore set up with an HTTP proxy on a non-standard port

  • Fixed issue of database schemas not refreshable for users logged in via OAuth

  • Bundled PostgreSQL server updated to Postgres 15

2024.3

  • Scheduled runs: advanced run data handling

  • Smoother environment setup for R and Scala

  • Early Terminal access during setup

  • Support for ipydatagrid versions 1.3.0 and 1.3.1.

  • Customizable Git pull size limit

  • Correlation chart fixed for Visualize tab

  • Fixed rendering of ipywidgets outputs in the report builder and reports

  • Markdown cell variables now updated in scheduled reports

2024.2.3

SQL cell output: fixed handling of DataFrames containing long strings

2024.2.2

  • VM options support in database connection dialog

  • Improved performance of notebook scheduler

  • Fixed workspace menu display when viewed in Admin mode

2024.2.1

Home page:

  • Improved usability of notebooks files view

  • Fixed folder, trash, and report search

  • Fixed issue of specific workspace names

  • Fixed URLs inside Data section

2024.2

  • New Home page:

    • Links to respective reports in the Notebooks list

    • All workspaces listed together and easily accessible

    • Data section with quick access to all data sources in the workspace

    • Option to enable workspace-level management of SSH keys and Git libraries

  • Editor:

    • Enhanced Ask AI

    • AI-powered Autopilot used to get code suggestions based on a custom notebook objective or notebook content analysis

    • Support for variables in Markdown cells

    • Code syntax highlighting for Markdown cells

  • Data:

    • Windows native authentication for Microsoft SQL Server databases

    • Azure OAuth2 authentication for Snowflake databases

  • Reports: option to restrict downloading or editing copies of report notebooks

  • Administration:

    • Customizable full storage alerts

    • Custom PyPi server configuration support

    • Customizable notebook code import limit

2024.1

  • Editor:

    • Download file interactive control added

    • Option to load HTML outputs form a separate domain for security and performance purposes

    • Optimized performance for large notebooks

    • More edit options for interactive table outputs

    • Vim text editor preinstalled for Terminal

  • Data:

    • configurable mounts for Docker agents

    • Instance-wide data connections created and managed by admins

  • Collaboration: keeping report calculation results within the user session

  • Administration:

    • Option to add custom time intervals to background computation settings

    • Environment tab for advanced environment configuration

    • Group plans option for user group management

2023

Version

Fixes/Improvements

2023.6

  • Editor:

    • New UI design (tool icons and menu layout)

    • Sticky Table of Contents with active section highlighting

    • Soft-wrapping for text outputs

  • Computation: Notebook ID and notebook owner ID as environment variables

  • Administration:

    • Dashboard tab for status monitoring

    • Limitation for viewer type users dropped

  • Data visualization: lets-plot library support for air-gapped environments

  • SQL cell: proactively starting sessions for each data source when opening the report

  • Database connections: ability to import connection parameters in XML format, Okta OAuth authentication for Snowflake

  • Scheduled runs: starting agents without using the agent pool

2023.5.1

A security fix that handles XSS vulnerabilities

2023.5

  • Editor: improved interface for opening recent notebooks from the editor main menu

  • Data:

    • Custom JDBC drivers support

    • Snowflake and BigQuery OAuth authentication method

    • Fixed issue of multiple Git repos with different authentication keys

  • SQL cells:

    • Table name parametrisation

    • Proactive start of SQL sessions to ensure faster execution of SQL cells

  • Collaboration: mentions enabled for comments in reports and notebooks

  • Report builder: dynamic height enabled for more than one cell in a row

  • Admin panel: note field for profile deletion action

  • Interactive reports: notifying users of recalculations without time delay

  • Home page: option to empty all trash

2023.4

  • Comments feature for notebook and report collaboration

  • SQL cell: option to query DataFrames

  • Export to database cell

  • Databases: cloning data sources between workspaces

  • New data source type: SMB/CIFS folders support

  • Collaboration: Viewer user role, report sharing by email invitations, STARTTLS support for sending emails

  • Interactive cells: file uploader control, improved usability of dropdowns

  • Admin panel: improved user list view, background computation management on user level

  • Scheduling: parameterized runs

  • Cells: full width mode for cells, improved display of cell elements/fonts

  • Reports: fixed behaviour of dependent interactive controls

  • Report builder: cells multiselect and multidrag, UX improvements

2023.3.1

A security fix that resolves session authentication vulnerabilities and improves session cookie management

2023.3

  • Data: Vault AppRole authentication method for Athena and other database types

  • Scheduling: multiple schedules for one notebook

  • Report builder: layout improvements, more flexible drag and drop

  • Fixed variable selection in the dropdown interaction control settings

  • Shared notebooks cloned now together with attached files

  • Improved file saving when executing scheduled runs

  • Fixed value setting for the slider interactive control

2023.2

  • Report Builder: Undo/Redo support

  • Search for notebook content option

  • Environment:

    • Secrets deprecated, replaced by Environment variables

    • Native R packages support

  • Fixed agents recovery after plans disablement

  • Fixed issue of ToC not displayed properly after configuring Datalore to a non-default port

  • Improved Scala kernel performance

  • DataFrames download issue fixed for Firefox

  • Swipe left gesture disabled in the editor

  • Admins allowed to access all Datalore workspaces

  • IPywidgets rendering issues fixed

2023.1

  • Support for several API requests

  • Computation:

    1. Fixed issue of stopping computations for scheduled runs

    2. Fixed machine performance in case of Internet outage

    3. Reactive kernel improvements

  • Data sources:

    1. Bucket content now visible to viewers

    2. Ability to create cloud storage data sources on workspace level

  • Improved report builder performance

  • Ability to download workspace content as .zip files

  • Fixed procedure of public link revoking

  • Fixed metric cell issues

  • Fixed issue of ipywidget rendering

  • Improved procedure of data source connection checking

  • Fixed issue of Terminal unresponsiveness after system sleep

2022.3

Version

Fixes/Improvements

Release updates

2022.3.1

  • Data output: Table and Raw tabs improved, table viewing and editing issues fixed

  • Auditing: improved server data logs

  • Statistics tab: support for Kotlin, Scala, and R fixed

  • Code execution: fixed cell linking in output after re-evaluation

  • Reports: fixed display of scheduled runs

  • Scheduling: reports not affected by stopping associated scheduled runs

  • Computation management: fixed scheduled runs started in the background and concluded after re-opening the notebook

Major release details

2022.3

  • Sharing and collaboration: designing and publishing reports with the Report builder tool

  • Editor:

    1. Interactive table output

    2. Metric cells

  • Logging:

    1. Head dump and thread dump download options

    2. Server logs accessible from the Admin panel

  • User management:

    1. Group management interface (including Hub-attached groups)

    2. Option to disable user registration

    3. Option to create user accounts from the Admin panel

    4. Contact system support (search and auto-completion)

  • New landing page

  • Interactivity controls: date picker control added

  • Database integration:

    1. SSH tunneling

    2. BigQuery database connection support

    3. Improved S3 bucket integration interface

  • Background computation: default option support

  • Administration: advanced user activity logging

2022.2

Release updates

Minor version

Fixed/Improved

2022.2.3

New licensing: free license for groups of 1-4 users (not available now)

2022.2.2

  • Databases: URL field input limit extended in database connection interface

  • Collaboration: interactive reports made available to unauthorized users

2022.2.1

  • Login after adding Hub enabled

  • One-by-one execution of multiple SQL cells on the Run all action

  • 5-minute limit for SQL query execution removed

  • Ability to stop database querying by explicitly stopping SQL cell calculation

  • Presto database support via SSL

  • Default database schemas displayed in detailed view

  • Ability to fix database credentials inside the notebook without reattaching

  • Scheduled runs fixed

  • Improved S3 datasource interface to support custom options

Major release details

New features

  • New installation option: possible to install Datalore On-Premises using easy-to-set-up Docker-based configuration

  • Coding and execution:

    1. Scheduled runs for notebooks

    2. Parameterized SQL statements

    3. Computation tab

  • Collaboration: collaborative editing of attached files

  • Databases:

    1. Limiting database schemas for introspection

    2. MS SQL Server support

Improvements

  • Meaningful names of files downloaded from interactive reports

  • Database passwords no longer exposed in Test Connection pod

  • Collaborators allowed to change machine type in shared notebooks

  • More cut-off timer options for background computation

2022.1

Release updates

Minor version

Fixed/Improved

2022.1.4

  • Added instance and storage gift codes

  • Jedi disabled by default to speed up kernel completion

  • Improved kernel stability following non-critical code errors

  • Fixed opening of notebooks with erroneous syntax info

  • Fixed dropdown value update in interactive reports

  • Improved kernel stability when using interactive controls in Reactive mode

2022.1.3

  • Improved overall performance, especially when working with big DataFrames and using Statistics tab or Variable viewer

  • Fixed analysis server

2022.1.2

  • Restarting kernel when detecting out of memory status to ensure uninterrupted cell execution

  • Faster DataFrame rendering

  • Data output download allowed to unauthorized users

2022.1.1

Fixed Spring Core RCE vulnerability

Major release details

New features

  • Statistics tab: used to provide in-depth information on DataFrames

  • Table viewer: editor component for reviewing and editing CSV and TSV files from Attached data

  • Visibility control: cell type for collapsing other cells in interactive reports

  • Сut-off timer for background computation: option for setting the machine to shut down after a specific period of inactivity

  • User activity logging: feature used to track cell execution by a specific user

  • Reactive mode in interactive reports: automatic recalculation following user interaction with the report

Improvements

  • Enhanced reports:

    1. Full width mode

    2. Automatically hidden empty code cells

    3. Checkbox control (for interactive reports only)

  • Variable viewer: providing information on notebook object sizes

  • Notebook structure: ability to use duplicated worksheets

  • Improved editor experience:

    1. Viewport extended when scrolling past the notebook last cell

    2. Better indication of cells queued for processing

2021.3

Release updates

Minor version

Fixed/Improved

2021.3.5

  • Hub updated to 2021.1.14194

  • Database connection sharing options:

    Shared workspace

    • Editors can create and edit database connections, and attach them to the notebooks of the shared workspace.

    • Editors can create and execute SQL cells for any connected database.

    • Viewers can view database connection lists and respective database schemas.

    Home workspace notebook

    Editors can create and execute SQL cells for any connected database.

2021.3.4

  • Hiding worksheet inputs and outputs

  • Improved matrix rendering for the R kernel

  • Latest Lets-Plot library version

  • Visualize tab:

    1. Bigger charts, improved chart settings layout

    2. One-click chart export to a chart cell

    3. R kernel support

2021.3.3

log4j-related security issue in Hub

2021.3.2

  • Rendering of Pandas series with custom indexing

  • Reactive mode for Kotlin

2021.3.1

log4j-related security issue

Major release details

New features

  • Database connection: interface for connecting to databases from the Home page and the editor

  • Native SQL cells: cell type used to retrieve data from connected databases without passing user's credentials to the environment

  • Interactive reports: feature for publishing notebooks as pages that collaborators can dynamically change using interactivity elements

  • Chart cells: cell type used to build customizable visualizations based on DataFrames in your notebooks

  • Interactive controls: dropdown, slider, and text input elements to quickly customize output without manually changing the code

  • Cell visibility control: interface used to select what input and output to hide from viewers

  • More language support: R and Scala available as notebook languages

  • Reactive mode: top-down evaluation with automatic recalculation of cells below the modified one

Improvements

  • Environment: customization via initialization scripts in the init.sh file

  • Expanded Python support: Python 3.6 and later versions available with respective code insight features for each

  • Visualize tab: more plot types added (point, bar, area, correlation)

  • More bucket types: Google Cloud Storage buckets attachable to notebooks

  • Last cell execution info: last execution time shown when hovering over a cell

  • Cell collapsing: ability to collapse specific cell inputs and outputs for better coding experience

Last modified: 27 September 2024