Documentation | Support
Skip to end of metadata
Go to start of metadata

Datameer 1.4

Here are the new features and significant changes implemented in Datameer 1.4

Partitioning Data Sets

Users can now create time-based partitions from data stored by Datameer in HDFS (Run Import Job, File Upload or Workbooks). When working with data (Link Data Set, Link External Sheet, Create Export or View Full Data), users can now select subsets of the data by either static (from-to date) or dynamic (e.g. last week) date ranges increasing both performance and usability. Partitions can be reconfigured as needed with a few clicks.

Multi-way Join

When joining a sheet with many lookup tables, this can now be accomplished within a single dialog without the need for additional sheets.

New Field Types

Datameer now supports big integer and big decimal data field types. These allow the user to work with very large numbers as well as with a very high precision. In addition, the Datameer plug-in SDK allows users to configure custom scale and precision values for big decimals when writing their own input adapter.

Separate Files for Field Detection

In cases where the first rows of a data file do not contain data representative of the entire data set, users can upload and specify a separate header file, which is then used for field and type detection.

Input Adapters

Datameer now has connectors for data in both PostgreSQL and Greenplum databases.

Enhanced REST API

Several enhancements have been made to the Datameer REST API making it more flexible when working with 3rd party applications or scripts.

Security Enhancements

Secure LDAP

Datameer has added support for Secure LDAP (over SSL) as an external authentication mechanism.

Group Filters

Group filters can be applied when importing users from Active Directory or LDAP.

Secure Hadoop Sychonization

In addition, with secure impersonation enabled, Datameer leverages existing HDFS permissions and enforces these at the end-user level.  Datameer also secures data in its own private folder based on the (authenticated) user. In this way, permissions to all new or pre-existing user-specific data are now completely synchronized between Datameer and Hadoop.

More Supported Platforms / Hadoop Distributions

Datameer now supports Cloudera’s Hadoop CDH3u1 and CDH3u2, IBM’s BigInsights 1.2 (IHC) including GPFS.

 

See Supported Hadoop Distributions for details.

New functions

MINUTE - retruns only the minute of a date.

GROUPANY - returns an arbitrarily selected value contained in the group.

GROUPFIRST - returns the first value contained within the group.

GROUPLAST - returns the last value contained within the group.

API Changes

We have made a few important changes to our API. Please see Important API Changes for Developers for more information.

Oracle CLOB Support for Importjobs

A character large object or CLOB usually have very high size limits on the order of 2GB or more. Since Version 1.4.4 we support the import of clob columns as strings.

Labels: