Actian Data Platform for Data Scientists: Data Preparation

Actian Data Platform for Data Scientists: Data Preparation

The training provided within this course provides you with knowledge for the various native Platform capabilities to ingest, clean and transform data and access it all in a single data platform.

rate limit

Code not recognized.

About this course

Course Outcome:
You will understand how simple it is to clean, transform and ingest data into a Platform Warehouse using the wide range of native capabilities provided as part of the Actian Data Platform.
Course Style:
The course is provided in a step-by-step fashion to ensure you understand the process to clean and transform and ingest data into Actian Data Platform.
Audience:
For Data Scientists and Data Architects, and similar personas in the organization responsible for business intelligence, analytics, and data visualization.
Prerequisites:
  • No knowledge of the Actian Data Platform is required
  • Some knowledge of the data science workflow is assumed
  • Actian Data Platform login credentials and access to the Warehouse data are assumed to be available to you
Supplementary Resources:

Curriculum158 min

  • Course Objective 1 min
  • Course Admin
  • Prerequisites and Supporting Content 1 min
  • Adding Platform Data Integration IPs to the Warehouse trusted list of IPs 3 min
  • Install the Platform ODBC Data Driver for Windows 7 min
  • Install the Platform ODBC Data Driver for Linux 4 min
  • Install the Platform Data Driver for Apple MAC 5 min
  • Jupyter Notebook: Data Preparation Iris Dataset 1 min
  • Native Data Loading Functionality
  • Section Objective 1 min
  • Data Integration: Load Files
  • Load a local data file 5 min
  • Native Data Integration
  • Overview 6 min
  • Configuration Creation 5 min
  • Simple Mapper 6 min
  • Editing Existing Simple Mapper Maps 4 min
  • Using the Simple Mapper Expression Builder 6 min
  • Files 6 min
  • Jobs 3 min
  • Macros 6 min
  • Agents 4 min
  • Scheduling Jobs wit a CRON Expression 3 min
  • Job Log 3 min
  • Native Data Integration Templates
  • GCP Storage to GCP Platform Warehouse 5 min
  • GCP Delimited Data File to GCP Platform Warehouse 4 min
  • NetSuite to a GCP Platform Warehouse 3 min
  • DataConnect Integrations
  • Section Objective 1 min
  • DataConnect: Design Integrations on Premise, Deploy and Run in Avalanche Connect
  • Create an Integration using the DataConnect Process Wizard 6 min
  • Process and Map Views 2 min
  • Define Macros and Create a Package 5 min
  • Deploy package in the Cloud and Run the Configuration 3 min
  • Update a Package, Deploy and Run the Updated Package 7 min
  • DataConnect Data Profiler
  • Introduction to Data Profiler 5 min
  • Ingesting Data using External Table Functionality
  • Overview 2 min
  • Load data from a Microsoft Azure Blob Store 5 min
  • Load data (CSV and Parquet) from an Amazon S3 Bucket 7 min
  • Load data from a GCP Bucket using Spark via the External Table SQL Statement 5 min
  • Data Preparation using the Iris Dataset
  • Create a Connection and Supporting Functions 3 min
  • Create a Table 2 min
  • Loading data into a Platform Warehouse table 7 min
  • Copying and Enlarging the Data Set 2 min
  • Optional Content
  • Six Essential Data Preparation Steps for Analytics 1 min
  • Feedback
  • Take Course Survey

About this course

Course Outcome:
You will understand how simple it is to clean, transform and ingest data into a Platform Warehouse using the wide range of native capabilities provided as part of the Actian Data Platform.
Course Style:
The course is provided in a step-by-step fashion to ensure you understand the process to clean and transform and ingest data into Actian Data Platform.
Audience:
For Data Scientists and Data Architects, and similar personas in the organization responsible for business intelligence, analytics, and data visualization.
Prerequisites:
  • No knowledge of the Actian Data Platform is required
  • Some knowledge of the data science workflow is assumed
  • Actian Data Platform login credentials and access to the Warehouse data are assumed to be available to you
Supplementary Resources:

Curriculum158 min

  • Course Objective 1 min
  • Course Admin
  • Prerequisites and Supporting Content 1 min
  • Adding Platform Data Integration IPs to the Warehouse trusted list of IPs 3 min
  • Install the Platform ODBC Data Driver for Windows 7 min
  • Install the Platform ODBC Data Driver for Linux 4 min
  • Install the Platform Data Driver for Apple MAC 5 min
  • Jupyter Notebook: Data Preparation Iris Dataset 1 min
  • Native Data Loading Functionality
  • Section Objective 1 min
  • Data Integration: Load Files
  • Load a local data file 5 min
  • Native Data Integration
  • Overview 6 min
  • Configuration Creation 5 min
  • Simple Mapper 6 min
  • Editing Existing Simple Mapper Maps 4 min
  • Using the Simple Mapper Expression Builder 6 min
  • Files 6 min
  • Jobs 3 min
  • Macros 6 min
  • Agents 4 min
  • Scheduling Jobs wit a CRON Expression 3 min
  • Job Log 3 min
  • Native Data Integration Templates
  • GCP Storage to GCP Platform Warehouse 5 min
  • GCP Delimited Data File to GCP Platform Warehouse 4 min
  • NetSuite to a GCP Platform Warehouse 3 min
  • DataConnect Integrations
  • Section Objective 1 min
  • DataConnect: Design Integrations on Premise, Deploy and Run in Avalanche Connect
  • Create an Integration using the DataConnect Process Wizard 6 min
  • Process and Map Views 2 min
  • Define Macros and Create a Package 5 min
  • Deploy package in the Cloud and Run the Configuration 3 min
  • Update a Package, Deploy and Run the Updated Package 7 min
  • DataConnect Data Profiler
  • Introduction to Data Profiler 5 min
  • Ingesting Data using External Table Functionality
  • Overview 2 min
  • Load data from a Microsoft Azure Blob Store 5 min
  • Load data (CSV and Parquet) from an Amazon S3 Bucket 7 min
  • Load data from a GCP Bucket using Spark via the External Table SQL Statement 5 min
  • Data Preparation using the Iris Dataset
  • Create a Connection and Supporting Functions 3 min
  • Create a Table 2 min
  • Loading data into a Platform Warehouse table 7 min
  • Copying and Enlarging the Data Set 2 min
  • Optional Content
  • Six Essential Data Preparation Steps for Analytics 1 min
  • Feedback
  • Take Course Survey