Build a Custom Provider Dataset

This guide walks you through creating Datasets from custom data sources in NinjaCat’s Data Cloud. From establishing Data Source connections to utilizing the AI-powered auto-matching functionality to map your data efficiently. Then ultimately setting up account matching, sync schedule and requesting data.

Suggest Edits

Establish Custom Data Source

Navigate to Settings > Data Sources.
Select your desired provider.
Follow the instructions in the linked articles

📘
After following the steps to create the data sources, come directly back to this article to see instructions on creating mappings. Don't follow the mapping instructions in the other articles.

Google Sheets

Follow the instructions from the article Create a Google Sheets Data Source until you get to the mapping section

Big Query

Follow the instructions in the article Add a Native Data Source, selecting "Big Query" as the desired provider

Snowflake Share

Follow the instructions in the article Setup Snowflake Share until you get to the mapping section

Configure the Data Connector

Navigate to Data Cloud > Connectors
Click "Add Connector" in the top right
Choose the desired custom Provider
Give your new Connector an original, distinct name
Choose the desired authenticated Provider account
1. Click on "Add Credential" under "Credential"
2. Click on the desired data source
Click "Save Connector"

Create a New Dataset

Click on "Add Dataset" under Datasets

Google Sheets

Provide an original, distinct Dataset Name
Select the Google Sheet you want to import data from

Big Query

Provide an original, distinct Dataset Name
Enter the Big Query Project ID for the desired data source
Enter a query that sources your desired data
Click "Next"

Snowflake Share

Provide an original, distinct Dataset Name
Enter a query that sources your desired data

Auto-Mapping and Data Formatting

NinjaCat will automatically map column headers to known dimensions and metrics using AI-powered auto-matching.
Review the mappings:
- Ensure the correct data types (e.g., currency, integer, percentage) are assigned.
- Modify any mappings as needed.
Use the Refresh Columns option if you need to update the data structure without reloading the data source.
Exclude unnecessary columns if needed.

Save and Account Matching

Click Save to finalize the dataset.
Assign an Account Matching column if applicable.
(Upcoming Google Sheets specific feature) Enable Multiple Sheets Connection for mapping data from different sheets within a single dataset.

Set Sync Schedules

Set the refresh time for the data sync.
(Upcoming feature) Configure multiple sync times per day.

Key Features of the New Custom Datasets

Dataset-First Approach: All mapped data from providers is automatically created as a dataset.
Dataset includes sync history with detailed information on each data sync

AI-Powered Auto-Matching: Suggests optimal column mappings to reduce manual work.
Editable and Refreshable Mappings: Modify datasets without reloading the entire data source.
Legacy & New System Coexistence: Users can still use legacy mappings while transitioning to the new system.

Troubleshooting & Common Issues

Issue: Headers not loading correctly
Solution: Refresh the columns or verify access permissions

Issue: Auto-matching errors
Solution: Manually adjust column mappings and re-save

Issue: Missing data after update
Solution: Ensure that new columns are included in the dataset settings

Issue: Unable to select multiple credentials
Solution: The feature is planned for future updates; currently, use one credential per connection

Updated 6 months ago