dss-visual-edit

Dataiku Plugin: Visual Edit

Spin up a visual app to validate and edit data

Applying Dataiku Visual Edit to entity resolution: correcting Matched Entity, marking as Reviewed, adding Comments.

Source code

Use cases

Enable domain experts to…

Features

How to use

Use case description

There are two main types of use cases for the plugin’s Visual Webapp:

  1. Correcting source data
  2. Validating machine-generated data

Let’s focus on the 1st type. We want business users (aka end-users) to edit data based on their domain expertise, and we want to use the edited data for better downstream analytics and reporting. Instead of doing this in Excel, we want end-users to access a web interface. Therefore, we need a front-end for them to see and enter data, and we need to “connect” the data entered via the web front-end to the analytics pipeline.

Create a Visual Edit webapp

Let’s assume that you have a Dataiku DSS project with a dataset whose data you would like end-users to review and edit if corrections are needed. For illustration purposes, we use a blank project to which we upload the t-shirt orders CSV file found in the Basics 101 course of the Dataiku Academy.

Navigate to the Webapps section of your project, which you can find by hovering over the 3rd icon from the project name at the top of the interface. Click the + NEW WEBAPP button, choose VISUAL WEBAPP, then VISUAL EDIT. Give a name to your webapp and click CREATE.

List of available Visual Webapp types in Dataiku, shown when creating a new Visual Webapp, filtered by “visual edit”.

You are then taken to the webapp’s Edit tab:

“Edit” tab of the Dataiku Visual Edit webapp, where parameters were set for the tshirt orders dataset.

“View” tab of the Dataiku Visual Edit webapp showing the tshirt orders dataset. Editable columns are shown in blue.

You can find a description of all end-user features of the data table here. The webapp can be shared with Dataiku users on a Reader license or above.

Use edits in the Flow

3 datasets are created upon starting the webapp (if they don’t already exist), using the same Connection as the original dataset’s. Their names start with the original dataset’s name. Let’s review them by their suffix:

Dataiku Flow showing the tshirt orders dataset along with the 3 datasets created by the Visual Edit webapp, and plugin recipes that connect them.