Skip to main content

Join

Join

The Join transformer allows user to join tables by selecting the datasets , the columns and type of join. for example if there are two datasets based on covid and the user wants to join both of them and the common column is country in both the datasets then state column can be selected as a join column and the join type can be selected according to the requirement.

Join Tables Transform dialog is shown as below:

Join

  1. Enter Title like most of the other transforms. Ensure that the Title is uniquely identifying the step
  2. Enter Description to provide further information for this particular step. This is helpful in ensuring the transform step is readable in a complex workflow, ensuring readability. This step however, is Optional.
  3. Enter Datasources gives an option to the user to select which two tables from the steps view should be joined.As the user selects the Tables it gets updated in the view.
  4. Click on Join Type There are four icons seen in between the table selection. We support left,right ,inner and outer join for now.
  5. Click Create to add the step to the recipe.

JoinFilled

info

The default join if you choose any option would be left join

note
  1. Supported Join types include left, right, inner and outer.
  2. Multiple columns can be selected in both the Tables.
  3. The chain icon seen the join view can be used to see how many columns are selected for the Join.
warning

There are few errors users should be aware of and these errors can be seen in the console:

  1. if there is a datatype mismatch between the two selected columns the you will get an error called : datatype mismatch between left and right dataframes
  2. if either of the selected columns have null values in them then you will get an error saying : Null check failed in right dataframe.
  3. There is a uniqueness check in the join where it searches for unique keys in the column, if there is no unique key in either of the selected columns then you see an error about "One to Many" and "Many to one"relationship errors saying : 1. Column is not unique in left frame 2. Column is not unique in right frame.
  4. If equal number of columns are not selected in the frame then you will get an error saying : Equal number of columns are not selected in both the dataframes.

Get the power of futuristic Data & AI Platform for your enterprise.