Give your merged dataset a descriptive name
Select 2 or more datasets and specify which column to join them on.
The join column must have the same name in all selected datasets.
Join Types Explained:
-
Inner Join: Returns only rows where the join column matches in both datasets.
This is the most restrictive and typically results in the smallest dataset. -
Left Join: Returns all rows from the left (first) dataset and matching rows from the right dataset.
Non-matching rows from the right will have NULL values. -
Right Join: Returns all rows from the right dataset and matching rows from the left dataset.
Non-matching rows from the left will have NULL values. -
Outer Join: Returns all rows from both datasets.
Non-matching rows will have NULL values for the other dataset’s columns.
Tips for Successful Merging:
- Ensure the join column has the exact same name in all datasets you want to merge
- The join column should contain matching values across datasets (e.g., customer IDs, product codes)
- Start with 2 datasets to test, then add more if needed
- If you have duplicate column names (other than the join column), they’ll be automatically renamed with the dataset name