Merge Datasets
Give your merged dataset a descriptive name
Select 2 or more datasets and specify which column to join them on. The join column must have the same name in all selected datasets.
Join Types Explained:
- Inner Join: Returns only rows where the join column matches in both datasets. This is the most restrictive and typically results in the smallest dataset.
- Left Join: Returns all rows from the left (first) dataset and matching rows from the right dataset. Non-matching rows from the right will have NULL values.
- Right Join: Returns all rows from the right dataset and matching rows from the left dataset. Non-matching rows from the left will have NULL values.
- Outer Join: Returns all rows from both datasets. Non-matching rows will have NULL values for the other dataset’s columns.
Tips for Successful Merging:
- Ensure the join column has the exact same name in all datasets you want to merge
- The join column should contain matching values across datasets (e.g., customer IDs, product codes)
- Start with 2 datasets to test, then add more if needed
- If you have duplicate column names (other than the join column), they’ll be automatically renamed with the dataset name