Creating A Data Model
What's on This Page:
About Creating A Data Model
After creating a new Employee Journey Analytics project from scratch, the next step is to set up a data model to join two or more of your existing datasets. Within the data model you are able to add datasets, join datasets, filter your data, and create an output dataset that will be used for analysis.
Adding & Editing Sources
The first step when creating your data model is adding a source.
- Navigate to the Data Model tab within your Employee Journey Analytics project.
- Click Add Source.
- Click the checkbox next to all the sources you want to add to your data model. Clicking on a specific source will display details about that source’s data. Uncheck the box next to a source to remove it from your data model.
Qtip: Use the search bar to search for projects by name.Attention: Only projects you have access to will appear in this list. If any projects are not showing up, make sure they have been collaborated with you. - Select all the fields you want to include in your dataset. You can click on different types of data, like “Metadata”, to see only those fields, or use the search bar to search for specific fields.
Qtip: When selecting fields to include in your data model, think about which fields will be important when analyzing your data. Fields that are commonly included are survey questions, important metadata, and the field you plan on using to join your datasets.
- When you have finished selecting fields, click anywhere on your data model to return to it. Your selections will be saved automatically.
- Repeat steps 2-5 for each project you want to add as a data source.
Qtip: If you are adding two or more surveys to your data model that contain the same employee and organization data, you may only want to include one set of this data. Including the same fields twice will create duplicates in your output dataset and dashboard.
Editing a Source
- Input dataset details: This tab shows the source name of the dataset, the number of records in the source, and the number of fields in the source. You can also edit the output name of the source, which is what appears in the data model.
Qtip: Editing the output name will not change the name of the original source.
- Schema: This tab shows the fields you selected when adding the source, as well as their field type.
Qtip: Click Manage fields to add or remove fields from the dataset.
- Preview: This tab shows a preview of the records in this source.
Removing a Source
If you have added a source to your data model but decide you no longer want it, click the x to remove it.
Modifying Data Sources
The elements of your data model can be modified by filtering rows or editing columns. Both of these options affect which data will appear in the outcome dataset. For example, you may want to add a filter to your sources so that only data from the last 12 months is displayed, allowing you to analyze trends over the last year.
Filtering Rows
This section covers the basics of adding a filter within your data model. For more information on filters and building filter conditions, see Filtering Responses.
- Click the plus sign (+) next to the element you want to filter.
- Click Filter rows.
- Click into the Output field to edit the name of the filter if desired.
- Choose whether all conditions need to be met for a record to appear in a filter, or if only some of them need to be met. See All vs. Any for more information.
- Select a data type for the filter. If you want to create a filter based on fields from a survey project, select either Embedded data or Survey metadata. If you want to create a filter based on fields from an Imported Data Project, select Others.
- Select a specific field for the data type you selected.
- Select a condition for the filter.
- Select a value for the filter. In the above screenshot, the joined data is being filtered to only include employees that are in the Seattle office.
- Click the three horizontal dots next to the first condition to add additional conditions or condition sets, or delete the condition.
- If desired, click the Schema tab to view the fields in your dataset, as well as their field type.
- When you have finished setting up your filter, click anywhere on your data model to return to it. Your selections will be saved automatically.
Editing Columns
- Click the plus sign (+) next to the element you want to filter.
- Click Edit columns.
- Click the checkbox next to a column to remove it from the output dataset.
Qtip: Use the search bar to search for specific fields, or click Filter by to filter by specific data types or datasets. - Click the trash icon to remove the selected fields.
- To change the field type of a field, click the dropdown and select a different field type.
Qtip: If you select Date as the field type, you will have to specify a date format. Choose a format from the dropdown and then click Apply.
- If desired, click the Schema tab to view the fields in your dataset, as well as their field type.
- When you have finished editing your columns, click anywhere on your data model to return to it. Your changes will be saved automatically.
Combining Data Sources
Once you have added at least two sources to your data model, you can begin combining them.
- While on the Data Model tab of your project, click the plus sign (+) next to one of the datasets you want to combine.
- Click Join.
- Select a Join Type. This will determine how your two datasets are combined.
The following join types are available:- Inner: The merged dataset will only include matching rows found in both datasets.
- Left outer: The merged dataset will include all rows from the left input and matching rows found in the right input.
- Right outer: The merged dataset will include all rows from the right input and matching rows found in the left input.
- Full outer: The merged dataset will include all rows found in both datasets.
- Select a data source for your right input. Only data sources that have been added to your data model will show up here.
Qtip: When setting up a join, only data sources that have not already been selected in previous joins will appear in the dropdown.
- Set the join condition you want to use to combine your data sources. Click Select one to open the dropdown and select a field from each data source. This field should be something that is the same in both datasets, such as employee email or unique ID.
Qtip: Click the plus sign (+) to add additional join conditions.
- If desired, click the Schema tab to view the fields in your combined dataset, as well as their field type.
- When you have finished setting up your join, click anywhere on your data model to return to it. Your selections will be saved automatically.
Repeat the above steps to combine additional datasets. You can also combine joined datasets to keep creating a larger output dataset, as shown below.
Adding an Output Dataset
Once you have finished combining and modifying your data, it is time to add an output dataset.
- Click the plus sign next to the last element in your data model.
- Select Output dataset.
- If desired, click the Schema tab to view the fields in your dataset, as well as their field type.
Attention: There is a limit of 1,000 columns for your output dataset. - Click anywhere on your data model to return to it.
Previewing Your Data Model
After creating your data model, click Run preview to generate your output dataset.
This may take a while to generate; when the preview is complete you will be taken to a preview of your dataset.