Overview and Quickstart
dataset mapper is a formedix ryze plugin which can be used to generate the mapping file used in data transformations dataset mapper extends a define xml file with mapping commands used to configure an extract, transfer, load (etl) system dataset mapper is used to run dataset conversions, see also the docid\ uirdze0pggao0gidi ofd for an example of how you might map proprietary datasets to sdtm, see docid\ t1w53uucbfozehj4xhs26 dataset mapper can be used to map one variable to another you can use it to map from a source dataset to a target dataset, for example, mapping variables from your edc dataset to a standard sdtm dataset dataset mapper can also be used to perform operations on data, for example, to change strings or to compute other data this quickstart guide describes how to upload a source dataset and map variables to the correct standard for submission enable the plugin for your repository dataset mapper must be enabled by an administrator to be available in your repository to enable the plugin for your repository, go to the plugins section under admin > repository > plugins add your source dataset from your study, click upload datasets from the toolbar in this example, we’ll use the sample demographics dataset above (dm source xml) as a guide choose a name for the asset group that is created (whenever you upload a dataset, a new asset group is created) the uploaded dataset should now appear in the study as a new asset group add your target dataset in this example, we’ll map some erroneous variables in the source dataset to the sdtm ig 3 3 standard this standard must be available in your library, see docid\ y3vlpzi tlsfewtsshk3q click associate library and add the sdtm ig 3 3 standard create a new asset group for the target dataset and open the new asset group from the new asset group, click import assets and import the demographics dataset from sdtm ig 3 3 add the plugin to the target asset group plugins are added at the asset group level of a standard or study, see docid 6usratl xv9izvtc8ien9 for more information from the target asset group, click the asset group properties icon add the dataset mapper plugin to the target asset group map the target dataset to the source dataset before we map individual variables, we must first map the dataset from the target dataset, click edit dataset open the plugin properties tab and go to dataset mapper properties add the source dataset by specifying the source dataset identifier see also, docid 3szqr ruwc7 jm7bzvykb example mappings some mapping functions must be in the same format, for example, when example a mapping one variable to another once the dataset source is mapped, we can map individual variables from the target dataset in this example, we’ll map the target variable “siteid” to the source variable “site” from the target dataset, select the variable “siteid” open the plugin properties tab and click add mapping group from dataset mapper properties enter the select source variable from the source type from the source field add “site” when you run a conversion, the variable in the target dataset is populated with data from the source variable example b concatenating project, siteid, and subjecti to derive the unique subject identifier usubjid in this example we’ll use a function to concatenate these variables select the variable usubjid in the target dataset from dataset mapper properties, click add mapping group for this source type, select function under function, click add function group and and select the function name concatenate click add parameter group and fill in the following source type source variable source projecti click add parameter group again and add source type string source click add parameter group and add source type string click add parameter group and add click add parameter group and add source source type source variable source siteid source type source variable source subjeti when you run a conversion, these three variables are concatenated into one in the target dataset example c converting a date string into iso 8601 format in this example the collected date (brthdtc ) is not in iso 8601 format, you can use a function to map the string into the correct format select the variable brthdtc in the target dataset from dataset mapper properties, click add mapping group for this source type, select function under function, click add function group and and select the function name convert date to iso 8601 click add parameter group and fill in the following source type source variable source brthdtc click add parameter group again and fill in the following source type string source dd mmm yy the string parameter must match the formatting of the source string the brthdtc variable is converted to the correct format when you run the conversion process example d compute dy in this example, the number of days since the study start date is calculated the number of days is calculated by comparing two date variables in the source dataset, reference start date (rfstdtc) and date of collection (mbdat) these dates should be in iso 8601 format for this example add a variable in the target dataset called study day count from dataset mapper properties, click add mapping group for this source type, select function under function, select compute dy click add parameter group and fill in the following source type source variable source rfstdtc click add parameter group again and fill in the following source type source variable source mdbat note both source variables must be in the same format, for example, date (2021 10 17) or datetime (2021 10 17t07 44) if they are not in the same format then you must use another function to change the formatting when the conversion is run, the study day count column displays the number of days between the dates specified in the source tutorial mapping multiple source datasets the following video shows an example of pulling data from two datasets (ae and aepbasic) into one dataset (ae a01) the video shows the typical workflow for working with dataset mapper, including the following viewing the source data mapping individual variables from multiple sources (aeterm) using copy mapping group to extend the mapping to other datasets converting date types to iso 8601 format (aestdtc) running test conversions viewing the converted data common dataset mapping properties the following properties are commonly used when mapping datasets include rows include rows allows you to include rows of data depending on user set conditions for example to include rows where a variable has a specific value include rows uses conditions that affect the input if the condition evaluates to true you can specify more than one condition conditions are logic operations and subsequent conditions apply depending on the previous condition and any logic operators the following example would include medical history data where the mhpresp variable is equal to “yes” sort output rows sort output rows allows you to sort rows in the target dataset or output for example, by sorting data by unique subject identifier you can add more than one sort key so that subsets of data are sorted
