• Dr. CliniData
  • Posts
  • Splitting Domains in SDTMIG: The Secret That Experts Rarely Talk About!

Splitting Domains in SDTMIG: The Secret That Experts Rarely Talk About!

Discover the SDTMIG trick that experts rarely talk about – Simple rules for splitting domains in SDTMIG

🌟 Hey, friends! 🌟

Welcome to this week’s episode of Dr. Clinidata! Hope your week was epic! πŸš€

In clinical trials, when dealing with large datasets, it's sometimes necessary to split a domain into smaller, more manageable datasets. Here’s a quick overview of how and when you can split a domain, following the rules set by SDTM Implementation Guide (SDTMIG).

When Can a Domain Be Split?

  • Based on --CAT: A domain can be split according to the values in the category (--CAT) variable. However, remember, the --CAT value should never be left blank.

  • Findings About (FA) Domain: This domain can be split based on the values of the --OBJ variable. For example, findings about medical history (MH) records can be stored in the FAMH domain.

Key Rules for Splitting Domains:

  1. Consistency in Domain Name: The value of DOMAIN should remain consistent across all split datasets, just as it would have if the domain wasn't split.

  2. Variable Prefix: All variables requiring a domain prefix (e.g., --CAT, --TESTCD) must use the domain name as a prefix (e.g., QSCAT, VSTESTCD).

  3. Unique --SEQ Variable: The --SEQ value should be unique within each USUBJID across all split datasets. Even if there are 100 records for a subject, each must have a unique --SEQ value.

  4. Linking with Relationship Datasets: When linking relationship datasets like SUPPxx, FAxx, or RELREC to the parent domain, ensure that the IDVAR used (e.g., --SEQ, --SPID, --GRPID) allows smooth joining with the parent domain.

  5. Variable Distribution Across Splits: Some permissible variables included in one split dataset don't need to be included in every split.

  6. Naming the Split Dataset: You can use up to 4 characters to name a split dataset (e.g., QSPI for 'Questionnaire on Pain Intensity'). This naming approach helps in maintaining compatibility with supplemental qualifier datasets.

  7. Supplemental Qualifiers for Split Domains: When splitting domains, even the supplemental qualifiers (SUPPxx) must be split. The naming follows the same 1 or 2 extra character approach (e.g., SUPPQSPI).

  8. RELREC Relationships for Split Datasets: If a dataset-level relationship exists for a split domain, the RDOMAIN variable in RELREC may contain the full 4-character split dataset name instead of the usual 2-character domain code.

Got opinions? Click the button and share them. We totally can't wait to hear all about it! πŸ‘‡

THANKS FOR SURVIVING! SEE YOU NEXT WEEK!

Reply

or to participate.