Characteristics of a Good Data File
- Data in a Rectangular Shape, with Appropriate Cases and Varibles
- Variables are Stored in the Data Set Consistently with their Variable Type
- The Data File Contains a Unique Identifier (ID Variable)
- Missing Values are Stored Appropriately
- Categorical Variables Contain a Complete List of Useful Value Labels
- Randomizations and Rotations Have Been Removed
- All Combinations of Piped Inputs Appear as Separate Variables
- Variable Names are Short, Simple, and Informative
- Variable Labels are Short, Unique, Informative, and Indicate Variable Set Structure
- Variables are Correctly Grouped Into Variable Sets
- Rankings Should Have a Single Variable for Each Option
- No Irrelevant Data
- Variables Should Not Contain Multiple Variables (e.g., Comma-separated)
- Metadata is Consistent When Files Are Revised/Updated