Skip to main content
The Data Story Guide Help Center home page
Submit a request Get certified
Sign in
  1. The Data Story Guide
  2. Data Files
  3. Characteristics of a Good Data File

Characteristics of a Good Data File

  • Data in a Rectangular Shape, with Appropriate Cases and Varibles
  • Variables are Stored in the Data Set Consistently with their Variable Type
  • The Data File Contains a Unique Identifier (ID Variable)
  • Missing Values are Stored Appropriately
  • Categorical Variables Contain a Complete List of Useful Value Labels
  • Randomizations and Rotations Have Been Removed
  • All Combinations of Piped Inputs Appear as Separate Variables
  • Variable Names are Short, Simple, and Informative
  • Variable Labels are Short, Unique, Informative, and Indicate Variable Set Structure
  • Variables are Correctly Grouped Into Variable Sets
  • Rankings Should Have a Single Variable for Each Option
  • No Irrelevant Data
  • Variables Should Not Contain Multiple Variables (e.g., Comma-separated)
  • Metadata is Consistent When Files Are Revised/Updated

Lorem ipsum dolor sit amet, consectetur adipiscing elit do eiusmod

Get certified
Powered by Zendesk