After problem statement and defining what are the problem you're try to solve do EDA, then you'll be able to define needed: columns, metrics, schemas