@im I'm hoping this thread will be the main source of discussion for this topic -- it's a huge undertaking but I think the project would benefit from a clean categorization scheme, even if it's not perfect. The only useful data is clean data, after all.