Monday, 13 June 2016
Data is like money
The particular bit of scientific pedantry that gets my back up is the instance that the word 'data' should be treated as plural. So scientists will pedantically insist on writing 'the data support the hypothesis' rather than 'the data supports the hypothesis.' To every normal person, the scientists' version is clearly wrong. Because language evolves, and the way we use the word 'data' has evolved too.
I would argue that data has become the same kind of singular collective noun as money. The word 'money' usually refers to more than one thing and we use some plural forms with it - so we say 'I have some money' not 'I have a money'. But we also say 'The money is in the bank,' not 'The money are in the bank.'
This makes a huge amount of sense. There are very clear similarities with the way 'money' and 'data' are used as words. But the trouble with being a pedant is that you can stick with an outdated theory far longer than you should. So those who want data to be plural, scratch around for a justification and think they have found one. 'Ah,' they say, 'data has to be plural because it is a Latin word, the plural of datum.' But this is rubbish. Classical plural forms are decreasingly used in English, and have never been definitive. If you really wanted to be pedantic about Classical plurals - and even Fowler thought this was silly - the plural of octopus would be octopodes. Data has become a word we use for something that had nothing to do with its Latin roots.
No, you've lost this one scientists. Data, as a word, should work just like money does, and it's about time you switched away from this clumsy usage.