What is Messy Data?
Inconsistent formats, unnecessary white space, extra characters, typos, etc…
Each row contains exactly the same info:
2015-10-14 | $1,000 | PA |
10/14/2015 | 1000 | Pa. |
10/14/15 | 1,000 | US-PA |
Oct 14, 2015 | 1000 dollars | pennsylvania |
Wed, Oct 14th | US$1000 | Pennsylvania, |
42291 | $1k | Pennsylvania |
Multivalued cells limit ability to manipulate and use the data:
Giuseppe Acerbi, Joseph Acerbi, Signor Acerbi | traveller, literary, naturalist, composer |