Datasets in the public domain often require a fair amount of cleansing and curation before they can be used. By contrast, the datasets that are used in coursework and tutorials are generally pre-cleaned and presented in a much more organised format than what practitioners may find when working with real-world datasets.
A general list of data challenges that you may encounter are as follows: