Look back to the Web Technologies task view we talked about in the previous section. There are a tremendous amount of R packages specifically designed to import data directly from specialized sources on the web. Among these are packages to search for and retrieve the full text of academic articles in the Public Library of Science journals (rplos
), search for and download the full text of Wikipedia articles (WikipediR
), download data about Berlin from the German government (BerlinData
), interface with the Chromosome Counts Database (chromer
), download historical financial data (quantmod
), and access the information in the PubChem chemistry database (rpubchem
).
These examples notwithstanding, given that there are many hundreds of immense repositories of public data, it is far too much to expect the R community to have a package specially built for every single one. Luckily, with the ability to handle many different data formats under our belt, we can just download and import the data from these repositories ourselves. The following are a few of my favorite repositories. Perhaps some of them will have dedicated R packages for handling them by the time you read this.
3.129.70.113