Remember to
- download
portal_mammals.sqlite.- make sure the copy you are going to use in class does not have the
SpeciesCountstable or view.
- Can use dplyrto access data directly from a database.- No need to export files from the database
- Lets the database do the heavy lifting
        - Faster
- No RAM limits
 
 
- Need to install the dbplyrpackage
Installation
install.packages(c("DBI", "dbplyr", "RSQLite"))
Connect
library(dplyr)
portaldb <- src_sqlite("portal_mammals.sqlite")
Check out database structure
portaldb
src_tbls(portaldb)
tbl(portaldb, "plots")
surveys <- tbl(portaldb, "surveys") %>% collect()
surveys
colnames(surveys)
Write a query
- Write a query to extract counts of each genus and species
count_query <- "SELECT genus, species, COUNT(*)
                FROM surveys
                JOIN species
                USING (species_id)
                GROUP BY genus, species"
tbl(portaldb, sql(count_query))
- Queries and data manipulation functions return similar results with various
headings (Source: SQL)
- Number of rows is unknown as shown by ??
Using dplyr with databases
- Can also use dplyrcommands directly on databases
surveys <- tbl(portaldb, "surveys")
surveys
species <- tbl(portaldb, "species")
species_counts <- inner_join(surveys, species, by = "species_id") %>%
               group_by(genus, species) %>%
               summarize(count = n())
- All of the calculation still happens in the databases
- So outside of RAM calculations are possible
Move the final data into R
- Queries and data manipulation results remain in the external database.
- Some calculations can’t be done in the database.
- Use collect()to load the results into R in a local data frame (tibble).
species_counts <- inner_join(surveys, species, by = "species_id") %>%
               group_by(genus, species) %>%
               summarize(count = n()) %>%
               collect()
Write new information to database
- Can also store tables from R in the database use copy_to()
Show
species_countstable NOT inportal_mammals.sqlite.
copy_to(portaldb, species_counts, temporary=FALSE, 
        name="SpeciesCounts")
portaldb
Show
SpeciesCountstable inportal_mammals.sqlitewith new name.
Do Copy to Database.
