SQLite and sqlite3
Monday, Apr 13
- Understand sqlite3 commands
- Join tables with different join operations
- Create sub queries
- Understand SQL vs NoSQL
sparklyr part I
Wednesday, Apr 15
- Understand what Spark is and what it does
- Connect and interact with Spark through R
dplyras an interface to Spark DataFrames
Friday, Apr 17
Exercise of the week
oilthat you created in Monday’s slides to perform one SQL query that returns the mean oil price and mean nuclear tests per year for all years where oil price and nuclear test data is available.
Find a large dataset (more than 10 million rows). Use it to create a table in a
sqlite3database. Perform a timed search and record the time. Next, create indexes for this table and perform the same search again. How is the time performance?